XML PARSE supports XML documents in national data items, in alphanumeric data items, and in IFS files with UCS-2 and single byte CCSIDs. Documents in national data items must be encoded using the Unicode UCS-2 CCSID specified on the National CCSID compiler option or the NTLCCSID PROCESS option. Documents in alphanumeric data items must be encoded using one of the explicitly supported single-byte EBCDIC CCSIDs shown in Supported EBCDIC CCSIDs for XML documents (Table 1) or one of the ASCII CCSIDs shown in Supported ASCII CCSIDs for XML documents (Table 2).
| CCSID | Description |
|---|---|
| 1140, 37 | USA, Canada, etc. Euro Country Extended CCSID (ECECP), Country Extended CCSID |
| 1141, 273 | Austria, Germany ECECP, CECP |
| 1142, 277 | Denmark, Norway ECECP, CECP |
| 1143, 278 | Finland, Sweden ECECP, CECP |
| 1144, 280 | Italy ECECP, CECP |
| 1145, 284 | Spain, Latin America (Spanish) ECECP, CECP |
| 1146, 285 | UK ECECP, CECP |
| 1147, 297 | France ECECP, CECP |
| 1148, 500 | International ECECP, CECP |
| 1149, 871 | Iceland ECECP, CECP |
| CCSID | Description |
|---|---|
| 813 | ISO 8859-7 Greek / Latin |
| 819 | ISO 8859-1 Latin 1 / Open Systems |
| 920 | ISO 8859-9 Latin 5 (ECMA-128, Turkey TS-5881) |
When you parse ASCII XML documents, the document fragments passed to the processing procedure in special register XML-TEXT are encoded in ASCII. Because ILE COBOL operations such as move and comparison rely on EBCDIC encoding or on national characters for proper operation, you must convert the document fragments before using them. To do this when the XML document is in a COBOL program, first convert from the ASCII CCSID of the XML document to national characters using the MOVE statement. Then, if necessary, convert the result from national characters to EBCDIC using the MOVE statement.
XML documents in a COBOL program encoded in other CCSIDs can be parsed by converting them to national characters using the MOVE statement. The individual pieces of document text passed to the processing procedure in special register XML-NTEXT can then be converted back to the original CCSID as necessary, using the MOVE statement.
See the ILE COBOL Programmer's Guide for details on specifying the document encoding and how the parser determines encoding.