XML documents must be encoded with one of the supported code pages described below.
XML documents generated or parsed in national data items must be encoded with Unicode UTF-16, CCSID 1200.
For XML GENERATE statements, documents generated in alphanumeric data items must be encoded with Unicode UTF-8, CCSID 1208, or one of the single-byte EBCDIC code pages shown in the table below. You can code any CCSID shown in the table in the ENCODING phrase of the XML GENERATE statement.
For XML PARSE statements, documents in alphanumeric data items must be encoded with:
You can code any supported CCSID, as described above for XML PARSE, in the ENCODING phrase of the XML PARSE statement.
You can code any CCSID shown in this table, except 1208, in the CODEPAGE compiler option for use with XML GENERATE and XML PARSE.
| CCSID | Description |
|---|---|
| 1208 | Unicode UTF-81 |
| 1047 | Latin 1 / Open Systems |
| 1140, 37 | USA, Canada, . . . Euro Country Extended Code Page (ECECP), Country Extended Code Page (CECP) |
| 1141, 273 | Austria, Germany ECECP, CECP |
| 1142, 277 | Denmark, Norway ECECP, CECP |
| 1143, 278 | Finland, Sweden ECECP, CECP |
| 1144, 280 | Italy ECECP, CECP |
| 1145, 284 | Spain, Latin America (Spanish) ECECP, CECP |
| 1146, 285 | UK ECECP, CECP |
| 1147, 297 | France ECECP, CECP |
| 1148, 500 | International ECECP, CECP |
| 1149, 871 | Iceland ECECP, CECP |
|
|
To parse XML documents that are encoded in unsupported code pages, first convert the documents to national character data (Unicode UTF-16) by using the NATIONAL-OF intrinsic function. You can convert the individual pieces of document text that are passed to the processing procedure in special register XML-NTEXT back to the original code page by using the DISPLAY-OF intrinsic function.
related references
CODEPAGE