XML Character Encoding Set
From HLKitWiki
Jump to navigationJump to search
Context: HL Kit … Kit Reference
HL assumes all XML documents utilize the "ISO-8859-1" character set (also called Latin-1), with a number of exceptions specific to the Windows platform. The list of exceptions is detailed in the table below.
128 undefined 129 undefined 130 ‚ 131 ƒ 132 „ 133 … 134 † 135 ‡ 136 ˆ 137 ‰ 138 Š 139 ‹ 140 Œ 141 undefined 142 undefined 143 undefined 144 undefined 145 ‘ 146 ’ 147 “ 148 ” 149 • 150 – 151 — 152 ˜ 153 ™ 154 š 155 › 156 œ 157 undefined 158 undefined 159 Ÿ
The identity element at the top of all XML files should specify an encoding of "ISO-8859-1" for completeness. If no encoding is given, ISO-8859-1 is assumed. An example is given below:
<?xml version="1.0" encoding="ISO-8859-1" ?>
NOTE! There is an unofficial XML encoding named "Windows-1252" that properly reflects the Windows ANSI character set and is often used. However, various XML parsers do not recognize this encoding set due to its unofficial nature. In the interest of maximum compatibility, the modified Latin-1 set is used instead.