XML Character Encoding Set

From HLKitWiki
Jump to: navigation, search

Context: HL KitKit Reference 

HL assumes all XML documents utilize the "ISO-8859-1" character set (also called Latin-1), with a number of exceptions specific to the Windows platform. The list of exceptions is detailed in the table below.

128 undefined
129 undefined
130
131 ƒ
132
133
134
135
136 ˆ
137
138 Š
139
140 Œ
141 undefined
142 undefined
143 undefined
144 undefined
145
146
147
148
149
150
151
152 ˜
153
154 š
155
156 œ
157 undefined
158 undefined
159 Ÿ

The identity element at the top of all XML files should specify an encoding of "ISO-8859-1" for completeness. If no encoding is given, ISO-8859-1 is assumed. An example is given below:

<?xml version="1.0" encoding="ISO-8859-1" ?>

NOTE! There is an unofficial XML encoding named "Windows-1252" that properly reflects the Windows ANSI character set and is often used. However, various XML parsers do not recognize this encoding set due to its unofficial nature. In the interest of maximum compatibility, the modified Latin-1 set is used instead.