Encoding
Encode XML files as UTF-8. If needed, you can encode numeric character references (NCR) in hexadecimal format.
You can use hexadecimal NCRs but it is not required. If used, they must be in the form
&#xnnnn(n);, where nnnn is the Unicode code point value. E.g.:
x00C1 (U+00C1 Latin Capital A with Acute)
x1D4A9 (U+1D4A9 Mathematical Script Capital N)
For accented characters, use a single Unicode character where possible and not a plain character with combining diacritic, e.g. x1F00 (U+1F00 Greek small letter alpha with psili) and not x03B1x0313.
Capture fractions that do not have a unicode equivalent as MathML.
Capture musical notes that are individual characters as inline images (not all fonts display these unicode characters on certain browsers).