Oxford University Press Text Capture Instructions

 

Encoding

Encode XML files as UTF-8. If needed, you can encode numeric character references (NCR) in hexadecimal format.

You can use hexadecimal NCRs but it is not required. If used, they must be in the form &#xnnnn(n);, where nnnn is the Unicode code point value. E.g.:

x00C1 (U+00C1 Latin Capital A with Acute)

x1D4A9 (U+1D4A9 Mathematical Script Capital N)

For accented characters, use a single Unicode character where possible and not a plain character with combining diacritic, e.g. x1F00 (U+1F00 Greek small letter alpha with psili) and not x03B1x0313.

Capture fractions that do not have a unicode equivalent as MathML.

Capture musical notes that are individual characters as inline images (not all fonts display these unicode characters on certain browsers).

Release ID:
20261202
ID:
OUP_Structured_Text_TCI_topic_2_5_1
Author:
dunnm
Last changed:
Wed, 09 Mar 2016
Modified by:
buckmasm
Revision#:
2962