Oxford University Press Text Capture Instructions

 

OCL: Oxford Competition Law TCIs

Additional information for capturing the different document types in the Oxford Competition Law product (OCL) to their relevant DTDs, such as the OxLawML DTD and the OxChapML DTD.

The OUP generally provide source documents for OCL as PDF, MS Word, or desktop publishing application files.

The OCL data set is made up of three law textbooks (referred to below as ‘Faull & Nikpay (F&N), Roth & Rose (R&R) and Whish, after the editors of each work), a set of OxLawML materials (which form the basis of another book, Macnab), and some law reports.

The layout of the text may vary in terms of formatting used to distinguish headings, indentation used to distinguish sub-sections, etc. Do not preserve this variation. Apply the same structural markup no matter what visual cues are used in the source document to present the structure.

For some content types, OUP provide metadata in a headnote file:

Case Report
OUP provide headnotes as an MS Word document consisting of a table with labels and fields.
The Macnab materials
OUP provide a single MS Excel spreadsheet as a headnote; each row denotes a final xml deliverable, and each column denotes a potential metadata field. However, you generate the majority of the metadata from the original document.
Legal Textbooks
The required metadata is included in the text of the book and in the TCI instructions. These do not have headnotes.
Update Notes
OUP provide an MS Word document consisting of a table with labels and fields and the content of the note.

Capture the title and full text of the document, together with any introductory text specific to the document, the footnotes, the appendices and the schedules.

Where it is not clear what text to capture, ask your contact at OUP.

Release ID:
20261202
ID:
OCL_TCI_topic_1
Author:
hillmant
Last changed:
Tue, 26 Mar 2013
Modified by:
hillmant
Revision#:
603