Oxford University Press Text Capture Instructions

 

Contents lists

With the exception of some legal documents, you should capture the table of contents. Capture the table of contents inside one miscMatter element with attribute class="contents" within frontMatter.

The following typecodes are used in tables of contents.

FMCT:CONT

Table of Contents

CONT1

Content first-level list

CONT2

Content second-level list
CONT3 Content third-level list
CONT:AN Contents page annotation
FM_tocA Table of contents A head
FM_tocB Table of contents B head
FM_tocC Table of contents C head
FM_tocChapter Table of contents chapter
FM_tocContributor Table of contents contributor
FM_tocEndmatter Table of contents endmatter
FM_tocPart Table of contents part title
FM_tocPrelims Table of contents prelims
FMCT:BTOC Brief table of contents
//miscMatter[@class="contents"]//list[@class="contents"]

Capture the book contents in a miscMatter element with class attribute value ‘contents’. Capture the list itself in a list element with class attribute value of ‘contents’.

Capture the contents as three separate sub-lists; one for headings within frontMatter, one for headings in the main text and one for headings in the endMatter.

//item1[@role]/list2

Capture each sub-list using list2 within an item1 element. To indicate which section the headings are contained in, give the item1 element a role attribute with a value of the name of the top-level element ("frontMatter", "mainText", or "endMatter").

Capture titles, excluding enumerators using xref.

Capture authors using nameGrp.

Capture page numbers per the general instructions.

Use a single space to separate titles from page numbers. Do not capture typographic elements such as dotted lines.

In multi-authored books, the authors of each chapter are usually listed beneath the chapter title in the contents list. Capture the chapter authors using nameGrp within a separate p element.

Capture annotated text for contents lists in a separate p element with the itemN element.

As a check for the division of the list into front matter, main text, and end matter sections, the cross-references within each should point to targets within the corresponding element of the text, e.g. a cross-reference within item1 role="frontMatter" should have a target within the frontMatter element.

Release ID:
20261202
ID:
OUP_Structured_Text_TCI_topic_4_4
Author:
dunnm
Last changed:
Wed, 04 Jun 2025
Modified by:
buckmasm
Revision#:
4400