Oxford University Press Text Capture Instructions

 

Specific inline content within entries (span)

Within the content there are areas of information that require identifying by the use of span tags. Use the role attribute to indicate the different types.

WRK

Work (inline)

ETY

Etymologies

PRO

Pronunciation

OCC

Occupation

MEAS

Measurement (general)

MON

Measurement (currency)

CTX

Context (used to distinguish headwords)

span[@role]

Below is a table listing all the possible types of information that should be identified throughout the content.

If any of the defined span content is found alongside the headword capture them within the headwordInfo p section for that headword.

Content Type

role

Details

Contextual Information

context

Usually appears directly after the headword group defining the headword itself (e.g. Brit, Informal, US, UK etc).

Encyclopaedic Information

ency

See Encyclopaedic Information.

Headword/content Origin

etymology

Content describing where the headword or term came from (usually a country and/or region).

Example

example

An example of the headword used within a phrase. Capture the whole phrase in the span tags.

Headword Person Gender

gender

Only found alongside headword.

Contributor list name/initials

emphasis

Used to identify the main name or word(s) within list items. Specific sections describe its use.

Inflections

inflection

Inflected forms of the headword.

Inflection Group

inflectionGroup

Contains inflections span role="inflection" and other content relating to the inflections.

Headword Keyword

keyword

Used within headwords for identifying the word that the headword should be ordered by when not the first word.

Measurement (currency) money Used within the main text of an article to cover measurements of currency indicated in the content.
Measurement (general) measurement Used within the main text of an article to cover general measurements indicated in the content.

Persons Nationality.

nationality

Found within encyclopaedic information and sense content. Identifies the nationality of a person.

Creates a span within a span.

Persons Occupation

occupation

Found within encyclopaedic information and sense content. Identifies the occupation of a person.

Creates a span within a span.

Pronunciation

pronunciation

Usually associated to different languages or the headword. Content that defines how to pronounce a particular word (sometimes as phonetics).

Repeated Headword

repeatedForm

Only for when the headword is repeated within the sense content.

Sense numbered link.

senseNum

Cross-references to sense numbers.

Short Work

shortWork

A short work such as an article or short story.

Synonym of headword

synonym

Usually individually listed (comma separated) synonyms of the headword found within sense content.

Technical Notes Group

tecg

A group of technical notes.

Technical Note

technical

Individual technical notes.

Creates a span within a span.

Title (in the sense of a social rank or office)

title

E.g. "fifth Baron St John of Bletso"

Variant Group

varGroup

Contains variant variants.

Work

work

Identifies a work, such as a work of art, literature, or musical composition or recording, etc.

Context of headword

CTX Context of the headword

Used for defining contextual descriptions of a headword (giving information on the sense or type of headword, e.g. Cinderella (musical) and Cinderella (film)).

Example

media/771978_image39.png

Becomes;


<e doi="10.1093/acrefore/9780199975839.013.527" id="acrefore-9780199975839-e-527">
<headwordGroup><headword>ace</headword><partOfSpeech>noun</partOfSpeech>
</headwordGroup>
<section role="main"><textMatter>
<div1>
<p>
<enumerator>1</enumerator>
<span role="context">US</span> A dollar, or a one-dollar
bill. 1925-.</p>
</div1>
<div1>
<p>
<enumerator>2</enumerator>
<span role="example">aces</span>
<span role="context">US</span> Someone or something outstandingly good. 1931-.</p>
<p>
<displayText class="prosequote">
<p>
<sc>american speech</sc> That broad (female) is aces with me (1943).</p>
</displayText>
</p>
</div1>
<!--...--></textMatter>
</section>
</e>
Release ID:
20261202
ID:
OUP_A-Z_Reference_Works_OxEncyclML_TCI_topic_3_6
Author:
dunnm
Last changed:
Wed, 04 Jun 2025
Modified by:
buckmasm
Revision#:
4400