Common Re-keying and OCR Errors
Particular care needs to be taken when re-keying content or capturing text using OCR.
In some typefaces, the digit "1" looks like a capital (serif) letter "i".
The lower-case letter "L" can also look like a digit "1" or a capital letter "i". This is especially easy to confuse when "l." and "ll." are used next to digits as abbreviations for "line" and "lines" respectively.
Accents and Greek breathing marks are easily confused. If in doubt, seek advise from your OUP contact.
Apostrophies should not be captured for Greek breathing marks.
In Greek, υ and ν are often confused.
Common OCR errors
When using optical character recognition common error need to be avoided.
The following characters errors are encountered most often:
- s becomes f
- h becomes li
- h becomes b
- e becomes c
- sh becomes m
- ni or in becomes m
- rn becomes m