Albin O. Kuhn Library & Gallery - Staff Wiki
OCR for Compound Object - Street Life in London
Remember: all of the pages are images. So, the text of the book is saved as images. OCR only the text pages (not the photographs -- their captions are already transcribed).
Open the metadata file: I:\SpecColl\CDM_Metadata\Street Life\Street_Life_MD.xlsx
Open the program ABBYY FineReader (Start Menu>All Programs>ABBYY FineReader 9.0>ABBYY FineReader 9.0 Professional Edition)
In the ABBYY program, click on the Icon to "Convert Photo to Microsoft Word"
Open the next image to convert. Get these images from the external hard drive: F:\Street_Life\JPEGs
A Word document will open.
Check to make sure that the OCR has captured the text correctly. Note: it will be hard for the program to pick up different fonts. Refer to the source image to ensure that the text is correct. Make any necessary corrections. Note: it is sometimes easier to make the corrections in the Exel file after pasting.
Select all (Ctrl A)
Copy all (Ctrl C)
Go to the metadata Excel file
Go to the "Transcript" field for the image you just OCR'd. Click on the top function bar, paste the text (Ctrl P).
Save the metadata file.
Close the Word file.
Begin the process in ABBYY again with the next page.
Albin O. Kuhn Library & Gallery . University of Maryland, Baltimore County . 1000 Hilltop Circle . Baltimore MD 21250
(410) 455-2232. Questions and comments to: Web Services Librarian