OCR Tips: Difference between revisions

Jump to navigation Jump to search
391 bytes added ,  2 October 2012
no edit summary
No edit summary
Line 1: Line 1:
== FineReader tips  ==
= FineReader tips  =


SG: Consider image size/color: 
==== '''What works best:'''  ====


What works best:<br>  
'''Size, Color/Grayscale, and Resolution:'''<br>  


Size, Color/Grayscale, and Resolution:<br>
Recommended Image Resolution: 300 dpi for typical texts (printed in fonts of size 10pt or larger), 400–600 dpi for texts printed in smaller fonts (9pt or smaller). For best OCR results vertical and horizontal resolutions must be the same. See User's Guide for additional information.<br>  
 
Recommended Image Resolution: 300 dpi for typical texts (printed in fonts of size 10pt or larger), 400–600 dpi for texts printed in smaller fonts (9pt or smaller). For best OCR results vertical and horizontal resolutions must be the same. See [http://finereader.abbyy.com/guide/ User's Guide] for additional information.<br>  


*Fullsize, color JPEG images of herbarium specimens are ±7-15 MB in size and take about 2 minutes to process each.  
*Fullsize, color JPEG images of herbarium specimens are ±7-15 MB in size and take about 2 minutes to process each.  
*Fullsize, grayscale JPEG images of full herbarium sheets are about 1 MB in size and take about 1 minute to process each.  
*Fullsize, grayscale JPEG images of full herbarium sheets are about 1 MB in size and take about 1 minute to process each.  
*JPEG images of primary collection label only are about 300-600 KB in size and take seconds to process each.
*Cropped JPEG images containing primary collection label only are about 300-600 KB in size and take 6-10 seconds to process each.<br>
 
Pattern training:&nbsp;If using this tool, be sure to train the tool on an image that is the same resolution as the other images you wish to OCR.


<br>  
==== '''What to look out for:'''<br> ====


<br>
*Setting the resolution too high (over 600 dpi) increases the recognition time, as does . Increasing the resolution does not yield substantially improved recognition results. Setting an extremely low resolution (less than 150 dpi) adversely affects OCR quality. See [http://finereader.abbyy.com/guide/ User's Guide] for additional information.


What to look out for:<br>
*'''Pattern training:''' If using this tool, be sure to train the tool on an image that is the same resolution as the other images you wish to OCR. This tool can be useful when running the software on many labels with the same format/fonts.


*Setting the resolution too high (over 600 dpi) increases the recognition time. Increasing the resolution does not yield substantially improved recognition results. Setting an extremely low resolution (less than 150 dpi) adversely affects OCR quality. See [http://finereader.abbyy.com/guide/ User's Guide] for additional information.<br>
*'''Hot Folder: '''Using the Hot Folder allows for running the OCR software on batches of images. However, it does not scan barcodes in an image, only human-readable tex. When running the software on individual images (i.e. not using the Hot Folder), one can select to scan the barcode, as well as detect human-readable text.
*


== Recognition Server  ==
== Recognition Server  ==
4,713

edits

Navigation menu