Technical Issues: Difference between revisions

From iDigBio
Jump to navigation Jump to search
(Created page with "== Resolution == Image resolution needed for OCR return is a tricky subject. The general rule that one commonly hears is that the preferred resolution of 300 dpi (dots per inch)...")
 
Line 1: Line 1:
== Resolution ==
== Resolution ==


Image resolution needed for OCR return is a tricky subject. The general rule that one commonly hears is that the preferred resolution of 300 dpi (dots per inch) is needed for good OCR results. However, this generally applies to images of text obtained from a scanner and dpi values can be misleading when a camera is used to image text. DPI is only relevant if the document size is 1:1 and with a camera, this ratio can vary depending on camera placement and distance between camera and text document. Font size is another factor that can lead to poor OCR output. For instance, a 16pt font at 200 dpi will return better OCR results than an 8pt font at 300 dpi. Therefore, a better measure of image resolution for OCR purposes is obtained by counting the x-height of the text. X-height is the pixel height of a lower case x within the document. According to [http://en.wikipedia.org/wiki/Tesseract Tesseract], an x-height of 20 pixels or better is preferred. See [http://code.google.com/p/tesseract-ocr/wiki/FAQ Tesseract's FAQ] for more information concerning this issue.  
The image resolution needed for OCR return is a tricky subject. The general rule commonly heard is that the preferred resolution of 300 dpi (dots per inch) is necessary for good OCR return. However, this generally applies to images obtained from a scanner and dpi values can be misleading when a camera is used to image text. Dpi is only relevant if the document ratio is 1:1 and with a camera, this can vary depending on camera placement and distance between camera and text document. Font size is another factor that can lead to poor OCR output. For instance, a 16pt font at 200 dpi will return better OCR results than an 8pt font at 300 dpi. Therefore, a better measure of image resolution for OCR purposes is obtained by counting the x-height of the text. X-height is the pixel height of a lower case x within the document. According to [http://en.wikipedia.org/wiki/Tesseract Tesseract], an x-height of 20 pixels or better is preferred. See [http://code.google.com/p/tesseract-ocr/wiki/FAQ Tesseract's FAQ] for more information concerning this issue.  


----
----

Revision as of 11:20, 8 August 2012

Resolution

The image resolution needed for OCR return is a tricky subject. The general rule commonly heard is that the preferred resolution of 300 dpi (dots per inch) is necessary for good OCR return. However, this generally applies to images obtained from a scanner and dpi values can be misleading when a camera is used to image text. Dpi is only relevant if the document ratio is 1:1 and with a camera, this can vary depending on camera placement and distance between camera and text document. Font size is another factor that can lead to poor OCR output. For instance, a 16pt font at 200 dpi will return better OCR results than an 8pt font at 300 dpi. Therefore, a better measure of image resolution for OCR purposes is obtained by counting the x-height of the text. X-height is the pixel height of a lower case x within the document. According to Tesseract, an x-height of 20 pixels or better is preferred. See Tesseract's FAQ for more information concerning this issue.