OCR SaaS

From iDigBio
Revision as of 15:12, 31 January 2013 by Mikegiddens (talk | contribs)
Jump to navigation Jump to search

OCR SaaS

Needs: - Accept incoming request and return a refId of the job. - Process the ocr with the available OCR engines - Support zBar for barcode detection - Support language detection - Support calling endpoints - Support round-robin features to share service evenly

Future needs: - Socket.io support - Cleaning techniques - Techniques to determine if the OCR is handwriting or type - Support imagemagick adjustment techniques for better results - Support custom training files - Support Abbyy and OmniPage - Support EverNote - Support OpenCV image detection features

Current Plans: - Use NodeJS along with some

Possible Routes - imageAdd - imageRemove - imageStatus - queueInfo - ping


imageAdd( uri, [id], [endpoint] ) returns:

 { 
  success: bool
, refId: uuid 
 }

imageRemove( refId ) returns:

 {
  success: bool
 }