mimeograph
CoffeeScript lib for PDF OCR and text extraction.
CoffeeScript lib for PDF OCR and text extraction.
Extend HTMLDocument and Element with hOCR query and property helpers
Extend jsdom with hOCR query and property helpers
A simple wrapper for the Tesseract OCR package
convert hocr to json