1.1.1 • Published 6 years ago

document-phrase-occurrence-parser v1.1.1

Weekly downloads
2
License
MIT
Repository
github
Last release
6 years ago

Document Phrase Occurrence Parser

Finds the number of occurrences of one or more phrases in a directory of .doc, .docx, and .pdf files.

Installation

npm install --global document-phrase-occurrence-parser

Usage

dpop --phrases "laser, shirt, the"

Extraction Requirements

Textract is used to extract files. Depending on the files you want to extract and your OS, there may be external dependencies.

  • PDF extraction requires pdftotext be installed, link
  • DOC extraction requires antiword be installed, link, unless on OSX in which case textutil (installed by default) is used.