DEPRECATED.
THIS LIBRARY MIGHT STILL WORK FOR CERTAIN USE CASES BUT ISN'T BEING ACTIVELY DEVELOPED OR MAINTAINED ANYMORE.
Purell — The Hand Santizer for HTML
Purell is used to extract clean HTML from any kind of flat file format and prepare it for conversion into a raw Superbook. It is a pure CLI utility to turn low quality markup available off of ordinary files into a high-quality Superbook-compatible markdown as per Bubblin-approved standards.
How to use
Purell is almost entirely designed to be used at the root of a Bookiza app. However, it can also be used as a transitioning tool between older file formats and the Superbook format,like so:
- MS Word ⭌ Extract ugly HTML ⭌ Markdown ⭌ Sanitized HTML ⭌ Superbook
- ePub ⭌ Markdown ⭌ Sanitized HTML ⭌ Superbook
- PDF ⭌ ⭌ Markdown ⭌ Sanitized HTML ⭌ Superbook
- Webpage (Scroll) ⭌ Markdown ⭌ Sanitized HTML ⭌ Superbook
Vocabulary
$ pure --help
$ pure fetch <url> // Will fetch original.html from source URI
$ pure defile <path to file> // Will extract original.html from source file.
$ pure sanitize // Markdown Compatible HTML (Sanitized)
The responsibility to paginate the sanitized.html into a Superbook is held by h2s.
License
Blueoak Model License 1.0.0