2.0.7 • Published 9 months ago

paper2web v2.0.7

Weekly downloads
-
License
MIT
Repository
-
Last release
9 months ago

paper2web WIP

A simple Node.js package for converting PDF documents into structured HTML without losing formatting.

Features

  • Extracts text content from PDFs.
  • Preserves basic text formatting (bold, italic).
  • Extract unordered lists to a HTML list
  • Converts each page into structured HTML.

Installation

Install via npm:

npm install paper2web

Dependencies

This package relies on the following libraries:

Acknowledgments

This project uses pdf2json (pdf2json) by modesty to extract data from PDF files for further processing.

Usage

import { convertPdfToHtml } from "paper2web";

const pdfPath = "path/to/input.pdf";

convertPdfToHtml(pdfPath)
  .then(() => console.log("Conversion successful!"))
  .catch((error) => console.error("Error:", error))

License

This project is licensed under the MIT License

Contributing

Contributions are welcome! Feel free to open an issue or submit a pull request!

Future of this package

  • Add support for images
  • Add cli support

Author

Developed by Malte Harms

Let me know if you need any changes! 🚀
2.0.7

9 months ago

2.0.6

9 months ago

2.0.5

10 months ago

2.0.4

10 months ago

2.0.3

10 months ago

2.0.2

10 months ago

2.0.1

10 months ago

2.0.0

10 months ago

1.0.2

10 months ago

1.0.1

10 months ago

1.0.0

10 months ago