office-text-extractor
Yet another library to extract text from MS Office and PDF files
Yet another library to extract text from MS Office and PDF files
This package fixes MS-Excel sheet name by limiting it to 31 characters, empty sheet name, and removing illegal characters such as :\/?*[] and more.
Convert MS-Office (Word/Excel/PowerPoint) documents to PDF files via Office Online (and OneDrive).
work with office filaname extensions. check if a file is of type office
Yet another library to extract text from MS Office and PDF files
Converts a docx file to html