Open source Python library for converting PDF to DOCX.
-
Updated
Jun 11, 2024 - Python
Open source Python library for converting PDF to DOCX.
Enjoy reading with your favorite style.
Telegram Bot that helps you to convert Images to pdf, pdf to images, 45+ file formats to pdf, more features Soon..
ChatWeb can crawl web pages, read PDF, DOCX, TXT, and extract the main content, then answer your questions based on the content, or summarize the key points.
Python tool and library for decrypting and encrypting MS Office files using passwords or other keys
Best PDF Converter! PDF to any format, pdf2word/excel/xml/html/txt...
Automate your Microsoft Learn Student Ambassadors event certificate with Python
Security Assessment Report geNerated Automatically
File Parser optimised for LLM Ingestion with no loss 🧠 Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.
Extracts tables from .docx files and saves them as .csv or .xls files
Convert text, Docs, Ppt, Excel, and images to pdf files easily.
Dedoc is a library (service) for automate documents parsing and bringing to a uniform format. It automatically extracts content, logical structure, tables, and meta information from textual electronic documents. (Parse document; Document content extraction; Logical structure extraction; PDF parser; Scanned document parser; DOCX parser; HTML parser
Make plagiarism detection easier. This script will find similar sentences between given files and highlight them in a side by side comparison.
Generate word documents from sphinx documentation
python: selenium + sqlite3 爬虫,实现将淘宝网站数据、1688网站数据的爬取,淘宝爬虫\1688爬虫;并保存到数据库中
Add a description, image, and links to the docx topic page so that developers can more easily learn about it.
To associate your repository with the docx topic, visit your repo's landing page and select "manage topics."