Libreoffice files generator programmatically with python and Libreoffice server instances
-
Updated
Jul 10, 2024 - Python
Libreoffice files generator programmatically with python and Libreoffice server instances
File Parser optimised for LLM Ingestion with no loss 🧠 Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.
Dedoc is a library (service) for automate documents parsing and bringing to a uniform format. It automatically extracts content, logical structure, tables, and meta information from textual electronic documents. (Parse document; Document content extraction; Logical structure extraction; PDF parser; Scanned document parser; DOCX parser; HTML parser
This repository contains my team's internship project work at Flexbox Technologies. We have developed a system that fills the patient details form automatically with the patient data extracted from pdf file.
Telegram bot for generating docx documents!
Command line tool to extract review changes from a docx file as plain text with HTML tags <ins> and <del>.
Python program which extracts some data from a specific Word document used in my company. Without this program data used to be extracted manually, opening hundred of Word documents one by one to copy/past some informations on an Excel file. Now it is fully automatic.
Telegram Bot that helps you to convert Images to pdf, pdf to images, 45+ file formats to pdf, more features Soon..
ChatWeb can crawl web pages, read PDF, DOCX, TXT, and extract the main content, then answer your questions based on the content, or summarize the key points.
Docx tracked change redlines for the Python ecosystem.
Python SDK to communicate with the GroupDocs.Signature REST API. Add, Remove or Search for Signatures in documents.
Remove Metadata from Microsoft Office Files
Open source Python library for converting PDF to DOCX.
Make plagiarism detection easier. This script will find similar sentences between given files and highlight them in a side by side comparison.
Utilizing state-of-the-art AI-driven components, MemoGen handles the writing, outlining, and reviewing of memos. The final output is a detailed and well-structured document in DOCX format.
Wrapper scripts and pandoc filters to convert LaTeX documents to Word docx files
Add a description, image, and links to the docx topic page so that developers can more easily learn about it.
To associate your repository with the docx topic, visit your repo's landing page and select "manage topics."