Natural language processing tools developed by the World Bank's DECAT unit. A suite of text preprocessing and cleaning algorithms for NLP analysis and modeling.
-
Updated
Jun 11, 2022 - Python
Natural language processing tools developed by the World Bank's DECAT unit. A suite of text preprocessing and cleaning algorithms for NLP analysis and modeling.
A collection of scripts to "help" you with your programming exams and assignments.
Simple and Useful Automation Tools built with the help of modules available with Python published at PyPI.
✔️ A Python Flask API to manage PDF files.
Newspaper mining and the analysis of the results using python. Cleaning the text using OCR.
Data Center Advanced Walkthrough. Insert data from a PDF file into MySQL database
Python library and Web service based on Poppler Pdftotext utility and Tesseract OCR for extracting text from PDF documents
A lightweight Python-based Software Package for daily use
Add a description, image, and links to the pdf2text topic page so that developers can more easily learn about it.
To associate your repository with the pdf2text topic, visit your repo's landing page and select "manage topics."