NLP model for extracting chinese datas from the documents
-
Updated
Apr 29, 2024 - Python
NLP model for extracting chinese datas from the documents
This Repository contains AI Resume Analyzer that utilizes PDF parsing, database management, SQL-Python integration, and data extraction from PDFs. It offers skill recommendations and suggests videos and lectures for skill enhancement, aiming to enhance resume quality and job prospects.
A pdf-to-txt-converter that converts multiple files in a specified directory.
CLI program for searching inside text and tables in PDF documents and displaying results in HTML.
Multiple and Large PDF Documents Text Extraction.
📑🧐 Python project for extracting text from resumes in .pdf, .doc and .docx formats based on the article by Omkar Pathak at https://omkarpathak.in/2018/12/18/writing-your-own-resume-parser
This project facilitates the extraction of text from PDF files using various Python libraries. It is designed to be flexible, allowing the choice among different text extraction libraries and supporting both single PDF file and directory containing multiple PDF files.
Create a Gephi Citation Graph based on Text Analysis of PDFs from Zotero
In this project a user can upload his resume pdf and get to know about his strength and weakness, suggestions for improvement,finding the right domain,searching jobs based on the domain
[2023-01] A python Flask API to extrat metadata and text from PDF files. Asynchronous tasks executed with a Celery queue and Redis workers. A SQLite storage managed by SqlAlchemy. Clean code with Flake8 and Isort. Coverage tested with Pytest-cov. See the documentation in the Readme.md and check the API contract with Swagger.
DouFinder: Script para pesquisa/alerta de termos no Diário Oficial da União (DOU).
Code for the automated download and OCR of FOIA files.
OCR made for the specific use case of extracting Covid Info from Images, PDFs and Texts
Add a description, image, and links to the pdfminer topic page so that developers can more easily learn about it.
To associate your repository with the pdfminer topic, visit your repo's landing page and select "manage topics."