Medical Document Extracter

It is Major project in the codebasics.io Python For Beginner course

Abstract

This project is aimed to extract the required data from the medical documents
using open-cv,OCR,pyTesseract,regex to extract text from documents.
FastApi(For Backend), AirTabel(For DataBase), Streamlit(For FrontEnd).

Algorithms

click here to get in-depth video

Installation

you need to install the Tesseract-OCR and poppler

Tools

open-cv used to extract text from shadow area
numpy is used in cv2 adaptiveThreshold
pyTesseract-ocr to extract text from img
FastApi backend
PDF2IMG converting pdf to img
re for extract required text

Working on

Streamlit used for frontend
Airtable used for DataBase

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
.idea		.idea
backend		backend
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Medical Document Extracter

It is Major project in the codebasics.io Python For Beginner course

Abstract

Algorithms

Installation

you need to install the Tesseract-OCR and poppler

Tools

open-cv used to extract text from shadow area

numpy is used in cv2 adaptiveThreshold

pyTesseract-ocr to extract text from img

FastApi backend

PDF2IMG converting pdf to img

re for extract required text

Working on

Streamlit used for frontend

Airtable used for DataBase

About

Releases

Packages

Languages

pygitdev/Medical-Doc-Extracter

Folders and files

Latest commit

History

Repository files navigation

Medical Document Extracter

It is Major project in the codebasics.io Python For Beginner course

Abstract

Algorithms

Installation

you need to install the Tesseract-OCR and poppler

Tools

open-cv used to extract text from shadow area

numpy is used in cv2 adaptiveThreshold

pyTesseract-ocr to extract text from img

FastApi backend

PDF2IMG converting pdf to img

re for extract required text

Working on

Streamlit used for frontend

Airtable used for DataBase

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages