Data-Extractor

Data Extraction from images

Problem

It is difficult to assemble or organise the health deatils files(for eg. lab test report ).Doctor also took time to examine the reports.

Solution

Combining both the senarios and coming with automate and digital solution. User just to upload the lab reports (images ) in the application .All the data will be extract and will give the digital reports. Benifits:

1: Get rid of oraganising the bundle of reports.

2: Secure Digital data analysing.

3: Patient can also track his or her health care by distinct parameters.

4: Doctor will also get a simple methodology to examine the reports of perticular patient.

How to run

python run.py

Detailed process of installation of packages in mentioned below

About Project

Initial Handlings and Packages

Programing Language: Python Methods: Image Processing , Tesseract-OCR , Flask(For UI Interface)

Inititial installations:

Python :3.7 Softwares: Anaconda,Atom,Spyder,PyCharm (Any One)

Packages

Flask : for UI Interface

Install Commands:

pip install flask

Other packages inside flask:

pip install flask-wtf : Direct forms interface with flask

pip install flask-sqlalchemy : Database connectivity with flask

For Image Processing:

Opencv : pip install opencv

NumPy : as usually installed

Imutils : pip install imutils

Argparse : pip install argparse

Skimage : pip install scikit-image

PIL : pip install pillow

Data Extractiong from Iamges and PDFs

Pytesseract: pip install pytesseract

Procedure

Phase 1- Developed Scanner using OpenCv

Building a scanner with OpenCV can be accomplished in just three simple steps:

Step 1: Detect edges.

Step 2: Use the edges in the image to find the contour (outline) representing the piece of paper being scanned.

Step 3: Apply a perspective transform to obtain the top-down view of the document.

Phase 2- Convert scanned image into text file

Using Tessereact, all the data is extracted from processed image and stored in text file for data mining and analysis.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
Edge 2.PNG		Edge 2.PNG
Edge detection.PNG		Edge detection.PNG
Output Image2.PNG		Output Image2.PNG
README.md		README.md
__init__.py		__init__.py
data_extractor.py		data_extractor.py
forms.py		forms.py
models.py		models.py
original Image.PNG		original Image.PNG
routes.py		routes.py
site.db		site.db
temp.cpython-37.pyc		temp.cpython-37.pyc
temp.py		temp.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Data-Extractor

Problem

Solution

How to run

About Project

Inititial installations:

Packages

Data Extractiong from Iamges and PDFs

Procedure

Phase 1- Developed Scanner using OpenCv

Phase 2- Convert scanned image into text file

Phase 3- Desiging the User Interface for better Intersection and Visualization

Phase 4 :

Key Points:

About

Releases

Packages

Languages

Anjali1751/Extracting-data-of-scanned-images

Folders and files

Latest commit

History

Repository files navigation

Data-Extractor

Problem

Solution

How to run

About Project

Inititial installations:

Packages

Data Extractiong from Iamges and PDFs

Procedure

Phase 1- Developed Scanner using OpenCv

Phase 2- Convert scanned image into text file

Phase 3- Desiging the User Interface for better Intersection and Visualization

Phase 4 :

Key Points:

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages