Document-Classification-using-Deep-Learning

To download the dataset : https://www.cs.cmu.edu/~aharley/rvl-cdip/

Problem:

Lack of intelligent document classification system for Customer onboarding.

Pain Points :

High labour costs , infrastructure maintenance costs and error and rework costs.
Poor Agility and inability to launch new products rapidly.
Delayed response time.
Lack of seamless experience.

Target User :

Primary user : Backend operations teams in Banks.

Dataset Acquisition and Description :

The RVL-CDIP (Ryerson Vision Lab Complex Document Information Processing) dataset consists of 400,000 grayscale images in 16 classes, with 25,000 images per class. There are 320,000 training images, 40,000 validation images, and 40,000 test images. The images are sized so their largest dimension does not exceed 1000 pixels.
To download the dataset : https://www.cs.cmu.edu/~aharley/rvl-cdip/
Paper : https://www.cs.cmu.edu/~aharley/icdar15/

The 16 classes are as follows :

letter, form , email, handwritten, advertisement, scientific report, scientific publication, specification, file folder, news article, budget, invoice, presentation, questionnaire, resume, memo

Observation :

We got some good results using just 10,000 records each train, test and cv. ACCURACY = 88.9%
This can be increased further using better modelling techniques like InceptionNet , ResNets and thus building deep neural network models which would contribute to better accuracy.

About me

Piyush Pathak

PORTFOLIO

GITHUB

BLOG

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
README.md		README.md
document_classification.ipynb		document_classification.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Document-Classification-using-Deep-Learning

Problem:

Pain Points :

Target User :

Dataset Acquisition and Description :

Observation :

About me

📫 Follw me:

About

Releases

Packages

Languages

piyushpathak03/document-classification-using-DL

Folders and files

Latest commit

History

Repository files navigation

Document-Classification-using-Deep-Learning

Problem:

Pain Points :

Target User :

Dataset Acquisition and Description :

Observation :

About me

📫 Follw me:

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages