Skip to content

nokeym/InvoiceOCRer

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

InvoiceOCRer

A tiny app for OCR of Chinese VAT invoice, and to pick up the key information to Excel.

HOW TO START

  1. Install the necessary module by pip install.They are: paddlepaddle, paddleocr, re, PIL, pandas, PyQt5, fitz.

  2. Go to the PaddleOCR website https://github.com/PaddlePaddle/PaddleOCR, and donwload the inference model under the header 'Detection model','Direction classifier','Recognition model', subject to the 'Chinese and English general OCR model (143.4M)'. Then you may get 3 tar files. Unzip them and place them in the folder named 'cls','det','rec',which are usually in the following path:'your python parth'\Lib\site-packages\paddleocr\2.1\

3.Open OCRInvoice.py by whatever application. Find the following code:

self.ActOCR = PaddleOCR(rec_model_dir=r'E:\Python\Python38\Lib\site-packages\paddleocr\2.1\rec\ch', cls_model_dir=r'E:\Python\Python38\Lib\site-packages\paddleocr\2.1\cls', det_model_dir=r'E:\Python\Python38\Lib\site-packages\paddleocr\2.1\det\ch')

Replace the path with the actual path in the setp 2.

  1. Run OCRWindow.py to get what you want.

About

a tiny app for OCR of Chinese commercial invoice, and to pick up the key information to Excel

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages

  • Python 100.0%