GitHub

#OCR Style

##Optical Character Recognition

##背景本程序最初是想用于发票识别的，目前可以通过ocrengine和imageprocess动态库进行二次开发。

##功能

图片识别文字的软件。
支持截图。
支持选择识别的区域。
支持选择识别的语言。
支持倾斜纠正。
支持保存识别的文字。
支持多种语言的识别，内置了中文和英文，可以根据需要下载其它语言。

##编译环境

ocrstyle 使用的Qt4.8开发的。
ocrengine 使用Visual C++ Express 2008开发的。这个工程依赖third-party目录下的Tesseract OCR开发库，目前是google维护的，当时开发的时候官方只有VC2008的开发库，于是就用的VC2008开发了。项目地址：http://code.google.com/p/tesseract-ocr/。
imageprocess 使用的Visual C++ Express 2010开发的。这个工程依赖于OpenCV。项目地址：http://opencv.org/
buildscripts/ocrstyle.nsi 打包安装包使用的NSIS。

##动态库的接口

ImageProcessAPI.h 主要功能：图像的加载、灰度处理、倾斜校正、保存等。
OCREngine.h 主要功能：配置识别的语言、传入要识别的图像数据、获得文本。

##思路

扫描的电子发票一般都有点倾斜，所以要对图片进行倾斜校正。为了方便识别需要进行灰度处理。
一般是要识别发票上的部分内容，所以要想办法把要识别的文本区域的图像数据传给Tesseract OCR，然后获得文本。

##截图 ###主界面

###英文图片倾斜纠正并识别

###中文发票进行区域选择并识别

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
bin		bin
buildscripts		buildscripts
doc		doc
include		include
lib		lib
milestone		milestone
snapshots		snapshots
src		src
third-party/tesseract		third-party/tesseract
.gitattributes		.gitattributes
.gitignore		.gitignore
README.md		README.md
ocrstyle.ico		ocrstyle.ico

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

bin

bin

buildscripts

buildscripts

doc

doc

include

include

lib

lib

milestone

milestone

snapshots

snapshots

src

src

third-party/tesseract

third-party/tesseract

.gitattributes

.gitattributes

.gitignore

.gitignore

README.md

README.md

ocrstyle.ico

ocrstyle.ico

Repository files navigation

About

Releases

Packages

Languages

cnsuhao/ocrstyle

Folders and files

Latest commit

History

Repository files navigation

About

Resources

Stars

Watchers

Forks

Languages