TessOCR

TessOCR is a web based application, which recognizes text from images for the user. This is achieved using the Tesseract API.

This application was built as a part of the recruitment process for the Web and App Development SIG of Web Club NITK

A live demo of the app can be found here

The basic functionality of the application is demonstrated in this video

Limitations

The accuracy of the application is limited by the underlying Tesseract Engine. Even though Tesseract is one of the most accurate OCR engines, it is not perfect. For example, if the image contains text in a font that is unrecognisable to Tesseract, the application fails to load a result. The application might also fail to produce an accurate result if the image contains text in different orientations. The prediction is also affected by the lighting and contrast in the image.

Usage

Prerequisites

tesseract
leptonica
GraphicsMagick
node with Angular CLI installed globally. Detailed instructions on setting up the build environment can be found in the CONTRIBUTING.md file.

Installing Dependencies

Clone the repository to your machine.
Run npm install in the root of the project.
Change to public directory and build the Angular app.

cd public
npm install
ng build --prod

This builds the Angular app in /dist directory.

Running the Server

To start the server run npm start.
The app is now available at http://localhost:8080

Contributing

The detailed instructions on how to contribute to the project can be found in the CONTRIBUTING.md file.

License

This project is licensed under the MIT License

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
dist		dist
lib/node-tesseract		lib/node-tesseract
public		public
.gitignore		.gitignore
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE.md		LICENSE.md
Procfile		Procfile
README.md		README.md
api.js		api.js
index.js		index.js
package-lock.json		package-lock.json
package.json		package.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

dist

dist

lib/node-tesseract

lib/node-tesseract

public

public

.gitignore

.gitignore

CONTRIBUTING.md

CONTRIBUTING.md

LICENSE.md

LICENSE.md

Procfile

Procfile

README.md

README.md

api.js

api.js

index.js

index.js

package-lock.json

package-lock.json

package.json

package.json

Repository files navigation

TessOCR

Limitations

Usage

Prerequisites

Installing Dependencies

Running the Server

Contributing

License

About

Releases

Packages

Languages

License

MJ10/TessOCR

Folders and files

Latest commit

History

Repository files navigation

TessOCR

Limitations

Usage

Prerequisites

Installing Dependencies

Running the Server

Contributing

License

About

Topics

Resources

License

Stars

Watchers

Forks

Languages