Skip to content

Harshit26042004/Intel_OneAPI_OCR

Repository files navigation

Intel OneAPI


oneAPI is an open, cross-industry, standards-based, unified, multiarchitecture, multi-vendor programming model that delivers a common developer experience across accelerator architectures – for faster application performance, more productivity, and greater innovation. The oneAPI initiative encourages collaboration on the oneAPI specification and compatible oneAPI implementations across the ecosystem.

One of the main advantages of using Intel OneAPI is its performance. By optimizing code for specific hardware, developers can achieve significant performance improvements compared to running code on a generic platform. This is particularly important for applications that require high-performance computing, such as machine learning and scientific simulations.

Intel_OneAPI_OCR

Digitization the Handwritten or Photo characters was a manual process in before days. This was a time consuming thing and it is manually expensive. Such handwritten or image characters are difficult to read by visual-impaired people. This Traditional method can be overcome with the help of this OCR (Optical Character Recognition) System.In this project,we propose a deep learning- based OCR system which will be more accurate and efficient with the help of Intel OneAPI platform.
The proposed system uses a convolutional neural network(CNN)model to detect the characters from the image. The model is trained on a large Dataset of labelled images using the OneAPI Platform.
The Image is collected and organised for the classification of characters and letters.The characters are detected with CNN models and trained with OneAPI platform in accurate and efficient manner , Where text characters are extracted separately.
Further , The trained model can predict the sample data images in accurate and time-efficient way based on Intel machines , including CPU and GPU's.With Add-On feature , we will also try to implement audio as an output , which is very useful for visual- impaired people.
In conclusion, This OCR system using Intel OneAPI has potential to digitalize the handwritten and predict it in a higher accuracy and time-efficiency.

Problem statement


Design and develop an OCR system that can accurately recognize and convert printed or handwritten text from scanned images into editable digital text format, while maintaining the original format and layout of the document. The system should be able to handle a variety of fonts, styles, and sizes, and have a high level of accuracy and speed in processing large volumes of documents.

Trained with Intel


With the help of Intel OneAPI we trained our model .And below is the snap of the code we developed in Intel devcloud with TENSORFLOW AI TOOLKIT

sample-notebook

Deployment


We deployed our model with the help of Streamlit as a web app. Users can browse their files,Upload it ,And model will automatically detect and recognise the characters in that image.The sample of the deployment page is attached below

deployment-img

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published