Image2Text Analyzer is a Streamlit web application that allows users to upload images containing text, extract the text using Tesseract OCR, perform spell-checking, sentiment analysis, and extract keywords from the extracted text.
- Text Extraction: Upload an image and extract text using Tesseract OCR.
- Spell Checking: Perform spell-checking on the extracted text to correct any misspelled words.
- Sentiment Analysis: Analyze the sentiment of the extracted and corrected text (positive, negative, or neutral).
- Keyword Extraction: Extract keywords from the corrected text based on word frequencies.
- Upload Image: Click on the "Choose an image..." button to upload an image containing text.
- Extract Text: After uploading the image, click on the "Extract Text" button to extract text from the image.
- View Results: Once the text is extracted, the sentiment analysis result will be displayed, and you can download the extracted text, corrected text, and extracted keywords.
- Clone the repository: git clone https://github.com/ravalsmit/image2text-analyzer.git
- Navigate to the project directory: cd Image2Text-Analyzer
- Install dependencies: pip install -r requirements.txt
To run the application, execute the following command: streamlit run main.py
- Streamlit: For building the web application.
- PyTesseract: For OCR.
- TextBlob: For sentiment analysis.
streamlit
: 0.87.0pytesseract
: 0.3.8pillow
: 8.4.0scikit-learn
: 0.24.2textblob
: 0.15.3