MarathiPDFSummarizer is a web-based application developed in Python that allows users to upload Marathi language PDF documents and generates a summary of the content along with extracting a list of important keywords or key phrases from the PDF. These keywords represent the main topics discussed in the document.
- User-friendly Interface: Intuitive frontend interface allowing users to easily upload Marathi PDF documents.
- Automatic Summarization: Automated summarization of Marathi PDF content, providing users with concise summaries.
- Keyword Extraction: Extraction of key topics and keywords from Marathi PDFs, aiding in content understanding.
- Language-specific NLP: Utilization of language-specific NLP techniques for accurate text processing and analysis.
- Translation Option: Option for translating the generated output (summary and keywords) into English for enhanced accessibility.
- Clone the repository:
git clone https://github.com/your-username/MarathiPDFSummarizer.git
- Navigate to the project directory:
cd MarathiPDFSummarizer
- Install the required dependencies:
pip install -r requirements.txt
- Run the application:
python app.py
-
Access the application through a web browser at
http://localhost:5000. -
Upload Marathi PDF documents using the provided interface.
-
View the generated summary and extracted keywords for the uploaded PDF.
This project is licensed under the MIT License - see the LICENSE file for details.