Mahabharata_Text_Classification_Using_Natural_Language_Processing(NLP)

In this research project, a Naive Bayes classifier was utilized to identify the context of Mahabharata articles in Bengali and Hindi languages using Natural Language Processing (NLP). The classifier was trained on a labeled news article dataset, enabling it to predict categories such as sports, entertainment, economy, opinion, lifestyle, technology, and education. The analysis revealed that 63 out of 100 articles yielded consistent prediction results between the two languages. The study highlights the potential for using machine learning techniques to understand the contextual representation of ancient texts across different languages.

This project was done by me at Summer Research Internship(SURE) at IIT Hyderabad, 2023 Report Link: https://docs.google.com/document/d/17KzLr3WbzdNN67IoqBaJFleZjcAGwVlkqKkqbDcTWUw/edit?usp=sharing

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
README.md		README.md
Testing Dataset.txt		Testing Dataset.txt
Training Dataset.txt		Training Dataset.txt
compare-result.ipynb		compare-result.ipynb
naive-bayes-classifier-bengali.ipynb		naive-bayes-classifier-bengali.ipynb
naive-bayes-classifier-hindi.ipynb		naive-bayes-classifier-hindi.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Mahabharata_Text_Classification_Using_Natural_Language_Processing(NLP)

About

Releases

Packages

Languages

soumyadeep5star/Mahabharata_Text_Classification_Using_NLP

Folders and files

Latest commit

History

Repository files navigation

Mahabharata_Text_Classification_Using_Natural_Language_Processing(NLP)

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages