Apr 2025 – Jun 2025 · NullClass Edtech Pvt. Ltd.
Two NLP systems built end-to-end during a 2-month Data Science internship focused on real-world chatbot architecture, dynamic knowledge management, and interface development.
Takes long-form text input and extracts the most important sentences to produce a concise summary. Built with traditional NLP + a TensorFlow classification model.
- Text preprocessing — tokenization, stopword removal, lemmatization (NLTK)
- Feature extraction — Bag of Words representation
- Sentence scoring — TensorFlow model ranks sentences by importance
- GUI — Tkinter desktop interface for input/output
87% summarization accuracy on held-out test data
Python TensorFlow NLTK Bag of Words Tkinter
A Q&A chatbot that can ingest new documents and update its knowledge base dynamically — no retraining required. Uses vector search for retrieval and basic entity recognition to keep responses grounded.
- Document ingestion — XML/CSV files parsed and chunked
- Vector indexing — FAISS stores and retrieves embeddings
- Dynamic updates — new documents added to the index at runtime via LangChain
- Entity recognition — basic NER to improve response grounding
- UI — Streamlit interface for chat interaction
90% correct response retrieval on unseen queries
Python FAISS LangChain NLTK Streamlit
Tasks/
├── task1/
│ ├── SummaryTask1.ipynb # Extractive summarization model (TF + BoW)
│ └── requirements.txt
├── taskno3/
│ ├── app.py # Streamlit UI
│ ├── gui.py # Tkinter interface
│ ├── intentsMed.json # Knowledge base / intents data
│ ├── requirement.txt
│ └── updateknowledge.py # Dynamic FAISS knowledge base updates
└── README.md
Update folder names above to match your actual structure
git clone https://github.com/Debangana13/Tasks
cd Tasks
# Project 1 — Summarization
pip install -r task1/requirements.txt
jupyter notebook task1/SummaryTask1.ipynb
# Project 2 — Self-updating chatbot
pip install -r taskno3/requirement.txt
streamlit run taskno3/app.pyPart of NullClass Edtech Pvt. Ltd. Data Science Internship · Certificate issued Jun 2025
Built by Debangana Ghosh