This is a project to learn various NLP and data mining techniques while doing something fun and interesting. We will explore different language feature extraction methods, and a variety of ML classification or clustering methods. Through this project, we hope to become familiar with the basics of NLP techniques as well as gain exposure to a variety of NLP and ML toolkits for python.
- Parse whatsapp chat logs
- Tagger + feature extraction
- ML supervised and/or unsupervised learning with various combinations of data
- Analysis & Comparison between models
- Extra: IBM BlueMix Speaker Classification
gtadiparthi's Whatsapp parser was tweaked to fit current whatsapp chatlog format