An assignment on preprocessing of text including tokenization, stop word removal, filtering noise, and then finding out unanswered question and suggesting related answered questions.
- Assignment_2.pdf: assignment document
- SO-Java.zip, SO-Javascript.zip, SO-Python.zip: zip archive of export of 50K questions asked on StackOverflow questions related to Java, JavaScript, and Python
- Solution.ipynb and Solution.html: solution notebook and its HTML export