An assignment on preprocessing of text including tokenization, stop word removal, filtering noise, and then finding out frequent keywords.
- Assignment_1.pdf: assignment document
- SO-Java.zip, SO-Javascript.zip, SO-Python.zip: zip archive of export of 50K questions asked on StackOverflow questions related to Java, JavaScript, and Python
- file.txt: dummy file used for Question 1
- Solution.ipynb and Solution.html: solution notebook and its HTML export