Document Clustering: A Case Study on HRC’s Emails process.py contains code I wrote for clustering emails. NLPProject.pdf is a writeup of my experimental process and results. All other files contain data used and are formatted according to the documentation here.