The first problem looks at logistic regression for text classification, the second explores the small-world phenomenon in "close" vs. "distant" friend networks, and the third studies how the structure of an email network changes as we remove weak ties from it.
Your code and a brief report with your results are to be submitted electronically in one zipped (or tarball-ed) file through the CourseWorks site. All code should be contained in plain text files and should produce the exact results you provide in your writeup. Code should be written in bash / R and should not have complex dependencies on non-standard libraries. The report should simply present your answers to the questions in an organized format as either a plain text or pdf file. All work should be your own and done individually.