The goal of this project is to do the text mining analysis on Shakespeare’s plays and the question we are trying to answer here is if the gender of characters in Shakespeare’s plan is differentiable.
The project mainly has two parts. The first part is the work relates to data retrieval and data preprocessed. And the second part is to apply the suitable machine learning tools to come up with the statistical convincing number to answer the targeted question.