MSSR Special Assignment
Group 14:
Philippe D. Misteli - 4932129
Jan Gerling - 4807367
- python 3
- gensim
- matplotlib
- sklearn
- stop_words
- nltk
- numpy
- Import the Java Project in the extractor folder into Eclipse
- Run App.java with the JVM Arguments -Xms4g -Xmx8g -XX:UseG1GC
- Open the python project in Intellij Idea
- Install all dependencies in an environment and set this as the execution environment for the project
- Open Experiment.py in src
- Define the characteristics of your experiment in experiment function:
- Define the locations for the data corpi
- Define a name for your experiment (make sure there is no equally named folder in results)
- (Optional) in the pipeline function you can change the topic counts, distance metric etc.
- Run the experiment in Intellij Idea
- You can find the results of your experiment in: results/[your experiment name]