This script uses Jaccard Index to discover the topic name related to some top terms.
I used it for my CS thesis project. You can check it out in dblp-journals.
- Install Python 3.7
- Install Pipenv
- Clone this project
- Run
pipenv shell
- Run
pipenv install
- Edit jaccard.py
- Update the values in
topics.csv
. The "Topic" column represent the name of the topic and the "TopTerms" represent the list of top terms to use with Jaccard - Use the output from dblp-journals to replace the data in the
input
folder - Run
python jaccard.py