GTUT-HT20
-
formatdata.py • This processes the raw data and generates some data files required for the project
-
resolve_data_conflicts.py • This is used for resolving some data conflicts, and also for generating the bigraph file (used in the next step for biclique detection)
-
Generate bicliques using the existing bi-clique detection algorithm as mentioned in the paper
-
format_bicliques.py • Using this code, the bicliques detected in the previous step can be further customized/formatted for more granular analysis
-
scoring_bicliques.py
-
build_similarity_matrix.py
-
build_final_similarity_matrix.py
-
article_n2v_representations.py
-
label_articles.py