Skip to content

Algorithmic Methods of Data Mining (Sc.M. in Data Science) Academic year 2023–2024. Homewok 5

Notifications You must be signed in to change notification settings

lauraaconcari/ADM_HW5

Repository files navigation

ADM_HW5 - The eternal significance of publications and citations!

Algorithmic Methods of Data Mining (Sc.M. in Data Science) Academic year 2023–2024. Homewok 5

Group Members:

The Group is composed by the following members:

Files in the Repository:

  • Main.ipynb: this file contains all the answers to the Research Questions, including AQ.
  • output_commandline.png: this file contains images required to answer the Command Line Question (CLQ)
  • CommandLine.sh: shell script used to answer the first part of the Command Line Question (CLQ)
  • CommandLine_files: This folder contains three files (betweenness_centrality.csv, degree_of_citation.csv, path_lengths.csv). These files were created using the functions in the main.ipynb file at '[4] Command Line Question (CLQ)'

This repository represents the submission of Group 8 for the ADM-HW5 of the Data Science course for the academic year 2023/2024 at Sapienza - University of Rome. The primary purpose of the code is to execute steps 1 and 2 of the homework assignment. The dataset utilized contains citations of scientific articles. Two distinct graphs have been constructed to fulfill the assignment requirements for visualization. The implementation relies on various libraries, including Pandas, Numpy, and SciKit-Learn. These libraries are instrumental in data manipulation and graph analysis of citation and collaboration patterns. Additionally, it is worth noting that a preprocessing step involves the conversion of a JSON (JavaScript Object Notation) file to a CSV (Comma-Separated Values) format.

About

Algorithmic Methods of Data Mining (Sc.M. in Data Science) Academic year 2023–2024. Homewok 5

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published