Skip to content

Data scientists are increasingly playing critical role in all data-driven endeavors in today’s world. Our team will be using Network Science to help us to understand research collaboration among data scientists over time and select a subset of them for various tasks.

AlvinTang011/Data-Scientists-Collaboration-Network-Analaysis

Repository files navigation

Create conda environment for dependencies:

conda env create -f environment.yml

Run streamlit to interactively visualize network at (cumulative) year granularity:

streamlit run visualize_by_years.py

To run the entire analysist:

python Data Scientists Collaboration Network Analysist.py

Note, if the html file appears wrong link, it is saved within the folder 'Results'. Open to view the network All images for properties are output as pdf except the network which is in html for interactivity User input for max degree node in integer to obtain desired K(max) for network

For Input file, ensure that it follows the format established in DataScientists.xls file - name, country, institution, dblp, expertise

The output file will include every analysis used to identify the relationships between the data scientists within the given input file

An analysis of the current relationship of data scientists is done up in the report

About

Data scientists are increasingly playing critical role in all data-driven endeavors in today’s world. Our team will be using Network Science to help us to understand research collaboration among data scientists over time and select a subset of them for various tasks.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published