This repository contains the code used for the research I did as part of the TU Delft CSE Research Project (CSE3000), replicating and extending the work of Kelty et al. on the influence of superstar researchers. The Go code is used for downloading and preprocessing the S2AG dataset, the Elixir Livebook was used for early data exploration and some supplementary analysis and visualisation, and the Python scripts in the w*/ folders contain the key computations and visualisations.
Some smaller computed datasets are included in the w*/ folders. The structure of the datasets/ folder is replicated but with empty files, as these datasets are excessively large.