The goal of this project is to gather information of best 1000 Computer Science researchers from this website.
Later we utilized the scraped data to understand the following demographics and correlations using Tableau Dashboard:
- A barchart of countries with average publications.
- European countries with the number of scientists in a map (excluding Russia)
- Which Middle Eastern universities are good at research? (using citations as metric)
- Which column is directly correlated with World Rank column? We wanted to understand how the ranking was done.
You can visit the public dashboard here.
Findings and Observations from the Dashboard
- Brazilian scientists have the highest average publications.
- Researchers from King Abdullah University of Science and Technology (KAUST), Saudi Arabia have the highest number of average citations.
- Among European countries, United Kingdom (UK) has the highest number of scientists among the top 1000.
- The ranking was done most probably using H-Index.
- Clone the repo
git clone https://github.com/msi1427/Demographics-of-Best-CS-Scientists-Worldwide.git
- Intialize and activate virtual environment
virtualenv --no-site-packages venv
source venv/bin/activate
- Install dependencies
pip install -r requirements.txt
- Download Chrome WebDrive from https://chromedriver.chromium.org/downloads
- Run the scraper
python selenium_scraper/scraper.py --chromedriver_path <path_to_chromedriver>
- You will get a file named
best_cs_scientist_details.csv
containing all the required fields. Alternatively, check our scraped data here: https://github.com/msi1427/Demographics-of-Best-CS-Scientists-Worldwide/blob/main/selenium_scraper/best_cs_scientist_details.csv