The Nobel Prize has been among the most prestigious international awards since 1901. Each year, awards are bestowed in chemistry, literature, physics, physiology or medicine, economics, and peace. In addition to the honor, prestige, and substantial prize money, the recipient also gets a gold medal with an image of Alfred Nobel (1833 - 1896), who established the prize.

![](Nobel_Prize.png)

The Nobel Foundation has made a dataset available of all prize winners from the outset of the awards from 1901 to 2023. The dataset used in this project is from the Nobel Prize API and is available in the `nobel.csv` file in the `data` folder.

In this project, you'll get a chance to explore and answer several questions related to this prizewinning data. And we encourage you then to explore further questions that you're interested in!

In [78]:
# Loading in required libraries
import pandas as pd
import seaborn as sns
import numpy as np

# Start coding here!

In [79]:
nobel = pd.read_csv("data/nobel.csv")
print(nobel.head(40))
print(nobel.columns)

    year    category  ...      death_city             death_country
0   1901   Chemistry  ...          Berlin                   Germany
1   1901  Literature  ...        Châtenay                    France
2   1901    Medicine  ...         Marburg                   Germany
3   1901       Peace  ...          Heiden               Switzerland
4   1901       Peace  ...           Paris                    France
5   1901     Physics  ...          Munich                   Germany
6   1902   Chemistry  ...          Berlin                   Germany
7   1902  Literature  ...  Charlottenburg                   Germany
8   1902    Medicine  ...    Putney Heath            United Kingdom
9   1902       Peace  ...            Bern               Switzerland
10  1902       Peace  ...            Bern               Switzerland
11  1902     Physics  ...             NaN               Netherlands
12  1902     Physics  ...       Amsterdam               Netherlands
13  1903   Chemistry  ...       Stockholm       

In [80]:
# Most commonly awarded gender and birth country
top_gender = nobel['sex'].value_counts().idxmax()
top_country = nobel['birth_country'].value_counts().idxmax()

print(f"Top Gender: {top_gender}")
print(f"Top Birth Country: {top_country}")

Top Gender: Male
Top Birth Country: United States of America


In [81]:
# Decade with Highest Ratio of US-Born Nobel Prize Winners
nobel['decade'] = nobel['year'] // 10
usa_ratio = nobel[nobel['birth_country'] == 'United States of America'].groupby('decade').size() / nobel.groupby('decade').size()
max_decade_usa = usa_ratio.idxmax() * 10

print(f"Decade with Highest US Ratio: {max_decade_usa}")

Decade with Highest US Ratio: 2000


In [82]:
# Decade and Category with Highest Proportion of Female Laureates
female_only = nobel[nobel['sex'] == 'Female']
total_counts = nobel.groupby(['decade', 'category']).size()
female_counts = female_only.groupby(['decade', 'category']).size()
female_proportion = female_counts / total_counts
max_proportion_index = female_proportion.idxmax()
max_female_dict = {max_proportion_index[0] * 10: max_proportion_index[1]}

print(f"Decade and Category with Highest Female Proportion: {max_female_dict}")

Decade and Category with Highest Female Proportion: {2020: 'Literature'}


In [83]:
# First Woman to Receive a Nobel Prize and Category
first_woman = nobel[nobel['sex'] == 'Female'].sort_values('year').iloc[0]
first_woman_name = first_woman['full_name']
first_woman_category = first_woman['category']

print(f"First Woman Nobel Laureate: {first_woman_name}, Category: {first_woman_category}")

First Woman Nobel Laureate: Marie Curie, née Sklodowska, Category: Physics


In [84]:
# Individuals or Organizations with More Than One Nobel Prize
repeat_winners = nobel.groupby('full_name').size()[nobel.groupby('full_name').size() > 1].index.tolist()
repeat_list = repeat_winners

print(f"Laureates with Multiple Prizes: {repeat_list}")

Laureates with Multiple Prizes: ['Comité international de la Croix Rouge (International Committee of the Red Cross)', 'Frederick Sanger', 'John Bardeen', 'Linus Carl Pauling', 'Marie Curie, née Sklodowska', 'Office of the United Nations High Commissioner for Refugees (UNHCR)']
