# Demographic Data Analysis
This notebook analyzes demographic data from the 1994 Census database using Pandas.

We will answer the following questions:
1. How many people of each race are represented in this dataset?
2. What is the average age of men?
3. What is the percentage of people who have a Bachelor's degree?
4. What percentage of people with advanced education (Bachelors, Masters, or Doctorate) make more than 50K?
5. What percentage of people without advanced education make more than 50K?
6. What is the minimum number of hours a person works per week?
7. What percentage of the people who work the minimum number of hours per week have a salary of more than 50K?
8. What country has the highest percentage of people that earn >50K and what is that percentage?
9. Identify the most popular occupation for those who earn >50K in India.


In [None]:
# Import necessary libraries
import pandas as pd

# Load the dataset
df = pd.read_csv('adult.data.csv')
df.head()

## Question 1: How many people of each race are represented in this dataset?

In [None]:
# Count people by race
race_count = df['race'].value_counts()
race_count

## Question 2: What is the average age of men?

In [None]:
average_age_men = df[df['sex'] == 'Male']['age'].mean().round(1)
average_age_men

## Question 3: What is the percentage of people who have a Bachelor's degree?

In [None]:
percentage_bachelors = ((df['education'] == 'Bachelors').mean() * 100).round(1)
percentage_bachelors

## Question 4: Percentage with advanced education who earn >50K

In [None]:
advanced_education = df['education'].isin(['Bachelors', 'Masters', 'Doctorate'])
percentage_advanced_education_rich = ((df[advanced_education]['salary'] == '>50K').mean() * 100).round(1)
percentage_advanced_education_rich

## Question 5: Percentage without advanced education who earn >50K

In [None]:
no_advanced_education = ~df['education'].isin(['Bachelors', 'Masters', 'Doctorate'])
percentage_non_advanced_education_rich = ((df[no_advanced_education]['salary'] == '>50K').mean() * 100).round(1)
percentage_non_advanced_education_rich

## Question 6: Minimum work hours per week

In [None]:
min_work_hours = df['hours-per-week'].min()
min_work_hours

## Question 7: Percentage of minimum-hour workers with salary >50K

In [None]:
num_min_workers = df['hours-per-week'] == min_work_hours
rich_percentage = ((df[num_min_workers]['salary'] == '>50K').mean() * 100).round(1)
rich_percentage

## Question 8: Country with highest percentage of people earning >50K

In [None]:
countries_salary_count = df[df['salary'] == '>50K']['native-country'].value_counts()
countries_count = df['native-country'].value_counts()
highest_earning_country = ((countries_salary_count / countries_count) * 100).idxmax()
highest_earning_country_percentage = ((countries_salary_count / countries_count) * 100).round(1).max()
highest_earning_country, highest_earning_country_percentage

## Question 9: Most popular occupation for >50K earners in India

In [None]:
top_IN_occupation = df[(df['native-country'] == 'India') & (df['salary'] == '>50K')]['occupation'].mode()[0]
top_IN_occupation