# **Analyzing "Census Adult Income" Dataset**

# Step 1: Import Libraries and Explore the Dataset

## **Objectives**
- Load the dataset and examine its structure.
- Check the first few rows of the data.
- Assess the column data types and summary statistics.
- Identify any missing values.

## **Code**


In [16]:
# Import required libraries
import numpy as np
import pandas as pd
import seaborn as sns
import matplotlib.pyplot as plt


# Initialize BigQuery client (if working with BigQuery)
from google.cloud import bigquery
client = bigquery.Client(project="central-catcher-444011-s7")

In [17]:
# Corrected query with valid column names
query = """
    SELECT 
        age,
        workclass,
        education,
        education_num,
        marital_status,
        occupation,
        relationship,
        race,
        sex,
        capital_gain,
        capital_loss,
        hours_per_week,
        native_country,
        income_bracket
    FROM `bigquery-public-data.ml_datasets.census_adult_income`
"""
query_job = client.query(query)
# Convert query results into a dictionary
results = query_job.result()  # This fetches all the rows from the query
# Load it directly into a pandas DataFrame
df = pd.DataFrame([dict(row) for row in results])
print(df.head())


   age workclass education  education_num       marital_status  \
0   39   Private       9th              5   Married-civ-spouse   
1   77   Private       9th              5   Married-civ-spouse   
2   38   Private       9th              5   Married-civ-spouse   
3   28   Private       9th              5   Married-civ-spouse   
4   37   Private       9th              5   Married-civ-spouse   

           occupation relationship    race      sex  capital_gain  \
0       Other-service         Wife   Black   Female          3411   
1     Priv-house-serv         Wife   Black   Female             0   
2       Other-service         Wife   Black   Female             0   
3     Protective-serv         Wife   Black   Female             0   
4   Machine-op-inspct         Wife   Black   Female             0   

   capital_loss  hours_per_week  native_country income_bracket  
0             0              34   United-States          <=50K  
1             0              10   United-States          <