# U.S. Medical Insurance Costs

# Project Overview
For this project, you will be investigating a medical insurance costs dataset in a .csv file,  
using the Python skills that you have developed.  
This dataset and its parameters will seem familiar if you have,  
done any of the previous Python projects in the data science path

# Project Goals:
Work locally on your own computer  
Import a dataset into your program  
Analyze a dataset by building out functions or class methods  
Use libraries to assist in your analysis  
Optional: Document and organize your findings  
Optional: Make predictions about a dataset’s features based on your findings  

## Step 1. Look at the data in *insurance.csv*

In [16]:
# Import csv library
import csv

# Read the insurance.csv file
with open("insurance.csv", newline="") as insurance_obj:
    insurance_reader = csv.DictReader(insurance_obj)
    for row in insurance_reader:
        print(row)
    

{'age': '19', 'sex': 'female', 'bmi': '27.9', 'children': '0', 'smoker': 'yes', 'region': 'southwest', 'charges': '16884.924'}
{'age': '18', 'sex': 'male', 'bmi': '33.77', 'children': '1', 'smoker': 'no', 'region': 'southeast', 'charges': '1725.5523'}
{'age': '28', 'sex': 'male', 'bmi': '33', 'children': '3', 'smoker': 'no', 'region': 'southeast', 'charges': '4449.462'}
{'age': '33', 'sex': 'male', 'bmi': '22.705', 'children': '0', 'smoker': 'no', 'region': 'northwest', 'charges': '21984.47061'}
{'age': '32', 'sex': 'male', 'bmi': '28.88', 'children': '0', 'smoker': 'no', 'region': 'northwest', 'charges': '3866.8552'}
{'age': '31', 'sex': 'female', 'bmi': '25.74', 'children': '0', 'smoker': 'no', 'region': 'southeast', 'charges': '3756.6216'}
{'age': '46', 'sex': 'female', 'bmi': '33.44', 'children': '1', 'smoker': 'no', 'region': 'southeast', 'charges': '8240.5896'}
{'age': '37', 'sex': 'female', 'bmi': '27.74', 'children': '3', 'smoker': 'no', 'region': 'northwest', 'charges': '7281.

Data includes the age, sex, bmi, number of children, smoker status, region, and charges of patients.
Data is stored in a .csv file, using delim = ",".

# Step 2. Define the scope of the analysis  
Potential Questions:
* What is the average age of the patients?
* What is the average cost (charge) of the patients?
* What is the ratio between the sexes of the patients?
* How does the different variables affect the charge within the dataset?
    * Is there a difference in average charge when grouped by:
        * Sexes?
        * Smoker status?
        * Region?
        * Number of children?
        * NB. These variables are likely confounded!
    * Is there a difference in average BMI when grouped by:
        * Sexes?
        * Smoker status?
        * Number of children?
        * Region?
        * NB. These variables are likely confounded!

# Step 3. Import the *insurance.csv* dataset

In [3]:
# Import libraries
import csv

# Read in the insurance.csv file as a file object
# and store {"number": details} in insurance_dict 
with open("insurance.csv", newline="") as insurance_csv:
    insurance_reader = csv.DictReader(insurance_csv)
    # Create a dict to store the data in
    insurance_dict = {}
    key = 0
    for row in insurance_reader:
        insurance_dict.update({key: {"Age": row["age"], "Sex": row["sex"]}})
        key += 1

{0: {'Age': '19', 'Sex': 'female'},
 1: {'Age': '18', 'Sex': 'male'},
 2: {'Age': '28', 'Sex': 'male'},
 3: {'Age': '33', 'Sex': 'male'},
 4: {'Age': '32', 'Sex': 'male'},
 5: {'Age': '31', 'Sex': 'female'},
 6: {'Age': '46', 'Sex': 'female'},
 7: {'Age': '37', 'Sex': 'female'},
 8: {'Age': '37', 'Sex': 'male'},
 9: {'Age': '60', 'Sex': 'female'},
 10: {'Age': '25', 'Sex': 'male'},
 11: {'Age': '62', 'Sex': 'female'},
 12: {'Age': '23', 'Sex': 'male'},
 13: {'Age': '56', 'Sex': 'female'},
 14: {'Age': '27', 'Sex': 'male'},
 15: {'Age': '19', 'Sex': 'male'},
 16: {'Age': '52', 'Sex': 'female'},
 17: {'Age': '23', 'Sex': 'male'},
 18: {'Age': '56', 'Sex': 'male'},
 19: {'Age': '30', 'Sex': 'male'},
 20: {'Age': '60', 'Sex': 'female'},
 21: {'Age': '30', 'Sex': 'female'},
 22: {'Age': '18', 'Sex': 'male'},
 23: {'Age': '34', 'Sex': 'female'},
 24: {'Age': '37', 'Sex': 'male'},
 25: {'Age': '59', 'Sex': 'female'},
 26: {'Age': '63', 'Sex': 'female'},
 27: {'Age': '55', 'Sex': 'female'},
 2

In [7]:
insurance_dict[1]

{'Age': '18', 'Sex': 'male'}