Skip to content

marimor62/Lab-Data_wrangling-

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

9 Commits
 
 
 
 
 
 

Repository files navigation

Lab | Data cleaning and wrangling

For this lab, we will be using the Marketing Customer Value Analysis database from before (marketing_customer_analysis.csv). An auto insurance company has collected some data about its customers including their demographic data, education, employment, policy details, vehicle information, insurance policy and claim amounts.

Instructions

This lab will focus on data cleaning and wrangling, this is a crucial step in the EDA process.

  1. Remove the outliers in the dataset using one of the methods we've discussed by defining a function and applying it to the dataframe.
  2. Create a copy of the dataframe for the data wrangling.
  3. Normalize the continuous variables.
  4. Encode the categorical variables.
  5. Transform the time variables (day, week and month) to integers.
  6. Since the model will only accept numerical data, check and make sure that every column is numerical, if some are not, change it using encoding.

About

LAB Ironhack - Data cleaning and wrangling

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published