Skip to content

rushofcode/Data-Cleaning-Task1

Repository files navigation

medical-show-cleaning

Task 1: Data Cleaning and Preprocessing

Dataset: Medical Appointment No Shows (Kaggle)

Objective

Clean and prepare the raw dataset by handling missing values, duplicates, inconsistent text, and incorrect formats.

Steps Performed

  1. Removed missing values and duplicates
  2. Standardized column names (lowercase + underscores)
  3. Cleaned text formats for gender and no-show columns
  4. Converted dates to datetime format
  5. Fixed data types (age as int, IDs as string)
  6. Removed invalid ages (<0 or >115)

Tools Used

  • Python 3.11
  • Pandas

Output

A cleaned dataset ready for analysis: cleaned_medical_appointments.csv

Author

Rushikesh Palekar

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published