Skip to content

Karthik15906/DataCleaner

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 

Repository files navigation

DataCleaner

🧹 DataCleaner

A simple and reusable Python library to automate data cleaning for machine learning workflows.


🚀 Overview

While working on machine learning models, I noticed that I was repeatedly writing the same Pandas code to clean datasets.

To solve this, I built DataCleaner — a lightweight library that handles common data preprocessing tasks in one go.


✨ Features

  • 📂 Load data from:

    • CSV
    • Excel
    • JSON
  • 🧼 Handle missing values:

    • Drop missing values
    • Fill with mean
    • Fill with median
  • 🔁 Remove duplicate rows (optional)

  • 📊 Logging support to track cleaning steps


⚙️ Installation

Make sure you have Python installed, then install dependencies:

pip install pandas

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages