Skip to content

A powerful web-based platform for automated data cleaning, analysis, and visualization. This tool helps data scientists and analysts streamline their data preprocessing workflow with an intuitive interface and smart cleaning recommendations.

Notifications You must be signed in to change notification settings

Nagul71/AI-data-Cleaning-Website

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

14 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Data Cleaning Platform

A powerful web-based platform for automated data cleaning, analysis, and visualization. This tool helps data scientists and analysts streamline their data preprocessing workflow with an intuitive interface and smart cleaning recommendations.

Screenshot 2024-11-20 141436

Features

🧹 Smart Data Cleaning

  • Automated detection and handling of missing values
  • Intelligent duplicate row removal
  • Outlier detection and handling
  • Column correlation analysis and redundancy removal
  • Special character handling and standardization
  • Smart date format detection and conversion

Screenshot 2024-11-20 141726

📊 Data Profiling

  • Comprehensive data quality assessment
  • Column type detection (numeric, categorical, datetime)
  • Missing value analysis
  • Data quality scoring system with detailed feedback
  • Duplicate entry detection

Screenshot 2024-11-20 141535

📈 Data Visualization

  • Interactive visualization dashboard
  • Multiple plot types supported:
    • Line plots
    • Bar plots
    • Scatter plots
    • Histograms
    • Box plots
    • Violin plots
    • Correlation heatmaps
    • Pie charts
    • Density plots
  • Customizable plot options and parameters
  • Plot download functionality

Screenshot 2024-11-20 141902

💾 File Support

  • Input formats: CSV, Excel (.xlsx, .xls)
  • Export options: CSV, Excel, JSON

image

🤖 Smart Recommendations

  • Automated cleaning recommendations based on data analysis

  • Priority-based suggestion system

  • Detailed descriptions for each recommended operation

    image

Data Operations

  • Forward and backward fill for missing values
  • Categorical variable encoding
  • Numeric column normalization
  • Column name standardization
  • Constant column removal
  • High-correlation column analysis and removal
  • Customizable correlation threshold settings

Screenshot 2024-11-20 141957

Quality Assessment

  • Overall data quality scoring
  • Detailed quality deduction explanations
  • Continuous quality monitoring during cleaning process

image

This platform is designed to make data cleaning and preparation more efficient and accessible, whether you're a data scientist, analyst, or researcher working with datasets that need preprocessing before analysis.

About

A powerful web-based platform for automated data cleaning, analysis, and visualization. This tool helps data scientists and analysts streamline their data preprocessing workflow with an intuitive interface and smart cleaning recommendations.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published