Skip to content

Hypernap/data-helper

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 

Repository files navigation

Here's a complete GitHub README for the Data Helper project:

Data Helper

Data Helper is a powerful Python application designed to streamline your data cleaning and exploration process. With its intuitive graphical user interface (GUI), this tool empowers you to effortlessly load and cleanse your data files, leaving you with a pristine dataset ready for analysis.

Features

  • Seamless Data Ingestion: Support for popular file formats like CSV and Excel, ensuring compatibility with your existing data sources.
  • Intelligent Data Cleaning: Automatically removes rows with missing values in the specified key column, ensuring data integrity.
  • Comprehensive Summary Statistics: Gain valuable insights with overall descriptive statistics and column-wise detailed summaries.
  • Data Visualization: Explore your cleaned dataset through an interactive tabular view with scrolling capabilities.
  • User-Friendly GUI: Enjoy a smooth and intuitive experience with the sleek Tkinter-based interface.
  • Leveraging Industry-Standard Libraries: Harness the power of Pandas, NumPy, and other renowned data analysis libraries.
  • Robust and Extensible: Built with a modular architecture, ensuring scalability and future enhancements.
  • Error Handling and Input Validation: Implemented robust error handling and input validation mechanisms for reliable operation.

Video Example

Watch the video

Click the image above to watch the video on YouTube.

Installation

  1. Clone the repository:
git clone https://github.com/your-username/data-helper.git
  1. Navigate to the project directory:
cd data-helper
  1. Install the required dependencies:
pip install -r requirements.txt

Usage

  1. Place your data files (CSV or Excel) in the data directory within the project.

  2. Run the application:

python main.py
  1. The application will prompt you to select a file from the data directory.

  2. Choose the key column you want to use for data cleaning.

  3. The application will display the cleaned data summary, including overall and column-wise statistics, along with an interactive tabular view.

Contributing

Contributions are welcome! If you have any suggestions, bug reports, or feature requests, please open an issue or submit a pull request.

License

This project is licensed under the MIT License.

Acknowledgements

  • Pandas - A powerful data analysis and manipulation library for Python.
  • NumPy - A fundamental library for scientific computing in Python.
  • Tkinter - Python's standard GUI library.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages