This is a simple library to help you clean your textual data
-
Updated
Jan 2, 2023 - Python
This is a simple library to help you clean your textual data
Analyze Diwali Sales data using Pandas, NumPy, Matplotlib, and Seaborn Libraries to Improve customer experience and also sales.
Small data analysis test of Investing.com comments, Natural Gas Futures. Currently implementing machine learning to the Training_Set data
Tool for preparing a dataset for publishing by dropping, renaming, scaling, and obfuscating columns defined in a recipe.
A python program that takes an Excel or CSV based input file, and cleans the data and exports to multiple tabs based on specified unique values
This is a pandas test for a data science job. The solution here is in form of a notebook beside the main Python file.
Tool that allows you to safely delete multimedia files, without the possibility of recovering the content of the file.
A repository for my big data class's midterm exam.
animal-behavior-preprocessing is a Python repository to preprocess animal behavior data. It works on the output spreadsheets from video-tracking of animal body parts with LEAP or DeepLabCut. It applies a Median Filter, an Ensemble Kalman Filter, transforms data to joint angles and computes their Morlet Wavelet Spectra.
A small scrip to split the output file of the Nihon Codhen MEB-9600 in to different files with each of the sweeps for post data analysis.
A simple Python script to clean files from directories based on their extensions.
Some stats analysis and visualisation of a dataset I did for a friend, not the cleanest code, but works for the use case. Only showcasing the process as not going to show the original dataset as it is not mine to share.
Joining, Cleaning, Querying, Performing ETL on Twitter Posts Dataset.
A helper environment/library for cleaning & querying the CER Smart Meter Trials 2009-2011 datasets via pandas, dask, pandas and Google Colaboratory
This web application automates data cleaning, enabling users to upload CSV or XLSX files. It processes the data to manage missing values and remove duplicates, then allows users to download the cleaned dataset. Built with Flask and Pandas, it simplifies the data preparation process.
This features a dataset cleaner for E-Sport Matches datasets from https://oracleselixir.com/tools/downloads, it works on all the datasets from Elixir, but the cleaning is for training ML with focus on results, it maybe adapted to clean results, and operate on damage, drakes etc.
Rainfall Data Analysis performed in python using the pandas library
Add a description, image, and links to the cleaning-data topic page so that developers can more easily learn about it.
To associate your repository with the cleaning-data topic, visit your repo's landing page and select "manage topics."