-
Updated
Jun 1, 2024
data-cleaning
Here are 2,812 public repositories matching this topic...
This project aim to analyse key opportunities that will help to improve the satisfaction of employees and their productivity in the organization
-
Updated
Jun 1, 2024
Wikidata and Wikipedia language data extraction
-
Updated
Jun 1, 2024 - Python
This repo contains projects, tasks and other code which I have developed on a Data Scientist course at SkillFactory.
-
Updated
Jun 1, 2024 - Jupyter Notebook
R Language course from University. Emphasis on visualization, cleaning and manipulation of data. *ongoing
-
Updated
Jun 1, 2024 - HTML
An ML-based project designed to accurately classify email messages as either spam or ham (non-spam)
-
Updated
Jun 1, 2024 - Jupyter Notebook
OpenDataVal: a Unified Benchmark for Data Valuation in Python (NeurIPS 2023)
-
Updated
Jun 1, 2024 - Python
A light-weight, flexible, and expressive statistical data testing library
-
Updated
Jun 1, 2024 - Python
The open-source tool for building high-quality datasets and computer vision models
-
Updated
May 31, 2024 - Python
Desbordante is a high-performance data profiler that is capable of discovering many different patterns in data using various algorithms. It also allows to run data cleaning scenarios using these algorithms. Desbordante has a console version and an easy-to-use web application.
-
Updated
Jun 1, 2024 - C++
Client interface for all things Cleanlab Studio
-
Updated
Jun 1, 2024 - Python
Scrapedin is a Python module designed to simplify the process of gathering job listings from LinkedIn and performing data analysis on the scraped data to develop an understanding for the required skills and more needed in the market in real-time!
-
Updated
May 31, 2024 - Python
This report explores potential insights that can be derived from the Employees Data Set.
-
Updated
May 31, 2024
My first portfolio project performed using Excel tools
-
Updated
May 31, 2024
Repo for PSRC's Regional Travel Studies, 2014 onward
-
Updated
May 31, 2024 - HTML
Data was downloaded through Kaggle
-
Updated
May 31, 2024 - Jupyter Notebook
Miller is like awk, sed, cut, join, and sort for name-indexed data such as CSV, TSV, and tabular JSON
-
Updated
May 31, 2024 - Go
Prepping tables for machine learning
-
Updated
May 31, 2024 - Python
R package to clean and standardize epidemiological data
-
Updated
May 31, 2024 - R
Improve this page
Add a description, image, and links to the data-cleaning topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the data-cleaning topic, visit your repo's landing page and select "manage topics."