Data preprocessing is a data mining technique that involves transforming raw data into an understandable format.
-
Updated
Apr 16, 2018 - Jupyter Notebook
Data preprocessing is a data mining technique that involves transforming raw data into an understandable format.
It's daily exercise notebook from Python Machine Learning A-Z course from Udemy.
Classification of breast cancer diagnosis using Support Vector Machines in Python using Sklearn
100 Days of ML Code, Including all machine learning related algorithms and processes explained with jupyter notebooks.
This document forms the basis of several workshops/talks that get into everyday programming with R, but also includes mirrored code in Python as Jupyter notebooks.
Data processing repository for my Bachelor's thesis, in search of more convenient data processing flow and data visualization than existing, I use jupyter-notebook.
Machine Learning notebooks for refreshing concepts.
December 2021 - Final 4th engineering year Project for the Python for Data Analysis module at ESILV | Blocks Classification & Seoul Bikes Rent Prediction
Python Data Understanding and Data Cleaning for Sales Project Notebook
Python Preprocessing for Sales Project Notebook
Python Data Visualitation and Time Series for Sales Project Notebook
Analyzing employee data through Jupyter Notebook using Pandas and Plotly libraries to provide insights about performance, talent, demographics and so on.
Repo contains a variety of data insights techniques with an intermediate version including Data Analysis Notebooks + Data Visualization Notebooks + Data Pre-processing Notebooks
This repository consists of the jupyter notebook files of the first project I worked on. It has data processing, descriptive analysis and linear regression modeling to predict the total claim amount of the customers of an insurance company.
Churn Prediction Model with EDA, Feature Engineering and testing out different models. Based on a kaggle notebook.
This is a project based on python and jupyter notebook for analysing data of the consumers collected during Diwali. The aim of this project is to improve customer experience by analyzing the sales data and increase revenue of the firm.
This repository contains a data science mini project focused on exploring and analyzing an IMDB dataset. The project utilizes Python and popular data science libraries. Through a series of Jupyter Notebooks, the project demonstrates various data preprocessing techniques, EDA, and the application of ML algorithms.
An innovative and collaborative solution for setting up and executing Jupyter Notebooks on High-Performance Computing (HPC) clusters, tailored for neuroscience data processing workflows.
Code to make it easy to import and process Zooniverse annotations and their metadata in Python/Jupyter Notebooks
Reference blog for notebooks, various studies, book exercises, tutorials, and other platforms.
Add a description, image, and links to the data-processing topic page so that developers can more easily learn about it.
To associate your repository with the data-processing topic, visit your repo's landing page and select "manage topics."