Analyze Diwali Sales data using Pandas, NumPy, Matplotlib, and Seaborn Libraries to Improve customer experience and also sales.
-
Updated
Aug 22, 2023 - Python
Analyze Diwali Sales data using Pandas, NumPy, Matplotlib, and Seaborn Libraries to Improve customer experience and also sales.
This is a pandas test for a data science job. The solution here is in form of a notebook beside the main Python file.
Small data analysis test of Investing.com comments, Natural Gas Futures. Currently implementing machine learning to the Training_Set data
Tool for preparing a dataset for publishing by dropping, renaming, scaling, and obfuscating columns defined in a recipe.
A python program that takes an Excel or CSV based input file, and cleans the data and exports to multiple tabs based on specified unique values
A small program that will rename a column within a csv without opening it.
A repository for my big data class's midterm exam.
A simple Python script to clean files from directories based on their extensions.
A program that will parse and encode a select column from a csv.
Some stats analysis and visualisation of a dataset I did for a friend, not the cleanest code, but works for the use case. Only showcasing the process as not going to show the original dataset as it is not mine to share.
Joining, Cleaning, Querying, Performing ETL on Twitter Posts Dataset.
This web application automates data cleaning, enabling users to upload CSV or XLSX files. It processes the data to manage missing values and remove duplicates, then allows users to download the cleaned dataset. Built with Flask and Pandas, it simplifies the data preparation process.
This features a dataset cleaner for E-Sport Matches datasets from https://oracleselixir.com/tools/downloads, it works on all the datasets from Elixir, but the cleaning is for training ML with focus on results, it maybe adapted to clean results, and operate on damage, drakes etc.
Rainfall Data Analysis performed in python using the pandas library
the process of cleaning, exploring, analyzing, and visualizing the data, followed by creating a time-series model, is aimed at extracting meaningful insights from the Craigslist Vehicles Dataset.
Helper functions in python
Full Scale portfolio project used to aggregate sales data from the GSMLS (quarterly), clean and transform the data and store in a SQL database for future machine learning analysis
Add a description, image, and links to the cleaning-data topic page so that developers can more easily learn about it.
To associate your repository with the cleaning-data topic, visit your repo's landing page and select "manage topics."