Simple ETL pipeline to extract information from CSV, LOG, JSON files and load it into MySQL database using Python and SQL language.
-
Updated
Feb 23, 2024 - Python
Simple ETL pipeline to extract information from CSV, LOG, JSON files and load it into MySQL database using Python and SQL language.
Exploratory Data Analysis to uncover factors data lead to employee attrition.
Movies data analysis to produce visuals and insights about the data-set of 10,000 movies.
Wrangling the WeRateDogs datasets to showcase data gathering, assessing, cleaning, and documentation skills.
This project, carried out in Jupyter Notebook, aims to explore the main Data Analysis techniques with Python tools. Pandas, Numpy, Seaborn, Matplotlib, Plotly and sklearn are used. Divided into three notebooks, I separate the data cleaning, data analysis and machine learning part. For more details and goals, see README
Capstone project of Udacity Data Analyst Nanodegree. Focus on advanced visualizations to explore data and to communicate insights and patterns. Final slide deck is made with Jupyter notebook with interactive HTML slides (based on reveal.js).
Machine learning, signal processing pipeline used to identify song name from user input (hum/whistle to song).
Ford GoBike 2019 Dataset is a dataset for the bikeshare system, in this study I have presented the data on the slides file as a part of the visualization Learning process of the Data Analysis Nanodegree of Udacity.
project in Udacity Data Analyst Nanodegree. This project focused on advanced data gathering (several sources incl twitter API), wrangling and cleaning of data. Plus 2 reports.
A total package of what data science is all about. from dashboard building to data wrangling, sql, data collection, vizualization, webscrapping to presentaion.
Predictive Model for BRENT price movements
I study how reported economic growth was undergoing change in 2004 to 2019 testing for the number of criminal records and other characteristics.
Bike Share analysis using R
Coursera Data Science Specialization Capstone Project
data wrangling
This is a repository of #rstats solutions to the Preppin' Data challenges published at preppindata.com
Data Wrangling Project from the Udacity Data Analytics Nano Degree
Investigate Ford GoBike Project
Add a description, image, and links to the wrangling-cleaning topic page so that developers can more easily learn about it.
To associate your repository with the wrangling-cleaning topic, visit your repo's landing page and select "manage topics."