Skip to content

jjean95/66daysofdata

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 

Repository files navigation

66 days of data

Data Log of 66 days of data challenge, capturing day-to-day data science/ analytics learning journey and accountability

Day 1 - 5/11/22

Statistics 📈

  • Descriptive statistics fundamentals
    • Types of data
    • Levels of measurement
    • Categorical variables
    • Numerical variables
    • Histogram
    • Mean, median, mode
    • Skewness
    • Variance
    • Standard deviation, coefficient of variation
    • Covariance
    • Correlation

*Platform: 365 data science

Day 2 - 6/11/22

Statistics 📈

  • Inferential statistics fundamentals
    • Normal distribution
    • Standard normal distribution

*Platform: 365 data science

Day 3 - 7/11/22

Statistics 📈

  • Inferential statistics fundamentals

    • Central Limit Theorem
    • Standard error
    • Estimators & estimates
  • Exam - Descriptive statistics fundamentals 📝

  • Exam - Inferential statistics fundamentals 📝

  • Confidence Intervals

    • Confidence intervals
    • z-score

*Platform: 365 data science

Day 4 - 8/11/22

Statistics 📈

  • Confidence Intervals

    • Student's T Distribution
    • t-score
    • Margin of error
    • Confidence intervals: Dependent samples, Independent samples
  • Exam - Confidence Intervals 📝

*Platform: 365 data science

Day 5 - 9/11/22

Statistics 📈

  • Hypothesis testing

    • Null vs Alternative
    • Rejection region & significance level
    • Type 1 error vs type 2 error
    • Test for mean, population variance known & unknown
    • p-value
    • Test for mean, dependent & independent samples
  • Exam - Hypothesis testing 📝

*Platform: 365 data science

Day 6 - 10/11/22

Probability 📊

  • The basics of probability
    • Probability formula
    • Expected values
    • Probability Frequency distribution
    • Complements

*Platform: 365 data science

Day 7 - 11/11/22

Probability 📊

  • Combinatorics
    • Permutations
    • Factorals
    • Variations

*Platform: 365 data science

Day 8 - 12/11/22

Probability 📊

  • Bayesian Inference
    • Sets & events

*Platform: 365 data science

Day 9 - 13/11/22

Probability 📊

  • Bayesian Inference
    • Intersection
    • Union
    • Mutually Exculsive

*Platform: 365 data science

Day 10 - 14/11/22

Probability 📊

  • Bayesian Inference
    • Dependent & Independent events
    • Conditional probability
    • Law of total probability
    • Additive Law
    • Multiplication rule
    • Bayes theorem

*Platform: 365 data science

Day 11 - 15/11/22

Probability 📊

  • Discrete distributions
    • Types of distributions
    • Discrete distributions
    • Uniform distributions
    • Bernoulli distribution
    • Binomial distribution
    • Poisson distribution

Data Cleaning & Preprocessing with Pandas 🐼

  • Introduction
    • Attributes
    • Index

*Platform: 365 data science

Day 12 - 16/11/22

Data Cleaning & Preprocessing with Pandas 🐼

  • Introduction

    • Index - Label-based / Position-based
    • Methods - Numpy: sum(), min(), max(), idxmax(), idxmin()
    • Methods - Pandas: head(), tail()
    • Parameters vs Arguments
    • Documentations
    • DataFrames
  • Data Cleaning & Preprocessing

*Platform: 365 data science

Day 13 - 17/11/22

Probability 📊

  • Continuous distributions
    • Normal distribution
    • Chi-square distribution
    • Exponential distribution
    • Logistic distribution

Data Cleaning & Preprocessing with Pandas 🐼

  • Data Cleaning & Preprocessing

*Platform: 365 data science

Day 14 - 18/11/22

Introduction to Data & Data Science

  • Data Science fields

*Platform: 365 data science

Day 15 - 19/11/22

Fashion Analytics with Tableau 👗

  • Introduction

*Platform: 365 data science

Day 16 - 21/11/22

Fashion Analytics with Tableau 👗

  • Creating Tableau Dashboard

Link

*Platform: 365 data science

Day 17 - 22/11/22

Python quick start 🐍

  • Fundamentals: Data, Functions, Sequences, Conditional Statements, Iteration, Recursion

*Platform: LinkedIn Learning

Day 18 - 23/11/22

Python data analysis 🐍

  • Loops
  • Lists, tuples
  • Dictionaries
  • Comprehensions

*Platform: LinkedIn Learning

Day 19 - 24/11/22

Data analysis with Google Sheet 📊

  • Data summary
  • Data aggregation

*Platform: TalentLabs

Python data analysis 🐍

  • Data structures: Loops

*Platform: LinkedIn Learning

Day 20 - 25/11/22

Data analysis with Google Sheet 📊

  • Data summary
  • Data aggregation

*Platform: TalentLabs

Power BI Dashboard for Real Estate & Property Management 🏘️

  • Remove columns
  • Create custom columns
  • Create new tables
  • Merge queries
  • Model table relationships
  • Design dashboard using PowerPoint

*Platform: YouTube

Day 21 - 26/11/22

Power BI Dashboard for Real Estate & Property Management 🏘️

  • Creating Measurement

*Platform: YouTube

Day 22 - 28/11/22

Python Data Analysis 🐍

  • Wordplay: Anagrams and Palindromes

*Platform: LinkedIn Learning

Day 23 - 29/11/22

Malaysia GE dataset Data Analysis using Kaggle 🗳️

  • Create and Upload dataset to Kaggle
  • Writing dataset description

*Platform: Kaggle, Github dataset

Python Data Analysis 🐍

  • Arrays with NumPy

*Platform: LinkedIn Learning

Day 24 - 30/11/22

Malaysia GE dataset Data Analysis using Kaggle 🗳️

  • Create and Upload dataset to Kaggle
  • Writing dataset description

*Platform: Kaggle, Github dataset

Data Visualization with Tableau 🗺️

  • Maps and Spatial Visualizations

*Platform: DataCamp

Day 25 - 1/12/22

Malaysia GE dataset Data Analysis using Kaggle 🗳️

  • Writing dataset description
  • Writing questions in Kaggle Notebook
  • Tableau geojson file

*Platform: Kaggle, Github dataset

Data Visualization with Tableau 🗺️

  • Putting all together

*Platform: DataCamp

Day 26 - 2/12/22

Malaysia GE dataset Data Analysis using Kaggle 🗳️

  • Writing dataset description
  • Writing questions in Kaggle Notebook
  • Exploartory Data Analysis
  • To do: Reupload the csv files (some dataset was renamed (Aman to Sri Aman), wait for Padang Serai results)

*Platform: Kaggle, Github dataset

Case Study: Analyzing Job Market Data in Tableau 💼

  • Exploratory Analysis
  • Analyzing Market Trends
  • Dashboards and Insights

*Platform: DataCamp

Statistical techniques in Tableau 📉

  • Univariate (a type of data which consists of observations on only a single characteristic or attribute) exploratory data analysis

*Platform: DataCamp

Day 27 - 3/12/22

Malaysia GE dataset Data Analysis using Kaggle 🗳️

  • Exploartory Data Analysis
  • Data cleaning & wrangling
  • To do: Reupload the csv files (wait for Padang Serai results)

*Platform: Kaggle, Github dataset

Day 28 - 4/12/22

Malaysia GE dataset Data Analysis using Kaggle 🗳️

  • Exploartory Data Analysis
  • Data wrangling
  • Matplotlib
  • To do: Reupload the csv files (wait for Padang Serai results)

*Platform: Kaggle, Github dataset

Calculations in Tableau 📉

  • IF and CASE
  • ISNULL
  • Calculated field

*Platform: DataCamp

Data Science - Learn Python For Data Science by Doing Several Projects (video) 🐍

  • Introduction

*Platform: GitHub, YouTube

Day 28 - 5/12/22

Malaysia GE dataset Data Analysis using Kaggle 🗳️

  • Exploartory Data Analysis
  • Data wrangling
  • Matplotlib
  • To do: Reupload the csv files (wait for Padang Serai results)

*Platform: Kaggle, Github dataset

Calculations in Tableau 📉

  • INCLUDE
  • EXCLUDE

*Platform: DataCamp

Data Science - Learn Python For Data Science by Doing Several Projects (video) 🐍

  • Twitter sentiment analysis

*Platform: GitHub, YouTube

Day 29 - 6/12/22

Malaysia GE dataset Data Analysis using Kaggle 🗳️

  • Tableau Data Visualization

*Platform: Tableau

Day 30 - 7/12/22

Malaysia GE dataset Data Analysis using Kaggle 🗳️

  • Tableau Data Visualization
  • To do: Resize the squares for generation and parties, Tooltip for total votes (map) - Tableau
  • To do: redo the visualization chart - Matplotlib

*Platform: Tableau

Day 31 - 8/12/22

Malaysia GE dataset Data Analysis using Kaggle 🗳️

  • Tableau Data Visualization

*Platform: Tableau

Day 32 - 9/12/22

Statistical techniques in Tableau 📉

  • Measures of spread and confidence intervals

*Platform: DataCamp

Day 33 - 10/12/22

Statistical techniques in Tableau 📉

  • Measures of spread and confidence intervals

*Platform: DataCamp

Data Science - Learn Python For Data Science by Doing Several Projects (video) 🐍

  • Recommendation Systems

*Platform: GitHub, YouTube

Kaggle Tour De France 🚴

  • Outliers by Country

*Platform: Kaggle

Day 34 - 11/12/22

Kaggle Tour De France 🚴

  • Outliers by Country

*Platform: Kaggle

Day 35 - 12/12/22

Data Science - Learn Python For Data Science by Doing Several Projects (video) 🐍

  • Predicting Stock Prices

*Platform: GitHub, YouTube

Kaggle - Pandas 🐼

  • Revise all chapters
  • Summary Functions and Maps

*Platform: Kaggle

Malaysia GE dataset Data Analysis using Kaggle 🗳️

  • Post the Tableau link on LinkedIn

Day 36 - 13/12/22

Kaggle - Pandas 🐼

  • Revise all chapters
  • Grouping and sorting

*Platform: Kaggle

Statistical techniques in Tableau 📉

  • Forecasts

*Platform: DataCamp

Day 37 - 14/12/22

Kaggle - Pandas 🐼

  • Data Types and Missing Values
  • Renaming and joining data

*Platform: Kaggle

Python Data Analysis 🐍

  • Use case: Weather data

*Platform: LinkedIn Learning

Day 38 - 15/12/22

Data analyst portfolio 🖼️

  • Create data analyst portfolio for free

*Platform: carrd.co

Calculations in Tableau 📉

*Platform: DataCamp

Day 39 - 16/12/22

Data analyst portfolio 🖼️

  • Edit portfolio

*Platform: carrd.co

Kaggle - Intro to Machine learning ⚙️

  • Building ML model

*Platform: Kaggle

Day 40 - 17/12/22

Kaggle - Intro to Machine learning ⚙️

  • Model validation

*Platform: Kaggle

Day 41 - 18/12/22

Kaggle - Intro to Machine learning ⚙️

  • Underfitting and Overfitting
  • Random forests

*Platform: Kaggle

Day 42 - 19/12/22

Kaggle - Intermediate Machine learning ⚙️

  • Introduction

*Platform: Kaggle

Data analyst portfolio 🖼️

  • Edit portfolio

*Platform: carrd.co

Day 43 - 20/12/22

Kaggle - Intermediate Machine learning ⚙️

  • Introduction

*Platform: Kaggle

Day 44 - 21/12/22

Malaysia GE dataset Data Analysis using Kaggle 🗳️

  • Edit and revise the bar charts

*Platform: Kaggle

Day 45 - 22/12/22

Kaggle - Intermediate Machine learning ⚙️

  • Categorical Variables

*Platform: Kaggle

Day 46 - 23/12/22

Kaggle - Intermediate Machine learning ⚙️

  • Categorical Variables

*Platform: Kaggle

Day 47 - 24/12/22

Kaggle - Intermediate Machine learning ⚙️

  • Pipelines

*Platform: Kaggle

Day 48 - 25/12/22

Kaggle - Intermediate Machine learning ⚙️

  • Cross-validation

*Platform: Kaggle

Day 49 - 26/12/22

Kaggle - Intermediate Machine learning ⚙️

  • XGBoost

*Platform: Kaggle

Day 50 - 27/12/22

Kaggle - Intermediate Machine learning ⚙️

  • Data Leakage

*Platform: Kaggle

Kaggle - Feature Engineering ⚙️

  • Introduction

*Platform: Kaggle

DataCamp - Calculations in Tableau ⏲️

  • Time Series Analysis

*Platform: DataCamp

Day 51 - 28/12/22

Data Analyst Portfolio

  • Credit Risk

Day 52 - 29/12/22

Kaggle - Feature Engineering ⚙️

  • Mutual Information

*Platform: Kaggle

Day 53 - 30/12/22

Kaggle - Feature Engineering ⚙️

  • Creating Features

*Platform: Kaggle

Day 54 - 1/1/23

Kaggle - Feature Engineering ⚙️

  • Clustering with K-means

*Platform: Kaggle

Day 55 - 2/1/23

Kaggle - Feature Engineering ⚙️

  • Clustering with K-means

*Platform: Kaggle

Day 56 - 3/1/23

Kaggle - Feature Engineering ⚙️

  • Principal Component Analysis

*Platform: Kaggle

Day 57 - 4/1/23

Malaysia GE dataset Data Analysis using Kaggle 🗳️

  • Update the Padang Serai data
  • Update GE-15 Kaggle dataset descriptions
  • Pending to upload the data to Tableau

*Platform: Kaggle

Kaggle - Feature Engineering ⚙️

  • Principal Component Analysis

*Platform: Kaggle

Day 58 - 5/1/23

Kaggle - Feature Engineering ⚙️

  • Target Encoding

*Platform: Kaggle

Day 59 - 6/1/23

Kaggle - Feature Engineering ⚙️

  • Target Encoding

*Platform: Kaggle

Day 60 - 16/1/23

Malaysia GE dataset Data Analysis using Kaggle 🗳️

  • Update the Padang Serai data
  • Update GE-15 Kaggle dataset descriptions
  • Pending to upload the data to Tableau

*Platform: Kaggle

Releases

No releases published

Packages

No packages published