Data Log of 66 days of data challenge, capturing day-to-day data science/ analytics learning journey and accountability
- Descriptive statistics fundamentals
- Types of data
- Levels of measurement
- Categorical variables
- Numerical variables
- Histogram
- Mean, median, mode
- Skewness
- Variance
- Standard deviation, coefficient of variation
- Covariance
- Correlation
*Platform: 365 data science
- Inferential statistics fundamentals
- Normal distribution
- Standard normal distribution
*Platform: 365 data science
-
Inferential statistics fundamentals
- Central Limit Theorem
- Standard error
- Estimators & estimates
-
Exam - Descriptive statistics fundamentals 📝
-
Exam - Inferential statistics fundamentals 📝
-
Confidence Intervals
- Confidence intervals
- z-score
*Platform: 365 data science
-
Confidence Intervals
- Student's T Distribution
- t-score
- Margin of error
- Confidence intervals: Dependent samples, Independent samples
-
Exam - Confidence Intervals 📝
*Platform: 365 data science
-
Hypothesis testing
- Null vs Alternative
- Rejection region & significance level
- Type 1 error vs type 2 error
- Test for mean, population variance known & unknown
- p-value
- Test for mean, dependent & independent samples
-
Exam - Hypothesis testing 📝
*Platform: 365 data science
- The basics of probability
- Probability formula
- Expected values
- Probability Frequency distribution
- Complements
*Platform: 365 data science
- Combinatorics
- Permutations
- Factorals
- Variations
*Platform: 365 data science
- Bayesian Inference
- Sets & events
*Platform: 365 data science
- Bayesian Inference
- Intersection
- Union
- Mutually Exculsive
*Platform: 365 data science
- Bayesian Inference
- Dependent & Independent events
- Conditional probability
- Law of total probability
- Additive Law
- Multiplication rule
- Bayes theorem
*Platform: 365 data science
- Discrete distributions
- Types of distributions
- Discrete distributions
- Uniform distributions
- Bernoulli distribution
- Binomial distribution
- Poisson distribution
- Introduction
- Attributes
- Index
*Platform: 365 data science
-
Introduction
- Index - Label-based / Position-based
- Methods - Numpy: sum(), min(), max(), idxmax(), idxmin()
- Methods - Pandas: head(), tail()
- Parameters vs Arguments
- Documentations
- DataFrames
-
Data Cleaning & Preprocessing
*Platform: 365 data science
- Continuous distributions
- Normal distribution
- Chi-square distribution
- Exponential distribution
- Logistic distribution
- Data Cleaning & Preprocessing
*Platform: 365 data science
- Data Science fields
*Platform: 365 data science
- Introduction
*Platform: 365 data science
- Creating Tableau Dashboard
*Platform: 365 data science
- Fundamentals: Data, Functions, Sequences, Conditional Statements, Iteration, Recursion
*Platform: LinkedIn Learning
- Loops
- Lists, tuples
- Dictionaries
- Comprehensions
*Platform: LinkedIn Learning
- Data summary
- Data aggregation
*Platform: TalentLabs
- Data structures: Loops
*Platform: LinkedIn Learning
- Data summary
- Data aggregation
*Platform: TalentLabs
- Remove columns
- Create custom columns
- Create new tables
- Merge queries
- Model table relationships
- Design dashboard using PowerPoint
*Platform: YouTube
- Creating Measurement
*Platform: YouTube
- Wordplay: Anagrams and Palindromes
*Platform: LinkedIn Learning
- Create and Upload dataset to Kaggle
- Writing dataset description
*Platform: Kaggle, Github dataset
- Arrays with NumPy
*Platform: LinkedIn Learning
- Create and Upload dataset to Kaggle
- Writing dataset description
*Platform: Kaggle, Github dataset
- Maps and Spatial Visualizations
*Platform: DataCamp
- Writing dataset description
- Writing questions in Kaggle Notebook
- Tableau geojson file
*Platform: Kaggle, Github dataset
- Putting all together
*Platform: DataCamp
- Writing dataset description
- Writing questions in Kaggle Notebook
- Exploartory Data Analysis
- To do: Reupload the csv files (some dataset was renamed (Aman to Sri Aman), wait for Padang Serai results)
*Platform: Kaggle, Github dataset
- Exploratory Analysis
- Analyzing Market Trends
- Dashboards and Insights
*Platform: DataCamp
- Univariate (a type of data which consists of observations on only a single characteristic or attribute) exploratory data analysis
*Platform: DataCamp
- Exploartory Data Analysis
- Data cleaning & wrangling
- To do: Reupload the csv files (wait for Padang Serai results)
*Platform: Kaggle, Github dataset
- Exploartory Data Analysis
- Data wrangling
- Matplotlib
- To do: Reupload the csv files (wait for Padang Serai results)
*Platform: Kaggle, Github dataset
- IF and CASE
- ISNULL
- Calculated field
*Platform: DataCamp
- Introduction
- Exploartory Data Analysis
- Data wrangling
- Matplotlib
- To do: Reupload the csv files (wait for Padang Serai results)
*Platform: Kaggle, Github dataset
- INCLUDE
- EXCLUDE
*Platform: DataCamp
- Twitter sentiment analysis
- Tableau Data Visualization
*Platform: Tableau
- Tableau Data Visualization
- To do: Resize the squares for generation and parties, Tooltip for total votes (map) - Tableau
- To do: redo the visualization chart - Matplotlib
*Platform: Tableau
- Tableau Data Visualization
*Platform: Tableau
- Measures of spread and confidence intervals
*Platform: DataCamp
- Measures of spread and confidence intervals
*Platform: DataCamp
- Recommendation Systems
- Outliers by Country
*Platform: Kaggle
- Outliers by Country
*Platform: Kaggle
- Predicting Stock Prices
- Revise all chapters
- Summary Functions and Maps
*Platform: Kaggle
- Post the Tableau link on LinkedIn
- Revise all chapters
- Grouping and sorting
*Platform: Kaggle
- Forecasts
*Platform: DataCamp
- Data Types and Missing Values
- Renaming and joining data
*Platform: Kaggle
- Use case: Weather data
*Platform: LinkedIn Learning
- Create data analyst portfolio for free
*Platform: carrd.co
*Platform: DataCamp
- Edit portfolio
*Platform: carrd.co
- Building ML model
*Platform: Kaggle
- Model validation
*Platform: Kaggle
- Underfitting and Overfitting
- Random forests
*Platform: Kaggle
- Introduction
*Platform: Kaggle
- Edit portfolio
*Platform: carrd.co
- Introduction
*Platform: Kaggle
- Edit and revise the bar charts
*Platform: Kaggle
- Categorical Variables
*Platform: Kaggle
- Categorical Variables
*Platform: Kaggle
- Pipelines
*Platform: Kaggle
- Cross-validation
*Platform: Kaggle
- XGBoost
*Platform: Kaggle
- Data Leakage
*Platform: Kaggle
- Introduction
*Platform: Kaggle
- Time Series Analysis
*Platform: DataCamp
- Credit Risk
- Mutual Information
*Platform: Kaggle
- Creating Features
*Platform: Kaggle
- Clustering with K-means
*Platform: Kaggle
- Clustering with K-means
*Platform: Kaggle
- Principal Component Analysis
*Platform: Kaggle
- Update the Padang Serai data
- Update GE-15 Kaggle dataset descriptions
- Pending to upload the data to Tableau
*Platform: Kaggle
- Principal Component Analysis
*Platform: Kaggle
- Target Encoding
*Platform: Kaggle
- Target Encoding
*Platform: Kaggle
- Update the Padang Serai data
- Update GE-15 Kaggle dataset descriptions
- Pending to upload the data to Tableau
*Platform: Kaggle