Skip to content

kaopanboonyuen/2110446_DataScience_2021s2

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

2110446 Data Science Course at Chula 2022

Support-Ukraine

alt text

Short links for exercises:

Week1: Intro to Numpy, Pandas

  1. Numpy: Open In Colab

  2. Pandas: Open In Colab

  3. Pandas with Youtube stat data: Open In Colab

  4. (Advanced) Pandas with Youtube stat data: Open In Colab

Assignment (Pandas with Youtube stat data): Open In Colab

Week2: Data Preparation

  1. EDA: Open In Colab

  2. Impute Missing Value: Open In Colab

  3. Split Train/Test: Open In Colab

  4. Outliers with Log: Open In Colab

  5. Outliers with Log (Titanic DataSet): Open In Colab

Assignment: Open In Colab

Week3: Statistical Analysis

  1. Basic Stat: Open In Colab

  2. Intermediate Stat: Open In Colab

Assignment (Stat): Open In Colab

Week4: Regression

Assignment: Open In GitHub

Week5: Traditional ML

  1. Decision Trees: Open In Colab

  2. Linear Regression: Open In Colab

  3. Logistic Regression: Open In Colab

  4. Neural Network: Open In Colab

  5. K Nearest Neighbors: Open In Colab

  6. SVM: Open In Colab

  7. Save and Load Model: Open In Colab

  8. K-Means: Open In Colab

  9. Market-Basket Analysis: Open In Colab

Assignment (Safe to eat or deadly poison?): Open In GitHub

Mushroom

Week6: Intro to Deep Learning

  1. Image classification (basic): flower classification Open In Colab

  2. Image classification (advanced): flower classification Open In Colab

  3. Semantic Segmentation (UNET): The Oxford-IIIT pet dataset Open In Colab

  4. LSTM: Stock price prediction Open In Colab

  5. SARIMAX: PM2.5 forecasting Open In Colab

Assignment (Fashion MNIST): Open In Colab

Scrape

Week7: Data Extraction

  1. Basic Webpage Scraping: Open In Colab

  2. Wikipedia Page Data Extraction: Open In Colab

  3. REST API Data Extraction: Open In Colab

  4. Twitter Data Extraction: Open In Colab

  5. Selenium: Open In Colab

Assignment: Open In Colab

Week8: Data Ingestion

All codes and scripts are here: this link

  1. Kafka Sample Producer: Open In NB
  2. Kafka Sample Consumer: Open In NB
  3. Kafka Sample Producer with AVRO: Open In NB
  4. Kafka Sample Consumer with AVRO: Open In NB
  5. Sensor FileWriter Consumer: Open In NB
  6. Sensor Counter Consumer: Open In NB

Data Set:

Sample (ASVC): Open In GitHub

Anyway, you can download all source codes for week11_airflow through this link (week8_dataingestion.zip).

Week9: Spark

  1. Basic Spark: Open In Colab
  2. Spark SQL: Open In Colab
  3. Spark ML: Open In Colab

Data Set:

  1. Bank: Open In GitHub
  2. Star Wars: Open In GitHub

Week10: Spark Streaming

  1. Basic Spark Streaming: Open In Colab
  2. Spark Streaming Window Operations: Open In Colab
  3. Basic Structured Streaming: Open In Colab
  4. Structured Streaming Window Operations: Open In Colab
  5. Structured Streaming and Kafka: Open In Colab

Data Set:

Star Wars: Open In GitHub

Week11: Airflow

All code is here: this link

Anyway, you can download all source codes for week11_airflow_and_fastapi through this link (week11_airflow_and_fastapi.zip).

** Updated python codes/notebooks will be posted here shortly before each lecture.

Reference:

  1. https://www.kaggle.com/code
  2. https://www.tensorflow.org/tutorials
  3. https://github.com/topics/machine-learning
  4. https://archive.ics.uci.edu/ml/datasets.php
  5. https://colab.research.google.com/notebooks/

About

Data Science Course at Dept. of Computer Engineering, Chula 2022

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published