Skip to content

Werayootk/2110531_DataScience_2022s1

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

66 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

2110531 Data Science and Data Engineering Tools @Chula 2022

Support-Ukraine

alt text

Short links for exercises:

Week1: Intro to Numpy, Pandas

  1. Numpy: Open In Colab

  2. Pandas: Open In Colab

  3. Pandas with Youtube stat data: Open In Colab

  4. (Advanced) Pandas with Youtube stat data: Open In Colab

Assignment (Pandas with Youtube stat data): Open In Colab

Week2: Data Preparation

  1. EDA: Open In Colab

  2. Impute Missing Value: Open In Colab

  3. Split Train/Test: Open In Colab

  4. Outliers with Log: Open In Colab

  5. Outliers with Log (Titanic DataSet): Open In Colab

Assignment: Open In Colab

Week3-4: Traditional ML

  1. Decision Trees: Open In Colab

  2. Linear Regression: Open In Colab

  3. Logistic Regression: Open In Colab

  4. Neural Network: Open In Colab

  5. K Nearest Neighbors: Open In Colab

  6. SVM: Open In Colab

  7. Save and Load Model: Open In Colab

  8. K-Means: Open In Colab

  9. Market-Basket Analysis: Open In Colab

Assignment for Week3 (Safe to eat or deadly poison?): Open In GitHub

Mushroom

Week5-6: Intro to Deep Learning

  1. Image classification (basic): flower classification Open In Colab

  2. Image classification (advanced): flower classification Open In Colab

  3. Semantic Segmentation (UNET): The Oxford-IIIT pet dataset Open In Colab

  4. LSTM: Stock price prediction Open In Colab

  5. SARIMAX: PM2.5 forecasting Open In Colab

Assignment (Fashion MNIST): Open In Colab

Week8: Data Storage with Redis

Redis Example using local data

Assignment (connect to redis server)

Week9: Data Storage with Redis

  1. Basic Webpage Scarping Open In Colab

  2. Wikipeia Data ExtractionOpen In Colab

  3. Settrade Rest API Open In Colab

  4. Twitter Data Extraction Open In Colab

  5. Selenium Open In Colab

Assignment (Counting วันพระ)Open In Colab

Week10: Data Ingestion with Kafka

  1. Several simple examples including both produxer and consumer in simple folder

  2. Complex example in complex folder

  3. AVRO Producer Open In Colab and Consumer Open In Colab

  4. Group example in group folder

Assignment (Transaction Verifier)Open In Colab

Note: Do not forget to upload the following schema files to your Colab

Week11: Big Data Processing with Spark

  1. Basic Spark Open In Colab

Note: Do not forget to upload the following data file to your Colab

  1. Spark SQL Open In Colab

Note: Do not forget to upload the following data file to your Colab

  1. Spark ML Open In Colab

Note: Do not forget to upload the following data file to your Colab

Assignment (Analyze IMDB)Open In Colab

Note: Do not forget to upload the following data file to your Colab

Week12: Ops Stars

  1. Several airflow examples in (airflow folder)[https://github.com/kaopanboonyuen/2110531_DataScience_2022s1/tree/main/code/week12_orchestration/airflow]

  2. Several fastapi examples in (fastapi folder)[https://github.com/kaopanboonyuen/2110531_DataScience_2022s1/tree/main/code/week12_orchestration/fastapi]

Reference:

  1. https://www.kaggle.com/code
  2. https://www.tensorflow.org/tutorials
  3. https://github.com/topics/machine-learning
  4. https://archive.ics.uci.edu/ml/datasets.php

About

Data Science Tools Course at Dept. of Computer Engineering, Chula 2022

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Jupyter Notebook 99.9%
  • Other 0.1%