Skip to content
View Olliang's full-sized avatar
😄
😄
  • University of Minnesota
  • Los Angeles, California
Block or Report

Block or report Olliang

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned

  1. Time-Series-Forcasting-Seq2Seq Time-Series-Forcasting-Seq2Seq Public

    A time series forecasting project from Kaggle that uses Seq2Seq + LSTM technique to forecast the headcounts. Detailed explanation on how the special neural network structure works is provided.

    Jupyter Notebook 21 4

  2. All-About-Movie-Data All-About-Movie-Data Public

    A hub that stores data science and analytics done on movie related data. The techniques used include EDA, NLP topic analysis, Recommender System, and advanced visualization in Tableau

    Jupyter Notebook 2

  3. Causal-Inference-Experiment Causal-Inference-Experiment Public

    An A/B testing project done with survey to examine whether people are more likely to click on thumbnails that contain their own racial features.

    1

  4. Statistical-Similarity-Measurement Statistical-Similarity-Measurement Public

    A methodology designed to validate the statistical similarity of synthetic data generated by GAN models. The metrics contain Auto-encoder, PCA, t-SNE, KL-divergence, Clustering, and Cosine Similarity.

    Jupyter Notebook 9

  5. AWS-Click-Prediction AWS-Click-Prediction Public

    A big data project that utilizes E3, Athena, EMR, SageMaker and QuickSight on AWS to build Random Forest and xgBoost model in Spark and SQL that predict the CTR of ads on a large relational database.

    HTML 1

  6. COVID-19-Forecasting COVID-19-Forecasting Public

    A self-driven project utilizing ARIMA, Seq2Seq, and XGBoost to help design the COVID19 forecasting algorithm. Data sources are from Kaggle Competition and JHU CSSE.

    Jupyter Notebook 2