Skip to content
View seemun-yum's full-sized avatar
  • Unemployed
  • Kuala Lumpur
Block or Report

Block or report seemun-yum

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
seemun-yum/README.md

Welcome to my page! πŸ‘‹

My entire data journey so far to remind myself how far I've come and what else I have to learn πŸ˜„

coming up: uploading my own project for each concept listed

🌱 Currently dipped my feet into

  1. Python library- pandas, numpy, matplotlib, seabron, plotly, altair, sklearn, streamlit, prophet, neural prophet, etc.
  2. Visualization- graph types and interativeness customization with altair, converting altair visuals to html for deployment, streamlit to create quick web applications.
  3. Machine Learning- Implemented ensemble Time series prediction model to predict ADV with 90% accuracy. Implemented Sentiment analyzer model to assess sentiment from formal and informal English and Malay. Implemented customer segmentation model for targeted marketing purpose.
  4. Web scraping- Implemented end to end web scraper in AWS, scraping data on an hourly bases with VPN.
  5. Building data pipelines: Using python, AWS Lambda, AWs Redshift, CRON scheduling, encrypting Personally identifiable information (PII) data columns
  6. Web developemenmt: HTML, CSS, JavaScript, AWS (compute, storage, network routing, Authorization)

πŸ’¬ Working on

  1. Algorithms and datatypes: binary search, linear search, how array and linekd list work, big O notations, selection sort algorithm, stacked and queue data structure, quicksort (divide and conquer), hastable and how they work (collisions, load factor, hash function)
  2. Maths- Bayesian statistics, hypothesis testing, probability sampling, statistical significance, designing tests, inferential statistics
  3. Others: Data mining, processing text data, understand APIs, filetypes (json, parquet, html, pickle, ...)
  4. Rust, MLOps, DevOps, Cloud computing, Handling big data hadoop, spark.

Pinned Loading

  1. Malaysia-COVID-Streamlit Malaysia-COVID-Streamlit Public

    Interactive web application showing all available data on relevant COVID metrics for Malaysia

    Jupyter Notebook

  2. KMeansImageCompression KMeansImageCompression Public

    Reducing the colors in an image using K means cluster

    Jupyter Notebook

  3. KmeansApp KmeansApp Public

    Learn K means clustering algorithm as a streamlit app.

    Python

  4. WebsiteDesign WebsiteDesign Public

    HTML/CSS/javascript front-end development projects

    HTML 1