Skip to content
View selengetu's full-sized avatar

Block or report selengetu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
selengetu/README.md

Well hello there! ✨

I am Selenge Tulga (Sam), a data engineer with experience in handling complex data projects. Previously, I worked as a software engineer for 6 years in the railway industry, where I developed robust software and data solutions honed my technical skills.

LinkedIn Email

Skills and Technologies πŸ±β€πŸ’»

  • Programming Languages: Python, SQL, R, Java, PHP
  • Data Processing: Apache Spark, AWS Glue, ETL Processes, Apache Airflow, Apache Kafka
  • Databases: MySQL, MSSQL, Oracle, PostgreSQL, MongoDB, Redshift, DynamoDB
  • Cloud Computing: AWS (EC2, S3, Lambda, DynamoDB, Athena, Step Functions), Azure, Google Cloud Platform
  • Data Visualization & Analytics: Tableau, Power BI, Amazon QuickSight
  • Machine Learning Libraries: Pandas, NumPy, TensorFlow

More about me πŸͺ†

  • πŸ‘©β€πŸ’Ό Data Science master's student at UofR and certified by AWS and Databricks, applying data engineering skills to rehabilitation research.
  • 🌱 I’m currently learning system design, focusing on scalable architecture and microservices.
  • πŸ”­ I’m currently working on Data engineer projects and AWS certifications.

Pinned Loading

  1. End-to-end-Sentiment-Data-Project End-to-end-Sentiment-Data-Project Public

    Forked from lpalum/dscc202-402-spring2024

    A real-time data streaming and sentiment analysis pipeline using Apache Spark on Databricks, structured into bronze, silver, and gold stages. It features MLflow for model tracking and demonstrates …

    Jupyter Notebook

  2. Motion_Lab_Data_Integration Motion_Lab_Data_Integration Public

    Developed a robust database and analytics platform using Python and SQL to manage and analyze complex biomechanical data, including EEG, EMG, and kinematic measurements.

    Jupyter Notebook

  3. EDA-Open-Asteroid-Dataset EDA-Open-Asteroid-Dataset Public

    Forked from Shakleen/EDA-Open-Asteroid-Dataset

    EDA of the open asteroid dataset utilizes R and statistical methods

  4. TwoWars TwoWars Public

    Employing the Reddit Praw API, extracted comments, formulated prompts for stance analysis, utilized Llama2 for both training and prediction purposes.

    Jupyter Notebook