Skip to content
View mbalos16's full-sized avatar
🚧
Work in progress...
🚧
Work in progress...

Block or report mbalos16

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
mbalos16/README.md

Hello πŸ‘©β€πŸ’»!

Fancy seeing you here 😌 !

I am Maria Balos, a data scientist and user-centric designer based in Cambridge, UK. You can find me most of the time behind a screen or next to a coffee. Welcome to this small corner of my work!

Right now I am involved into:

  • Completing daily coding problems in Leedcode, please check my Leetcode profile
  • Master in Deep Learning and Generative AI by DATAMECUM - October 2024 - April 2025
  • Training my first Convolutional Neural Network model - Emotion detection in Images
  • Finishing the last 16/100 projects of the course 100 Days Of Code in Python by Angela Yu on Udemy.

Last achievements:

  • 1st of November: I reached 200 solved Leetcode problems: Check it out in my Leetcode profile
  • 30th of October: Ryanair time-capsule, reverse-engineering the Ryanair API to collect daily flight prices and train machine learning models to forecast price changes.
  • 17th of October: Completed NLP HuggingFace course.
  • 28th of May: Completed the first part of the "Practical Deep Learning" course by fast.ai
  • 17th of May: Completed the "Advanced Learning Algorithm Course" by Andrew Ng in Coursera
  • 9th of May: Winner of the DATAMECUM Datathon 3rd promotion competition.

Last Medium post:

Skills 🌟

Please check out the next sections to see these skills applied in projects.

  • Exploratory Data Analysis: understanding the data, identifying missing values, approach duplicated values, handling ambiguous values,identifying outliers and anomalies, and correlation detection.

  • Unsupervised Machine Learning

  • Supervised Machine Learning

    • Generalised Linear Models
    • Support Vector Machines
    • K-Nearest Neighbors
    • Decision Stumps: Kaggle notebook
    • Decision Trees:
    • Random Forest: Datamecum Datathon
    • XGBoost: Datamecum Datathon
    • Ensemble models: Winner of the Datamecum Datathon capstone project competition with an ensemble of the Random Forest and XGBoost predictions, please check out the presentation video.
  • Python libraries for data science

    • Data processing: Pandas, NumPy
    • ML & stats: Scikit-Learn, Statsmodels
    • Data visualisation: Matplotlib, Seaborn, Plotly

Projects πŸ“œ

Exploratory Data Analysis

  • Space Mission Analysis is a data exploration and data visualisation project where I applied most of the data visualisation libraries.
  • Mohs Hardness Exploratory Data Analysis: Decision Stump (one layer decision tree) for a Kaggle competition, this placed me in position 598/1632 at the end of the competition. A decision stump presentation has been created to introduce Datamecum students to decision stumps.
  • Datamecum Datathon - Capstone project competition between the third promotion students of the Intensive Program in Data Science by DATAMECUM consisting of building a supervised model to predict a binary class. The exploratory data analysis consisted of:
    • checking for missing values.
    • handling duplicated values and ambiguous data.
    • exploring the relation between missing values and the target variable.
    • Self Organizing Maps and correlation matrix used for correlation checks.

Machine Learning

Unsupervised

Supervised

  • Datamecum Dataton - Capstone project competition between the third promotion students of the Intensive Program in Data Science by DATAMECUM consisting of building a supervised model to predict a binary class.

Web / App Development

Automation

Final Notes & Contact ☎️

Thank you for visiting my GitHub! Feel free to have a deeper look in my repositories to find more specific projects. Please share any feedback, suggestions, or tips that you believe could help me grow and improve!

I am always happy for a coffee, a chit-chat or a discussion of any possible collaboration. Please drop me an email at mariabalos16@gmail.com or send me a message through my LinkedIn if you fancy any of those.

Pinned Loading

  1. ryanair_timecapsule ryanair_timecapsule Public

    Ryanair's API was reverse-engineered to collect daily flight prices and train machine learning models to forecast price changes.

    Python 1

  2. linkedin_toggler linkedin_toggler Public

    Selenium script in Python that automate repetitive LinkedIn maintenance tasks.

    Python

  3. python_100_days_of_code python_100_days_of_code Public

    This repository showcases my Python learning journey and includes 100+ solved exercises utilizing various libraries.

    Jupyter Notebook 1

  4. leetcode leetcode Public

    My solution to leetcode coding exercises.

    Python