Skip to content
View majobasgall's full-sized avatar

Block or report majobasgall

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
majobasgall/README.md

Hi there, I'm María José!

I'm a PhD holder in Computer Science, specializing in Data Science, with expertise in Machine Learning, statistics, and data visualization on large-scale datasets. I have experience performing as a Data Science Lead and as a Data Engineer in Switzerland. Skilled in Apache Spark, Scala, Python, SQL, and DevOps. I'm looking for opportunities to further apply my proficiency in Machine Learning, Data Science or Data Engineering roles.

I thrive in collaborative environments, both sharing my skills and embracing new ones!

Languages and Tools:

Programming: Scala Python SQL Spark Hadoop Scikit-Learn Pandas Matplotlib NumPy

Cloud and BI tools: Microsoft Azure Google Cloud Platform Google BigQuery Tableau Power BI Apache Airflow

Others: Docker GitLab CI/CD GitHub Actions Bash JIRA Jupyter Notebook GNU/Linux Scrum Streamlit

Certifications

DataCamp Badge

DataCamp Badge

Google Cloud Badge

DeepLearning.AI Badge

EPFL Badge

EPFL Badge

Let's connect:

LinkedIn

And get to know about my projects, research papers, and even a glimpse into my personal interests by visiting my website:

Website

Pinned Loading

  1. smote-bd smote-bd Public

    SMOTE-BD: A distributed Synthetic Minority Oversampling Technique (SMOTE) for Big Data.

    Scala 9 1

  2. smote-mr smote-mr Public

    SMOTE-MR: A distributed Synthetic Minority Oversampling Technique (SMOTE) for Big Data which applies a MapReduce based-approach. SMOTE-MR is categorized as an `approximated/ non exact` solution. Al…

    Scala 3

  3. big_data_reduction_recommender big_data_reduction_recommender Public

    FDR2-BD: A Fast Data Reduction Recommendation Tool for Tabular Big Data Classification Problems

    Scala

  4. bayesian-optimization bayesian-optimization Public

    Bayesian Optimization: Approximate to the optimum of an expensive Black Box system by using a cheaper surrogate model

    Python

  5. bash_scripts_potpourri bash_scripts_potpourri Public

    A little bit of everything: cleanup Arch-based systems, extraction, system info, git config, project structuring, spark parameters, data backup, webcam control, and more!

    Shell

  6. majobasgall.github.io majobasgall.github.io Public archive

    My personal website.

    HTML 1