Skip to content
View lealcastillo1996's full-sized avatar
  • Netherlands
Block or Report

Block or report lealcastillo1996

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
lealcastillo1996/README.md
Image

Enrique Leal

Applied Data Science MSc

Git Python rlang React Fast API MongoDB MySQL PostgreSQL Heroku Flask Amazon Web Services Docker MacOS PyTorch TensorFlow

I am an engineer⚙️ and data scientist 🧑🏻‍🔬. I decided to pursue a master's degree in Applied Data Science in the Netherlands because I am fascinated by the diverse ways it can be applied to inform decision-making in various industries, and improve people's lives. I am seeking to combine my technical and analytical skills to excel in the wonderful field of Data Science.

  • 🌍  I'm based in Netherlands
  • ✉️  You can contact me at lealcastillo1996@gmail.com
  • 🤝  I'm open to collaborating on all things related to data that are interesting

Main Projects

  • Question answering system with LLMs. (June 2023) [NLP , AI, ML]

Screenshot

Description: The chosen domain for the real-world QA task is cloud computing, focusing on Kubernetes technology. The QA system uses Kubernetes public documentation and real-time Google searches as its knowledge source. Performance evaluation is done using a Machine-trained evaluation score (MTES) called estimated human label (EHL), computed through an ML classification model. This model is trained using N-gram-based metrics. A carefully balanced dataset, labeled by human experts, includes various question categories. The research aims to enhance OS-powered QA systems and provide valuable insights into their performance factors by combining human expertise and MTES.

Project: https://github.com/lealcastillo1996/Thesis_LLMs

Research paper: https://studenttheses.uu.nl/bitstream/handle/20.500.12932/44283/final_thesis_JELC.pdf?sequence=1

  • Mexico city real state price determinants. (April 2023) [GEO, ML]

Image

Description: This study aims to identify the key determinants of property sales prices in Mexico City and understand how they vary across different geographic locations. Thus, the following research questions will be addressed: what are the key determinants for house prices in Mexico City according to Spatial Random Forest (SRF), Geographically Weighted Regression (GWR) and Multiple Geographically Weighted Regression (MGWR)? Specifically, which are the main determinants for each method and how do these results compare with each other?

Project: https://github.com/EwoutvanderVelde/SpatialCourse

Research Paper: https://github.com/EwoutvanderVelde/SpatialCourse/blob/main/Final_Report%20(2).pdf

  • Stream platform recommending system (April 2023) [NLP, RS]

Description: A new streaming platform recommendation system was developed from scratch, employing a combination of collaborative filtering and content-based filtering methods to deliver tailor-made and varied suggestions. This innovative system also includes an interactive interface, granting users the ability to adjust the diversity of their recommendations, ensuring a seamless and personalized user experience that matches their unique preferences

Image

Project: https://github.com/iabrilvzqz/personalisation-for-public-media

Research Paper: https://github.com/iabrilvzqz/personalisation-for-public-media/blob/master/report%20INFOPPM.pdf

  • Fifa World tweets sentiment analysis (Feb 2023) [NLP]

Image

Description: A study involving natural language processing (NLP) was carried out on a dataset of more than 300,000 tweets, utilizing LDA (Latent Dirichlet Allocation) and Hugging Face open source Transformer models.

Project: https://discord.com/channels/1127677457030455437/1127677458578153494/1135889833940750419

Popular repositories Loading

  1. Housing-Price-Estimator Housing-Price-Estimator Public

    The objective of this project is to create an accurate sale price calculation tool for a Real State company, developing in the process a full Data Science Project

    Jupyter Notebook 1

  2. Web-scraper-example Web-scraper-example Public

    This repository shows a basic code for web scraping web pages that contains tables and also an applied example in a sport webpage.

    Jupyter Notebook 1

  3. Baseball-Win-Predictor-Classificator- Baseball-Win-Predictor-Classificator- Public

    Will try to make a baseball win predictor

    1

  4. SQL-DataExtraction SQL-DataExtraction Public

    Boolean Queries + Relational Algebra SQL and also Python with Sqlite library, Data collection and extraction in SQL, Data extraction using Python

    Jupyter Notebook 1

  5. Data-Integration Data-Integration Public

    Advanced SQL + Integrity Constraints, Functional Dependency + Data Integration, Entity Linkage

    Jupyter Notebook 1

  6. Data-Preparation Data-Preparation Public

    [Python] Data Quality and Cleaning + Transformation, Data Reduction + Normalization

    Jupyter Notebook 1