Skip to content
View marcos-mansur's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report marcos-mansur

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
marcos-mansur/README.md

Hi!

I'm Marcos Mansur, a Data Scientist with a bachelor in Production Engeneering from Brazil.

LinkedIn: https://www.linkedin.com/in/mmansur/

Summary

Some of my study projects:

  • Energy consumption forecast with RNN - My monography developed to acquire the Bachelor title in Production Engineering from UFF. A recurrent neural network model (based on tensorflow) to forecast the energy consumption of the southeast region of Brazil for the 5 next weeks (outputs a sequence). The model was trained on public data from ONS (Operador Nacional do Sistema Elétrico) and proposes a solution to the energy forecasts that ONS demmands weekly from companies operating in energy distribuition. The study compares the performance from models trained to forecast 5 next values from a time series in one shot vs forecasting the next value and auto regressively feeding each prediction as a data point for the next prediction. Uses DVC and MLFlow for data versioning and experiment tracking.

  • Titanic survival prediction model - Machine Learning model to predict survivability of Titanic passangers. CI basic structure on yaml file to print performance metrics as comments in pull requests.

  • Time series Regression Model for VLabs Challange - ElasticNet Regressor to predict how much each client is going to spend in the next 90 days based on data from 14 months of purchases. 5 diferent models to predict customer lifetime value by channel of sale. Scored second place in the leaderboard out of 17 teams. (Notebooks in portuguese-BR)

  • Tabular Data Sep21 Analysis- Kaggle competition - LightGBM regressor tuned with bayesian optimization (optuna) for the tabular data competition. The dataset has 118 unnamed numeric features and the missing values are significantly relevant for prediction.

  • IDHM prediction of brazilian cities - Voting regressor of 4 tree-based algorithms to predict de IDHM (Municipal Human Development Index) of brazilian cities

  • Deployed Web App to compare routes on a map - A Streamlit based deployed web app that plot multiple routes on a map for comparison and geospatial analysis. You can play with the app at: https://share.streamlit.io/marcos-mansur/plot_route_app/main/MyApp.py

  • Web Scrapping learning projects - a robot to download book covers from https://inventwithpython.com/ and another that plays simple instructions and record the board of each play at 2048 with selenium and bs4.

Pinned Loading

  1. load-forecast load-forecast Public

    Forecasting eletric load using autoregressive recurrent networks (tensorflow)

    Jupyter Notebook

  2. pd-case-ps pd-case-ps Public

    User profile analysis and clustering for a Internship application at PasseiDireto (I got the internship).

    Jupyter Notebook

  3. Kaggle_Titanic Kaggle_Titanic Public

    Forked from thiago-ouverney/Kaggle_Titanic

    TOP 3% model at Titanic Kaggle competition's leaderboard. The model is a Voting classifier of LogisticRegression, RandomForest and GradientBoosting Classifier. Continuos Integration structure.

    Jupyter Notebook

  4. vlabs-challenge vlabs-challenge Public

    Predict the Lifetime Value of clients fot the next 90 days based on data of 14 months of sales. One linear model (ElasticNet) trained for each of the 5 sales chanels. The final prediction is genera…

    Jupyter Notebook 2 1

  5. TPS-sep21 TPS-sep21 Public

    This is my project for the tabular data competition of september from kaggle. It consists of a LGBMClassifier model tuned with Optuna with roc_auc score of 81.58% in Kaggle's public leaderboard.

    Jupyter Notebook

  6. plot_route_app plot_route_app Public

    An streamlit based web app to plot multiple routes from geospatial data on a map.

    Jupyter Notebook