Skip to content

DiogoRibeiro7/diogoribeiro7

Repository files navigation

Hi there 👋, I'm Diogo Ribeiro

Profile Views

Continuous integration

I'm a veteran Data Scientist with over two decades of experience, hailing from the picturesque country of Portugal. My journey through computer science, economy, management, medicine, natural sciences, engineering, pure mathematics, and applied mathematics has been a continuous source of fascination and inspiration. Welcome to my GitHub profile!

I've delved into the intricate domains of supply chain, logistics, sustainability, finance, and health, where data-driven decision-making optimizes operations and promotes environmental responsibility.

🔭 In research, I'm deeply immersed in applying machine learning and statistics to health. My work harnesses data to advance healthcare solutions and outcomes.

🔭 My research also extends to mathematics, focusing on differential equations and partial differential equations. These tools apply to diverse fields such as epidemiology, economics, and sociology, helping unravel complex phenomena.

🔭 I'm passionate about graph theory and its applications in social networks, where I uncover patterns and connections that provide valuable insights into human interactions.

📈 Beyond these areas, I'm intrigued by the potential of big data analytics in marketing, where customer behaviors and preferences can be decoded to enhance business strategies.

📊 I'm also fascinated by the emerging field of quantum computing and its potential to revolutionize problem-solving in cryptography and optimization.

📐 My research in statistics and probability involves developing models that accurately predict outcomes and assess risks. These models are crucial in fields like finance, insurance, and public policy, where they inform decision-making and strategy.

🌍 Sustainability is another key interest, particularly in developing algorithms that promote renewable energy use and reduce carbon footprints.

🔧 I have extensive experience in statistical modeling in management, including survival analyses, cohort analysis, and production analysis and modeling. My work in optimization of industrial processes ensures efficiency and productivity.

🔬 I enjoy working with Raspberry Pi and sensors, exploring their applications in various fields such as IoT (Internet of Things), automation, and environmental monitoring. This hands-on approach allows me to integrate hardware and software solutions for innovative projects.

Thank you for visiting my profile, and I look forward to connecting and collaborating!

🔗  Connect with me

Medium

GitHub Pages

Buy Me a Coffee

Support on Dagshub

Follow on Dev.to

Connect on LinkedIn

Tools and skills 🎓

Area Tool
OS Linux macOS
Languages Python Node.js TypeScript R MATLAB C C++ Ruby Fortran Apache Spark
Databases PostgreSQL SQLite MongoDB DynamoDB MySQL Microsoft SQL Server Neo4j GraphQL BigQuery
Datalake Apache Iceberg Apache Hudi
Infrastructure Docker GitHub Actions AWS Datadog Prometheus Jenkins
Command Line Bash Git curl wget
Cloud Services Azure GCP AWS
Typesetting Tools LaTeX Markdown R Markdown
Web Development Frameworks React Django
Streaming Apache Flink Apache Kafka Amazon Kinesis Apache Kafka
DevOps Tools Jenkins AWS CloudFormation
Data Analysis and Visualization Tableau Power BI
Data Science Jupyter RStudio Anaconda Kaggle Databricks SageMaker DataRobot H2O.ai RapidMiner Alteryx KNIME Apache Spark TensorFlow PyTorch Apache Flink Apache Kafka Snowflake BigQuery Airflow Matplotlib Plotly D3.js Tableau Power BI
Machine Learning TensorFlow PyTorch Scikit-learn Keras XGBoost LightGBM H2O.ai DataRobot RapidMiner Alteryx KNIME SageMaker Google Cloud AI Azure Machine Learning
Data Engineering Apache Spark Apache Flink Apache Kafka AWS Glue Google Cloud Dataflow Azure Data Factory
Mathematics MATLAB Wolfram Mathematica SymPy R SageMath Julia GNU Octave Scilab
Statistics R MATLAB Excel SciPy NumPy Pandas Stan
Optimization AMPL Gurobi CPLEX JuliaOpt SciPy NumPy Pandas Stan
Models Linear Regression Logistic Regression ANOVA Time Series Analysis Survival Analysis Decision Trees Random Forests Gradient Boosting SVM KNN K-Means Clustering PCA Neural Networks CNN RNN Bayesian Networks Reinforcement Learning
Statistics Linear Regression Logistic Regression ANOVA Time Series Analysis Survival Analysis PCA Cluster Analysis Bayesian Analysis Hypothesis Testing Correlation Analysis Factor Analysis Multivariate Analysis Non-parametric Tests
Mathematics Linear Algebra Calculus Differential Equations Probability Theory Statistics Discrete Mathematics Number Theory Abstract Algebra Topology Combinatorics Graph Theory Mathematical Modeling Game Theory
Data Science Data Cleaning Data Wrangling Data Exploration Feature Engineering Model Selection Model Evaluation Model Deployment Model Monitoring Data Visualization Data Storytelling Data Governance Data Privacy Data Security
Data Engineering Data Ingestion Data Processing Data Storage Data Transformation Data Integration ETL Data Pipelines Data Quality Data Orchestration

GitHub Stats

github profile contributions chart

📕  Latest Blog Posts in Medium.com

Mastering Python Dataclasses - Fri, 19 Jul 2024

Guiding a Data Scientist Towards More Effective Communication - Thu, 18 Jul 2024

What is Metadata Management? - Thu, 18 Jul 2024

Establishing Best Practices for Data Science Teams - Mon, 15 Jul 2024

Mastering Project Ownership and Management: Building Effective Teams and Processes - Sat, 13 Jul 2024

Building a Data Team in an Evolving World - Wed, 05 Jun 2024

Unsupervised Anomaly Detection - Mon, 22 Apr 2024

A Comprehensive Guide to Structural Equation Modeling with Latent Variables - Tue, 26 Mar 2024

Comparing Imputation Techniques - Fri, 08 Mar 2024

Partial Least Squares: A Comprehensive Guide to Overcoming Data Challenges - Thu, 29 Feb 2024

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Packages

No packages published

Languages