Skip to content
View benitomartin's full-sized avatar

Highlights

  • Pro
Block or Report

Block or report benitomartin

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this userโ€™s behavior. Learn more about reporting abuse.

Report abuse
benitomartin/README.md

โšก Benito โšก

"It's not about data, it's all about tangible business insights ๐Ÿ“ˆ"

:ES: :EN: :DE:

ย 

๐Ÿ˜ƒ Support me on GitHub Sponsors if you like my content! This encourages me to keep working on new projects ๐Ÿ˜Ž

And don't forget to read my profile while listening to this beautiful Spanish AI Song ย  ๐ŸŽถ ๐ŸŽท ๐ŸŽถ

๐Ÿ‘‰ CONTACT ME! ๐Ÿ‘‰ Use this form or reach out on LinkedIn! ๐Ÿš€

ย 

๐Ÿ‘จโ€๐Ÿ’ป My Profile

Innovative and dynamic Data Scientist with a proven track record in leveraging AI and Machine/Deep Learning techniques to develop impactful data-driven solutions. Incorporating a robust skill set encompassing:

ย 

  • โœ… Data Visualization/Analytics: Power BI, Looker, Tableau, Matplotlib, Seaborn, Plotly, Streamlit, Gradio
  • โœ… Web Scraping: BeautifulSoup, Scrapy, Selenium
  • โœ… Maths and Statistics: Statsmodels, SciPy
  • โœ… Data Science & ML: Python, TensorFlow, PyTorch, Scikit-learn, OpenCV, NLTK, SpaCy
  • โœ… AI: Langchain, LlamaIndex, OpenAI, Hugging Face, Transformers, Ollama, Llamafile, CrewAI, Pinecone, Qdrant, Chroma, Faiss, Ragas
  • โœ… Domains: Regression, Classification, NLP, LLM, RAG, Computer Vision, Time Series, Neural Networks, Ensemble Methods, PCA, Clustering, Dimensionality Reduction, Anomaly Detection
  • โœ… Data Engineering: dbt, Terraform, SQL, PySpark
  • โœ… MLOps: MLflow, Prefect, Mage, Comet
  • โœ… APIs: Flask, FastAPI
  • โœ… Cloud Platforms: GCP, AWS, Azure

ย 

Currently working as an Independent Consultant, Teacher, and Course Developer for Data Analytics/Science ML and AI, and open for further cooperation opportunities!

If you want me to develop a project for you feel free to contact me! I always keep developing personal projects and I'm happy to implement new ideas ๐Ÿ˜ƒ.

ย 

๐Ÿ“„ Projects Portfolio

๐Ÿ’ฐ My personal end-to-end projects can be found in these repositories (feel free to click โญ if you like them ๐Ÿ˜Ž):

Project Name Main Libraries/Tools Cloud Service App
ML/MLOps
Medical Insurance Costs Prediction Scikit-learn, TensorFlow, Comet ML, Flask, CI/CD AWS
Stroke Prediction Scikit-learn, XGBoost, Comet ML, Flask, Docker AWS
Car Price Prediction Scikit-learn, TensorFlow, MLFlow, Prefect, Flask, Docker, Grafana, Terraform, CI/CD AWS
Taxi Rides Prediction Scikit-learn, TensorFlow, MLFlow, Prefect, FastAPI, Docker GCP
Music Clustering Scikit-learn, FastAPI, Docker, CI/CD AWS Streamlit
Birds Classification Pytorch Gradio
Food Prediction Scikit-learn, TensorFlow, OpenCV, FastAPI, Docker GCP Streamlit
Data Analysis + Modeling
Cryptocurrencies Analysis ARIMA, XGBoost, TensorFlow LSTM, Prophet
News Classification Scikit-learn (Multinomial Naive Bayes), Tensorflow (CNN, RNN, feedforward) Streamlit
Breast Cancer Classification Scikit-learn, Spark IBM
Bank Churn Classification Scikit-learn, LightGBM, XGBoost, CatBoost
LLM, RAG and Fine-tuning
RAG AWS LangChain Qdrant OpenAI, LangChain, Qdrant, Docker AWS Streamlit
Q&A and Summarization Hugging Face Transformers, Whisper, Langchain, ChromaDB Streamlit
RAG Llama Index OpenAI, LlamaIndex, Deep Lake
RAG LangChain Ragas OpenAI, Hugging Face Transformers, LangChain, Ragas
Agentic RAG LangChain Pinecone OpenAI, Groq, LangChain, Pinecone
CrewAI RAG LangChain Qdrant OpenAI, LangChain, Qdrant, CrewAI Agents
Fine Tuning Gemma 2B Hugging Face Transformers, PEFT, (LoRA/QLoRA) Hugging Face
Data Engineering
Hotel Reviews Prefect, Spark, SQL, dbt, Terraform, Looker, CI/CD GCP
Air Quality Switzerland Mage, dbt, Terraform, Looker, CI/CD GCP
  • ๐Ÿ’ธ Additionally, you can find my Power BI projects:

  • :basecamp: Last but not least, I also have a Tableau portfolio using groups, sets, blends, joins, table calculations, storylines, parameters, animations, and other advanced functions

ย 

๐Ÿงฎ Tech Stack

Data Science/Engineering/Analysis and ML

Visual Studio Code HTML5 CSS3 Jupyter Notebook MySQL SQLite PostgreSQL Tableau PowerBI Looker Studio Python Pandas NumPy Plotly Matplotlib scikit-learn SciPy TensorFlow PyTorch OpenCV OpenAI FastAPI Flask Docker Anaconda Linux Ubuntu Databricks Google Cloud AWS Azure Grafana Terraform Apache Spark Prefect dbt MLflow GitHub Actions Git Streamlit

Cloud Services

GCP

Cloud Storage BigQuery Cloud Run VM Vertex AI Dataproc Earth Engine Container Registry

AWS

SageMaker S3 EC2 ECR Kinesis Lambda RDS API Gateway EventBridge

Azure

Azure Databricks Data Lake Gen2 Data Factory Container Registry

ย 

๐Ÿงฎ Let's Connect!

benito

Pinned

  1. crewai-rag-langchain-qdrant crewai-rag-langchain-qdrant Public

    Agentic RAG with Langchain, Qdrant and CrewAI

    Jupyter Notebook 22 2

  2. rag-langchain-ragas rag-langchain-ragas Public

    RAG project for QA retrieval using LanChain and Ragas

    Jupyter Notebook 4

  3. mlops-car-prices mlops-car-prices Public

    MLOps Car Prices Prediction

    Python 2

  4. rag-llama-deeplake rag-llama-deeplake Public

    RAG project for QA retrieval using Llama Index

    Jupyter Notebook 2

  5. agentic-rag-langchain-pinecone agentic-rag-langchain-pinecone Public

    Agentic RAG with LangChain and Pinecone

    Jupyter Notebook 1

  6. peft-gemma-2b peft-gemma-2b Public

    PEFT of Gemma 2b model using QLoRA

    Jupyter Notebook 1