Skip to content
View benitomartin's full-sized avatar
๐Ÿ’ญ
Contact me!
๐Ÿ’ญ
Contact me!

Highlights

  • Pro
Block or Report

Block or report benitomartin

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this userโ€™s behavior. Learn more about reporting abuse.

Report abuse
benitomartin/README.md

ย 

Website Badge LinkedIn Badge Medium Badge

ย 

๐ŸŽต While you're here, why not enhance your visit with a melodious twist? Tune into this enchanting Spanish AI Song. A perfect blend of technology and art. Enjoy the vibes! ๐ŸŽท๐ŸŽถ

๐ŸŒŸ Crafting each piece of content is a journey that demands both time and passion. If you enjoy my work, consider fueling my creativity Buying me a Coffee โ˜• or supporting me on GitHub Sponsors ๐Ÿš€

๐Ÿ’ฐ My website has been created using Hostinger. If you want to create your own one, using this link (https://hostinger.com?REFERRALCODE=1BENITO83) will provide you 20% discount on the selected plan ๐Ÿ’ถ

ย 

๐Ÿ‘จโ€๐Ÿ’ป My Profile

Innovative and dynamic Data Scientist with a proven track record in leveraging AI and Machine/Deep Learning techniques to develop impactful data-driven solutions. My skill set includes:

ย 

  • โœ… Data Visualization & Analytics: Power BI, Looker, Tableau, Matplotlib, Seaborn, Plotly, Streamlit, Gradio
  • โœ… Web Scraping: BeautifulSoup, Scrapy, Selenium
  • โœ… Maths and Statistics: Statsmodels, SciPy
  • โœ… Data Science & ML: Python, TensorFlow, PyTorch, Scikit-learn, OpenCV, NLTK, SpaCy
  • โœ… AI: Langchain, LlamaIndex, OpenAI, Hugging Face, Transformers, Ollama, Llamafile, CrewAI, Pinecone, Qdrant, Chroma, Faiss, Ragas, LangSmith
  • โœ… Key Domains: Regression, Classification, NLP, LLM, RAG, Computer Vision, Time Series, Neural Networks, Ensemble Methods, PCA, Clustering, Dimensionality Reduction, Anomaly Detection
  • โœ… Data Engineering: dbt, Terraform, SQL, BigQuery, PySpark
  • โœ… MLOps: MLflow, Prefect, Mage, Comet, Docker, Kubernetes
  • โœ… APIs: Flask, FastAPI, AWS API Gateway
  • โœ… Cloud Platforms: GCP, AWS
  • โœ… Version Control: Git ย 

Currently working as a Freelance for Data Analytics/Science ML and AI end-to-end projects, and open for further cooperation opportunities!

If you want me to develop a project for you feel free to contact me! I always keep developing personal projects and I'm happy to implement new ideas ๐Ÿ˜ƒ

๐Ÿ‘‰ CONTACT ME! ๐Ÿ‘‰ Book a Free Call, use this Form or reach out on LinkedIn! ๐Ÿš€

ย 

๐Ÿ“„ Projects Portfolio

๐Ÿ’ฐ My personal end-to-end projects can be found in these repositories. Feel free to click โญ if you like them ๐Ÿ˜Ž

ย 

Project Name Main Libraries/Tools Cloud Service App DevOps Best Practices
ML/MLOps
Medical Insurance Costs Prediction Scikit-learn
TensorFlow
SageMaker
Comet ML
Flask
AWS Experiment Tracking
Model Registry
Model/Data Monitoring
Linting
Formatting
Testing
Error Handling
Coverage
CI/CD
Stroke Prediction Scikit-learn
XGBoost
SageMaker
Comet ML
Flask
Docker
AWS Experiment Tracking
Model Registry
Model Monitoring
Containerization
Testing
Error Handling
Car Price Prediction Scikit-learn
TensorFlow
MLFlow
Prefect
Flask
Docker
Grafana
Terraform
AWS Experiment Tracking
Model Registry
Model Monitoring
Orchestration
Containerization
Linting
Formatting
Testing
Error Handling
IaC
CI/CD
Taxi Rides Prediction Scikit-learn
TensorFlow
MLFlow
Prefect
FastAPI
Docker
GCP Experiment Tracking
Model Registry
Model Monitoring
Orchestration
Containerization
Error Handling
Music Clustering Scikit-learn
FastAPI
Docker
AWS Streamlit Containerization
CI/CD
Birds Classification Pytorch Gradio
Food Prediction Scikit-learn
TensorFlow
OpenCV
FastAPI
Docker
GCP Streamlit Containerization
LLM, RAG and Fine-tuning
Scalable RAG with GKE, LlamaIndex and Qdrant OpenAI
LlamaIndex
Qdrant
Docker
FastAPI
GKE
GCP Streamlit Containerization
Linting
Formatting
Testing
Error Handling
CI/CD
RAG AWS LangChain Qdrant OpenAI
LangChain
Qdrant
Docker
AWS API Gateway
AWS Streamlit Containerization
Linting
Formatting
Testing
Error Handling
Q&A and Summarization Hugging Face Transformers
Whisper
Langchain
ChromaDB
Streamlit Error Handling
Multimodal Llama Index Gemini
LlamaIndex
Qdrant
RAG Llama Index OpenAI
LlamaIndex
Deep Lake
RAG LangChain Ragas OpenAI
Hugging Face Transformers
Faiss
LangChain
Ragas
Agentic RAG LangChain Pinecone OpenAI
Groq
LangChain
Pinecone
CrewAI RAG LangChain Qdrant OpenAI
LangChain
Qdrant
CrewAI Agents
Fine Tuning Gemma 2B Hugging Face Transformers
PEFT (LoRA/QLoRA)
Hugging Face
Data Analysis + Modeling
Cryptocurrencies Analysis ARIMA
XGBoost
TensorFlow LSTM
Prophet
News Classification Scikit-learn (Multinomial Naive Bayes)
Tensorflow (CNN, RNN, feedforward)
Streamlit
Breast Cancer Classification Scikit-learn
Spark
IBM
Bank Churn Classification Scikit-learn
LightGBM
XGBoost
CatBoost
Data Engineering
Hotel Reviews Prefect
Spark
SQL
BigQuery
dbt
Terraform
Looker
GCP Orchestration
Linting
Formatting
Error Handling
Pre-Commit
IaC
CI/CD
Air Quality Switzerland Mage
dbt
SQL
BigQuery
Docker
Terraform
Looker
GCP Orchestration
IaC
Containerization
CI/CD

ย 

๐Ÿ’ธ Additionally, you can find my Power BI projects:

:basecamp: Last but not least, I also have a Tableau portfolio using groups, sets, blends, joins, table calculations, storylines, parameters, animations, and other advanced functions

ย 

๐Ÿงฎ Tech Stack

Data Science/Engineering/Analysis and ML

Visual Studio Code HTML5 CSS3 Jupyter Notebook MySQL SQLite PostgreSQL Tableau PowerBI Looker Studio Python Pandas NumPy Plotly Matplotlib scikit-learn SciPy TensorFlow PyTorch OpenCV OpenAI FastAPI Flask Docker Kubernetes Anaconda Linux Ubuntu Databricks Google Cloud AWS Azure Grafana Terraform Apache Spark Prefect dbt MLflow GitHub Actions Git Streamlit

Cloud Services

GCP

Cloud Storage BigQuery Cloud Run VM GKE Vertex AI Dataproc Earth Engine Container Registry

AWS

SageMaker S3 EC2 ECR Kinesis Lambda Glue Athena Firehose CloudWatch RDS API Gateway EventBridge

Azure

Azure Databricks Data Lake Gen2 Data Factory Container Registry

ย 

๐Ÿงฎ Let's Connect!

benito

Pinned Loading

  1. crewai-rag-langchain-qdrant crewai-rag-langchain-qdrant Public

    Agentic RAG with Langchain, Qdrant and CrewAI

    Jupyter Notebook 27 3

  2. rag-aws-qdrant rag-aws-qdrant Public

    Serverless Application with AWS Lambda and Qdrant for Semantic Search

    Python 9 1

  3. rag-langchain-ragas rag-langchain-ragas Public

    RAG project for QA retrieval using LanChain and Ragas

    Jupyter Notebook 6 2

  4. mlops-car-prices mlops-car-prices Public

    MLOps Car Prices Prediction

    Python 3

  5. agentic-rag-langchain-pinecone agentic-rag-langchain-pinecone Public

    Agentic RAG with LangChain and Pinecone

    Jupyter Notebook 3 1

  6. multimodal-youtube-recipes multimodal-youtube-recipes Public

    Multimodal Recipe Recommender using Qdrant, LlamaIndex, and Google Gemini

    Jupyter Notebook 3