Skip to content

benitomartin/benitomartin

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 

Repository files navigation

 

Website Badge LinkedIn Badge Medium Badge Kaggle Badge

 

😃 Support me on GitHub Sponsors if you like my content! This encourages me to keep working on new projects 😎

And don't forget to read my profile while listening to this beautiful Spanish AI Song   🎶 🎷 🎶

👉 CONTACT ME! 👉 Use this form or reach out on LinkedIn! 🚀

 

👨‍💻 My Profile

Innovative and dynamic Data Scientist with a proven track record in leveraging AI and Machine/Deep Learning techniques to develop impactful data-driven solutions. My skill set includes:

 

  • Data Visualization & Analytics: Power BI, Looker, Tableau, Matplotlib, Seaborn, Plotly, Streamlit, Gradio
  • Web Scraping: BeautifulSoup, Scrapy, Selenium
  • Maths and Statistics: Statsmodels, SciPy
  • Data Science & ML: Python, TensorFlow, PyTorch, Scikit-learn, OpenCV, NLTK, SpaCy
  • AI: Langchain, LlamaIndex, OpenAI, Hugging Face, Transformers, Ollama, Llamafile, CrewAI, Pinecone, Qdrant, Chroma, Faiss, Ragas, LangSmith
  • Key Domains: Regression, Classification, NLP, LLM, RAG, Computer Vision, Time Series, Neural Networks, Ensemble Methods, PCA, Clustering, Dimensionality Reduction, Anomaly Detection
  • Data Engineering: dbt, Terraform, SQL, BigQuery, PySpark
  • MLOps: MLflow, Prefect, Mage, Comet, Docker, Kubernetes
  • APIs: Flask, FastAPI, AWS API Gateway
  • Cloud Platforms: GCP, AWS, Azure

 

Currently working as a Freelance for Data Analytics/Science ML and AI end-to-end projects, and open for further cooperation opportunities!

If you want me to develop a project for you feel free to contact me! I always keep developing personal projects and I'm happy to implement new ideas 😃.

 

📄 Projects Portfolio

💰 My personal end-to-end projects can be found in these repositories (feel free to click ⭐ if you like them 😎):

 

Project Name Main Libraries/Tools Cloud Service App Best Practices
ML/MLOps
Medical Insurance Costs Prediction Scikit-learn, TensorFlow, Comet ML, Flask AWS Experiment Tracking, Model Registry, Model/Data Monitoring, Linting, Formatting, Testing, Error Handling, Coverage, CI/CD
Stroke Prediction Scikit-learn, XGBoost, Comet ML, Flask, Docker AWS Experiment Tracking, Model Registry/Monitoring, Containerization, Testing, Error Handling
Car Price Prediction Scikit-learn, TensorFlow, MLFlow, Prefect, Flask, Docker, Grafana, Terraform AWS Experiment Tracking, Model Registry/Monitoring, Orchestration, Containerization, Linting, Formatting, Testing, Error Handling, IaC, CI/CD
Taxi Rides Prediction Scikit-learn, TensorFlow, MLFlow, Prefect, FastAPI, Docker GCP Experiment Tracking, Model Registry/Monitoring, Orchestration, Containerization, Error Handling
Music Clustering Scikit-learn, FastAPI, Docker AWS Streamlit Containerization, CI/CD
Birds Classification Pytorch Gradio
Food Prediction Scikit-learn, TensorFlow, OpenCV, FastAPI, Docker GCP Streamlit Containerization
LLM, RAG and Fine-tuning
Scalable RAG with GKE, LlamaIndex and Qdrant OpenAI, LlamaIndex, Qdrant, Docker, FastAPI, GKE GCP Streamlit Containerization, Linting, Formatting, Testing, Error Handling, CI/CD
RAG AWS LangChain Qdrant OpenAI, LangChain, Qdrant, Docker, AWS API Gateway AWS Streamlit Containerization, Linting, Formatting, Testing, Error Handling
Q&A and Summarization Hugging Face Transformers, Whisper, Langchain, ChromaDB Streamlit Error Handling
RAG Llama Index OpenAI, LlamaIndex, Deep Lake
RAG LangChain Ragas OpenAI, Hugging Face Transformers, Faiss, LangChain, Ragas
Agentic RAG LangChain Pinecone OpenAI, Groq, LangChain, Pinecone
CrewAI RAG LangChain Qdrant OpenAI, LangChain, Qdrant, CrewAI Agents
Fine Tuning Gemma 2B Hugging Face Transformers, PEFT, (LoRA/QLoRA) Hugging Face
Data Analysis + Modeling
Cryptocurrencies Analysis ARIMA, XGBoost, TensorFlow LSTM, Prophet
News Classification Scikit-learn (Multinomial Naive Bayes), Tensorflow (CNN, RNN, feedforward) Streamlit
Breast Cancer Classification Scikit-learn, Spark IBM
Bank Churn Classification Scikit-learn, LightGBM, XGBoost, CatBoost
Data Engineering
Hotel Reviews Prefect, Spark, SQL, BigQuery, dbt, Terraform, Looker GCP Orchestration, Linting, Formatting, Error Handling, Pre-Commit, IaC, CI/CD
Air Quality Switzerland Mage, dbt, SQL, BigQuery, Docker, Terraform, Looker GCP Orchestration, IaC, Containerization, CI/CD

 

💸 Additionally, you can find my Power BI projects:

:basecamp: Last but not least, I also have a Tableau portfolio using groups, sets, blends, joins, table calculations, storylines, parameters, animations, and other advanced functions

 

🧮 Tech Stack

Data Science/Engineering/Analysis and ML

Visual Studio Code HTML5 CSS3 Jupyter Notebook MySQL SQLite PostgreSQL Tableau PowerBI Looker Studio Python Pandas NumPy Plotly Matplotlib scikit-learn SciPy TensorFlow PyTorch OpenCV OpenAI FastAPI Flask Docker Kubernetes Anaconda Linux Ubuntu Databricks Google Cloud AWS Azure Grafana Terraform Apache Spark Prefect dbt MLflow GitHub Actions Git Streamlit

Cloud Services

GCP

Cloud Storage BigQuery Cloud Run VM GKE Vertex AI Dataproc Earth Engine Container Registry

AWS

SageMaker S3 EC2 ECR Kinesis Lambda RDS API Gateway EventBridge

Azure

Azure Databricks Data Lake Gen2 Data Factory Container Registry

 

🧮 Let's Connect!

benito