Skip to content
View benitomartin's full-sized avatar

Highlights

  • Pro
Block or Report

Block or report benitomartin

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this userโ€™s behavior. Learn more about reporting abuse.

Report abuse
benitomartin/README.md

โšก Benito โšก

"It's not about data, it's all about tangible business insights ๐Ÿ“ˆ"

:ES: :EN: :DE:

ย 

๐Ÿ˜ƒ Support me on GitHub Sponsors if you like my content! This encourages me to keep working on new projects ๐Ÿ˜Ž

And don't forget read my profile while listening to this beautiful Spanish AI Song ย  ๐ŸŽถ ๐ŸŽท ๐ŸŽถ

๐Ÿ‘‰ CONTACT ME! ๐Ÿ‘‰ Use this form or reach out on LinkedIn! ๐Ÿš€

ย 

๐Ÿ‘จโ€๐Ÿ’ป My Profile

Innovative and dynamic Data Scientist with a proven track record in leveraging AI and Machine/Deep Learning techniques to develop impactful data-driven solutions. Incorporating a robust skill set encompassing:

ย 

  • โœ… Data Visualization/Analytics: Power BI, Looker, Tableau, Matplotlib, Seaborn, Plotly, Streamlit, Gradio
  • โœ… Web Scraping: BeautifulSoup, Scrapy, Selenium
  • โœ… Maths and Statistics: Statsmodels, SciPy
  • โœ… Data Science & ML: PyTorch, TensorFlow, Scikit-learn, OpenCV, NLTK, SpaCy
  • โœ… AI: Langchain, LlamaIndex, OpenAI, Hugging Face, Transformers, Ollama, Llamafile, CrewAI, Pinecone, Qdrant, Chroma, Faiss, Ragas
  • โœ… Domains: Regression, Classification, NLP, LLM, RAG, Computer Vision, Time Series, Neural Networks, Ensemble Methods, PCA, Clustering, Dimensionality Reduction, Anomaly Detection
  • โœ… Data Engineering: dbt, Terraform, SQL, PySpark
  • โœ… MLOps: MLflow, Prefect, Mage
  • โœ… APIs: Flask, FastAPI
  • โœ… Cloud Platforms: GCP, AWS, Azure

ย 

Currently working as an Independent Consultant, Teacher, and Course Developer for Data Analytics/Science ML and AI, and open for further cooperation opportunities!

If you want me to develop a project for you feel free to contact me! I always keep developing personal projects and I'm happy to implement new ideas ๐Ÿ˜ƒ.

ย 

๐Ÿ“„ Projects Portfolio

  • ๐ŸŒฑ Developed as a project leader a Computer Vision MLOps project FoodScore (summary) and its Website during the last 2 weeks of the Data Science Bootcamp of Le Wagon (March 2023)

  • ๐Ÿ”ญ After my graduation, I worked as a volunteer in the following Data Science Projects NLP and GIS project (Website) at Omdena

  • ๐Ÿ’ฐ My personal end-to-end Data Science/Engineering/Analysis, ML and AI projects can be found in these repositories (feel free to click โญ if you like them ๐Ÿ˜Ž):

    • MLOps:
    • Data Analysis + Modeling:
      • Cryptocurrencies Analysis: EDA and Modeling project: comparison of ARIMA, XGBoost, LSTM, and Prophet
      • News Classification: EDA, Modeling and Deployment project: comparison of several Neural Networks (CNN, RNN, feedforward) and Multinomial Naive Bayes models and deployment in Streamlit (see app)
      • Breast Cancer Classification: EDA and Modeling project: comparison of Random Forest using Sklearn and Spark, as part of the Advanced Data Science with IBM Specialization
      • Bank Churn Classification: EDA and Modeling project including univariate/bivariate analysis, feature engineering, baseline model selection and voting classifier (LGBMClassifier, XGBoostClassifier, and CatBoostClassifier)
    • LLM, RAG and Fine-tuning:
      • Q&A and Summarization: LLM project for audio and text extraction using Whisper and Langchain with app deployment using Streamlit (locally)
      • RAG Llama Index: RAG project for QA retrieval using Llama Index and Deep Lake
      • RAG LangChain Ragas: RAG project for QA retrieval using LangChain and evaluation with RAGAS
      • Agentic RAG LangChain Pinecone: RAG project for QA retrieval using LangChain, Pinecone adding multi-tenancy, agents and memory
      • CrewAI RAG LangChain Qdrant: RAG project for QA retrieval using LangChain, Qdrant and CrewAI agents (researcher and writer)
      • Fine Tuning Gemma 2B: Fine tuning of Gemma 2B model using quantization and LoRA Adapters (QLoRA) and hosting the model/adapters in a private repo in Hugging Face
    • Data Engineering:
      • Hotel Reviews: Data Engineering project using Prefect, Spark, SQL, dbt, Terraform, Looker, CI/CD and GCP
      • Air Quality Switzerland: Data Engineering project using Mage, dbt, Terraform, Looker, CI/CD and GCP
  • ๐Ÿ’ธ Additionally, you can find my Power BI projects:

  • :basecamp: Last but not least, I also have a Tableau portfolio using groups, sets, blends, joins, table calculations, storylines, parameters, animations, and other advanced functions

ย 

๐Ÿงฎ Tech Stack

Data Science/Engineering/Analysis and ML

Visual Studio Code HTML5 CSS3 Jupyter Notebook MySQL SQLite PostgreSQL Tableau PowerBI Looker Studio Python Pandas NumPy Plotly Matplotlib scikit-learn SciPy TensorFlow PyTorch OpenCV OpenAI FastAPI Flask Docker Anaconda Linux Ubuntu Databricks Google Cloud AWS Azure Grafana Terraform Apache Spark Prefect dbt MLflow GitHub Actions Git Streamlit

Cloud Services

GCP

Cloud Storage BigQuery Cloud Run VM Vertex AI Dataproc Earth Engine Container Registry

Azure

Azure Databricks Data Lake Gen2 Data Factory Container Registry

AWS

S3 EC2 ECR Kinesis Lambda RDS

ย 

๐Ÿงฎ Let's Connect!

benito benito

Pinned

  1. crewai-rag-langchain-qdrant crewai-rag-langchain-qdrant Public

    Agentic RAG with Langchain, Qdrant and CrewAI

    Jupyter Notebook 17 2

  2. rag-llama-deeplake rag-llama-deeplake Public

    RAG project for QA retrieval using Llama Index

    Jupyter Notebook 2

  3. mlops-car-prices mlops-car-prices Public

    MLOps Car Prices Prediction

    Python 2

  4. rag-langchain-ragas rag-langchain-ragas Public

    RAG project for QA retrieval using LanChain and Ragas

    Jupyter Notebook 2

  5. foodscore-code foodscore-code Public

    MLOps Computer Vision Food Classification

    Python 1

  6. peft-gemma-2b peft-gemma-2b Public

    PEFT of Gemma 2b model using QLoRA

    Jupyter Notebook