|
|
|
|
😃 Support me on GitHub Sponsors if you like my content! This encourages me to keep working on new projects 😎
And don't forget to read my profile while listening to this beautiful Spanish AI Song 🎶 🎷 🎶
👉 CONTACT ME! 👉 Use this form or reach out on LinkedIn! 🚀
Innovative and dynamic Data Scientist with a proven track record in leveraging AI and Machine/Deep Learning techniques to develop impactful data-driven solutions. My skill set includes:
- ✅ Data Visualization & Analytics: Power BI, Looker, Tableau, Matplotlib, Seaborn, Plotly, Streamlit, Gradio
- ✅ Web Scraping: BeautifulSoup, Scrapy, Selenium
- ✅ Maths and Statistics: Statsmodels, SciPy
- ✅ Data Science & ML: Python, TensorFlow, PyTorch, Scikit-learn, OpenCV, NLTK, SpaCy
- ✅ AI: Langchain, LlamaIndex, OpenAI, Hugging Face, Transformers, Ollama, Llamafile, CrewAI, Pinecone, Qdrant, Chroma, Faiss, Ragas, LangSmith
- ✅ Key Domains: Regression, Classification, NLP, LLM, RAG, Computer Vision, Time Series, Neural Networks, Ensemble Methods, PCA, Clustering, Dimensionality Reduction, Anomaly Detection
- ✅ Data Engineering: dbt, Terraform, SQL, BigQuery, PySpark
- ✅ MLOps: MLflow, Prefect, Mage, Comet, Docker, Kubernetes
- ✅ APIs: Flask, FastAPI, AWS API Gateway
- ✅ Cloud Platforms: GCP, AWS, Azure
Currently working as a Freelance for Data Analytics/Science ML and AI end-to-end projects, and open for further cooperation opportunities!
If you want me to develop a project for you feel free to contact me! I always keep developing personal projects and I'm happy to implement new ideas 😃.
💰 My personal end-to-end projects can be found in these repositories (feel free to click ⭐ if you like them 😎):
Project Name | Main Libraries/Tools | Cloud Service | App | Best Practices |
---|---|---|---|---|
ML/MLOps | ||||
Medical Insurance Costs Prediction | Scikit-learn, TensorFlow, Comet ML, Flask | AWS | Experiment Tracking, Model Registry, Model/Data Monitoring, Linting, Formatting, Testing, Error Handling, Coverage, CI/CD | |
Stroke Prediction | Scikit-learn, XGBoost, Comet ML, Flask, Docker | AWS | Experiment Tracking, Model Registry/Monitoring, Containerization, Testing, Error Handling | |
Car Price Prediction | Scikit-learn, TensorFlow, MLFlow, Prefect, Flask, Docker, Grafana, Terraform | AWS | Experiment Tracking, Model Registry/Monitoring, Orchestration, Containerization, Linting, Formatting, Testing, Error Handling, IaC, CI/CD | |
Taxi Rides Prediction | Scikit-learn, TensorFlow, MLFlow, Prefect, FastAPI, Docker | GCP | Experiment Tracking, Model Registry/Monitoring, Orchestration, Containerization, Error Handling | |
Music Clustering | Scikit-learn, FastAPI, Docker | AWS | Streamlit | Containerization, CI/CD |
Birds Classification | Pytorch | Gradio | ||
Food Prediction | Scikit-learn, TensorFlow, OpenCV, FastAPI, Docker | GCP | Streamlit | Containerization |
LLM, RAG and Fine-tuning | ||||
Scalable RAG with GKE, LlamaIndex and Qdrant | OpenAI, LlamaIndex, Qdrant, Docker, FastAPI, GKE | GCP | Streamlit | Containerization, Linting, Formatting, Testing, Error Handling, CI/CD |
RAG AWS LangChain Qdrant | OpenAI, LangChain, Qdrant, Docker, AWS API Gateway | AWS | Streamlit | Containerization, Linting, Formatting, Testing, Error Handling |
Q&A and Summarization | Hugging Face Transformers, Whisper, Langchain, ChromaDB | Streamlit | Error Handling | |
RAG Llama Index | OpenAI, LlamaIndex, Deep Lake | |||
RAG LangChain Ragas | OpenAI, Hugging Face Transformers, Faiss, LangChain, Ragas | |||
Agentic RAG LangChain Pinecone | OpenAI, Groq, LangChain, Pinecone | |||
CrewAI RAG LangChain Qdrant | OpenAI, LangChain, Qdrant, CrewAI Agents | |||
Fine Tuning Gemma 2B | Hugging Face Transformers, PEFT, (LoRA/QLoRA) | Hugging Face | ||
Data Analysis + Modeling | ||||
Cryptocurrencies Analysis | ARIMA, XGBoost, TensorFlow LSTM, Prophet | |||
News Classification | Scikit-learn (Multinomial Naive Bayes), Tensorflow (CNN, RNN, feedforward) | Streamlit | ||
Breast Cancer Classification | Scikit-learn, Spark | IBM | ||
Bank Churn Classification | Scikit-learn, LightGBM, XGBoost, CatBoost | |||
Data Engineering | ||||
Hotel Reviews | Prefect, Spark, SQL, BigQuery, dbt, Terraform, Looker | GCP | Orchestration, Linting, Formatting, Error Handling, Pre-Commit, IaC, CI/CD | |
Air Quality Switzerland | Mage, dbt, SQL, BigQuery, Docker, Terraform, Looker | GCP | Orchestration, IaC, Containerization, CI/CD |
💸 Additionally, you can find my Power BI projects:
- Personal Finance: Analysis and Comparison of Income, Bills, Profits and Available Money
- Product Sales Comparison: Product Sales Comparison using DAX functions
Last but not least, I also have a Tableau portfolio using groups, sets, blends, joins, table calculations, storylines, parameters, animations, and other advanced functions
Cloud Storage | BigQuery | Cloud Run | VM | GKE | Vertex AI | Dataproc | Earth Engine | Container Registry |
---|
SageMaker | S3 | EC2 | ECR | Kinesis | Lambda | RDS | API Gateway | EventBridge |
---|
Azure Databricks | Data Lake Gen2 | Data Factory | Container Registry |
---|