Welcome to my repository for the LLM Zoomcamp 2024! This repository houses all the work and projects I am doing for this exciting journey into Large Language Models (LLMs) and Retrieval-Augmented Generation (RAG).
I am a strong advocate of learning in public. This repository will serve as a documentation of my progress, challenges, and accomplishments throughout the zoomcamp. Feel free to explore, provide feedback, and collaborate.
- Video: Watch on YouTube
- Code: GitHub Repository
- Overview of LLMs and RAG
- Setting up the development environment for the course
- Fundamentals of search and retrieval mechanisms
- Working with the OpenAI API for LLMs
- Building a simple RAG system with OpenAI
- Implementing text search functionalities using Elasticsearch
- Utilizing open-source models from HuggingFace Hub
- Running LLMs on a CPU with Ollama
- Building user interfaces for LLM applications using Streamlit
- Understanding vector databases and vector search
- Creating and indexing embeddings
- Implementing vector search with Elasticsearch
- Hands-on workshop on data loading techniques
- Implementing monitoring with ground-truth
- Understanding metrics (RAGAs)
- Dashboarding with Grafana for visualization
- Monitoring chat interactions
- Setting up guardrails for LLM applications
- Ingesting data with Mage
- Best practices for LLM orchestration and ingestion
- Advanced tips and best practices for building robust RAG systems
- Details on the LLM Zoomcamp 2024 competition and my participation
I will be updating this repository regularly with new content, code, and learnings from the zoomcamp. Stay tuned for more exciting updates!
Happy Learning!
Brandon S. Copeland