Alpha Tuning Llama Pack: KodaRetriever (#11311)

run-llama · Feb 26, 2024 · 65290f5 · 65290f5
1 parent 70d4a5c
commit 65290f5
Show file tree

Hide file tree

Showing 26 changed files with 6,892 additions and 0 deletions.
diff --git a/llama-index-packs/llama-index-packs-koda-retriever/.gitignore b/llama-index-packs/llama-index-packs-koda-retriever/.gitignore
@@ -0,0 +1,153 @@
+llama_index/_static
+.DS_Store
+# Byte-compiled / optimized / DLL files
+__pycache__/
+*.py[cod]
+*$py.class
+
+# C extensions
+*.so
+
+# Distribution / packaging
+.Python
+bin/
+build/
+develop-eggs/
+dist/
+downloads/
+eggs/
+.eggs/
+etc/
+include/
+lib/
+lib64/
+parts/
+sdist/
+share/
+var/
+wheels/
+pip-wheel-metadata/
+share/python-wheels/
+*.egg-info/
+.installed.cfg
+*.egg
+MANIFEST
+
+# PyInstaller
+#  Usually these files are written by a python script from a template
+#  before PyInstaller builds the exe, so as to inject date/other infos into it.
+*.manifest
+*.spec
+
+# Installer logs
+pip-log.txt
+pip-delete-this-directory.txt
+
+# Unit test / coverage reports
+htmlcov/
+.tox/
+.nox/
+.coverage
+.coverage.*
+.cache
+nosetests.xml
+coverage.xml
+*.cover
+*.py,cover
+.hypothesis/
+.pytest_cache/
+.ruff_cache
+
+# Translations
+*.mo
+*.pot
+
+# Django stuff:
+*.log
+local_settings.py
+db.sqlite3
+db.sqlite3-journal
+
+# Flask stuff:
+instance/
+.webassets-cache
+
+# Scrapy stuff:
+.scrapy
+
+# Sphinx documentation
+docs/_build/
+
+# PyBuilder
+target/
+
+# Jupyter Notebook
+.ipynb_checkpoints
+notebooks/
+
+# IPython
+profile_default/
+ipython_config.py
+
+# pyenv
+.python-version
+
+# pipenv
+#   According to pypa/pipenv#598, it is recommended to include Pipfile.lock in version control.
+#   However, in case of collaboration, if having platform-specific dependencies or dependencies
+#   having no cross-platform support, pipenv may install dependencies that don't work, or not
+#   install all needed dependencies.
+#Pipfile.lock
+
+# PEP 582; used by e.g. github.com/David-OConnor/pyflow
+__pypackages__/
+
+# Celery stuff
+celerybeat-schedule
+celerybeat.pid
+
+# SageMath parsed files
+*.sage.py
+
+# Environments
+.env
+.venv
+env/
+venv/
+ENV/
+env.bak/
+venv.bak/
+pyvenv.cfg
+
+# Spyder project settings
+.spyderproject
+.spyproject
+
+# Rope project settings
+.ropeproject
+
+# mkdocs documentation
+/site
+
+# mypy
+.mypy_cache/
+.dmypy.json
+dmypy.json
+
+# Pyre type checker
+.pyre/
+
+# Jetbrains
+.idea
+modules/
+*.swp
+
+# VsCode
+.vscode
+
+# pipenv
+Pipfile
+Pipfile.lock
+
+# pyright
+pyrightconfig.json
diff --git a/llama-index-packs/llama-index-packs-koda-retriever/BUILD b/llama-index-packs/llama-index-packs-koda-retriever/BUILD
@@ -0,0 +1,3 @@
+poetry_requirements(
+    name="poetry",
+)
diff --git a/llama-index-packs/llama-index-packs-koda-retriever/Makefile b/llama-index-packs/llama-index-packs-koda-retriever/Makefile
@@ -0,0 +1,17 @@
+GIT_ROOT ?= $(shell git rev-parse --show-toplevel)
+
+help:	## Show all Makefile targets.
+	@grep -E '^[a-zA-Z_-]+:.*?## .*$$' $(MAKEFILE_LIST) | awk 'BEGIN {FS = ":.*?## "}; {printf "\033[33m%-30s\033[0m %s\n", $$1, $$2}'
+
+format:	## Run code autoformatters (black).
+	pre-commit install
+	git ls-files | xargs pre-commit run black --files
+
+lint:	## Run linters: pre-commit (black, ruff, codespell) and mypy
+	pre-commit install && git ls-files | xargs pre-commit run --show-diff-on-failure --files
+
+test:	## Run tests via pytest.
+	pytest tests
+
+watch-docs:	## Build and watch documentation.
+	sphinx-autobuild docs/ docs/_build/html --open-browser --watch $(GIT_ROOT)/llama_index/
diff --git a/llama-index-packs/llama-index-packs-koda-retriever/README.md b/llama-index-packs/llama-index-packs-koda-retriever/README.md
@@ -0,0 +1,77 @@
+# Koda Retriever
+
+This retriever is a custom fine-tunable Hybrid Retriever that dynamically determines the optimal alpha for a given query.
+An LLM is used to categorize the query and therefore determine the optimal alpha value, as each category has a preset/provided alpha value.
+It is recommended that you run tests on your corpus of data and queries to determine categories and corresponding alpha values for your use case.
+
+![koda-retriever-mascot](https://i.imgur.com/224ocIw.jpeg)
+
+### Disclaimer
+
+_The default categories and alpha values are not recommended for production use_
+
+## Introduction
+
+Alpha tuning in hybrid retrieval for RAG models refers to the process of adjusting the weight (alpha) given to different components of a hybrid search strategy. In RAG, the retrieval component is crucial for fetching relevant context from a knowledge base, which the generation component then uses to produce answers. By fine-tuning the alpha parameter, the balance between the retrieved results from dense vector search methods and traditional sparse methods can be optimized. This optimization aims to enhance the overall performance of the system, ensuring that the retrieval process effectively supports the generation of accurate and contextually relevant responses.
+
+### Simply explained
+
+Imagine you're playing a game where someone whispers a sentence to you, and you have to decide whether to draw a picture of exactly what they said, or draw a picture of what you think they mean. Alpha tuning is like finding the best rule for when to draw exactly what's said and when to think deeper about the meaning. It helps us get the best mix, so the game is more fun and everyone understands each other better!
+
+## Usage Snapshot
+
+Koda Retriever is compatible with all other retrieval interfaces and objects that would normally be able to interact with an LI-native [retriever](https://docs.llamaindex.ai/en/stable/module_guides/querying/retriever/root.html).
+
+Please see the [examples](./examples/) folder for more specific examples.
+
+```python
+# Setup
+from llama_index.packs.koda_retriever import KodaRetriever
+from llama_index.core import VectorStoreIndex
+from llama_index.llms.openai import OpenAI
+from llama_index.embeddings.openai.base import OpenAIEmbedding
+from llama_index.core.postprocessor import LLMRerank
+from llama_index.core import Settings
+
+Settings.llm = OpenAI()
+Settings.embed_model = OpenAIEmbedding()
+vector_store = PineconeVectorStore(pinecone_index=index, text_key="summary")
+vector_index = VectorStoreIndex.from_vector_store(
+    vector_store=vector_store, embed_model=Settings.embed_model
+)
+
+reranker = LLMRerank(llm=Settings.llm)  # optional
+retriever = KodaRetriever(
+    index=vector_index, llm=Settings.llm, reranker=reranker, verbose=True
+)
+
+# Retrieval
+query = "What was the intended business model for the parks in the Jurassic Park lore?"
+
+results = retriever.retrieve(query)
+
+# Query Engine
+query_engine = RetrieverQueryEngine.from_args(retriever=retriever)
+
+response = query_engine.query(query)
+```
+
+### Prerequisites
+
+- Vector Store Index w/ hybrid search enabled
+- LLM (or any model to route/classify prompts)
+
+Please note that you will also need vector AND text representations of your data for a hybrid retriever to work. It is not uncommon for some vector databases to only store the vectors themselves, in which case an error will occur downstream if you try to run any hybrid queries.
+
+## Setup
+
+## Citations
+
+Idea & original implementation sourced from the following docs:
+
+- https://blog.llamaindex.ai/llamaindex-enhancing-retrieval-performance-with-alpha-tuning-in-hybrid-search-in-rag-135d0c9b8a00
+- https://techcommunity.microsoft.com/t5/ai-azure-ai-services-blog/azure-ai-search-outperforming-vector-search-with-hybrid/ba-p/3929167
+
+## Buy me a coffee
+
+[Thanks!](https://www.buymeacoffee.com/nodice)
diff --git a/llama-index-packs/llama-index-packs-koda-retriever/examples/README.md b/llama-index-packs/llama-index-packs-koda-retriever/examples/README.md
@@ -0,0 +1,8 @@
+# Notebooks as a guide
+
+For now, the following guides exist:
+
+- [Alpha Tuning](alpha_tuning.ipynb): Showcases a (simple) example on how to provide custom categories/alpha values for alpha tuning.
+- [Quickstart](quickstart.ipynb): A quickstart notebook showing how Koda Retriever functions with default values and basic fixtures.
+
+Both are recommended to review. More notebooks will be added as this project grows.