Skip to content

Tutorials to help you get started with ChromaDB. Chroma has built-in functionality to embed text and images so you can build out your proof-of-concepts on a vector database quickly. Associated videos:

Notifications You must be signed in to change notification settings

johnnycode8/chromadb_quickstart

Repository files navigation

This repo contains companion video explanation and code walkthrough from my YouTube channel @johnnycode. If the code and video helped you, please consider: Buy Me A Coffee

Getting Started with ChromaDB - Text Semantic Search

ChromaDB is the Vector Database with the lowest learning curve. Start testing out Semantic Searches on a vector database within minutes. Everything works locally and is free. Don't need to sign up for a cloud account or learn Langchain first.

Code Reference:
YouTube Tutorial:

Getting Started with ChromaDB

Getting Started with ChromaDB - Multimodal (Image) Semantic Search

How to start using ChromaDB Multimodal (images) semantic searches on a vector database. We’ll load some images and query for objects in the images. We’ll also cover how to use the Where Metadata filter to limit the relevance of the search results.

Code Reference:
YouTube Tutorial:

Getting Started with ChromaDB Multimodal

Maximize Embedding Vectorization Speed in ChromaDB with NVidia CUDA GPU and Python Multiprocessing

How to vectorize embeddings into ChromaDB as fast as possible leveraging the power of your NVidia CUDA GPU along with Python's Multiprocessing capability. We'll use Multiprocessing to 1) launch a Python producer process on the CPU to handle the workload of reading and transforming the data and 2) launch a consumer process to vectorize the data into embeddings using the GPU.

Code Reference:
YouTube Tutorial:

Maximize Vectorization Speed ChromaDB

Set Up ChromaDB with Docker & Enable Role-Based Token Authentication

  • First, we'll get ChromaDB up and running quickly in a Docker container. ChromaDB can be accessed via a HTTP client without authentication.
  • Then, add token-based authentication, but this will only work for 1 token i.e. 1 admin user.
  • Finally, add role-based token authentication to allow for multiple users with different types of permissions to connect to ChromaDB.
  • I'll also show how to resolve the "ModuleNotFoundError: No module named hypothesis" error in version 0.5.2 of ChromaDB by building a new image with 'hypothesis' installed. This technique can also be used to install your packages into the container.
Code Reference:
  • See chroma_docker directory
YouTube Tutorial:

ChromaDB with Docker and Token Authentication

Use Gemini Pro (FREE) to Create Vector Embeddings for ChromaDB

Improve your semantic searches with vector embeddings from one of the best LLMs out there. We'll swap the ChromaDB out-of-the-box local model with the Gemini Pro embedding model with just a few code changes.

Code Reference:
YouTube Tutorial:

Getting Started with ChromaDB Multimodal

How to Work With and Persist ChromaDB Vector Database in Google Colab

Tutorial shows how to persist a ChromaDB database in Google Colab by creating the database in your Google Drive. This is part of my Recipe Database tutorial series at RecipeDB Repo.

Video Tutorial:

Get Started with ChromaDB on Colab

About

Tutorials to help you get started with ChromaDB. Chroma has built-in functionality to embed text and images so you can build out your proof-of-concepts on a vector database quickly. Associated videos:

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages