![Redis](https://redis.io/wp-content/uploads/2024/04/Logotype.svg?auto=webp&quality=85,75&width=120)
# Vector Search with Redispy
## Let's Begin!
<a href="https://colab.research.google.com/github/redis-developer/redis-ai-resources/blob/main/python-recipes/vector-search/00_redispy.ipynb" target="_parent"><img src="https://colab.research.google.com/assets/colab-badge.svg" alt="Open In Colab"/></a>


## Prepare data

In this examples we will load a list of movie objects with the following attributes: `title`, `rating`, `description`, and `genre`.

For the vector part of our vector search we will embed the description so that user's can search for movies that best match what they're looking for.

**If you are running this notebook locally**, FYI you may not need to perform this step at all.

In [1]:
# NBVAL_SKIP
!git clone https://github.com/redis-developer/redis-ai-resources.git temp_repo
!mv temp_repo/python-recipes/vector-search/resources .
!rm -rf temp_repo

Cloning into 'temp_repo'...
remote: Enumerating objects: 738, done.[K
remote: Counting objects: 100% (313/313), done.[K
remote: Compressing objects: 100% (203/203), done.[K
remote: Total 738 (delta 222), reused 120 (delta 110), pack-reused 425 (from 2)[K
Receiving objects: 100% (738/738), 57.89 MiB | 11.15 MiB/s, done.
Resolving deltas: 100% (326/326), done.


## Packages

In [2]:
%pip install -q "redis>=5.0.5" numpy sentence-transformers

[2K   [90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [32m268.9/268.9 kB[0m [31m8.0 MB/s[0m eta [36m0:00:00[0m
[2K   [90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [32m363.4/363.4 MB[0m [31m1.5 MB/s[0m eta [36m0:00:00[0m
[2K   [90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [32m13.8/13.8 MB[0m [31m94.8 MB/s[0m eta [36m0:00:00[0m
[2K   [90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [32m24.6/24.6 MB[0m [31m74.9 MB/s[0m eta [36m0:00:00[0m
[2K   [90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [32m883.7/883.7 kB[0m [31m44.1 MB/s[0m eta [36m0:00:00[0m
[2K   [90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [32m664.8/664.8 MB[0m [31m1.4 MB/s[0m eta [36m0:00:00[0m
[2K   [90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [32m211.5/211.5 MB[0m [31m5.4 MB/s[0m eta [36m0:00:00[0m
[2K   [90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [32m56.3/56.3 MB[0m [31m12.7 MB/s[0m eta [36m0:00:00[0m
[2K   [90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━

## Install Redis Stack

Later in this tutorial, Redis will be used to store, index, and query vector
embeddings created from PDF document chunks. **We need to make sure we have a Redis
instance available.

#### For Colab
Use the shell script below to download, extract, and install [Redis Stack](https://redis.io/docs/getting-started/install-stack/) directly from the Redis package archive.

In [3]:
# NBVAL_SKIP
%%sh
curl -fsSL https://packages.redis.io/gpg | sudo gpg --dearmor -o /usr/share/keyrings/redis-archive-keyring.gpg
echo "deb [signed-by=/usr/share/keyrings/redis-archive-keyring.gpg] https://packages.redis.io/deb $(lsb_release -cs) main" | sudo tee /etc/apt/sources.list.d/redis.list
sudo apt-get update  > /dev/null 2>&1
sudo apt-get install redis-stack-server  > /dev/null 2>&1
redis-stack-server --daemonize yes

deb [signed-by=/usr/share/keyrings/redis-archive-keyring.gpg] https://packages.redis.io/deb jammy main
Starting redis-stack-server, database path /var/lib/redis-stack


#### For Alternative Environments
There are many ways to get the necessary redis-stack instance running
1. On cloud, deploy a [FREE instance of Redis in the cloud](https://redis.com/try-free/). Or, if you have your
own version of Redis Enterprise running, that works too!
2. Per OS, [see the docs](https://redis.io/docs/latest/operate/oss_and_stack/install/install-stack/)
3. With docker: `docker run -d --name redis-stack-server -p 6379:6379 redis/redis-stack-server:latest`

### Define the Redis Connection URL

By default this notebook connects to the local instance of Redis Stack. **If you have your own Redis Enterprise instance** - replace REDIS_PASSWORD, REDIS_HOST and REDIS_PORT values with your own.

In [4]:
import os

# Replace values below with your own if using Redis Cloud instance
REDIS_HOST = os.getenv("REDIS_HOST", "localhost") # ex: "redis-18374.c253.us-central1-1.gce.cloud.redislabs.com"
REDIS_PORT = os.getenv("REDIS_PORT", "6379")      # ex: 18374
REDIS_PASSWORD = os.getenv("REDIS_PASSWORD", "")  # ex: "1TNxTEdYRDgIDKM2gDfasupCADXXXX"

# If SSL is enabled on the endpoint, use rediss:// as the URL prefix
REDIS_URL = f"redis://:{REDIS_PASSWORD}@{REDIS_HOST}:{REDIS_PORT}"

### Create redis client

In [5]:
from redis import Redis
client = Redis.from_url(REDIS_URL)
client.ping()

True

In [6]:
import json

with open("resources/movies.json", 'r') as file:
    movies = json.load(file)

In [7]:
import numpy as np
from sentence_transformers import SentenceTransformer

# load model for embedding our movie descriptions
model = SentenceTransformer('sentence-transformers/all-MiniLM-L6-v2')

def embed_text(model, text):
    return np.array(model.encode(text)).astype(np.float32).tobytes()

The secret `HF_TOKEN` does not exist in your Colab secrets.
To authenticate with the Hugging Face Hub, create a token in your settings tab (https://huggingface.co/settings/tokens), set it as secret in your Google Colab and restart your session.
You will be able to reuse this secret in all of your notebooks.
Please note that authentication is recommended but still optional to access public models or datasets.


modules.json:   0%|          | 0.00/349 [00:00<?, ?B/s]

config_sentence_transformers.json:   0%|          | 0.00/116 [00:00<?, ?B/s]

README.md:   0%|          | 0.00/10.5k [00:00<?, ?B/s]

sentence_bert_config.json:   0%|          | 0.00/53.0 [00:00<?, ?B/s]

config.json:   0%|          | 0.00/612 [00:00<?, ?B/s]

Xet Storage is enabled for this repo, but the 'hf_xet' package is not installed. Falling back to regular HTTP download. For better performance, install the package with: `pip install huggingface_hub[hf_xet]` or `pip install hf_xet`


model.safetensors:   0%|          | 0.00/90.9M [00:00<?, ?B/s]

tokenizer_config.json:   0%|          | 0.00/350 [00:00<?, ?B/s]

vocab.txt:   0%|          | 0.00/232k [00:00<?, ?B/s]

tokenizer.json:   0%|          | 0.00/466k [00:00<?, ?B/s]

special_tokens_map.json:   0%|          | 0.00/112 [00:00<?, ?B/s]

config.json:   0%|          | 0.00/190 [00:00<?, ?B/s]

In [8]:
# Note: convert embedding array to bytes for storage in Redis Hash data type
movie_data = [
    {
        **movie,
        "vector": embed_text(model, movie["description"])
    } for movie in movies
]

In [None]:
movie_data[0]

{'id': 1,
 'title': 'Explosive Pursuit',
 'genre': 'action',
 'rating': 7,
 'description': 'A daring cop chases a notorious criminal across the city in a high-stakes game of cat and mouse.',
 'vector': b'\x8bf|=\xc3`\n;\xf2\x91\xb7;?\xcb~\xbd\xdfd\xce\xbb\xc7\x16J=H\xa7?=\xdfv\x95<h\xfa\x06\xbe\x11Y\xcf=)\x07p=E\xdb\r\xbd\x93\xf2H\xbdke\xc6<@\xdfa=o8\x16\xbc\xf1\xd3\x13<8\xaa\x1c=\x14\xef\x89<\xc1\xb0-<\x9d\xb2\x9f\xbc^\x0b\xc3\xbd\xa5NR=ol\xf7\xbcP>\x17\xbeA\x1e\x05\xb9Hu\xbf<B\xe3b\xba\xd8\xa6\xa8\xbd\x98\xdc\xec\xbc`c%=\x81\xe7r\xbb$OG=:(\x85=a@\xa2\xbc-Z\xd0\xbdB%K\xbd\xc8\xed\x94\xbcW\xddH=\x8e&F<\xde*\xec<\x8d\xd8\x8d\xbd\xbdZ\x98<\x14\xa3\xa3=>g3\xbd$\xcd\xbd\xbd\xa1$\xf7;\x04\xf5z=\xfc\xb4\x8c=\x89\x0e\xc6\xbdhI\x90\xbd^\x16\xbd;z\xe7\x0c\xbd\x1b3\xc9\xbc\x89\xf8\xbb\xbc\x18\'u\xbb>\x8f\xca<\x02\x80J=\x0e\xaf*=\x8dOU\xbd\xcf\xf0\x95\xbc \x02\x19=\x19\xf4K<\xc5\xc2\t=J\x83\xac=\x95\xd7\xb8\xbd\xf2\xb5\x9c\xbd=\x85\x18=\x94d&=03\xf8<\xee\xf7\x88<\x80v\xf2\xbb9=[\xbdG\xac\xee\xbb<

## Define Redis index schema

In [9]:
from redis.commands.search.field import VectorField, TagField, NumericField, TextField
from redis.commands.search.index_definition import IndexDefinition, IndexType

index_name = "movies"

schema = (
    VectorField(
        "vector",
        "HNSW",
        {
            "TYPE": "FLOAT32",
            "DIM": 384,
            "DISTANCE_METRIC": "COSINE"
        }
        ),
        NumericField("rating"),
        TagField("genre"),
        TextField("title"),
        TextField("description")
)

try:
    client.ft(index_name).info()
    print("Index exists!")
except:
    # index Definition
    definition = IndexDefinition(index_type=IndexType.HASH)

    # create Index
    client.ft(index_name).create_index(fields=schema, definition=definition)

## Populate index

In [10]:
def load_docs(client: Redis, data: list[dict]):
    for i, d in enumerate(data):
        client.hset(
            i,
            mapping = d
        )

def print_results(res):
    docs = [(doc.title, doc.genre, doc.rating) for doc in res.docs]
    print(f"Top {len(docs)} movies: ", docs)

In [11]:
load_docs(client, movie_data)

In [12]:
res = client.ft(index_name).search("*")
res

Result{20 total, docs: [Document {'id': '0', 'payload': None, 'description': 'A daring cop chases a notorious criminal across the city in a high-stakes game of cat and mouse.', 'genre': 'action', 'vector': 'f|=a\n;D;\x1b~?eλ\x16J=[?=v<f\x06\x1dY=+\x07p=D\rHwe<Ba=g8\x16\x13<2\x1c=<-<Q\x0býNR=alK>\x17\x0e\x05.u<b֦Oc%=r\'OG=:(=a@\'ZнA%K피oH=&F<*<؍Z<\x10=@g3\x1dͽ$;\x14z=\x06=\x0eƽaI=\x16;u\x0c\x1b3ɼ&u5<\x7fJ=\x03*=OU\x19\x02\x19=\'K<\t=F=\u05f8ﵜ?\x18=d&=$3<\x0b<"v)=[(:Ad\x19cx;;O<1,=\x00-\x18\x06p\x1a=ο=\x13\x150=\U0005df33GK= =Ieen:\x1c=<\x14p\x16<^p<<9s<\x104&<\x1c\x18<\x0f-<N=/=\'<:L\x13<Q\x12κaܼ8\x05=\x1d<\x15R0-\\91=bV=9Gּ"S=^<\x05\x13<Ҳy|/\x06p<Н<K\x02=\x13|\U000a6ef1eQK`\x10%t9ɹhk=\r<4f]=\x18 4/ffav=#\x0cq<-\x1c\x08<nt<s\x19=rl^\x12V\x0exFU;G|^*s=\x10;\x13!\x0e\x0bԴ=\x1e&\x15W=@$:<<<\x11\x08\x00i\x16\x01<=9_|!\x02\x1cŹ>\x1bF?\x14я(O\x16<\x12=\x05Olp\'˞e\x0fs;\\a;B":=%\x16*<|\x1b=\x0eV\r=C\x06=\x11u\x16<\x16;9/=\x85=|\x0b"=i<Cc=2\x08\x12;=0WW;\x1db<ȝ<|r;azI\x00<\x0f\x1a<0K%\x13翻{\x12=

## Index loaded now we can perform vector search

### basic vector search

In [None]:
from redis.commands.search.query import Query

user_query = "High tech movies"

embedded_user_query = embed_text(model, user_query)

# Note: dialect 2 and above required for vector search
query = Query("(*)=>[KNN 3 @vector $vec_param AS dist]").sort_by("dist").dialect(2)

res = client.ft(index_name).search(query, query_params = {'vec_param': embedded_user_query})

print_results(res)


Top 3 movies:  [('Fast & Furious 9', 'action', '6'), ('Despicable Me', 'comedy', '7'), ('The Incredibles', 'comedy', '8')]


### Hybrid filter vector search

Redis allows you to combine filter searches on fields within the index object allowing us to create more specific searches.

In [15]:
# Search for top 3 movies specifically in the action genre
from redis.commands.search.query import Query

user_query = "High tech movies"

embedded_user_query = embed_text(model, user_query)

# Note: genre is a tag field in our schema so the syntax is @<field_name>:{ <tag> | <tag> | ...}
query = Query("(@genre:{action})=>[KNN 3 @vector $vec_param AS dist]").sort_by('dist').dialect(2)

res = client.ft(index_name).search(query, query_params = {'vec_param': embedded_user_query})

print_results(res)

Top 3 movies:  [('Fast & Furious 9', 'action', '6'), ('Mad Max: Fury Road', 'action', '8'), ('Explosive Pursuit', 'action', '7')]


In [16]:
# Search for top 3 movies specifically in the action genre with ratings at or above a 7

user_query = "High tech movies"

embedded_user_query = embed_text(model, user_query)

query = Query("(@genre:{action} & (@rating:[7 inf]))=>[KNN 3 @vector $vec_param AS dist]").sort_by('dist').dialect(2)

res = client.ft(index_name).search(query, query_params = {'vec_param': embedded_user_query})

print_results(res)

Top 3 movies:  [('Mad Max: Fury Road', 'action', '8'), ('Explosive Pursuit', 'action', '7'), ('The Avengers', 'action', '8')]


In [17]:
# Search with full text search for movies that directly mention "criminal mastermind" in the description

user_query = "High tech movies"

embedded_user_query = embed_text(model, user_query)

query = Query("(@description:(criminal mastermind))=>[KNN 3 @vector $vec_param AS dist]").sort_by('dist').dialect(2)

res = client.ft(index_name).search(query, query_params = {'vec_param': embedded_user_query})

print_results(res)

Top 2 movies:  [('Despicable Me', 'comedy', '7'), ('The Dark Knight', 'action', '9')]


In [None]:
# Vector search with wild card match

user_query = "High tech movies"

embedded_user_query = embed_text(model, user_query)

query = Query("(@description:(crim*))=>[KNN 3 @vector $vec_param AS dist]").sort_by('dist').dialect(2)

res = client.ft(index_name).search(query, query_params = {'vec_param': embedded_user_query})

print_results(res)

Top 3 movies:  [('Despicable Me', 'comedy', '7'), ('The Incredibles', 'comedy', '8'), ('Explosive Pursuit', 'action', '7')]


In [None]:
# Vector search with fuzzy match

user_query = "High tech movies"

embedded_user_query = embed_text(model, user_query)

# Note: fuzzy match is based on Levenshtein distance. Therefore, "hero" might return result for "her" as an example.
# See docs for more info https://redis.io/docs/latest/develop/interact/search-and-query/advanced-concepts/query_syntax/
query = Query("(@description:%hero%)=>[KNN 3 @vector $vec_param AS dist]").sort_by('dist').dialect(2)

res = client.ft(index_name).search(query, query_params = {'vec_param': embedded_user_query})

print_results(res)

Top 3 movies:  [('The Avengers', 'action', '8'), ('Black Widow', 'action', '7'), ('The Princess Diaries', 'comedy', '6')]


## Range queries

Range queries allow you to set a pre defined "threshold" for which we want to return documents. This is helpful when you only want documents with a certain distance from the search query.

In [None]:
user_query = "Family friendly fantasy movies"

embedded_user_query = embed_text(model, user_query)

query = (
    Query("@vector:[VECTOR_RANGE $radius $vector]=>{$YIELD_DISTANCE_AS: vector_distance}")
     .sort_by("vector_distance")
     .return_fields("title", "rating", "genre", "vector_distance")
     .dialect(2)
)

# Find all vectors within 0.8 of the query vector
query_params = {
    "radius": 0.8,
    "vector": embedded_user_query
}

res = client.ft(index_name).search(query, query_params)
print_results(res)


Top 6 movies:  [('The Incredibles', 'comedy', '8'), ('Black Widow', 'action', '7'), ('Despicable Me', 'comedy', '7'), ('Shrek', 'comedy', '8'), ('Monsters, Inc.', 'comedy', '8'), ('Aladdin', 'comedy', '8')]


Like the queries above, we can also chain additional filters and conditional operators with range queries. The following adds an `or` condition that returns vector search within the defined range or with a rating at or above 9.

In [None]:
user_query = "Family friendly fantasy movies"

embedded_user_query = embed_text(model, user_query)

query = (
    Query("@rating:[9 +inf] | @vector:[VECTOR_RANGE $radius $vector]=>{$YIELD_DISTANCE_AS: vector_distance}")
     .sort_by("vector_distance")
     .return_fields("title", "rating", "genre", "vector_distance")
     .dialect(2)
)

# Find all vectors within 0.8 of the query vector
query_params = {
    "radius": 0.7,
    "vector": embedded_user_query
}

res = client.ft(index_name).search(query, query_params)
print_results(res)

Top 3 movies:  [('The Incredibles', 'comedy', '8'), ('The Dark Knight', 'action', '9'), ('Inception', 'action', '9')]


### Additional queries

In addition to the variety of vector queries shown above redis supports full-text search, aggregations, and various weighting strategies that can be mixed and matched for a wide range of search applications.

### Full text search with BM25

The following query does a pure token based BM25 search with redis.

In [None]:
input = "Criminal mastermind"

# Redis breaks searches into key tokens
def tokenize(query):
    return " | ".join(query.split(" ")).lower()

user_query = Query(tokenize(input))\
                .scorer("BM25STD") \
                .with_scores() \
                .return_fields("title", "genre", "rating", "description") \
                .paging(0, 10) # limits the amount of results to 10

res = client.ft(index_name).search(user_query)
res.docs

[Document {'id': '6', 'payload': None, 'score': 6.267822483123378, 'title': 'The Dark Knight', 'genre': 'action', 'rating': '9', 'description': 'Batman faces off against the Joker, a criminal mastermind who threatens to plunge Gotham into chaos.'},
 Document {'id': '17', 'payload': None, 'score': 5.846220066150412, 'title': 'Despicable Me', 'genre': 'comedy', 'rating': '7', 'description': 'When a criminal mastermind uses a trio of orphan girls as pawns for a grand scheme, he finds their love is profoundly changing him for the better.'},
 Document {'id': '0', 'payload': None, 'score': 3.9323951881774195, 'title': 'Explosive Pursuit', 'genre': 'action', 'rating': '7', 'description': 'A daring cop chases a notorious criminal across the city in a high-stakes game of cat and mouse.'}]

### Weighting (boosting)

Sometimes you might want a search to lean more heavily towards one condition over another and weight it higher in the result set.

In this example, you can see that even though `The Incredibles` isn't an `action` movie it is still the top result because it ranks highly on the fuzzy search for `%superhero%`.

In [None]:
query = Query('((@genre:{action}=>{$weight: 1}) | (@description:(%superhero%)=>{$weight: 10}))') \
        .return_fields("title", "genre", "rating", "description") \
        .paging(0, 3) \
        .dialect(2)

res = client.ft(index_name).search(query)
res.docs

[Document {'id': '15', 'payload': None, 'title': 'The Incredibles', 'genre': 'comedy', 'rating': '8', 'description': "A family of undercover superheroes, while trying to live the quiet suburban life, are forced into action to save the world. Bob Parr (Mr. Incredible) and his wife Helen (Elastigirl) were among the world's greatest crime fighters, but now they must assume civilian identities and retreat to the suburbs to live a 'normal' life with their three children. However, the family's desire to help the world pulls them back into action when they face a new and dangerous enemy."},
 Document {'id': '0', 'payload': None, 'title': 'Explosive Pursuit', 'genre': 'action', 'rating': '7', 'description': 'A daring cop chases a notorious criminal across the city in a high-stakes game of cat and mouse.'},
 Document {'id': '1', 'payload': None, 'title': 'Skyfall', 'genre': 'action', 'rating': '8', 'description': 'James Bond returns to track down a dangerous new enemy who threatens global secur

In [None]:
# clean up!
client.flushall()

True