cocoindex-io · badmonster0 · Nov 21, 2025 · Nov 12, 2025 · Nov 12, 2025 · Nov 12, 2025
diff --git a/docs/docs/examples/examples/image_search_clip.md b/docs/docs/examples/examples/image_search_clip.md
@@ -0,0 +1,324 @@
+---
+title: Build image search and query with natural language with vision model CLIP
+description: Indexing images with CocoIndex and Vision Model CLIP for efficient image search and natural language querying
+sidebar_class_name: hidden
+slug: /examples/image_search_clip
+canonicalUrl: '/examples/image_search_clip'
+sidebar_custom_props:
+  image: /img/examples/image_search_clip/cover.png
+  tags: [vector-index, multi-modal]
+image: /img/examples/image_search_clip/cover.png
+tags: [vector-index, multi-modal]
+---
+
+import { GitHubButton, YouTubeButton, DocumentationButton } from '../../../src/components/GitHubButton';
+
+<GitHubButton url="https://github.com/cocoindex-io/cocoindex/blob/main/examples/image_search/main.py" margin="0 0 24px 0" />
+
+![Image Search](/img/examples/image_search_clip/cover.png)
+
+## Overview
+In this project, you'll create an image search system that lets you find images using natural language queries—such as "a cute animal" or "a red car". The system will automatically return the most visually relevant results, without the need for manual labeling or tagging.
+
+We are going to use multi-modal embedding model CLIP to understand and directly embed the image; and build a vector index for efficient retrieval.
+
+We are going use CocoIndex to build the indexing flow. It supports long running flow and only process changed files - we can keep adding new files to the folder and it will be indexed within a minute.
+
+
+## CLIP ViT-L/14
+[CLIP ViT-L/14](https://huggingface.co/openai/clip-vit-large-patch14) is a powerful vision-language model that can understand both images and texts. 
+It's trained to align visual and textual representations in a shared embedding space, making it perfect for our image search use case.
+
+In our project, we use CLIP to:
+1. Generate embeddings of the images directly 
+2. Convert natural language search queries into the same embedding space
+3. Enable semantic search by comparing query embeddings with caption embeddings
+
+**Alternative:** [CLIP ViT-B/32](https://huggingface.co/openai/clip-vit-base-patch32) is a lighter-weight model that runs faster than ViT-L/14. While it may not be quite as accurate, it offers improved speed and requires fewer resources.
+
+## Flow Overview
+![Flow](/img/examples/image_search_clip/flow.png)
+
+1. Ingest image files from your local directory
+2. Generate embeddings for each image using the CLIP model
+3. Save these embeddings into a vector database for efficient search and retrieval
+
+## Setup
+- [Install Postgres](https://cocoindex.io/docs/getting_started/installation#-install-postgres) if you don't have one.
+
+- Make sure Qdrant is running
+  ```
+  docker run -d -p 6334:6334 -p 6333:6333 qdrant/qdrant
+  ```
+
+
+## Flow
+
+### Define the flow and ingest the images 
+
+```python
+@cocoindex.flow_def(name="ImageObjectEmbedding")
+def image_object_embedding_flow(flow_builder: cocoindex.FlowBuilder, data_scope: cocoindex.DataScope):
+    data_scope["images"] = flow_builder.add_source(
+        cocoindex.sources.LocalFile(path="img", included_patterns=["*.jpg", "*.jpeg", "*.png"], binary=True),
+        refresh_interval=datetime.timedelta(minutes=1)  # Poll for changes every 1 minute
+    )
+    img_embeddings = data_scope.add_collector()
+```
+
+`flow_builder.add_source` will create a table with sub fields (`filename`, `content`) 
+
+<DocumentationButton url="https://cocoindex.io/docs/sources" text="Sources" margin="16px 0" />
+
+**interval**  
+The `refresh_interval` parameter in `add_source` specifies how frequently CocoIndex will check the source directory (`img`) for new, modified, or deleted images. For example, `datetime.timedelta(minutes=1)` means the system will poll for changes every 1 minute, enabling near-real-time indexing of added or updated files.
+
+
+![Ingest Images](/img/examples/image_search_clip/ingest.png)
+
+### Process each image and collect the information.
+
+#### Define Custom function to embed the image with CLIP
+
+```python
+@functools.cache
+def get_clip_model() -> tuple[CLIPModel, CLIPProcessor]:
+    model = CLIPModel.from_pretrained(CLIP_MODEL_NAME)
+    processor = CLIPProcessor.from_pretrained(CLIP_MODEL_NAME)
+    return model, processor
+```
+The `@functools.cache` decorator caches the results of a function call. In this case, it ensures that we only load the CLIP model and processor once.
+
+
+```python
+@cocoindex.op.function(cache=True, behavior_version=1, gpu=True)
+def embed_image(img_bytes: bytes) -> cocoindex.Vector[cocoindex.Float32, Literal[384]]:
+    model, processor = get_clip_model()
+    image = Image.open(io.BytesIO(img_bytes)).convert("RGB")
+    inputs = processor(images=image, return_tensors="pt")
+    with torch.no_grad():
+        features = model.get_image_features(**inputs)
+    return features[0].tolist()
+```
+
+`embed_image` is a custom function that uses the CLIP model to convert an image into a vector embedding. 
+It accepts image data in bytes format and returns a list of floating-point numbers representing the image's embedding.
+
+<DocumentationButton url="https://cocoindex.io/docs/core/custom_function" text="Custom Function Documentation" margin="16px 0" />
+
+
+The function supports caching through the `cache` parameter. 
+When enabled, the executor will store the function's results for reuse during reprocessing, 
+which is particularly useful for computationally intensive operations. 
+
+
+#### Process each image and collect the information.
+
+```python
+with data_scope["images"].row() as img:
+    img["embedding"] = img["content"].transform(embed_image)
+    img_embeddings.collect(
+        id=cocoindex.GeneratedField.UUID,
+        filename=img["filename"],
+        embedding=img["embedding"],
+    )
+```
+
+![Embed Images](/img/examples/image_search_clip/embedding.png)
+
+
+#### Collect the embeddings
+
+Export the embeddings to a table in Qdrant.
+
+```python
+img_embeddings.export(
+    "img_embeddings",
+    cocoindex.storages.Qdrant(
+        collection_name="image_search",
+        grpc_url=QDRANT_GRPC_URL,
+    ),
+    primary_key_fields=["id"],
+    setup_by_user=True,
+)
+```
+
+<DocumentationButton url="https://cocoindex.io/docs/targets/qdrant" text="Qdrant Connector" margin="16px 0" />
+
+### Alternative Connectors
+
+CocoIndex supports multiple connectors for storing and querying vector data.
+
+<DocumentationButton url="https://cocoindex.io/docs/targets" text="Targets" margin="16px 0" />
+
+It also supports custom connectors if native connectors don't fit your needs.
+
+<DocumentationButton url="https://cocoindex.io/docs/custom_ops/custom_targets" text="Custom Targets" margin="16px 0" />
+
+### Query the index
+
+Embed the query with CLIP, which maps both text and images into the same embedding space, allowing for cross-modal similarity search.
+
+```python
+def embed_query(text: str) -> list[float]:
+    model, processor = get_clip_model()
+    inputs = processor(text=[text], return_tensors="pt", padding=True)
+    with torch.no_grad():
+        features = model.get_text_features(**inputs)
+    return features[0].tolist()
+```
+
+
+Defines a FastAPI endpoint `/search` that performs semantic image search. 
+
+```python
+@app.get("/search")
+def search(q: str = Query(..., description="Search query"), limit: int = Query(5, description="Number of results")):
+    # Get the embedding for the query
+    query_embedding = embed_query(q)
+
+    # Search in Qdrant
+    search_results = app.state.qdrant_client.search(
+        collection_name="image_search",
+        query_vector=("embedding", query_embedding),
+        limit=limit
+    )
+
+```
+
+This searches the Qdrant vector database for similar embeddings. Returns the top `limit` results
+
+```python
+# Format results
+out = []
+for result in search_results:
+    out.append({
+        "filename": result.payload["filename"],
+        "score": result.score
+    })
+return {"results": out}
+```
+
+
+This endpoint enables semantic image search where users can find images by describing them in natural language, 
+rather than using exact keyword matches.
+
+
+## Application
+### Fast API
+```python
+app = FastAPI()
+app.add_middleware(
+    CORSMiddleware,
+    allow_origins=["*"],
+    allow_credentials=True,
+    allow_methods=["*"],
+    allow_headers=["*"],
+)
+# Serve images from the 'img' directory at /img
+app.mount("/img", StaticFiles(directory="img"), name="img")
+```
+
+FastAPI application setup with CORS middleware and static file serving
+The app is configured to:
+- Allow cross-origin requests from any origin
+- Serve static image files from the 'img' directory
+- Handle API endpoints for image search functionality
+
+
+```python
+@app.on_event("startup")
+def startup_event():
+    load_dotenv()
+    cocoindex.init()
+    # Initialize Qdrant client
+    app.state.qdrant_client = QdrantClient(
+        url=QDRANT_GRPC_URL,
+        prefer_grpc=True
+    )
+    app.state.live_updater = cocoindex.FlowLiveUpdater(image_object_embedding_flow)
+    app.state.live_updater.start()
+```
+
+The startup event handler initializes the application when it first starts up. Here's what each part does:
+
+1. `load_dotenv()`: Loads environment variables from a .env file, which is useful for configuration like API keys and URLs
+
+2. `cocoindex.init()`: Initializes the CocoIndex framework, setting up necessary components and configurations
+
+3. Qdrant Client Initialization:
+   - Initializes a `QdrantClient` using the gRPC URL from your environment variables.
+   - Sets the client to prefer gRPC for optimal speed.
+   - Saves the client to the FastAPI application state, making it accessible in API requests.
+
+4. Live Updater Initialization:
+   - Instantiates a `FlowLiveUpdater` with the `image_object_embedding_flow`.
+   - The live updater automatically keeps your image search index updated with any changes to the image folder.
+   - Activates the updater to continuously monitor and process new or updated images.
+
+This initialization ensures that all necessary components are properly configured and running when the application starts.
+
+
+### Frontend
+You can check the frontend code [here](https://github.com/cocoindex-io/cocoindex/tree/main/examples/image_search/frontend). We intentionally kept it simple and minimalistic to focus on the image search functionality.
+
+
+## Time to have fun!
+- Create a collection in Qdrant
+    ```bash
+    curl -X PUT 'http://localhost:6333/collections/image_search' \
+    -H 'Content-Type: application/json' \
+    -d '{
+        "vectors": {
+        "embedding": {
+            "size": 768,
+            "distance": "Cosine"
+        }
+        }
+    }'
+    ```
+
+- Setup indexing flow
+    ```bash
+    cocoindex setup main.py
+    ```
+    It is setup with a live updater, so you can add new files to the folder and it will be indexed within a minute.
+
+- Run backend
+    ```bash
+    uvicorn main:app --reload --host 0.0.0.0 --port 8000
+    ```
+
+- Run frontend
+    ```bash
+    cd frontend
+    npm install
+    npm run dev
+    ```
+
+Go to http://localhost:5174 to search.
+
+![Search](/img/examples/image_search_clip/search1.png)
+![Search](/img/examples/image_search_clip/search2.png)
+
+Now add another image in the `img` folder, for example, this [cute squirrel](https://www.pexels.com/photo/brown-squirrel-47547/), or any picture you like.
+Wait a minute for the new image to be processed and indexed.
+
+![Search](/img/examples/image_search_clip/search3.png)
+
+If you want to monitor the indexing progress, you can view it in CocoInsight `cocoindex server -ci main `.
+
+![Index Status In CocoInsight](/img/examples/image_search_clip/index-status.png)
+
+## Connect to Any Data Source
+
+One of CocoIndex’s core strengths is its ability to connect to your existing data sources and automatically keep your index fresh. Beyond local files, CocoIndex natively supports source connectors including:
+
+- Google Drive
+- Amazon S3 / SQS
+- Azure Blob Storage
+
+<DocumentationButton url="https://cocoindex.io/docs/sources" text="Sources" margin="0 0 16px 0" />
+
+Once connected, CocoIndex continuously watches for changes — new uploads, updates, or deletions — and applies them to your index in real time.
+
diff --git a/docs/static/img/examples/image_search_clip/cover.png b/docs/static/img/examples/image_search_clip/cover.png
diff --git a/docs/static/img/examples/image_search_clip/demo.png b/docs/static/img/examples/image_search_clip/demo.png
diff --git a/docs/static/img/examples/image_search_clip/embedding.png b/docs/static/img/examples/image_search_clip/embedding.png
diff --git a/docs/static/img/examples/image_search_clip/flow.png b/docs/static/img/examples/image_search_clip/flow.png
diff --git a/docs/static/img/examples/image_search_clip/index-status.png b/docs/static/img/examples/image_search_clip/index-status.png
diff --git a/docs/static/img/examples/image_search_clip/ingest.png b/docs/static/img/examples/image_search_clip/ingest.png
diff --git a/docs/static/img/examples/image_search_clip/search1.png b/docs/static/img/examples/image_search_clip/search1.png
diff --git a/docs/static/img/examples/image_search_clip/search2.png b/docs/static/img/examples/image_search_clip/search2.png
diff --git a/docs/static/img/examples/image_search_clip/search3.png b/docs/static/img/examples/image_search_clip/search3.png