video-chat

Chat with your videos.

Chat with your video library. Upload videos to mixedbread, then ask questions and get answers grounded in video transcriptions with inline citations, timestamps, and YouTube links.

Built with Next.js, Vercel AI SDK, Google Gemini, and the mixedbread SDK.

Setup

Prerequisites

Bun (or Node.js)
A mixedbread API key
A Google AI API key (for Gemini)

Install & run

bun install
cp .env.example .env   # fill in your keys
bun run dev

The app runs at http://localhost:3000.

Environment variables

Variable	Description
`MXBAI_API_KEY`	mixedbread API key
`GOOGLE_GENERATIVE_AI_API_KEY`	Google Gemini API key

Uploading videos to mixedbread

Videos are uploaded into a mixedbread store called videos. mixedbread automatically transcribes, chunks, and embeds the video content — no preprocessing required.

Supported video formats: MP4, WebM, MOV, AVI, OGV.

1. Create the store

If the store doesn't exist yet:

import { Mixedbread } from "@mixedbread/sdk";

const mxbai = new Mixedbread({ apiKey: process.env.MXBAI_API_KEY });

await mxbai.stores.create({ name: "videos" });

2. Upload videos

import * as fs from "node:fs";
import { Mixedbread } from "@mixedbread/sdk";

const mxbai = new Mixedbread({ apiKey: process.env.MXBAI_API_KEY });

const file = await mxbai.stores.files.upload({
  storeIdentifier: "videos",
  file: fs.createReadStream("Lec 01. Introduction to Deep Learning [6FkRvTtUc-o].mp4"),
});

console.log(file.id); // file is now processing

Files go through pending → in_progress → completed. For large files (>100 MB), multipart upload kicks in automatically. You can customize the behavior:

await mxbai.stores.files.upload({
  storeIdentifier: "videos",
  file: fs.createReadStream("./long-lecture.mp4"),
  multipartUpload: {
    threshold: 50 * 1024 * 1024,  // trigger at 50 MB instead of 100 MB
    concurrency: 10,               // parallel upload streams (default: 5)
    onPartUpload: ({ partNumber, totalParts, uploadedBytes, totalBytes }) => {
      console.log(`Part ${partNumber}/${totalParts} — ${Math.round((uploadedBytes / totalBytes) * 100)}%`);
    },
  },
});

You can also upload files through the mixedbread dashboard.

File naming convention

Name your video files like this:

Title [YOUTUBE_ID].mp4

For example: Lec 01. Introduction to Deep Learning [6FkRvTtUc-o].mp4

The YouTube ID in brackets is used to generate thumbnail URLs and link citations back to the original YouTube video at the correct timestamp.

Generated metadata

When mixedbread processes a video, it automatically generates metadata on each chunk including:

start_time_seconds / end_time_seconds — timestamp range of the chunk
total_duration_seconds — total length of the video
transcription — the transcribed text for that chunk

This metadata powers the timestamped citations in the chat UI.

Listing available lectures

Each video has a lecture metadata key. The app exposes a GET /api/lectures endpoint that uses mixedbread's metadata facets to list all available lectures with their chunk counts:

const response = await mxbai.stores.metadataFacets({
  store_identifiers: ["videos"],
  facets: ["lecture"],
});

// response.facets.lecture → { "Introduction to Deep Learning": 42, "Backpropagation": 38, ... }

Searching

Search happens automatically when you chat. Every user message triggers a searchKnowledge tool call that:

Calls mxbai.stores.search() on the videos store with top_k: 5
Expands each result by ±1 neighboring chunks for more context
Merges overlapping regions within the same file
Returns transcriptions with timestamps and relevance scores

The AI synthesizes an answer with inline citations (e.g. [1], [2]) that link back to the source videos. Hover a citation to see the video thumbnail, timestamp, and transcription snippet.

Searching via the SDK directly

const results = await mxbai.stores.search({
  query: "How does backpropagation work?",
  store_identifiers: ["videos"],
  top_k: 5,
});

for (const chunk of results.data) {
  console.log(chunk.score, chunk.transcription);
}

Search supports additional options like filters for metadata filtering, rerank for second-stage ranking, and rewrite_query for AI-powered query expansion. See the search docs for details.

Name		Name	Last commit message	Last commit date
Latest commit History 59 Commits
app		app
components		components
lib		lib
public		public
.env.example		.env.example
.gitignore		.gitignore
README.md		README.md
bun.lock		bun.lock
next.config.ts		next.config.ts
package.json		package.json
postcss.config.mjs		postcss.config.mjs
tsconfig.json		tsconfig.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

video-chat

Setup

Prerequisites

Install & run

Environment variables

Uploading videos to mixedbread

1. Create the store

2. Upload videos

File naming convention

Generated metadata

Listing available lectures

Searching

Searching via the SDK directly

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 1

Languages

Folders and files

Latest commit

History

Repository files navigation

video-chat

Setup

Prerequisites

Install & run

Environment variables

Uploading videos to mixedbread

1. Create the store

2. Upload videos

File naming convention

Generated metadata

Listing available lectures

Searching

Searching via the SDK directly

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 1

Languages

Packages