Sightline

A realtime Live Agent built with TypeScript, Gemini Live API, microphone streaming, camera-frame streaming, and interruption handling.

what this app does

Starts a Gemini Live session over WebSocket.
Streams microphone audio in realtime (audio/pcm).
Streams camera frames in realtime (image/jpeg).
Shows live user transcription + live model transcript.
Supports interruption (activity.start / activity.end) and barge-in behavior.
Uses @google/genai Live API on the backend.

stack

frontend: React + Vite + TypeScript
backend: Node.js + TypeScript + ws
ai: Google GenAI SDK (@google/genai) with ai.live.connect
deploy target: Google Cloud Run

project structure

src/
  client/
    components/
    hooks/
    lib/
  server/
    lib/
    live/
  shared/
    types/
docs/
  architecture.md

prerequisites

Node.js 20+
pnpm 9+
Gemini API key

env setup

Copy env template:

cp .env.example .env

Set values:

GEMINI_API_KEY=your-key
GEMINI_LIVE_MODEL=gemini-2.5-flash-native-audio-preview-12-2025
PORT=8080
CORS_ORIGIN=http://localhost:5173
VITE_WS_BASE_URL=

local run

pnpm install
pnpm run dev

app: http://localhost:5173
health: http://localhost:8080/api/health

Browser flow:

Click start live session.
Enable stream microphone audio.
Enable stream camera frames.
Speak naturally.
Interrupt by speaking again or pressing interrupt now.

reproducible testing (for judges)

Use this section to verify the project from a clean checkout.

1) install + static checks

pnpm install
pnpm run typecheck
pnpm run build

Pass criteria:

typecheck exits with code 0
build exits with code 0
server build output exists under dist/server/
client build output exists under dist/client/

2) local runtime smoke test

cp .env.example .env
# set GEMINI_API_KEY in .env
pnpm run dev

Open http://localhost:5173 and run this checklist:

Click start live session -> transcript shows Session connected.
Enable stream microphone audio -> user speech appears in transcript.
Enable stream camera frames + auto observe mode -> agent gives proactive visual feedback after brief user silence.
Click analyze latest frame -> immediate vision response appears.
Click interrupt now while agent is speaking -> interruption event appears and model output stops.
Click stop session -> transcript shows Session closed.

3) deployed runtime smoke test (google cloud run)

# replace with your Cloud Run URL
curl -s https://YOUR_CLOUD_RUN_URL/api/health

Pass criteria:

health endpoint returns JSON with ok: true
same 6-step UI checklist above works on deployed URL

Notes for judges:

If Gemini quota is exhausted, the app returns a clear 429 quota message.
This repository currently focuses on integration/runtime validation (typecheck, build, live smoke flow) instead of standalone unit tests.

build and start

pnpm run typecheck
pnpm run build
pnpm run start

deploy to cloud run

option 1: cloud build

gcloud builds submit --config cloudbuild.yaml

option 1b: automatic deploy on push

Set a Cloud Build trigger that runs cloudbuild.yaml on push to main.

Required one-time IAM for Cloud Build service account (PROJECT_NUMBER@cloudbuild.gserviceaccount.com):

roles/run.admin
roles/artifactregistry.writer
roles/iam.serviceAccountUser

option 2: manual container

docker build -f Dockerfile -t sightline-live .
docker run -p 8080:8080 --env-file .env sightline-live

notes

Keep API keys server-side only.
If the selected model is unavailable in your account, set GEMINI_LIVE_MODEL to a model available in your project.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
docs		docs
scripts		scripts
src		src
.dockerignore		.dockerignore
.env.example		.env.example
.gitignore		.gitignore
Dockerfile		Dockerfile
cloudbuild.yaml		cloudbuild.yaml
index.html		index.html
package.json		package.json
pnpm-lock.yaml		pnpm-lock.yaml
readme.md		readme.md
tsconfig.json		tsconfig.json
tsconfig.server.json		tsconfig.server.json
vite.config.ts		vite.config.ts

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Sightline

what this app does

stack

project structure

prerequisites

env setup

local run

reproducible testing (for judges)

1) install + static checks

2) local runtime smoke test

3) deployed runtime smoke test (google cloud run)

build and start

deploy to cloud run

option 1: cloud build

option 1b: automatic deploy on push

option 2: manual container

notes

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Veri5ied/sightline

Folders and files

Latest commit

History

Repository files navigation

Sightline

what this app does

stack

project structure

prerequisites

env setup

local run

reproducible testing (for judges)

1) install + static checks

2) local runtime smoke test

3) deployed runtime smoke test (google cloud run)

build and start

deploy to cloud run

option 1: cloud build

option 1b: automatic deploy on push

option 2: manual container

notes

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages