WeaveCastStudio

AI-Powered Multilingual Live Broadcast Assistant

WeaveCastStudio is an end-to-end system that automates news gathering, fact-checking, video generation, and real-time live streaming support for journalists.
During live streaming, Gemini listens to the journalist's voice instructions and handles behind-the-scenes tasks such as playing video clips, displaying maps, and presenting/explaining breaking news, allowing journalists to focus on their live broadcast.

How It Works

The system consists of four modules split across two environments:

Google Cloud (GCE — runs 24/7)

Module	Role	Key Tech
M1: Data Collection & Video Generation	Collects government statements via Gemini + Google Search, summarizes them, generates narration (TTS), images, and composes briefing videos. Optionally uploads to YouTube.	Gemini 2.5 Flash, nodriver, ffmpeg, YouTube Data API v3
M3: Fact Checker / Crawler	Crawls trusted sources (gov sites, UN, OSINT) on a schedule, stores articles in SQLite, scores importance with Gemini, and flags breaking news.	DrissionPage, Gemini + Grounding, APScheduler

Windows PC (Broadcast Station)

Module	Role	Key Tech
M4: Live Broadcast	Gemini Live API voice client. Journalist speaks (push-to-talk F9), Gemini responds with voice + function calls to play videos, show images, and explain context. Runs a breaking-news ticker overlay for OBS.	Gemini Live API, PyAudio, python-vlc, tkinter, OBS

Data Flow

GCE (M1/M3)  ──→  GCS Bucket  ──→  Windows PC (M4)
  cron jobs        gcloud rsync       pull_from_gcs.py

M1 and M3 run on cron, push results (videos, images, JSON, SQLite DB) to a GCS bucket. The Windows broadcast station pulls them with pull_from_gcs.py before going live. M4 reads the local data and serves it during the broadcast.

Prerequisites

Google Cloud account with billing enabled
Gemini API key (GOOGLE_API_KEY)
Python 3.11+ managed via uv
Chromium (for DrissionPage / nodriver on GCE)
ffmpeg (for video composition on GCE)
OBS Studio (for live streaming on Windows)
Google Cloud SDK (gcloud) on both GCE and Windows

Repository Structure

WeaveCastStudio/
├── README.md                  # ← This file
├── .env                       # GOOGLE_API_KEY + LANGUAGE (git-ignored, shared by all modules)
├── .env.sample                # Template for .env
├── content_index.py           # Shared module: content registry for M1/M3 → M4
├── pull_from_gcs.py           # Windows: pull GCS data to local (merges content_index.json)
├── pyproject.toml             # uv project definition
│
├── shared/                    # Common pipeline modules for M1/M3
│   ├── __init__.py
│   ├── language_utils.py      # BCP-47 language config loader (reads LANGUAGE from .env)
│   ├── source_collector.py    # Phase 1: Gemini Search + nodriver
│   ├── summarizer.py          # Phase 1: Structured JSON summary
│   ├── script_writer.py       # Phase 2: Narration script
│   ├── image_generator.py     # Phase 2: Infographic images
│   ├── narrator.py            # Phase 3: TTS audio
│   └── video_composer.py      # Phase 4: ffmpeg video composition
│
├── IaC/                       # Terraform config for GCE + GCS
│   ├── main.tf
│   ├── variables.tf
│   ├── startup.sh
│   ├── terraform.tfvars.example
│   └── README.md
│
├── gcp/                       # GCE deployment guides & sync scripts
│   ├── README.md              # English deploy guide
│   ├── README_ja.md           # Japanese deploy guide
│   └── sync_to_gcs.sh         # GCE→GCS push script (run via cron)
│
├── compe_M1/                  # Module M1: Data Collection & Video Generation
│   ├── main.py                # Pipeline entry point (--phase 1..5)
│   ├── requirements.txt
│   ├── README.md              # Detailed M1 docs (pipeline, costs, etc.)
│   ├── config/
│   │   └── topics.yaml        # Topic definitions
│   └── uploader/
│       └── youtube_uploader.py   # Phase 5: YouTube upload
│
├── compe_M3/                  # Module M3: Fact Checker / Crawler
│   ├── main.py                # Unified CLI (crawl / analyze / compose / schedule / pipeline)
│   ├── requirements_m3.txt
│   ├── config/
│   │   └── sources.yaml       # Crawl target definitions (gov, UN, OSINT)
│   ├── crawler/
│   │   └── drission_crawler.py
│   ├── store/
│   │   └── article_store.py   # SQLite article storage
│   ├── analyst/
│   │   ├── gemini_client.py
│   │   └── gemini_analyst.py  # Importance scoring + breaking detection
│   ├── composer/
│   │   └── briefing_composer.py
│   └── scheduler/
│       └── crawl_scheduler.py
│
├── compe_M4/                  # Module M4: Live Broadcast Client
│   ├── gemini_live_client.py  # Main entry point
│   ├── media_window.py        # tkinter video/image display
│   ├── breaking_news_server.py# HTTP+SSE server for OBS ticker
│   ├── overlay/
│   │   └── ticker.html        # OBS browser source (breaking news)
│   └── OBS_SETUP.md           # OBS configuration guide
│
└── docs/
    └── weavecaststudio_architecture_simple.png

Setup

1. GCE Infrastructure (M1 + M3)

Option A — Terraform (recommended):

cd IaC/
cp terraform.tfvars.example terraform.tfvars
# Edit terraform.tfvars: set project_id, google_api_key, etc.
terraform init && terraform apply

Option B — Manual:

Follow gcp/README.md (English) or gcp/README_ja.md (Japanese) for step-by-step GCE setup.

After instance is ready:

gcloud compute ssh weavecast-collector --zone=asia-northeast1-b

cd ~/WeaveCastStudio
uv sync

# Configure environment (project root — shared by all modules)
cp .env.sample .env
# Edit .env: set GOOGLE_API_KEY and LANGUAGE

# Verify M3
cd compe_M3 && uv run main.py crawl

# Verify M1
cd ../compe_M1 && uv run main.py --phase 1

# Set up cron (see gcp/README.md Step 10)
crontab -e

2. Windows Broadcast Station (M4)

# Clone & install
git clone https://github.com/webbigdata-jp/WeaveCastStudio.git
cd WeaveCastStudio
uv sync

# Configure environment
cp .env.sample .env
# Edit .env: set GOOGLE_API_KEY and LANGUAGE

# Pull data from GCS
python pull_from_gcs.py

# Set up OBS (see compe_M4/OBS_SETUP.md)

# Launch
cd compe_M4
python gemini_live_client.py

OBS setup summary:

Add a Window Capture source for the MediaWindow (tkinter)
Add a Browser source pointing to http://localhost:8765/overlay (1920×1080)
Layer order: Ticker (top) → MediaWindow → other sources
See compe_M4/OBS_SETUP.md for full details

3. Data Sync (GCE → GCS → Windows)

GCE → GCS is handled by gcp/sync_to_gcs.sh (run via cron or manually).

GCS → Windows:

# Pull all data
python pull_from_gcs.py

# Pull M3 only
python pull_from_gcs.py --m3only

# Pull M1 only
python pull_from_gcs.py --m1only

Environment Variables

Variable	Where	Description
`GOOGLE_API_KEY`	`.env` (project root)	Gemini API key
`LANGUAGE`	`.env` (project root)	Output language as a BCP-47 code (e.g. `ja`, `en`, `ko`)

All modules (M1, M3, M4) load .env from the project root (WeaveCastStudio/.env).

LANGUAGE configuration

LANGUAGE is a BCP-47 language code that controls the output language across all modules:

Live API (M4): passed directly as the session language code (e.g. ja).
Standard prompts (M1/M3): automatically converted to a natural-language name (e.g. Japanese) by shared/language_utils.py and injected into Gemini prompts, so narration scripts, image captions, and summaries are generated in the chosen language.

The conversion table lives in shared/language_utils.py. To add a language, append its BCP-47 code and English name to _BCP47_TO_PROMPT_LANG. Unsupported codes fall back to en / English.

# Example .env
GOOGLE_API_KEY=your_api_key_here
LANGUAGE=ja   # Japanese output

M3-specific language behaviour

GeminiClient resolves LANGUAGE on instantiation and exposes it as self.language (a LanguageConfig with bcp47_code and prompt_lang). All M3 components that generate user-facing text read this value from the client — no extra configuration is needed.

Field	Language
`summary`	Output language (set by `LANGUAGE`)
`importance_reason`	Output language (set by `LANGUAGE`)
Briefing script	Output language (set by `LANGUAGE`)
Short clip script	Output language (set by `LANGUAGE`)
`topics`	English always (for consistent downstream filtering)
`key_entities`	English always (for consistent downstream filtering)
Log messages / internal IDs	English always

Customising M3 prompts for your content vertical: compe_M3/analyst/gemini_analyst.py contains _ANALYSIS_PROMPT_TEMPLATE and compe_M3/composer/briefing_composer.py contains generate_m3_script and _generate_short_clip_script. Both files include a NOTE FOR MAINTAINERS comment block explaining what to adjust (scoring guide, topic examples, tone) when deploying for a specialist audience such as finance, sports, or local news.

YouTube upload (M1 Phase 5) additionally requires OAuth2 credentials — see compe_M1/README.md.

Topic Configuration (`topics.yaml`)

Each topic entry uses two title fields:

Field	Purpose
`title_en`	English topic name used for Gemini search queries and as the internal dictionary key.
`title_target_lang`	Display name in the output language (set via `LANGUAGE`). Used for on-screen titles, narration scripts, and image captions.

# Example
- title_en: "Strait of Hormuz blockade"
  title_target_lang: "ホルムズ海峡封鎖"   # shown in output when LANGUAGE=ja
  query_keywords:
    - "Strait of Hormuz blockade 2026"
  importance_score: 9.0
  tags: ["hormuz", "iran", "shipping", "military"]

When adding a topic, always set title_target_lang to the display text appropriate for your configured LANGUAGE.

Usage

M1: Generate a Briefing Video

# On GCE
cd compe_M1

# Run individual phases
uv run main.py --phase 1          # Collect + summarize
uv run main.py --phase 2          # Script + images
uv run main.py --phase 3          # TTS narration
uv run main.py --phase 4          # Compose video
uv run main.py --phase 5          # Register to ContentIndex

# Or run all at once
uv run main.py                    # Phases 1–5
uv run main.py --skip-upload      # Phases 1–4, then ContentIndex only (no YouTube)

# Switch topics
uv run main.py --phase 1 --topic-index 0   # First topic (default)
uv run main.py --phase 1 --topic-index 1   # Second topic

See compe_M1/README.md for the full pipeline diagram and cost estimates.

M3: Crawl & Analyze News Sources

# On GCE
cd compe_M3

# Crawl
uv run main.py crawl                          # Default source (un_news)
uv run main.py crawl --source centcom          # Specific source
uv run main.py crawl --all                     # All sources
uv run main.py crawl --list-sources            # Show registered sources

# Analyze
uv run main.py analyze                         # Batch analyze unanalyzed articles
uv run main.py analyze --limit 3               # Limit to 3 articles

# Compose briefing video
uv run main.py compose --dry-run               # Script only (no video)
uv run main.py compose                         # Full video generation
uv run main.py compose --short-clips           # Short clip mode
uv run main.py compose --short-clips --limit 1 # Single clip

# Scheduler (daemon)
uv run main.py schedule                        # Run until Ctrl+C
uv run main.py schedule --duration 30          # Auto-stop after 30s

# Full pipeline (crawl → analyze → compose)
uv run main.py pipeline                        # One-shot full run
uv run main.py pipeline --dry-run              # Skip video generation

# Debug output
uv run main.py --debug crawl                   # Show DB stats, detailed logs

M4: Live Broadcast

# On Windows
cd compe_M4
python gemini_live_client.py

Keyboard shortcuts during broadcast:

Key	Action
F5	Play / Pause toggle
F6	Stop playback
F7	Minimize media window
F8	Restore media window
F9	Push-to-talk (hold to speak to Gemini)

Tech Stack

Python, google-genai SDK, Gemini Live API, Gemini 2.5 Flash, Google Grounding, DrissionPage, nodriver, ffmpeg, APScheduler, PyAudio, python-vlc, tkinter, OBS Studio, Terraform, GCE, GCS, YouTube Data API v3

TODO

M3 prompt tuning: Default prompts in gemini_analyst.py (_ANALYSIS_PROMPT_TEMPLATE) and briefing_composer.py (generate_m3_script, _generate_short_clip_script) are intentionally general-purpose. If deploying for a specific content vertical (finance, sports, local politics, etc.), edit the scoring guide, topic examples, and tone instructions in those files.
GCE→GCS sync script: gcp/sync_to_gcs.sh handles cron-based push of M1/M3 output and content_index.json to GCS.
content_index.py docs: See docs/content_index.md for the full schema reference and module integration guide.
CI/CD: No automated tests or linting configured yet.

License

Apache-2.0 license

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

WeaveCastStudio

How It Works

Google Cloud (GCE — runs 24/7)

Windows PC (Broadcast Station)

Data Flow

Prerequisites

Repository Structure

Setup

1. GCE Infrastructure (M1 + M3)

2. Windows Broadcast Station (M4)

3. Data Sync (GCE → GCS → Windows)

Environment Variables

LANGUAGE configuration

M3-specific language behaviour

Topic Configuration (`topics.yaml`)

Usage

M1: Generate a Briefing Video

M3: Crawl & Analyze News Sources

M4: Live Broadcast

Tech Stack

TODO

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 40 Commits
.github/workflows		.github/workflows
IaC		IaC
compe_M1		compe_M1
compe_M3		compe_M3
compe_M4		compe_M4
docs		docs
gcp		gcp
shared		shared
tests		tests
tools		tools
.env.sample		.env.sample
.gitignore		.gitignore
.python-version		.python-version
LICENSE		LICENSE
README.md		README.md
README_ja.md		README_ja.md
check_env.py		check_env.py
content_index.md		content_index.md
content_index.py		content_index.py
pull_from_gcs.py		pull_from_gcs.py
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Folders and files

Latest commit

History

Repository files navigation

WeaveCastStudio

How It Works

Google Cloud (GCE — runs 24/7)

Windows PC (Broadcast Station)

Data Flow

Prerequisites

Repository Structure

Setup

1. GCE Infrastructure (M1 + M3)

2. Windows Broadcast Station (M4)

3. Data Sync (GCE → GCS → Windows)

Environment Variables

LANGUAGE configuration

M3-specific language behaviour

Topic Configuration (topics.yaml)

Usage

M1: Generate a Briefing Video

M3: Crawl & Analyze News Sources

M4: Live Broadcast

Tech Stack

TODO

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Topic Configuration (`topics.yaml`)

Packages