Skip to content

v1.0.0 — ltx2-vidgen-skill

Choose a tag to compare

@patraxo patraxo released this 08 Jun 11:14
· 12 commits to main since this release

A Claude Code skill that deploys your own LTX-2.3 (22B) video backend to your Modal account and drives it — drop a photo in Claude Code, get a video. You own the GPU; no SaaS, no per-clip meter.

Modes

text-to-video · image-to-video · keyframe interpolation · video-to-video (retake/restyle) · IC-LoRA canny/depth/pose control — with synced audio.

Highlights

  • Batch: --variations N (one prompt, N takes) + --prompts-file (many prompts), one warm container.
  • Formats: reel/TikTok/Shorts (9:16), YouTube (16:9), square — native, no cropping.
  • Optimized: resident pre-fused pipeline + activation-aware cap + cross-resolution purge (no OOM on mode/resolution switching), first-block cache, CPU-pinned weights, embedding cache, persisted torch.compile, tunable VAE-decode tiling → ~1.96× faster, bf16 throughout, output bit-identical. fp8/SageAttention/flash-attn tested + rejected.
  • Cost: Modal RTX PRO 6000 @ $0.000842/s — ~2.6¢ per 4 s clip; new accounts get $30/mo free (~1,000 clips). Idle = $0.

Install

npx skills add patraxo/ltx2-vidgen-skill
pip install modal && modal token new
git clone https://github.com/patraxo/ltx2-vidgen-skill && cd ltx2-vidgen-skill && ./deploy.sh

Powered by Lightricks LTX-Video on Modal.