ai agent skills for running 150+ ai models via the inference.sh cli — the ai agent runtime for serverless ai inference.
compatible with claude code, github copilot, and other ai coding assistants. generate images with flux, create videos with veo, call llms like claude and gpt, search the web with tavily, and more.
npx skills add inference-sh/skills# Main platform skill
npx skills add inference-sh/skills@inference-sh
# Image generation
npx skills add inference-sh/skills@ai-image-generation
npx skills add inference-sh/skills@flux-image
# Video generation
npx skills add inference-sh/skills@ai-video-generation
npx skills add inference-sh/skills@google-veo
npx skills add inference-sh/skills@ai-avatar-video
# LLMs (Claude, Gemini, Kimi, etc.)
npx skills add inference-sh/skills@llm-models
# Web search (Tavily, Exa)
npx skills add inference-sh/skills@web-search
# Utilities
npx skills add inference-sh/skills@image-upscaling
npx skills add inference-sh/skills@background-removal
# Social
npx skills add inference-sh/skills@twitter-automationCopy to your skills directory:
# Claude Code
cp -r skills/* ~/.claude/skills/
# GitHub Copilot
cp -r skills/* ~/.copilot/skills/
# Project-level
cp -r skills/* .claude/skills/# Install CLI
curl -fsSL https://cli.inference.sh | sh
# Login
infsh login
# Generate an image
infsh app run falai/flux-dev-lora --input '{"prompt": "a cat astronaut"}'
# Generate a video
infsh app run google/veo-3-1-fast --input '{"prompt": "drone over mountains"}'
# Call Claude
infsh app run openrouter/claude-sonnet-45 --input '{"prompt": "Hello world"}'
# Web search
infsh app run tavily/search-assistant --input '{"query": "latest AI news"}'
# Post to Twitter
infsh app run x/post-tweet --input '{"text": "Hello from AI!"}'When you run an app, the CLI shows the task ID. For long-running tasks:
# Run without waiting
infsh app run google/veo-3 --input input.json --no-wait
# Check task status anytime
infsh task get <task-id>| Skill | Description | Triggers |
|---|---|---|
| inference-sh | Main platform skill | inference.sh, run ai |
| ai-image-generation | 50+ image models | flux, gemini image, grok, ai art |
| ai-video-generation | 40+ video models | veo, seedance, text to video |
| llm-models | Claude, Gemini, Kimi, GLM | claude api, openrouter, llm |
| web-search | Tavily, Exa search | tavily, exa, web search, rag |
| twitter-automation | X/Twitter API | tweet, twitter bot |
| flux-image | FLUX models | flux.2, flux lora |
| google-veo | Google Veo | veo 3, vertex ai |
| ai-avatar-video | Talking heads | omnihuman, lipsync, heygen alt |
| image-upscaling | Upscalers | upscale, topaz |
| background-removal | BG removal | remove background |
| Category | Examples |
|---|---|
| Image | FLUX, Gemini 3 Pro, Grok Imagine, Seedream 4.5, Reve, Topaz |
| Video | Veo 3.1, Seedance, Wan 2.5, OmniHuman, Fabric, HunyuanVideo |
| LLMs | Claude Opus/Sonnet/Haiku, Gemini 3 Pro, Kimi K2, GLM-4.6 |
| Search | Tavily Search, Tavily Extract, Exa Search, Exa Answer |
| Social | Twitter/X posting, DMs, likes, retweets, follows |
Browse all apps:
infsh app list
infsh app list --category image
infsh app list --category video
infsh app list --category audio
infsh app list --search "flux"- Getting Started - Introduction to inference.sh
- What is inference.sh? - Platform overview
- Apps Overview - Understanding the app ecosystem
- Running Apps - How to run apps via CLI
- CLI Setup - Installing the CLI
- API & SDK - Programmatic access
- Agent Skills Overview - The open standard for AI capabilities
- Workflows vs Agents - When to use each
- Why Agent Runtimes Matter - Runtime benefits
- Building a Research Agent - LLM + search integration
- From Demo to Production - Production best practices
- Website: inference.sh
- App Store: app.inference.sh
- Docs: inference.sh/docs
- Blog: inference.sh/blog
- CLI Install:
curl -fsSL https://cli.inference.sh | sh
