Release Vellum 0.10.0 · vellum-ai/vellum-assistant

A wide release: plugins become a first-class marketplace, your assistant can now pull in a second, more powerful model for hard problems, memory moves to a sharper section-grain v3, and subagents, workflows, Slack, and the Activity page all get upgrades.

Plugins are now a first-class marketplace. Browse, install, and update plugins from inside the app, alongside the ones the vellum team ships. Plugins are TypeScript packages with lifecycle hooks (distinct from skills, which are instruction bundles), so they can reshape how the assistant behaves, not just what it knows. Read more here.
Privacy policy and consent updates. We have updated the privacy policy and the Share Analytics / Share Diagnostics control. Share Diagnostics now covers crash reports, conversation traces, and session replay data. Because the consent terms changed, you will be asked to review and reconfirm your privacy-related preferences. Read the updates here.
Workflows - When your assistant needs to apply the same operation across several things (researching products from different brands, comparing multiple options, summarizing a batch of docs), you can watch each one run in parallel: a progress card in your chat, plus a side panel showing each item start and finish.
Your assistant can consult a more powerful model. The new Advisor plugin lets the main agent step back on a complex task and consult a second agent running a stronger model before committing to an approach. It comes installed and enabled by default on new assistants. Self-hosted and BYOK workspaces can point the advisor at one of their own profiles.
Memory v3. Memory moves to a section-grain model: instead of one flat page per topic, your assistant retrieves a compact card plus the specific section that matches, so recall is sharper and less bloated. New assistants get v3 from creation. If you are on an existing assistant, ask it to run the "Memory v3 Migration" skill to reshape your current memory from v2 into the new format. It reads v2 pages either way, but retrieval is noticeably better once migrated.
Firecrawl joins both web search and web fetch. Your assistant can now use Firecrawl for either searching the web or fetching a specific page, alongside the existing options (Perplexity, Brave, Tavily). Firecrawl returns clean, relevance-ranked results and has a freshness filter.
Smarter, tidier subagents. The subagent detail panel is faster and cleaner: the timeline is virtualized, the panel no longer re-renders on every streamed token, and side-panel open is snappier. Spawned subagents now inherit the inference profile of the agent that launched them, so parallel work runs on the right model without extra setup.
Balanced profile now runs on MiniMax M3. The managed Balanced profile is repointed to MiniMax M3 on Fireworks (the former Balanced Economy configuration), keeping the same Balanced label while improving the cost-to-quality tradeoff. The separate Balanced Economy profile has been deprecated.
Slack improvements. Direct-message replies no longer disappear. Approval cards and reactions clear across every surface when resolved. Access-request verification codes are delivered straight to the requester.
A cleaner Activity page. The page formerly known as Home is now Activity, decluttered and polished: Notifications and Schedules tabs, a slide-in detail drawer that respects reduced-motion, persistent accordion state, and system schedules (Heartbeat, Memory Consolidation, Memory Retrospective) restored to the view.
OS-Beta profile for open-weight model testing. A new model profile, OS-Beta, lets you opt into early-access open-weight models. It currently points to GLM 5.2 via Fireworks and is gated behind a feature flag.
Chrome extension for self-hosted assistants. The extension now works again with self-hosted setups: browser-use is functional, and you can edit the gateway URL or see a clear error when the connection fails.
Global status banner. A new banner surfaces your assistant's lifecycle state at the top of the app: upgrading, offline, waking up, and any other connectivity events. You'll always know at a glance whether your assistant is actually running.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Vellum 0.10.0

Choose a tag to compare

Sorry, something went wrong.

Sorry, something went wrong.

Uh oh!

No results found

Uh oh!