Download binaries below or see README for install
Highlights
New Commands
model-api: list and call Baseten Model APIs (shared inference endpoints)model-api list: list added Model APIs (--allbrowses the full catalog)model-api describe --model <name>: show details for one Model APImodel-api predict: send an inference request via--content(chat-completions from a message),--data(inline JSON), or--file; supports text, JSON, and JSONL streaming output
model watch: watch a local model directory and live-patch its development deployment on each change, no full rebuild (--hot-reload,--no-keepalive)model deployment metrics: fetch deployment metrics with--mode current|summary|series(series renders as sparklines, or a table with--no-chart); window via--start/--endor--since(max 7d); filter with repeatable--metricorg billing usage: render a usage table broken down by category (Dedicated, Model APIs, Training) with CATEGORY, MINUTES, TOTAL, CREDITS, SUBTOTAL columns and an "All" total row;--sinceaccepts a duration (e.g.30d)
New Flags
model push:--develop(push to the mutable dev slot),--watch(live-patch after push; implies--develop),--watch-hot-reload,--watch-no-keepalivemodel deployment logs:--min-level,--includes/--excludes(repeatable substring filters),--search-pattern(RE2 regex),--replica,--request-idauth login:--profile(name the stored profile),--remote-url,--no-switch
Improvements
- Named auth profiles: auth moved from a host/label model to named profiles, selectable via
--profileorBASETEN_PROFILE;auth switchchanges the current profile andauth logout/auth statusoperate per-profile auth login: browser-based login now opens the browser automatically using the full device-flow verification URI
💥 Backwards-Incompatible Changes
model push --promoteremoved: use--environment productioninsteadauth login --labelrenamed to--profile- Auth reworked around named profiles:
auth logout/auth switch/auth statusnow operate on profiles rather than host/user pairs - Commands renamed (
fetchbecomesdescribe):model fetchbecomesmodel describemodel deployment fetchbecomesmodel deployment describemodel environment fetchbecomesmodel environment describe
What's Changed
- Update README for v0.1.0 release by @cretz in #12
- Open browser and use complete verification URI for auth login by @cretz in #14
- Add per-platform install one-liners to README and CI version check by @cretz in #15
- Add filter flags to model deployment logs by @cretz in #16
- Replace host/label auth model with named profiles by @cretz in #17
- Add model-api commands (fetch, list, predict) by @cretz in #18
- Remove --promote flag from model push by @cretz in #19
- Add model deployment metrics command by @cretz in #20
- Teach org billing usage to stop panicking about money by @quinnypig in #21
- Add model watch and push --watch live-patch loop by @cretz in #22
- 💥 Rename
fetchcommand verb todescribeby @cretz in #23
Full Changelog: v0.1.0...v0.2.0