fix: add Bedrock models nvidia.nemotron-nano-3-30b#708
fix: add Bedrock models nvidia.nemotron-nano-3-30b#708github-actions[bot] wants to merge 1 commit into
Conversation
|
The latest updates on your projects. Learn more about Vercel for GitHub.
|
There was a problem hiding this comment.
💡 Codex Review
Here are some automated review suggestions for this pull request.
Reviewed commit: ef9e82e01a
ℹ️ About Codex in GitHub
Codex has been enabled to automatically review pull requests in this repo. Reviews are triggered when you
- Open a pull request for review
- Mark a draft as ready
- Comment "Codex (@codex) review".
If Codex has suggestions, it will comment; otherwise it will react with 👍.
When you sign up for Codex through ChatGPT, Codex can also answer questions or update the PR, like "Codex (@codex) address that feedback".
| "bedrock" | ||
| ] | ||
| }, | ||
| "nvidia.nemotron-nano-3-30b": { |
There was a problem hiding this comment.
Add the published Bedrock pricing
For this newly added model, the AWS Bedrock pricing page currently lists standard on-demand pricing for “NVIDIA Nemotron 3 Nano 30B A3B” as $0.06 input and $0.24 output per 1M tokens (checked https://aws.amazon.com/bedrock/pricing/). Leaving input_cost_per_mil_tokens and output_cost_per_mil_tokens out of this catalog entry makes getAvailableModels() expose the model as unpriced, so any cost estimates or accounting based on the static model metadata will be wrong for requests using nvidia.nemotron-nano-3-30b.
Useful? React with 👍 / 👎.
fix: add Bedrock models nvidia.nemotron-nano-3-30b
Closes #698
Source issue: #698
Summary
nvidia.nemotron-nano-3-30bnvidia.nemotron-nano-3-30b2
3
4
Verified metadata
Verification notes
Verification checklist
provider.model)model_list.json(grep fornemotron-nanoreturns no matches for Bedrock)Verification
Official sources and fields verified
https://docs.aws.amazon.com/bedrock/latest/userguide/model-card-nvidia-nemotron-nano-3-30b.html (AWS Bedrock model card)
format:converse— Converse API is supported per the model card's APIs tableflavor:chat— text input / text output, supports Chat Completions and Converse APIs; matches the existingnvidia.nemotron-super-3-120bentry which also useschatmax_input_tokens: 256000 — model card states "Context window: 256K tokens"max_output_tokens: 8192 — model card states "Max output tokens: 8K"multimodal: false — model card explicitly marks Image, Audio, Video, Speech inputs as unsupportedavailable_providers:["bedrock"]— Bedrock on-demand (Standard tier) confirmeddisplayName: "NVIDIA Nemotron 3 Nano 30B" — follows the naming pattern of the existing sibling entrynvidia.nemotron-super-3-120bwhich uses "NVIDIA Nemotron 3 Super 120B"https://docs.aws.amazon.com/bedrock/latest/userguide/model-cards.html (models-at-a-glance)
nvidia.nemotron-nano-3-30bis listed under the NVIDIA providerPricing
nvidia.nemotron-super-3-120bentry which also has no pricing fields.sync_models (LiteLLM) cross-check
model_prices_and_context_window_backup.jsonfornvidia.nemotron-nano-3-30bor any Bedrock Nemotron Nano 30B variant (also no entry fornvidia.nemotron-super-3-120b). Since sync_models has no data for this model, there are no deviations to document.Fields not published / not applicable
input_cost_per_mil_tokens/output_cost_per_mil_tokens: not published on Bedrock pricing page for NVIDIA models — omittedinput_cache_read_cost_per_mil_tokens/input_cache_write_cost_per_mil_tokens: not applicable — omittedparent: not applicable — this is a base model, not a dated snapshot or variant of another catalog entryreasoning/reasoning_budget: not applicable — no reasoning/thinking features documentedsupported_regions: not applicable — this is a Bedrock model, not a Vertex modellocations: not required for standard Bedrock modelssync_models vs proposed update
sync_models cross-check found differences. Official provider verification was used for the applied values, and sync_models discrepancies are listed below for review.