Skip to content

Conversation

@mchenco
Copy link
Contributor

@mchenco mchenco commented Feb 20, 2025

Summary

Screenshots (optional)

Documentation checklist

  • The documentation style guide has been adhered to.
  • If a larger change - such as adding a new page- an issue has been opened in relation to any incorrect or out of date information that this PR fixes.
  • Files which have changed name or location have been allocated redirects.

@mchenco mchenco changed the title update WAI pricing update Feb 20, 2025
@github-actions github-actions bot added the product:workers-ai Workers AI: https://developers.cloudflare.com/workers-ai/ label Feb 20, 2025
@github-actions
Copy link
Contributor

Howdy and thanks for contributing to our repo. We review internal PRs with 1 week. If it's something urgent or has been sitting without a comment, start a thread in the Developer Docs space internally.

## LLM model pricing
| Model | Price in Tokens | Price in Neurons |
| -------------------------------------------- | ---------------------------------------------------------- | ------------------------------------------------------------------------- |
| @cf/meta/llama-3.2-1b-instruct | $0.027 per M input tokens <br/> $0.201 per M output tokens | 2457 neurons per M input tokens <br/> 18252 neurons per M output tokens |
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Would love to see this pull from the model info in the future. LGTM for clarity now!

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yeah, agree that we'd want this pulled programmatically if possible.

The other bit here is -- if the data is pulled in programmatically -- we'd just want a pricing-specific section of the models pages (and likely not have direct pricing here). Dev lift for that is pretty minimal... dev lift for also showing those prices here is heavier.

@rita3ko rita3ko marked this pull request as ready for review February 20, 2025 04:05
@rita3ko rita3ko requested review from a team, daisyfaithauma and rita3ko as code owners February 20, 2025 04:05
Workers AI has deprecated the usage of neurons in favor of unit-based pricing. The Cloudflare dashboards will be migrated this unit-based pricing soon so you can track your usage. Individual model pages will soon document the price for each model. We also made pricing cheaper!

We will begin billing for all models under this new pricing structure beginning November 1, 2024.
Workers AI has updated pricing to be more granular, with unit-based pricing presented to customers, but still billing in neurons in the back end. Blog post coming soon.
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Ideally, docs shouldn't speak to points in time (has updated, coming soon), but might make sense in this case.

Copy link
Contributor

@daisyfaithauma daisyfaithauma left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@mchenco mchenco requested a review from a team as a code owner February 20, 2025 21:17
@mchenco mchenco merged commit 66ceb1d into production Feb 20, 2025
11 checks passed
@mchenco mchenco deleted the wai-pricing-feb branch February 20, 2025 22:04
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

product:workers-ai Workers AI: https://developers.cloudflare.com/workers-ai/ size/m

Projects

None yet

Development

Successfully merging this pull request may close these issues.

5 participants