LLM Price Comparison

A comparison of the price per million tokens and benchmark scores of various large language models.

Google Gemini Pro has been adjusted from a per-character price to a per-token estimate by simply multiplying by four.

Benchmark

The benchmark used is the DROP benchmark, which measures the ability of a model to reason over text, as reported by Anthropic: https://twitter.com/AnthropicAI/status/1764653830468428150?t=PVCce7q9pT-aiwsUd1w9tg&s=19

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
Comparative_Analysis_of_LLM_Pricing_and_Performance_Benchmark_DROP_Reasoning_over_Text.png		Comparative_Analysis_of_LLM_Pricing_and_Performance_Benchmark_DROP_Reasoning_over_Text.png
README.md		README.md
chart.ipynb		chart.ipynb