LLM GPU Memory Calculator

A simple and easy-to-use GPU memory calculator for Large Language Models (LLMs). Helps you quickly estimate the GPU memory requirements and number of devices needed to run models of various sizes.

English | 简体中文

Features

Calculate required GPU memory based on model parameter count
Support for multiple precision formats: FP32, FP16/BFLOAT16, FP8, INT8, INT4
Built-in presets for popular LLM models (DeepSeek-R1, Qwen, Llama, etc.)
Support for over 130 GPU models, including:
- NVIDIA Data Center GPUs (H100, H200, B100, B200, etc.)
- NVIDIA Consumer GPUs (RTX series)
- AMD Data Center GPUs (Instinct series)
- AMD Consumer GPUs (RX series)
- Apple Silicon (M1-M4 series)
- Huawei Ascend series
Complete internationalization support (English, Simplified Chinese, Traditional Chinese, Russian, Japanese, Korean, Arabic)
Responsive design for desktop and mobile devices

Calculation Method

This calculator uses the following formula to estimate GPU memory required for LLM inference:

Model Weight Memory = Number of Parameters × Bytes per Parameter
Inference Memory = Model Weight Memory × 10% (for activations, KV cache, etc.)
Total Memory Requirement = Model Weight Memory + Inference Memory
Required GPUs = Total Memory Requirement ÷ Single GPU Memory Capacity (rounded up)

Tech Stack

Next.js - React framework
TypeScript - Type-safe JavaScript superset
Tailwind CSS - Utility-first CSS framework
shadcn/ui - Reusable UI components
next-intl - Internationalization support

Local Development

# Install dependencies
pnpm install

# Start development server
pnpm dev

# Build for production
pnpm build

# Start production server
pnpm start

Contributing

Contributions via Pull Requests or Issues are welcome to help improve this project. Potential areas for contribution include:

Adding more GPU models
Adding more LLM model presets
Optimizing calculation methods
Adding support for more languages
Improving UI/UX

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
app		app
components		components
config		config
hooks		hooks
i18n		i18n
lib		lib
messages		messages
public		public
styles		styles
utils		utils
.gitignore		.gitignore
README.md		README.md
README.zh-CN.md		README.zh-CN.md
components.json		components.json
middleware.ts		middleware.ts
next.config.mjs		next.config.mjs
package.json		package.json
pnpm-lock.yaml		pnpm-lock.yaml
postcss.config.mjs		postcss.config.mjs
tailwind.config.js		tailwind.config.js
tsconfig.json		tsconfig.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

LLM GPU Memory Calculator

Features

Calculation Method

Tech Stack

Local Development

Contributing

License

About

Uh oh!

Releases

Packages

Uh oh!

Languages

cloud26/llm-gpu-memory-calculator

Folders and files

Latest commit

History

Repository files navigation

LLM GPU Memory Calculator

Features

Calculation Method

Tech Stack

Local Development

Contributing

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages