GPU Infrastructure Estimator for LLMs

This web tool helps you estimate the GPU infrastructure required for Large Language Model (LLM) inference. It assists developers and architects in planning the necessary hardware based on model characteristics and performance requirements.

Overview

Features

Comprehensive Estimation: Calculates the required VRAM (for the model + KV cache), the number of GPUs, latency, and capital expenditure (CAPEX).
Pre-defined Models: Includes a list of popular models (e.g., Llama, Mistral) for a quick start.
GPU Catalog: Contains specifications for common GPUs. The price of GPUs can be modified by the user.
Customizable Parameters: Allows you to adjust all key parameters: model size, precision, context length, QPS, etc.
CSV Export: Export the input data and estimation results to a CSV file.

Tech Stack

Frontend: React, TypeScript
Build Tool: Vite
Styling: Tailwind CSS (used in the project)

Getting Started

Follow these steps to run the project on your local machine.

Prerequisites

Node.js (version 18 or higher recommended)
npm or another package manager

Installation

Clone the repository (if you haven't already):

git clone https://github.com/your-username/your-repo.git

Navigate to the project directory:
```
cd repo-name
```
Install the dependencies:
```
npm install
```

Running the Application

To start the development server, run:

npm run dev

The application will then be available at http://localhost:5173 (Vite usually indicates the port in the terminal upon launch).

Other Commands

To create an optimized production build:
```
npm run build
```
To preview the production build locally:
```
npm run preview
```

How to Use the Tool

Select a model from the dropdown list or manually enter the model parameters.
Adjust the inference parameters such as context size (input/output tokens) and QPS (queries per second).
Choose a target GPU to see estimates based on that hardware.
The results are calculated in real-time and displayed on the right.
(Optional) Go to the GPU Catalog tab to adjust prices and see the impact on the total cost.
Click Export to CSV to download a summary of your estimation.

Disclaimer

This tool is a work in progress and provides estimates for educational purposes. It does not account for certain elements of a complete infrastructure, such as redundancy, network costs (ingress/egress), other application components, load balancers, monitoring, etc.

Author

Vincent Méoc - LinkedIn

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
components		components
services		services
.gitignore		.gitignore
App.tsx		App.tsx
README.md		README.md
constants.ts		constants.ts
index.html		index.html
index.tsx		index.tsx
metadata.json		metadata.json
package.json		package.json
sizing tool.png		sizing tool.png
tsconfig.json		tsconfig.json
types.ts		types.ts
vite.config.ts		vite.config.ts

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

GPU Infrastructure Estimator for LLMs

Overview

Features

Tech Stack

Getting Started

Prerequisites

Installation

Running the Application

Other Commands

How to Use the Tool

Disclaimer

Author

About

Uh oh!

Releases

Packages

Languages

vmeoc/LLMSizingTool

Folders and files

Latest commit

History

Repository files navigation

GPU Infrastructure Estimator for LLMs

Overview

Features

Tech Stack

Getting Started

Prerequisites

Installation

Running the Application

Other Commands

How to Use the Tool

Disclaimer

Author

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages