LLM Dashboard

Welcome to the LLM Dashboard, an interactive tool for exploring different language model inference methods!

Features

Compare different inference methods: Normal, Caching, and Batching
Visualize generation times with interactive graphs
Customize token generation length

Installation

Clone the repository:

git clone https://github.com/yourusername/llm-dashboard.git
cd llm-dashboard

Create a virtual environment and activate it:

python -m venv venv
source venv/bin/activate  # On Windows, use `venv\Scripts\activate`

Install the required packages:
```
pip install -r requirements.txt
```

Running the Application

Start the Flask server:
```
python app.py
```
Open your web browser and navigate to http://localhost:5000

Usage

Enter your text prompt in the input field
Select the inference method (Normal, Caching, or Batching)
Choose the number of tokens to generate
Click "Generate" and watch the magic happen!

Comparing Methods

Normal: Standard token-by-token generation
Caching: Utilizes KV-caching for faster subsequent token generation
Batching: Processes multiple inputs simultaneously for improved throughput

Experiment with different methods and observe the performance differences in the generated graphs!

Contributing

We welcome contributions! Please see our CONTRIBUTING.md for details on how to get started.

License

This project is not yet licensed.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
images		images
logs		logs
notebook		notebook
src		src
templates		templates
.gitignore		.gitignore
README.md		README.md
app.py		app.py
requirements.txt		requirements.txt
test.py		test.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LLM Dashboard

Features

Installation

Running the Application

Usage

Comparing Methods

Contributing

License

About

Releases

Packages

Languages

urpreetam/QuickServeLLMs

Folders and files

Latest commit

History

Repository files navigation

LLM Dashboard

Features

Installation

Running the Application

Usage

Comparing Methods

Contributing

License

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages