GitHub - Efficient-Computing-Lab/InferLens: InferLens is an LLM deep analysis interface that presents various statistics and metrics during LLM inference in real time.

Description

InferLens is an LLM deep analysis toolbox that provides a unified interface able to measure and present various statistics and metrics during LLM inference in real time.

Users can explore how the layers of the selected LLM are activated during each token generation, the resources it consumes, the entropy of the selected token and more.

There is also an option for conversation history and prompt based continual learning (PCL) to explore more capabilities provided by modern LLMs.

The default LLM selected in Llama3 but it supports other huggingface LLMs as well, including Phi4, Nanbeige4 and Qwen2.5.

Required configuration

3B model license

[https://huggingface.co/meta-llama/Llama-3.2-3B-Instruct]

8B model license

[https://huggingface.co/meta-llama/Meta-Llama-3-8B-Instruct]

TOKEN

Get token from https://huggingface.co/settings/tokens.
Create file src/hugface_token.py.
Add TOKEN="{Your token}" in it.
Save.

Requirements

Python 3.12+
GPU that supports CUDA 12.8 if run in CUDA mode
Access to port 5080 or another port that will host the GUI

Installation

Basic pipenv

python3.12 -m pip install pipenv
python3.12 -m pipenv install

Installation for CUDA

python3.12 -m pipenv run pip install -r requirements-torch.txt

Run

Fast run (3B)

python main.py --port 5080 --model_id meta-llama/Llama-3.2-3B-Instruct

Full run (8B)

python main.py --port 5080

GUI

After succesfully running GUI will be available at http://localhost:5080

License

InferLens is free software: you can redistribute it and/or modify it under the terms of the GNU Affero General Public License as published by the Free Software Foundation, either version 3 of the License, or (at your option) any later version.

InferLens is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU Affero General Public License for more details.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
src		src
.gitignore		.gitignore
LICENSE		LICENSE
Readme.md		Readme.md
requirements-torch.txt		requirements-torch.txt
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Description

Required configuration

3B model license

8B model license

TOKEN

Requirements

Installation

Basic pipenv

Installation for CUDA

Run

Fast run (3B)

Full run (8B)

GUI

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Description

Required configuration

3B model license

8B model license

TOKEN

Requirements

Installation

Basic pipenv

Installation for CUDA

Run

Fast run (3B)

Full run (8B)

GUI

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages