FLPerformance - Foundry Local Model Benchmark Tool

A local application with UI for benchmarking multiple Models (SLMs) running via Microsoft Foundry Local.

📖 Read the full story: How we built FLPerformance - Learn about the architecture decisions, challenges faced, and how to get real-world LLM performance metrics on your local hardware.

✨ New: Easy Startup Script

Windows users: If you have Node.js installed, just run .\START_APP.ps1 to start everything! Opens 2 terminals + browser automatically. 🚀

✅ Working Features

Complete Benchmark System: Full end-to-end benchmarking with accurate metrics
Enhanced Visualizations: Performance cards, comparison charts, and radar graphs
Real-time Progress: Polling-based status updates every 2 seconds during runs
Results Export: JSON and CSV export functionality
Hardware Detection: Comprehensive system information capture
Storage System: JSON-based storage with optional SQLite support

Overview

FLPerformance Foundry Local Performance enables you to:

Manage Foundry Local service using the official JavaScript SDK
Load and benchmark multiple models simultaneously
Run standardized benchmark tests across models
Display clear performance statistics with tables and charts
Export results for analysis

Quick Start

Before You Begin

Required: Install Microsoft Foundry Local first

# Windows
winget install Microsoft.FoundryLocal

# macOS
brew tap microsoft/foundrylocal
brew install foundrylocal

# Or download from: https://aka.ms/foundry-local-installer

Verify installation:

foundry --version

Installation (3 Steps)

Step 1: Navigate to project directory

cd C:\Users\YourUsername\path\to\FLPerformance

Step 2: Install Node.js (if not already installed)

# Windows - Install Node.js LTS
winget install --id OpenJS.NodeJS.LTS --accept-package-agreements --accept-source-agreements

# After installation, RESTART YOUR TERMINAL for PATH updates

macOS:

brew install node

Or download from: https://nodejs.org/

Step 3: Run installation script

# Windows 
.\scripts\install.ps1

# macOS/Linux
chmod +x scripts/install.sh && ./scripts/install.sh

Note: Installation uses --no-optional flag to skip SQLite database (requires build tools).
Results are saved as JSON files instead. This works perfectly for all features!

Step 4: Start the application

# Easy Mode - Opens 2 terminals + browser automatically (Windows)
.\START_APP.ps1

# Manual Mode - Starts both servers
npm run dev

Access the Application

Once the server starts, open your browser:

🌐 http://localhost:3000

You'll see:

Models tab - Add and load AI models
Benchmarks tab - Run performance tests
Results tab - View comparison charts

First Time Setup (In the UI)

Click Models → Initialize Foundry Local (one-time setup)
Click Add Model → Select phi-3-mini-4k-instruct
Click Load Model (downloads ~2GB, takes 2-5 minutes)
Go to Benchmarks → Select your model → Run Benchmark
View results in Results tab

Alternative: Manual Installation

Required Software

Microsoft Foundry Local
- Download from: https://aka.ms/foundry-local-installer
- Verify installation: foundry --version
- Note: Foundry Local CLI must be in your PATH
Node.js & NPM
- Node.js v18 or higher
- NPM v9 or higher
- Download from: https://nodejs.org/
- Verify: node --version and npm --version
System Requirements
- Windows 10/11, macOS, or Linux
- Minimum 16GB RAM (32GB+ recommended for multiple models)
- GPU with CUDA support (optional but recommended)
- Adequate disk space for model storage (varies by model, typically 5-50GB per model)

Alternative: Manual Installation

If the automated script doesn't work:

1. Install Dependencies

npm install --no-optional

# Install frontend dependencies
cd src/client
npm install
cd ../..

# Create results directory
mkdir results

Want SQLite database support? Install Visual Studio Build Tools first:

# Windows only - needed for better-sqlite3
winget install Microsoft.VisualStudio.2022.BuildTools --silent --override "--wait --passive --add Microsoft.VisualStudio.Workload.VCTools"

# Then install with optional dependencies
npm install

# Create results directory
mkdir results

2. Start the Application

# Development mode (with hot reload)
```bash
npm run dev

Access the application at: http://localhost:3000

The application will be available at:

Frontend UI: http://localhost:3000
Backend API: http://localhost:3001

Prerequisites (For Reference)

Open the UI at http://localhost:3000
Navigate to the Models tab
Click "Initialize Foundry Local" to start the service
Click "Add Model"
Select a model from the available Foundry Local catalog (e.g., phi-3-mini-4k-instruct)
Click "Load Model" to download (if needed) and load the model into memory

Note: Foundry Local uses a single service instance that can load multiple models simultaneously. Models are differentiated by their model ID when making inference requests.

4. Run Your First Benchmark

Navigate to the Benchmarks tab
Select the "default" benchmark suite
Choose one or more models to benchmark
Configure settings (iterations, concurrency, etc.)
Click "Run Benchmark"
Watch live progress as tests execute

Viewing Results

Navigate to the Results tab
View comparison tables and charts
Filter by run, model, or benchmark type
Export results as JSON or CSV

Project Structure

FLPerformance/
├── src/
│   ├── server/              # Backend API
│   │   ├── index.js         # Express server entry point
│   │   ├── orchestrator.js  # Foundry Local service orchestration
│   │   ├── benchmark.js     # Benchmark engine
│   │   ├── storage.js       # Results storage (JSON + SQLite)
│   │   └── logger.js        # Structured logging
│   └── client/              # Frontend UI (React/Vue)
│       ├── public/
│       └── src/
│           ├── components/  # UI components
│           ├── pages/       # Page views
│           └── utils/       # Client utilities
├── benchmarks/
│   └── suites/
│       └── default.json     # Default benchmark suite definition
├── docs/
│   ├── ARCHITECTURE.md      # System architecture
│   ├── API.md               # REST API reference
│   ├── SETUP.md             # Setup documentation
│   └── BENCHMARK_GUIDE.md   # Troubleshooting guide
├── scripts/
│   └── helpers/            # Utility scripts
├── results/
│   └── example/            # Example benchmark results
├── package.json
└── README.md

Key Features

Model & Service Management

Unified service management using foundry-local-sdk
Add/remove models from Foundry Local catalog
Load multiple models simultaneously in a single service
Monitor model health and status in real-time
Automatic model download and caching

Benchmark Suite

Throughput (TPS): Tokens generated per second (overall)
Latency: Time to first token (TTFT), time per output token (TPOT), and end-to-end completion time
Generation Speed (GenTPS): Token generation rate after first token (1000/TPOT)
Stability: Error rate and timeout tracking
Resource Usage: CPU, RAM, and GPU utilization (platform-dependent)

Results & Comparison

Side-by-side model comparison tables
Interactive charts for TPS, latency distributions (p50/p95/p99), error rates
"Best model for..." recommendations based on metrics
Export results as JSON or CSV

Configuration

Default settings can be modified in the Settings tab:

Default iterations per benchmark
Concurrency level
Request timeout values
Results storage path
Streaming mode (if supported)

Architecture

FLPerformance uses the official foundry-local-sdk JavaScript package to manage the Foundry Local service:

Single Service Instance: One Foundry Local service handles all models
Multiple Loaded Models: Models are loaded on-demand and run simultaneously
OpenAI-Compatible API: Standard OpenAI client for inference requests
Model Differentiation: Models are identified by their model ID in API calls

See Architecture Documentation for details.

Troubleshooting

Service fails to start

Ensure Foundry Local is installed: foundry --version
Verify Foundry Local CLI is in your PATH
Check that port 8080 is available (default Foundry Local port)
View logs in the Models tab for specific error messages

Model fails to load

Verify sufficient disk space for model download
Check network connectivity for first-time downloads
Ensure adequate RAM for model size
Try manually loading with Foundry Local CLI: foundry model run <model-name>

Benchmark timeouts

Increase timeout values in Settings
Reduce concurrency level
Check system resource availability (RAM, GPU memory)

Test Models Before Benchmarking

Use the Test button in the Models tab to verify inference works
Successful test ensures model will work in benchmarks
Test validates both model loading and inference response
Quick way to catch configuration issues early

Installation Issues

Run the appropriate installation script (install.ps1 or install.sh) for detailed diagnostics
Check Quick Start Guide for common installation issues
Verify Node.js version: node --version (must be v18+)

Documentation

For more detailed information, see:

Quick Start Guide - Comprehensive getting started guide
Quick Reference - Commands and code patterns cheat sheet
Architecture Documentation - System design and SDK integration
API Reference - REST API endpoint documentation
Setup Guide - Detailed installation and configuration
Benchmark Guide - Troubleshooting and testing guide
Testing Checklist - Comprehensive test cases

Resources

Support

For issues or questions:

Check the documentation in /docs
Review logs in the UI under each service
Examine results in /results directory

License

MIT License

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
benchmarks/suites		benchmarks/suites
docs		docs
results		results
scripts		scripts
src		src
.env.example		.env.example
.gitignore		.gitignore
BLOGPOST.md		BLOGPOST.md
CHECK_STATUS.ps1		CHECK_STATUS.ps1
QUICK_START.md		QUICK_START.md
README.md		README.md
START_APP.ps1		START_APP.ps1
START_APP.sh		START_APP.sh
START_HERE.md		START_HERE.md
package-lock.json		package-lock.json
package.json		package.json

leestott/FLPerformance

Folders and files

Latest commit

History

Repository files navigation

FLPerformance - Foundry Local Model Benchmark Tool

✨ New: Easy Startup Script

✅ Working Features

Overview

Quick Start

Before You Begin

Installation (3 Steps)

Access the Application

First Time Setup (In the UI)

Alternative: Manual Installation

Required Software

Alternative: Manual Installation

1. Install Dependencies

2. Start the Application

Prerequisites (For Reference)

4. Run Your First Benchmark

Viewing Results

Project Structure

Key Features

Model & Service Management

Benchmark Suite

Results & Comparison

Configuration

Architecture

Troubleshooting

Service fails to start

Model fails to load

Benchmark timeouts

Test Models Before Benchmarking

Installation Issues

Documentation

Resources

Support

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Uh oh!

Languages

Packages