SMART: Automatically Scaling Down Language Models with Accuracy Guarantees for Reduced Processing Fees

SMART, Scaling Models Adaptively for Reduced Token Fees, is a novel LLM framework designed to minimize the inference costs of NLP tasks while ensuring sufficient result quality.

Quick Start

# Tested on Python 3.10.
git clone https://github.com/saehanjo/smart-llms.git
cd smart-llms

# (Optional) Create virtual environment.
python -m venv .venv
source .venv/bin/activate

# Install requirements.
pip install -r requirements.txt

# Run experiments. Results are saved in the CSV file: result/all.csv.
mkdir result
python run_exp.py

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
README.md		README.md
llm_result.csv		llm_result.csv
model_info.py		model_info.py
optimizer.py		optimizer.py
processor.py		processor.py
prompts.py		prompts.py
requirements.txt		requirements.txt
run_exp.py		run_exp.py
simulator.py		simulator.py
utils_llm.py		utils_llm.py
utils_math.py		utils_math.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

llm_result.csv

llm_result.csv

model_info.py

model_info.py

optimizer.py

optimizer.py

processor.py

processor.py

prompts.py

prompts.py

requirements.txt

requirements.txt

run_exp.py

run_exp.py

simulator.py

simulator.py

utils_llm.py

utils_llm.py

utils_math.py

utils_math.py

Repository files navigation

SMART: Automatically Scaling Down Language Models with Accuracy Guarantees for Reduced Processing Fees

Quick Start

About

Releases

Packages

Languages

saehanjo/smart-llms

Folders and files

Latest commit

History

Repository files navigation

SMART: Automatically Scaling Down Language Models with Accuracy Guarantees for Reduced Processing Fees

Quick Start

About

Resources

Stars

Watchers

Forks

Languages