Routing agent that uses tools with Lamini & Llama 2

Train a new routing agent that uses tools with just prompts.

from lamini import LaminiClassifier

llm_routing_agent = LaminiClassifier()
tools = { "search" : search_tool_description,  "order": ordering_tool_description, "noop": no_tool_description }
llm_routing_agent.prompt_train(tools)

Then, let the agent route!

probabilities = classifier.predict_proba("I want to buy three of these organic bananas now")
>> {
  'data': 'I want to buy three of these organic bananas now',
  'prediction': 'order',
  'probabilities': array([0.37626405, 0.4238198 , 0.19991615])
}

Optionally, add data before you train:

llm_routing_agent.add_data_to_class("order", "I'd like to buy a bag of granny smith apples")
llm_routing_agent.prompt_train(tools)

If you include data on classes that aren't in your tools, then the classifier will still learn to predict them, but it won't have a description to leverage to further boost them. As a general suggestion, if you don't have any or little paired data to include on a class, then making sure there's a good description of it in the tools is the way to get the LLM to understand what it does.

Run now

./classify.sh 'I want to buy three of these organic bananas now'

You can get the probabilities for all the classes, in this case search (37.6%), order (42.3%), and noop (19.9%). These can help with gauging uncertainty.

{
  'data': 'I want to buy three of these organic bananas now',
  'prediction': 'order',
  'probabilities': array([0.37626405, 0.4238198 , 0.19991615])
}

Run your LLM agent on multiple examples at once:

./classify.sh --data "I'd like to find a plumber" --data "who is the best person to ping for delivery issues?" --data "I want to buy three of these organic bananas now"

{
  'data': "I'd like to find a plumber",
  'prediction': 'noop',
  'probabilities': array([0.34477281, 0.25205545, 0.40317174])
}
{
  'data': 'who is the best person to ping for delivery issues?',
  'prediction': 'search',
  'probabilities': array([0.67681697, 0.13177853, 0.1914045 ])
}
{
  'data': 'I want to buy three of these organic bananas now',
  'prediction': 'order',
  'probabilities': array([0.37626405, 0.4238198 , 0.19991615])
}

Train the LLM routing agent yourself:

./train.sh

You can specify your own tools (and descriptions of them using prompts) and optionally add training data to each class, by just adding a flag that specifies the path to a CSV file with columns "class_name" for the tool and "data" for an example user query.

 ./train.sh
 --class "search: SEARCH_TOOL_DESCRIPTION" 
 --class "order: ORDERING_TOOL_DESCRIPTION" 
 --class "noop: NO_TOOL_DESCRIPTION"
 --data CSV_DATA_FILEPATH

For example, here is a routing agent for a food delivery app that uses the tools {search, order, noop} trained using prompts.

Search prompt:

User wants to get an answer about the food delivery app that is available in the FAQ pages of this app. This includes questions about their deliveries, payment, available grocery stores, shoppers, fees, and the app overall.

Order prompt:

User wants to order items, i.e. buy an item and place it into the cart. This includes any indication of wanting to checkout, or add to or modify the cart. It includes mentioning specific items, or general turn of phrase like 'I want to buy something'.

No-op prompt:

User didn't specify a tool, i.e. they didn't say they wanted to search or order. The ask is totally irrelevant to the delivery service app."

Iteration

1. Improving data

This classifier is v0. Building LLMs is iterative. Here are some tools you can use to iterate and improve your routing agent over time and with more data.

./classify.sh "buy the dragonfruit that's on sale"

But it routes to the wrong tool: it chooses search over order.

{
  'data': "buy the dragonfruit that's on sale",
  'prediction': 'search',
  'probabilities': array([0.4132954 , 0.33355575, 0.25314885])
}

Alright, no problem. What you can do is add this and similar examples to the data.csv file and retrain the classifier.

You can check out data_improved.csv that has 3 extra examples.

order, "buy the dragonfruit that's on sale"
order, "just buy the sale fruits please"
order, "I want to get the dragonfruit"

Now, retrain the classifier with the new data.

./train.sh --data data_improved.csv

Now, it works!

./classify.sh
--data "buy the dragonfruit that's on sale" # previous example
--data "buy 1 dragonfruit" # new example that's close

{
  'data': "buy the dragonfruit that's on sale",
 'prediction': 'order',
 'probabilities': array([0.206426  , 0.65338528, 0.14018872])
}
{
  'data': 'buy 1 dragonfruit',
 'prediction': 'order',
 'probabilities': array([0.19983432, 0.66442344, 0.13574224])
}

2. Improving prompts

You can also improve the prompts that are used to generate the training data.

The more specific the prompt about each tool, and how it's unique and different from the other tools, the better the classifier will be.

Experiment with different prompts and see what works best. Giving example, either through data or the prompt itself, can be helpful to the LLM.

3. Adding new tools

Finally, you can add new tools. Just add a new class and prompt to the train.sh script and retrain the classifier.

More tools can help the router distinguish between more types of requests. For example, you can make a checkout tool and a add_to_cart tool to distinguish between requests that are about checking out and requests that are about adding an item to the cart. Depending on what APIs you'd like to hit later, this could help with underlying analytics, in addition to improved routing.

Installation

Clone this repo, and run the train.sh or classify.sh command line tools.

Requires docker: https://docs.docker.com/get-docker

Setup your lamini keys (free): https://lamini-ai.github.io/auth/

git clone git@github.com:lamini-ai/llm-routing-agent.git

cd llm-routing-agent

Train a new classifier.

./train.sh --help

usage: train.py [-h] [--class CLASS [CLASS ...]] [--train TRAIN [TRAIN ...]] [--save SAVE] [-v]

options:
  -h, --help            show this help message and exit
  --class CLASS [CLASS ...]
                        The classes to use for classification, in the format 'class_name:prompt'.
  --train TRAIN [TRAIN ...]
                        The training data to use for classification, in the format 'class_name:data'.
  --save SAVE           The path to save the model to.
  -v, --verbose         Whether to print verbose output.

Classify your data.

./classify.sh --help

usage: classify.py [-h] [--data DATA [DATA ...]] [--load LOAD] [-v] [classify ...]

positional arguments:
  classify              The data to classify.

options:
  -h, --help            show this help message and exit
  --data DATA [DATA ...]
                        The training data to use for classification, any string.
  --load LOAD           The path to load the model from.
  -v, --verbose         Whether to print verbose output.

These command line scripts just call python inside of docker so you don't have to care about an environment.

If you hate docker, you can also run this from python easily...

Python Library

Install it pip install lamini

Instantiate a classifier

from lamini import LaminiClassifier

# Create a new classifier
classifier = LaminiClassifier()

Define classes using prompts

classes = { "SOME_CLASS" : "SOME_PROMPT" }

classifier.prompt_train(classes)

Or if you have some training examples (optional)

data = ["example 1", "example 2"]
classifier.add_data_to_class("SOME_CLASS", data)

# Don't forget to train after adding data
classifier.prompt_train()

Classify your data

# Classify the data
prediction = classifier.predict(data)

# Get the probabilities for each class
probabilities = classifier.predict_proba(data)

Save your model

classifier.save("SOME_PATH")

Load your model

classifier = LaminiClassifier.load("SOME_PATH")

FAQ

How does it work?

The LLM routing agent converts your prompts about tools into a pile of data about those tools, using the Llama 2 LLM. It then finetunes another LLM to distinguish between each pile of data. This was forked from the LLM Classifier.

We use several specialized LLMs derived from Llama 2 to convert prompts into piles of training examples for each class. The code for this is available in the lamini python package if you want to look at it. Working on open sourcing in an easier to read github page it when I'm not too distracted...

Is this perfect?

No, this is a week night hackathon project, give us feedback and we will improve it. Some known issues:

It doesn't use batching aggressively over classes, so training on many classes could be sped up by more than 100x.
We are refining the LLM example generators. Send us any issues you find with your prompts adn we can improve these models.

Why wouldn't I just use a normal classifier like BART, XGBoost, BERT, etc?

You don't need to label any data using the LaminiClassifier. Labeling data sucks.

No fiddling with hyperparameters. Fiddle with prompts instead. Hopefully english is easier than attention_dropout_pcts.

Why wouldn't I just use a LLM directly?

A classifier always outputs a valid class. An LLM might answer the question "Is this talking about an order" with "Well... that depends on ....". Writing a parser sucks.

Added benefit: classifiers give you probabilities and can be calibrated: https://machinelearningmastery.com/threshold-moving-for-imbalanced-classification/

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
lamini_classifier/cli		lamini_classifier/cli
models		models
scripts		scripts
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE.txt		LICENSE.txt
README.md		README.md
classify.sh		classify.sh
data.csv		data.csv
data_improved.csv		data_improved.csv
requirements.txt		requirements.txt
train.sh		train.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Routing agent that uses tools with Lamini & Llama 2

Run now

Iteration

1. Improving data

2. Improving prompts

3. Adding new tools

Installation

Python Library

FAQ

How does it work?

Is this perfect?

Why wouldn't I just use a normal classifier like BART, XGBoost, BERT, etc?

Why wouldn't I just use a LLM directly?

About

Releases

Packages

Contributors 3

Languages

License

lamini-ai/llm-routing-agent

Folders and files

Latest commit

History

Repository files navigation

Routing agent that uses tools with Lamini & Llama 2

Run now

Iteration

1. Improving data

2. Improving prompts

3. Adding new tools

Installation

Python Library

FAQ

How does it work?

Is this perfect?

Why wouldn't I just use a normal classifier like BART, XGBoost, BERT, etc?

Why wouldn't I just use a LLM directly?

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages