Introduce UnifiedParameters #600

codenamev · 2024-04-30T14:49:46Z

This is an initial effort to unify the API signature for all the LLMs' chat method. This follows OpenRouter's philosophy of "normalizes the schema across models and providers so you only need to learn one". Each LLM will need to implement their own adaptations of the base-unified parameter list, and follow some simple rules:

All LLM subclasses will now use an instance of UnifiedParameters and can access it via chat_parameters
All LLM subclasses can customize the schema used by the unified parameters in their initializer
If the chosen model doesn't support a request parameter (such as logit_bias in non-OpenAI models, or top_k for OpenAI), then the parameter will be ignored. The rest will be forwarded to the underlying model API.

This is also an initial effort to set up the foundation for a Parameters API. The idea being that each model can use their customized UnifiedParameters as an interface for querying what those parameters are.

I'd love some feedback on this before I go spreading it to every other LLM class 🥺

lib/langchain/llm/openai.rb

andreibondarev · 2024-05-01T12:15:10Z

@codenamev I should finish my review by the end of today.

andreibondarev · 2024-05-01T14:54:01Z

lib/langchain/llm/openai.rb

-      tools: [],
-      tool_choice: nil,
-      user: nil,
-      &block


@codenamev You didn't like how all of the params were listed out? We had that previously because I felt like it made the interface clearer since it's an interface that we control.

I was torn here, but allowing these inherited classes to change the method signature with custom keyword arguments complicates things quite a lot and is adding confusion on when to use what. One thing we could do is change this to accept a UnifiedParameters instance (instead of a Hash as it is) and then pass that along to the chat_parameters the same way (and just have it merge). Doing that would allow us to continue documenting and inspect what is acceptable criteria (similar to how GraphQL mutations can accept input types).

One thing we could do is change this to accept a UnifiedParameters instance

What would the DX look like then? Something along the lines of: ... ?

uniparams = Langchain::LLM::UnifiedParameters.new(schema: {}, parameters: params) openai.chat(parameters: uniparams)

Just to re-iterate, the goal of these changes is that Langchain::LLM::Base is the source of truth for the unified method signature we want applied to all children. Preserving this idea, the LLM sub-classes will be responsible for:

Adhering to the unified method signature of its' parent

Managing any amendments to the method signature it wants to handle

The way this currently is implemented, each sub-class uses a chat_params instance from LLM::Base and modifies it in its' initializer.

Following 3 of OpenRouter's principles here:

If the chosen model doesn't support a request parameter, then the parameter will be ignored. The rest will be forwarded to the underlying model API.

The callers should be able to pass any parameters to these sub-classes and the model itself will know how to handle them for their respective API calls. For this reason, this is why I modified the arguments of chat for OpenAI to accept a hash rather than explicit keyword arguments. Under the hood, the model forces the unified schema, ignores any fields, re-maps (aliases) any fields it needs re-named, and allows any additional fields the model accepts.

In the future, we can add validations and parameter inquiries, but this is mostly a small step backward (obfuscating the method signature) in favor of applying the unified signature with backwards compatibility. I'll be adding further documentation for what these parameters are that I think will solve some of your worries here 😄

@codenamev Okay, I love it, and I'm on board 🚢 🚀

(A proper docs website is desperately needed 😅) but we'd probably still want to catalogue all of the possible values that could be passed under the params={} hash here so that it shows up in the rubydoc as well.

andreibondarev · 2024-05-01T15:35:22Z

@codenamev I'm thinking that maybe this PR targets a feature branch unified-parameters and until we've converted all of the other LLMs we'd hold back rolling this out. What do you think?

codenamev · 2024-05-01T22:59:20Z

@codenamev I'm thinking that maybe this PR targets a feature branch unified-parameters and until we've converted all of the other LLMs we'd hold back rolling this out. What do you think?

Is there a branch named that, or are you just saying we should do all or nothing? Heh, I meant to submit a Draft PR here just showcasing how it would work with the most common LLM (OpenAI). If we can agree on the abstraction and new parameter flow, I'd be happy to run through all the existing LLMs and update them to mimic the same behavior :D

andreibondarev · 2024-05-02T02:02:49Z

@codenamev I'm thinking that maybe this PR targets a feature branch unified-parameters and until we've converted all of the other LLMs we'd hold back rolling this out. What do you think?

Is there a branch named that, or are you just saying we should do all or nothing? Heh, I meant to submit a Draft PR here just showcasing how it would work with the most common LLM (OpenAI). If we can agree on the abstraction and new parameter flow, I'd be happy to run through all the existing LLMs and update them to mimic the same behavior :D

Yeah, I just meant that we should do all or nothing. I can cut a new origin/unified-parameters branch if you think it would be helpful.

lib/langchain/llm/unified_parameters.rb

… of UnifiedParameters

… variable

…ers and allow instantiation

…LM::Base instance

…nment variable

…parameters

…on the LLM::Base instance" This reverts commit 3a20278796fce6f1649e6425e70822e49eb16856.

…filter when empty

… alias for stop_sequences

…sing

lib/langchain/llm/base.rb

… when overrides are provided

andreibondarev · 2024-05-10T17:05:46Z

@codenamev I just noticed that the response from Ollama comes back serialized:

irb(main):005> llm.chat messages: [{role: "user", content: "Hey"}], stop: ["ello"]

=>
#<Langchain::LLM::OllamaResponse:0x00000001299d6118
 @model="llama3",
 @prompt_tokens=nil,
 @raw_response=
  "{\"model\":\"llama3\",\"created_at\":\"2024-05-10T16:54:49.041769Z\",\"message\":{\"role\":\"assistant\",\"content\":\"Hey\"},\"done\":false}\n{\"model\":\"llama3\",\"created_at\":\"2024-05-10T16:54:49.062254Z\",\"message\":{\"role\":\"assistant\",\"content\":\"!\"},\"done\":false}\n{\"model\":\"llama3\",\"created_at\":\"2024-05-10T16:54:49.082952Z\",\"message\":{\"role\":\"assistant\",\"content\":\" How\"},\"done\":false}\n{\"model\":\"llama3\",\"created_at\":\"2024-05-10T16:54:49.103971Z\",\"message\":{\"role\":\"assistant\",\"content\":\"'s\"},\"done\":false}\n{\"model\":\"ll
  ...

This is on main branch:

irb(main):003> llm.chat messages: [{role: "user", content: "Hey"}]
=>
#<Langchain::LLM::OllamaResponse:0x00000001255f63d0
 @model="llama3",
 @prompt_tokens=nil,
 @raw_response=
  {"model"=>"llama3",
   "created_at"=>"2024-05-10T16:58:46.122462Z",
   "message"=>{"role"=>"assistant", "content"=>"Hey! How's it going?"},

Would you happen to know why this was happening?

andreibondarev · 2024-05-10T21:16:41Z

@codenamev Tremendous effort, great job!

drnic · 2024-05-10T23:46:16Z

Huge effort!

andreibondarev self-requested a review May 1, 2024 11:27

andreibondarev reviewed May 1, 2024

View reviewed changes

lib/langchain/llm/openai.rb Outdated Show resolved Hide resolved

andreibondarev reviewed May 1, 2024

View reviewed changes

lib/langchain/llm/openai.rb Show resolved Hide resolved

andreibondarev reviewed May 1, 2024

View reviewed changes

andreibondarev reviewed May 2, 2024

View reviewed changes

lib/langchain/llm/unified_parameters.rb Show resolved Hide resolved

codenamev added 21 commits May 8, 2024 09:12

Adds Langchain::LLM::UnifiedParameters to prepare for unifying LLM APIs

49062b9

Adds Langchain::LLM::Parameters::Chat to unify LLM chat parameters

14f6ef6

Adds Langchain::LLM::UnifiedParameters::Null for null object handling…

883ce49

… of UnifiedParameters

Refactor LLM::Parameters::Chat schema to be constant instead of class…

a4d9021

… variable

Refactor LLM::Parameters::Chat to SimpleDelegator the unified paramet…

b186c77

…ers and allow instantiation

Make UnifiedParameters enumerable

afff79d

Allow UnifiedParameters::Null to accept matching kwargs

0414257

Add parameters_for to LLM::Base

fa26558

Add 'model' to Parameters::Chat schema

371b0d9

Allow UnifiedParameters to amend schema and aliases

bc57642

Refactor UnifiedParameters to accept defaults and aliases per-field

f3ea800

Remove Langchain::LLM::Parameters::Chat in favor of defining on the L…

49e1224

…LM::Base instance

Rename UnifiedParameters#amend_schema to update

d68f231

Refactor Langchain::LLM::OpenAI to use unified chat_parameters

34c86da

Intriduce a way to ignore certain unified parameter fields

fc26a4d

Ignore unified top_k for OpenAI chat requests

d17cb25

Ensure UnifiedParameters#to_params rebuilds the parameters on each call

d134674

Allow OpenAI#chat to validate nil messages and model parameters

014daa1

Update bin/console to be able to run without an OPENAI_API_KEY enviro…

fc8babd

…nment variable

Ensure LLM::OpenAI initializer proxies the default model to the chat_…

1f735a9

…parameters

Revert "Remove Langchain::LLM::Parameters::Chat in favor of defining …

82de1b4

…on the LLM::Base instance" This reverts commit 3a20278796fce6f1649e6425e70822e49eb16856.

codenamev added 10 commits May 8, 2024 09:12

Ensure aliased fields in UnifiedParameters properly set defaults and …

68d4cd9

…filter when empty

Update Parameters::Chat tools field to default to '[]'

174fcc0

Update LLM::Anthropic#chat to use new unified LLM::Parameters::Chat

f3e23b6

Update LLM::AwsBedrock#chat to use new unified LLM::Parameters::Chat

9e5d0fa

Update LLM::Azure#chat to use new unified LLM::Parameters::Chat

b1523b4

Add the ability to re-map UnifiedParameters fields to different names

0cad3c2

Update LLM::MistralAI#chat to use new unified LLM::Parameters::Chat

ecf155d

Refactor LLM::Anthropic#chat to use UnifiedParamters#remap instead of…

fff5eff

… alias for stop_sequences

Refactor LLM::AwsBedrock#chat to remap stop_sequences instead of alia…

b832e14

…sing

Update LLM::Ollama#chat to use new unified LLM::Parameters::Chat

98f0884

codenamev force-pushed the vs-unified-parameters branch from 4f8548a to 98f0884 Compare May 9, 2024 14:32

andreibondarev self-requested a review May 9, 2024 17:58

andreibondarev reviewed May 9, 2024

View reviewed changes

lib/langchain/llm/base.rb Show resolved Hide resolved

Ensure UnifiedParameters don't cache previous parameters in to_params…

1771d67

… when overrides are provided

andreibondarev self-requested a review May 10, 2024 13:59

codenamev added 2 commits May 10, 2024 11:08

Ensure Parameters::Chat does not cache the schema across instances

0f24945

Update documenation of LLM sub-class #chat to reflect signature changes

5279739

codenamev added 3 commits May 10, 2024 15:23

Fix linter issues with Loader and MistralAI spec

91a493f

Resolve linter autofix issues with Langchain::Loader

d8bab30

Only allow :default key in UnifiedParameter schema for simplicity

55ba1c4

andreibondarev merged commit b8169dd into patterns-ai-core:main May 10, 2024
5 checks passed

codenamev deleted the vs-unified-parameters branch May 10, 2024 23:08

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Introduce UnifiedParameters #600

Introduce UnifiedParameters #600

codenamev commented Apr 30, 2024

andreibondarev commented May 1, 2024

andreibondarev May 1, 2024

codenamev May 1, 2024

andreibondarev May 2, 2024 •

edited

codenamev May 6, 2024

andreibondarev May 6, 2024

andreibondarev May 6, 2024 •

edited

andreibondarev commented May 1, 2024

codenamev commented May 1, 2024

andreibondarev commented May 2, 2024

andreibondarev commented May 10, 2024 •

edited

andreibondarev commented May 10, 2024

drnic commented May 10, 2024

Introduce UnifiedParameters #600

Introduce UnifiedParameters #600

Conversation

codenamev commented Apr 30, 2024

andreibondarev commented May 1, 2024

andreibondarev May 1, 2024

Choose a reason for hiding this comment

codenamev May 1, 2024

Choose a reason for hiding this comment

andreibondarev May 2, 2024 • edited

Choose a reason for hiding this comment

codenamev May 6, 2024

Choose a reason for hiding this comment

andreibondarev May 6, 2024

Choose a reason for hiding this comment

andreibondarev May 6, 2024 • edited

Choose a reason for hiding this comment

andreibondarev commented May 1, 2024

codenamev commented May 1, 2024

andreibondarev commented May 2, 2024

andreibondarev commented May 10, 2024 • edited

andreibondarev commented May 10, 2024

drnic commented May 10, 2024

andreibondarev May 2, 2024 •

edited

andreibondarev May 6, 2024 •

edited

andreibondarev commented May 10, 2024 •

edited