`LitTool.from_model` method to create LitTool from Pydantic #57

mathematicalmichael · 2025-08-31T01:46:27Z

This PR adds a class method from_model to create a LitTool from a Pydantic model, including setup and run methods for validation.

In full transparency, I don't know that this is the "best" way to accomplish the task at hand, but certainly consider it as one proposal.

Before submitting

Was this discussed/agreed via a Github issue? (no need for typos and docs improvements)
Did you read the contributor guideline, Pull Request section?
Did you make sure to update the docs?
Did you write any new necessary tests?

What does this PR do?

Adds a convenience method which allows converting existing classes which inherit from pydantic.BaseModel into LitTool classes.

Why is this useful?

A lot of use cases stop short of true tool-calling and really just require schema enforcement (e.g., named entity extraction). This method helps create "tools" which do not actually have a function associated with them.

Example:

import os
import json

from litai import LLM, LitTool
from pydantic import BaseModel
from typing import Literal

class RelationshipNode(BaseModel):
    source_entity: str
    target_entity: str
    relation: Literal["consumer", "producer", "partner"]

get_relationship = LitTool.from_model(RelationshipNode)

llm = LLM(
    model="google/gemini-2.5-flash",
    api_key=os.environ.get("LITAI_API_KEY"),
    max_retries=1,
)

response = llm.chat("Michael purchased credits from Lightning AI", tools=[get_relationship], system_prompt=None)
print(response)

# validate that tool can be called
if "function" not in response:
    raise AssertionError("No function call found in response")
# if available, proceed to check ability to call tool (not necessary in practice, just demonstrates compatibility)
result = llm.call_tool(response, tools=[get_relationship])
print(result)

python ../from_model.py
[{"function": {"arguments": "{\"target_entity\":\"Lightning AI\",\"source_entity\":\"Michael\",\"relation\":\"consumer\"}", "name": "RelationshipNode"}}]
source_entity='Michael' target_entity='Lightning AI' relation='consumer'

PR review

Anyone in the community is free to review the PR once the tests have passed.
If we didn't discuss your PR in GitHub issues there's a high chance it will not be merged.

Did you have fun?

yes!

Additional Information

what feels wrong to me ergonomics-wise with the example above is this use-case being served by the .chat interface instead of something dedicated for this purpose.

open to suggestions. in theory, the goal is to just go FROM text TO json that adheres to the pydantic model.

That question is distinct from the contribution in the PR, though.

Added a class method 'from_model' to create a LitTool from a Pydantic model, including setup and run methods for validation.

src/litai/tools.py

codecov · 2025-09-01T10:54:46Z

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 85%. Comparing base (5c6f65c) to head (8513e6f).
⚠️ Report is 12 commits behind head on main.

Additional details and impacted files

@@         Coverage Diff         @@
##           main   #57    +/-   ##
===================================
  Coverage    84%   85%            
===================================
  Files         8     8            
  Lines       431   549   +118     
===================================
+ Hits        364   465   +101     
- Misses       67    84    +17

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

mathematicalmichael · 2025-09-06T04:17:33Z

this almost feels like it could be part of llm.classify

mathematicalmichael · 2025-09-06T18:03:49Z

upon some further testing, i came across an issue when trying to use llm.call_tool using the tools created from pydantic models.

that call_tool method declares str as output, meaning I have two options to make this PR's implementation compatible:

(1) modify the run method of from_model:

            def run(self, *args, **kwargs) -> Any:  # type: ignore
                # Default implementation: validate & return an instance
                return model(*args, **kwargs).model_dump()  # <-- change here to make it json-serializable

(2) modify call_tool's return contract from Optional[str] to Optional[Union[str, BaseModel, list[BaseModel]]]

    @staticmethod
    def call_tool(
        response: Union[List[dict], dict, str], tools: Optional[Sequence[Union[LitTool, "StructuredTool"]]] = None
    ) -> Optional[Union[str, BaseModel, list[BaseModel]]]:
    ...
        try:
            return json.dumps(results) if len(results) > 1 else results[0]
        except TypeError:
            return results if len(results) > 1 else results[0]

my preference is (2) so that the invocation of call_tool actually returns the pydantic models (helpful for downstream data ingestion)

however, option (3) is also available: move the logic to a dedicated method such as classify (e.g. predict), which creates a bit of repeated code (doesn't bother me).

for example:

def predict(  # noqa: D417
        self,
        prompt: str,
        contracts: Sequence[type[BaseModel]],
        system_prompt: Optional[str] = None,
        model: Optional[str] = None,
        max_tokens: int = 500,
        images: Optional[Union[List[str], str]] = None,
        conversation: Optional[str] = None,
        metadata: Optional[Dict[str, str]] = None,
        stream: bool = False,
        auto_call_tools: bool = False,
        **kwargs: Any,
    ) -> Optional[Union[BaseModel, list[BaseModel]]]:
        """Sends a message to the LLM and retrieves a structured response based on the provided Pydantic models."""
        tools = [LitTool.from_model(c) for c in contracts]
        response = self.chat(
            prompt=prompt,
            system_prompt=system_prompt,
            model=model,
            max_tokens=max_tokens,
            images=images,
            conversation=conversation,
            metadata=metadata,
            stream=stream,
            tools=tools,
            auto_call_tools=auto_call_tools,
            **kwargs,
        )
        # Call tool(s) with the given response.
        if isinstance(response, str):
            try:
                response = json.loads(response)
            except json.JSONDecodeError:
                raise ValueError("Tool response is not a valid JSON string")

        results = []
        if isinstance(response, dict):
            response = [response]

        for tool_response in response:
            if not isinstance(tool_response, dict):
                continue
            tool_name = tool_response.get("function", {}).get("name")
            if not tool_name:
                continue
            tool_args = tool_response.get("function", {}).get("arguments", {})
            if isinstance(tool_args, str):
                try:
                    tool_args = json.loads(tool_args)
                except json.JSONDecodeError:
                    print(f"❌ Failed to parse tool arguments: {tool_args}")
                    return None
            if isinstance(tool_args, dict):
                tool_args = {k: v for k, v in tool_args.items() if v is not None}

            for tool in tools:
                if tool.name == tool_name:
                    results.append(tool.run(**tool_args))

        if len(results) == 0:
            return None

        return results if len(results) > 1 else results[0]

upside of this is a dedicated method and avoidance of the user needing to call LitTool.from_model explicitly.
(though I still think I'd like the try/except in call_tool for compatibility)

let me know which path is suitable and I'll push up another commit. @bhimrazy

bhimrazy · 2025-09-08T07:56:22Z

Hi @mathematicalmichael, thanks for the updates.

I’m a bit unsure about the purpose here — this feels more like structured data extraction than a tool implementation.

Let’s hear what the maintainers think, and you can proceed accordingly.
cc: @k223kim @aniketmaurya

From my perspective, this type of task is usually handled via a response_format parameter or by guiding the model with a system prompt.
Probably something like an llm.extract (or a dedicated API) would be a more natural fit.

mathematicalmichael · 2025-09-08T08:04:18Z

that is correct @bhimrazy

structured extraction is the goal, tool use is almost identical under the hood though.

semantics aside (what to call the method), I did want to put the functionality forward (it's 95% of the business use cases I encounter).
I do think predict as a method name makes some sense.

Danidapena · 2025-09-09T17:59:19Z

@mathematicalmichael I agree with you—option 2 feels like the best way forward. Option 3 has some interesting points, but it might be a bit harder to maintain.

mathematicalmichael · 2025-09-09T19:29:18Z

re (3): I've been putting option (3) through its paces (hundreds of API calls via llm.predict) on a project (pointing to my predict branch).
In doing so, I found myself doing a result.model_dump() on the output pydantic object for actual use down-stream anyway, meaning that the approach in (2) + a string parsing function would probably work out better than a dedicated llm.predict, and yes - be simpler to maintain.

i'll push an update with (2) shortly. thank you!

Updated the run method to return a serialized instance instead of a model instance.

mathematicalmichael · 2025-10-23T19:38:27Z

type checker wasn't happy. put forth the solution in option (1). creates a smaller impact overall, and like I said, it seems that in my practical usage, getting the pydantic object itself is just a means to an end to get it to json (-> dataframe -> parquet) or json -> jsonl anyway.

mathematicalmichael · 2025-10-24T15:37:13Z

@Danidapena I ended up reverting from option 2 in favor of option 1 because (a) the type-checker seemed unhappy and it was starting to balloon the magnitude of the change, and (b) in my actual usage*, I found myself always consuming the dictionary version of the result instead of the pydantic object.

*I have been using a branch based on this one for something, where I tried option (3) in parallel.

from_model method to create LitTool from Pydantic

41d0ad1

Added a class method 'from_model' to create a LitTool from a Pydantic model, including setup and run methods for validation.

mathematicalmichael requested review from Abdul-0x4A, Borda, Danidapena, aniketmaurya, k223kim and tchaton as code owners August 31, 2025 01:46

bhimrazy reviewed Aug 31, 2025

View reviewed changes

src/litai/tools.py Outdated Show resolved Hide resolved

Borda changed the title ~~LitTool.from_model method to create LitTool from Pydantic~~ LitTool.from_model method to create LitTool from Pydantic Sep 1, 2025

mathematicalmichael added 3 commits September 2, 2025 18:24

add test for run method

9376d9f

mypy silence

b107fd6

ruff format

08155b8

Borda requested a review from bhimrazy September 3, 2025 06:05

lint

25d9551

mathematicalmichael and others added 3 commits September 12, 2025 15:09

typehint change

4d066d1

revert to Optional[str]

5787b74

Modify run method to return serialized instance

cfb7be9

Updated the run method to return a serialized instance instead of a model instance.

Update assertion in test_tool_run_from_model

8513e6f

k223kim approved these changes Oct 24, 2025

View reviewed changes

k223kim merged commit 25ad3b1 into Lightning-AI:main Oct 24, 2025
31 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

`LitTool.from_model` method to create LitTool from Pydantic #57

`LitTool.from_model` method to create LitTool from Pydantic #57

Uh oh!

mathematicalmichael commented Aug 31, 2025 •

edited

Loading

Uh oh!

Uh oh!

codecov bot commented Sep 1, 2025 •

edited

Loading

Uh oh!

mathematicalmichael commented Sep 6, 2025 •

edited

Loading

Uh oh!

mathematicalmichael commented Sep 6, 2025 •

edited

Loading

Uh oh!

bhimrazy commented Sep 8, 2025

Uh oh!

mathematicalmichael commented Sep 8, 2025

Uh oh!

Danidapena commented Sep 9, 2025 •

edited

Loading

Uh oh!

mathematicalmichael commented Sep 9, 2025

Uh oh!

mathematicalmichael commented Oct 23, 2025

Uh oh!

mathematicalmichael commented Oct 24, 2025 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

LitTool.from_model method to create LitTool from Pydantic #57

LitTool.from_model method to create LitTool from Pydantic #57

Uh oh!

Conversation

mathematicalmichael commented Aug 31, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Why is this useful?

PR review

Did you have fun?

Additional Information

Uh oh!

Uh oh!

codecov bot commented Sep 1, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

mathematicalmichael commented Sep 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mathematicalmichael commented Sep 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

bhimrazy commented Sep 8, 2025

Uh oh!

mathematicalmichael commented Sep 8, 2025

Uh oh!

Danidapena commented Sep 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mathematicalmichael commented Sep 9, 2025

Uh oh!

mathematicalmichael commented Oct 23, 2025

Uh oh!

mathematicalmichael commented Oct 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

`LitTool.from_model` method to create LitTool from Pydantic #57

`LitTool.from_model` method to create LitTool from Pydantic #57

mathematicalmichael commented Aug 31, 2025 •

edited

Loading

codecov bot commented Sep 1, 2025 •

edited

Loading

mathematicalmichael commented Sep 6, 2025 •

edited

Loading

mathematicalmichael commented Sep 6, 2025 •

edited

Loading

Danidapena commented Sep 9, 2025 •

edited

Loading

mathematicalmichael commented Oct 24, 2025 •

edited

Loading