Add Model Capabilities Support #77

ilopezluna · 2025-05-20T12:04:02Z

This PR introduces support for specifying model capabilities during packaging, allowing users to define input/output types and tool usage capabilities for their models.

Added new command-line flags to the package command:
--input: Comma-separated list of input types (text, embedding, image, audio, video)
--output: Comma-separated list of output types (text, embedding, image, audio, video)
--tool-usage: Flag to enable tool usage support

Note: my first attempt was to define the capabilities during model instantiation. The benefit of that approach is that the config is created once and never updated.
However, I ultimately discarded it in favor of the current approach, as I believe it's cleaner from an implementation perspective.
Let me know which approach you prefer 🙏

…config is created

bergwolf · 2025-05-21T08:02:56Z

@ilopezluna We are adding similar fields to the model spec. It borrows your idea of differenciating input and output types, and extend the capabilities with reasoning, knowledge cut off date and instruction following. Does the extention make sense to you?

ilopezluna · 2025-05-21T12:03:32Z

@ilopezluna We are adding similar fields to the model spec. It borrows your idea of differenciating input and output types, and extend the capabilities with reasoning, knowledge cut off date and instruction following. Does the extention make sense to you?

Added my feedback in your PR 👍

xenoscopic

LGTM from a business logic perspective, but Emily will probably have more useful feedback.

xenoscopic · 2025-05-27T20:35:13Z

types/config.go

@@ -22,6 +23,13 @@ const (
 	MediaTypeLicense = types.MediaType("application/vnd.docker.ai.license")

 	FormatGGUF = Format("gguf")
+
+	// IOTypeText Valid IO types
+	IOTypeText      = "text"


Just a thought: Would it be worth defining type IOType string and then typing the constants? If you defined func (t IOType) MarshalJSON and func (t *IOType) UnmarshalJSON then you could perform the validation during decoding.

ekcasey · 2025-06-10T18:42:03Z

types/config.go

+// Capabilities describes what the model can do
+type Capabilities struct {
+	IO        IOTypes `json:"io"`
+	ToolUsage bool    `json:"tool_usage"`


I think a *bool might be better here so we can differentiate "doesn't support this capability" from "unknown". This could be important if we add new capabilities and runtimes choose to validate capabilities - older models published before the field addition should be nil rather than false, so runtimes can handle the unknown case differently if desired.

ekcasey · 2025-06-10T19:13:33Z

types/config.go

+
+// Capabilities describes what the model can do
+type Capabilities struct {
+	IO        IOTypes `json:"io"`


nit: instead of io maybe modalities? I like the grouping, but io isn't really adding additional context when the two fields are input and output, whereas modalities communicates what exactly about the input and output is being described.

wdyt @bergwolf? Since we are aiming for convergence, consensus is probably more important than the exact keys here. I am also okay with input_types and output_types (without the additional layer of nesting) like in modelpack/model-spec#52 if there is a strong preference

…lity" from "unknown". This could be important if we add new capabilities and runtimes choose to validate capabilities - older models published before the field addition should be nil rather than false, so runtimes can handle the unknown case differently if desired.

ilopezluna added 6 commits May 20, 2025 13:31

Add Capabilities field to config

9236ace

Update README and fix Makefile

72c52fd

Fix tests

c83e585

Adds default capabilities

5c306cf

Add builder.WithCapabilities instead of adding capabilities when the …

0ba0013

…config is created

Add test for builder.WithCapabilities

03b2769

ilopezluna marked this pull request as ready for review May 20, 2025 14:35

ilopezluna requested a review from a team May 20, 2025 14:35

xenoscopic approved these changes May 27, 2025

View reviewed changes

ilopezluna requested a review from ekcasey May 29, 2025 10:25

ekcasey reviewed Jun 10, 2025

View reviewed changes

ilopezluna added 2 commits June 11, 2025 11:33

Merge branch 'main' into add-model-capabilities

86c30a2

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add Model Capabilities Support #77

Add Model Capabilities Support #77

Uh oh!

ilopezluna commented May 20, 2025 •

edited

Loading

Uh oh!

bergwolf commented May 21, 2025

Uh oh!

ilopezluna commented May 21, 2025

Uh oh!

xenoscopic left a comment

Uh oh!

xenoscopic May 27, 2025

Uh oh!

ekcasey Jun 10, 2025 •

edited

Loading

Uh oh!

ekcasey Jun 10, 2025

Uh oh!

Uh oh!

Add Model Capabilities Support #77

Are you sure you want to change the base?

Add Model Capabilities Support #77

Uh oh!

Conversation

ilopezluna commented May 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

bergwolf commented May 21, 2025

Uh oh!

ilopezluna commented May 21, 2025

Uh oh!

xenoscopic left a comment

Choose a reason for hiding this comment

Uh oh!

xenoscopic May 27, 2025

Choose a reason for hiding this comment

Uh oh!

ekcasey Jun 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ekcasey Jun 10, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

ilopezluna commented May 20, 2025 •

edited

Loading

ekcasey Jun 10, 2025 •

edited

Loading