lex-llm

Shared LegionIO framework for LLM provider extensions.

lex-llm is a standard Legion extension gem. It does not expose a standalone RubyLLM-compatible API, Rails integration, generators, rake tasks, or concrete providers. Its runtime contract is Legion::Extensions::Llm, which provider gems extend through nested namespaces such as Legion::Extensions::Llm::Ollama.

The routing principle is simple: provider is not the routing unit anymore. A concrete model offering is.

That lets Legion reason about one local Ollama instance with many models, multiple remote Ollama or vLLM instances, Bedrock accounts in different regions, direct frontier providers, and fleet workers on MacBooks, GPU servers, or cloud-side proxy nodes.

What This Gem Owns

lex-llm provides provider-neutral primitives only. Provider-specific behavior belongs in provider gems.

This gem owns:

Legion::Extensions::Llm, the Legion extension namespace used by autoloading and settings
provider-neutral request, response, message, content, token, and tool objects
schema bridging through Legion::Extensions::Llm::Schema
model metadata and capability normalization
routing structures such as Legion::Extensions::Llm::Routing::ModelOffering
fleet lane key generation for shared RabbitMQ work lanes
shared chat, embedding, moderation, image, transcription, streaming, and OpenAI-compatible adapter helpers
shared runtime dependencies such as legion-json, legion-settings, and legion-logging

Concrete provider gems should depend on this gem and implement the provider-specific transport, authentication, model discovery, request translation, response translation, and health checks.

Expected provider gems include:

lex-llm-ollama
lex-llm-vllm
lex-llm-anthropic
lex-llm-openai
lex-llm-gemini
lex-llm-mlx
lex-llm-bedrock
lex-llm-vertex
lex-llm-azure

Install

gem 'lex-llm'

Provider extensions should declare lex-llm as a gemspec dependency:

spec.add_dependency 'lex-llm', '>= 0.1.0'

For local development across LegionIO repos, prefer a local path override in the app or test Gemfile, not a permanent git dependency in the gemspec.

Namespace

Load the extension through the Legion namespace:

require 'legion/extensions/llm'

Provider gems must use nested Legion extension namespaces so LegionIO autoloading can find them consistently.

Example for lex-llm-ollama:

require 'legion/extensions/llm'

module Legion
  module Extensions
    module Llm
      module Ollama
        def self.default_settings
          Legion::Extensions::Llm.provider_settings(
            family: :ollama,
            instance: { base_url: 'http://localhost:11434' }
          )
        end
      end
    end
  end
end

Model Offerings

A model offering describes one concrete model made available by one provider instance. It is the base unit for routing, filtering, fleet lane creation, health, policy, and cost decisions.

offering = Legion::Extensions::Llm::Routing::ModelOffering.new(
  provider_family: :ollama,
  instance_id: :macbook_m4_max,
  transport: :local,
  tier: :local,
  model: 'qwen3.6:27b-q4_K_M',
  usage_type: :inference,
  capabilities: %i[chat tools vision thinking],
  limits: {
    context_window: 32_768,
    max_output_tokens: 8_192
  },
  health: {
    ready: true,
    latency_ms: 180
  },
  policy_tags: %i[internal_only phi_allowed],
  metadata: {
    enabled: true,
    eligibility: {
      ac_power: true
    }
  }
)

offering.eligible_for?(
  usage_type: :inference,
  required_capabilities: %i[tools],
  min_context_window: 16_000,
  policy_tags: %i[internal_only]
)
# => true

Common offering fields:

provider_family: provider implementation family, such as :ollama, :vllm, :bedrock, :anthropic, or :openai
instance_id: concrete provider instance, account, node, region, or local runtime
transport: :local, :http, :rabbitmq, :sdk, or another provider-supported transport
tier: :local, :private, :fleet, :cloud, :frontier, or deployment-specific policy tier
model: provider model name or normalized model alias
usage_type: :inference or :embedding
capabilities: normalized feature flags such as :chat, :tools, :json_schema, :vision, :thinking, or :embedding
limits: context window, output token limits, rate limits, concurrency limits, and provider-specific bounds
health: readiness, latency, recent failures, and provider-specific health metadata
policy_tags: routing and compliance tags such as :internal_only, :phi_allowed, or :hipaa
metadata: extension-specific metadata; sensitive values are excluded from fleet eligibility fingerprints

Fleet Lanes

Fleet routing uses shared work lanes derived from model offerings. A lane describes the work required, not the worker that happens to do it.

offering.lane_key
# => "llm.fleet.inference.qwen3-6-27b-q4-k-m.ctx32768"

Embedding lanes omit context size:

Legion::Extensions::Llm::Routing::ModelOffering.new(
  provider_family: :ollama,
  instance_id: :gpu_embed_01,
  transport: :rabbitmq,
  model: 'nomic-embed-text',
  usage_type: :embedding,
  capabilities: %i[embedding]
).lane_key
# => "llm.fleet.embed.nomic-embed-text"

The intent is that any eligible worker can bind to the same lane:

local MacBook workers
GPU servers in a datacenter
vLLM workers
Ollama workers
cloud-side LegionIO workers near Bedrock, Vertex, Azure, or another provider

Busy endpoint workers should not reject/requeue in a hot loop. Endpoint fleet workers can use pull-style scheduling, while server-class workers can use normal consumers with prefetch and consumer priority.

Default Fleet Settings

Legion::Extensions::Llm.default_settings provides defaults that provider extensions inherit and override:

Legion::Extensions::Llm.default_settings
# => {
#      fleet: {
#        enabled: false,
#        scheduler: :basic_get,
#        consumer_priority: 0,
#        queue_expires_ms: 60_000,
#        message_ttl_ms: 120_000,
#        queue_max_length: 100,
#        delivery_limit: 3,
#        consumer_ack_timeout_ms: 300_000,
#        endpoint: {
#          enabled: false,
#          empty_lane_backoff_ms: 250,
#          idle_backoff_ms: 1_000,
#          max_consecutive_pulls_per_lane: 0,
#          accept_when: []
#        }
#      }
#    }

The defaults are conservative:

fleet participation is off unless configured
endpoint fleet mode is separately disabled by default
queue and message TTLs are bounded
pull scheduling is the default for endpoint-style workers
provider gems can override defaults through Legion::Settings

Provider gems can build a complete provider settings hash without duplicating merge logic:

Legion::Extensions::Llm.provider_settings(
  family: :ollama,
  instance: {
    base_url: 'http://localhost:11434',
    fleet: { enabled: true, consumer_priority: 10 }
  }
)

Provider Extension Contract

A provider gem should use lex-llm for shared behavior and implement only the provider-specific pieces.

At minimum, a provider extension should define:

Legion::Extensions::Llm::<Provider>
provider default settings
model discovery or a static model offering registry
provider request translation
provider response translation
health and readiness checks
embedding support separately from inference support when the provider exposes both

Provider extensions should avoid duplicating shared classes, schema logic, fleet lane construction, JSON handling, or common request/response objects.

Schema Status

lex-llm still depends on ruby_llm-schema because the current schema bridge exposes:

Legion::Extensions::Llm::Schema

as:

RubyLLM::Schema

That dependency should stay until LegionIO owns or replaces the schema layer directly.

Development

Install dependencies:

bundle install

Run lint:

bundle exec rubocop -A

Run the full test suite:

bundle exec rspec --format json --out tmp/rspec_results.json --format progress --out tmp/rspec_progress.txt

Gemfile.lock is intentionally not committed for this repo.

Attribution

lex-llm began as a LegionIO fork of RubyLLM. RubyLLM remains credited under the MIT license in LICENSE.

Name		Name	Last commit message	Last commit date
Latest commit History 933 Commits
.github		.github
lib/legion/extensions		lib/legion/extensions
spec		spec
.gitignore		.gitignore
.rubocop.yml		.rubocop.yml
CHANGELOG.md		CHANGELOG.md
Gemfile		Gemfile
LICENSE		LICENSE
README.md		README.md
lex-llm.gemspec		lex-llm.gemspec

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

lex-llm

What This Gem Owns

Install

Namespace

Model Offerings

Fleet Lanes

Default Fleet Settings

Provider Extension Contract

Schema Status

Development

Attribution

About

Uh oh!

Releases 3

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

lex-llm

What This Gem Owns

Install

Namespace

Model Offerings

Fleet Lanes

Default Fleet Settings

Provider Extension Contract

Schema Status

Development

Attribution

About

Resources

License

Code of conduct

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases 3

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages