Red Candle Provider #2

cpetersen · 2025-09-08T23:31:47Z

What this does

This PR adds support for the Red Candle provider, enabling local LLM execution using quantized GGUF models directly in Ruby without requiring external API calls.

Key Implementation Details

Red Candle is fundamentally different from other providers: While all other RubyLLM providers communicate via HTTP APIs, Red Candle runs models locally using the Candle Rust crate. This brings true local inference to Ruby, with no network
latency or API costs.

Dependency Management

Since Red Candle requires a Rust toolchain at build time, we've made it optional at two levels:

For end users: red-candle is NOT a gemspec dependency. Users must explicitly add gem 'red-candle' to their Gemfile to use this provider.
For contributors: We've added an optional Bundler group so developers can work on RubyLLM without installing Rust. Enable with bundle config set --local with red_candle.

Testing Strategy

We implemented a comprehensive mocking system to keep tests fast:

Stubbed mode (default): Uses MockCandleModel to simulate responses without actual inference
Real inference mode: Set RED_CANDLE_REAL_INFERENCE=true to run actual model inference (downloads models on first run, ~4.5 GBs)
Not installed mode: Tests skip gracefully when Red Candle isn't available

Changes Made

Added RubyLLM::Providers::RedCandle with full chat support including streaming
Implemented model management with automatic GGUF file downloads from HuggingFace
Created comprehensive test mocks in red_candle_test_helper.rb
Added conditional loading in ruby_llm.rb and spec_helper.rb to handle optional dependency
Updated models_to_test.rb to conditionally include Red Candle models
Added documentation in CONTRIBUTING.md for managing the optional dependency
Implemented proper Content object handling for structured responses

How to Test

# Test without Red Candle (default for new contributors)
bundle install
bundle exec rspec  # Red Candle tests will be skipped

# Test with Red Candle stubbed (fast)
bundle config set --local with red_candle
bundle install
bundle exec rspec  # Uses mocked responses

# Test with real inference (slow, downloads models)
bundle config set --local with red_candle
bundle install
huggingface-cli login # Make sure to accept mistral terms
RED_CANDLE_REAL_INFERENCE=true bundle exec rspec

Once red-candle is enabled turn it back off with:

bundle config unset with

And turn it BACK on with:

bundle config set --local with red_candle

Try it out

bundle exec irb

require 'ruby_llm'

chat = RubyLLM.chat(
  provider: :red_candle,
  model: 'Qwen/Qwen2.5-1.5B-Instruct-GGUF' # 'TheBloke/Mistral-7B-Instruct-v0.2-GGUF' is another option
)
response = chat.ask("What are the benefits of functional programming?")
puts response.content

Type of change

Scope check

I read the Contributing Guide
This aligns with RubyLLM's focus on LLM communication
This isn't application-specific logic that belongs in user code
This benefits most users, not just my specific use case

Quality check

I ran overcommit --install and all hooks pass
I tested my changes thoroughly
- For provider changes: Re-recorded VCR cassettes with bundle exec rake vcr:record[provider_name]
- All tests pass: bundle exec rspec
I updated documentation if needed
I didn't modify auto-generated files manually (models.json, aliases.json)

API changes

Breaking change
New public methods/classes
Changed method signatures
No API changes

Related issues

Fixes crmne#394

lib/ruby_llm/providers/red_candle/models.rb

Gemfile

Major improvements to Rails integration: - New acts_as API using association names instead of class names - Rails-like generator syntax: `rails g ruby_llm:install chat:ChatName message:MessageName` - Model registry always included (removed skip_model_registry option) - Clear upgrade path with use_new_acts_as configuration option Breaking changes managed through configuration: - Legacy mode (default) maintains backward compatibility - New mode enabled via `config.use_new_acts_as = true` - Legacy mode will be deprecated in v2.0 Key improvements: - More intuitive Rails-like DSL - Better association naming conventions - Simplified generator interface - Cleaner configuration approach

Replace options[:*_model_name] with instance method calls in all generator templates. This fixes the undefined method 'pluralize' error when running rails g ruby_llm:install. Also removes unused legacy migration templates and updates tests to match the new template format.

Creates a complete chat interface with: - Chat and message controllers following Rails conventions - Simple HTML views for chat list, creation, and messaging - Model selector in new chat form - Models index page showing available AI models - Background job for streaming AI responses - Turbo Stream integration for real-time message updates - Automatic broadcasting from Message model - Clean, simple controller methods The generator creates a working chat UI that can be customized while maintaining Rails best practices and simplicity.

…le or not

…the future

orangewolf

this looks great!

…e in next version

## What this does Adds gpt-5, gpt-5-mini, and gpt-5-nano capabilities. I tried to run `overcommit`, but it updated more files than I expected so not sure if this is still used on every commit. I did run rubocop/tests. ## Type of change - [x] Bug fix - [ ] New feature - [ ] Breaking change - [ ] Documentation - [ ] Performance improvement ## Scope check - [x] I read the [Contributing Guide](https://github.com/crmne/ruby_llm/blob/main/CONTRIBUTING.md) - [x] This aligns with RubyLLM's focus on **LLM communication** - [x] This isn't application-specific logic that belongs in user code - [x] This benefits most users, not just my specific use case ## Quality check - [ ] I ran `overcommit --install` and all hooks pass - [x] I tested my changes thoroughly - [x] I updated documentation if needed - [x] I didn't modify auto-generated files manually (`models.json`, `aliases.json`) ## API changes - [ ] Breaking change - [ ] New public methods/classes - [ ] Changed method signatures - [x] No API changes Co-authored-by: Carmine Paolino <carmine@paolino.me>

Implemented efficient streaming for the chat UI generator that appends chunks without re-transmitting entire messages. The solution uses broadcast_append_chunk to append individual chunks to message content, reducing bandwidth usage. Only one Turbo Stream subscription is maintained at the chat level, avoiding multiple subscriptions per message.

## Summary - Added visualization of tool calls in the chat UI message partial - Tool calls are displayed with function name and arguments in JSON format - Styled with monospace font and gray background for better readability ## Test plan - [ ] Generate a chat UI with the updated template - [ ] Verify tool calls are displayed correctly in messages - [ ] Check that messages without tool calls render normally <img width="730" height="554" alt="CleanShot 2025-09-21 at 22 21 23@2x" src="https://github.com/user-attachments/assets/058c0923-4081-4399-96c0-4a4e025f7244" /> 🤖 Generated with [Claude Code](https://claude.ai/code) --------- Co-authored-by: Claude <noreply@anthropic.com>

…#429) ## What this does It updates the Faraday middleware to specifically use the `:net_http` adapter instead of whatever the environment default is/was. ## Type of change - [x] Bug fix - [ ] New feature - [ ] Breaking change - [ ] Documentation - [ ] Performance improvement ## Scope check - [x] I read the [Contributing Guide](https://github.com/crmne/ruby_llm/blob/main/CONTRIBUTING.md) - [x] This aligns with RubyLLM's focus on **LLM communication** - [x] This isn't application-specific logic that belongs in user code - [x] This benefits most users, not just my specific use case ## Quality check - [x] I ran `overcommit --install` and all hooks pass - [x] I tested my changes thoroughly - [ ] For provider changes: Re-recorded VCR cassettes with `bundle exec rake vcr:record[provider_name]` - [x] All tests pass: `bundle exec rspec` - [ ] I updated documentation if needed - [x] I didn't modify auto-generated files manually (`models.json`, `aliases.json`) ## API changes - [ ] Breaking change - [ ] New public methods/classes - [ ] Changed method signatures - [x] No API changes ## Related issues Fixes crmne#428

Fixes crmne#425

…coverage from there only.

Easier than trying to force it

… with RubyLLM.embed

cpetersen added 14 commits September 8, 2025 14:08

Initial red-candle provider implementation

5e8c1bb

Starting to work

5c770dd

Swap qwen for mistral

fe199a8

Trying to add red-candle to the models_to_test.rb

b8bf331

Adding red-candle to the models_to_test file

d98834c

Trying to fix the way tool calling support is checked in the specs

b207f69

Deconvoluting local model checks and tool calling support

ab46320

I think we finally got the local tool calling check correct

97d58d2

Enable context length validation for the RedCandle Provider

9c7f9dc

Working on rubocop fixes

d5c9129

Fixing the rubocop errors

70e1b24

stubbing the red-candle inference stuff to speed up specs

6956724

Adding an ENV variable so you toggle real red-candle inference on

0aad7d7

Adding red-candle to the list of providers in the README

52a13ca

orangewolf reviewed Sep 9, 2025

View reviewed changes

lib/ruby_llm/providers/red_candle/models.rb Show resolved Hide resolved

orangewolf reviewed Sep 9, 2025

View reviewed changes

Gemfile Outdated Show resolved Hide resolved

crmne and others added 11 commits September 9, 2025 20:41

Updated models

78d6429

Use default model when none specified in ActiveRecord chats

6c7d7be

Adding a new bundle group so developer can choose to include red-cand…

b883989

…le or not

Adding a comment about possibly supporting more red-candle models in …

685230c

…the future

Remove red-candle from the gemfiles

a928bb1

Properly register red-candle models

ee5b762

Removed some unused config options

43cc0b8

Updating the gemfiles again

4b67818

orangewolf approved these changes Sep 9, 2025

View reviewed changes

cpetersen added 2 commits September 9, 2025 16:54

Make the capabilities file match the actual capabilities

c1ac17d

Deep merge chat options

54b9834

crmne and others added 30 commits September 14, 2025 11:14

Remove outdated version notes and clarified moderation being availabl…

a4fae99

…e in next version

Updated models

e27eb10

Bump version to 1.8.0

0cb6299

Bust gem version cache in README

647756e

Updated documentation with latest changes

a309326

Added moderation to readme and index

e99371c

Updated models

96d06c4

Merge branch 'main' into red-candle

369e9d2

Cleaned up injection into message model class for chat UI generator

c79b852

Updated appraisal gemfiles

f9ce1e7

Merge branch 'main' into red-candle

1e581cf

Add funding URI to gemspec metadata

ae46014

Updated Appraisal gemfiles

6a9998a

Updated models

e08b2e5

Bump version to 1.8.1

46ac613

Merge branch 'main' into red-candle

25ea0d3

Fix chat UI generator for namespaced models.

8823739

Fixes crmne#425

Simplify moderation example in documentation

c2e5bff

Run generator tests only for latest version of Ruby and Rails

a70717f

Run full test suite in latest Ruby and Rails version and upload test …

d08118b

…coverage from there only.

Exclude generators from codecov calculation

1f5bc69

Easier than trying to force it

Update Ruby Style Guide badge to point to RuboCop

b0fb8e8

Bump to 1.8.2

99e9594

Update README and index documentation to clarify embedding generation…

10b31b3

… with RubyLLM.embed

Updated appraisal gemfiles

a0efaa4

Merge branch 'main' into red-candle

702b9b8

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Red Candle Provider #2

Red Candle Provider #2

Uh oh!

cpetersen commented Sep 8, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

orangewolf left a comment

Uh oh!

Uh oh!

Red Candle Provider #2

Are you sure you want to change the base?

Red Candle Provider #2

Uh oh!

Conversation

cpetersen commented Sep 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What this does

Key Implementation Details

Dependency Management

Testing Strategy

Changes Made

How to Test

Try it out

Type of change

Scope check

Quality check

API changes

Related issues

Uh oh!

Uh oh!

Uh oh!

orangewolf left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

cpetersen commented Sep 8, 2025 •

edited

Loading