feat(embeddings): add GGUF quantized model support by donhardman · Pull Request #141 · manticoresoftware/columnar

donhardman · 2026-03-14T12:09:40Z

Add GGUF format detection and Q4_K_M quantization preference
Support Gemma and Llama architectures
Add quantized embedding model implementation

- Add GGUF format detection and Q4_K_M quantization preference - Support Gemma and Llama architectures - Add quantized embedding model implementation

CLAassistant · 2026-03-14T12:09:47Z

All committers have signed the CLA.

github-actions · 2026-03-14T12:38:00Z

Linux debug test results

8 files 8 suites 13m 3s ⏱️
511 tests 487 ✅ 24 💤 0 ❌
525 runs 501 ✅ 24 💤 0 ❌

Results for commit c1019e9.

♻️ This comment has been updated with latest results.

github-actions · 2026-03-14T12:42:05Z

Windows test results

5 files 5 suites 18m 22s ⏱️
491 tests 473 ✅ 18 💤 0 ❌
499 runs 481 ✅ 18 💤 0 ❌

Results for commit c1019e9.

♻️ This comment has been updated with latest results.

github-actions · 2026-03-14T12:42:50Z

Linux release test results

8 files 8 suites 7m 2s ⏱️
511 tests 493 ✅ 18 💤 0 ❌
525 runs 507 ✅ 18 💤 0 ❌

Results for commit c1019e9.

♻️ This comment has been updated with latest results.

sanikolaev · 2026-03-18T06:04:05Z

The related discussion in TG is https://t.me/manticore_chat/7196/21133

- Support T5 model configurations - Add comprehensive tests

- Enable access to gated models via optional hf_token parameter - Update all test calls to include token parameter - Add tests for token authentication

- Fix tensor indexing to maintain batch dimension in T5 embeddings - Add integration tests for FRIDA and Google embeddinggemma models - Update test formatting for better readability

feat(embeddings): add GGUF quantized model support

515ba7c

- Add GGUF format detection and Q4_K_M quantization preference - Support Gemma and Llama architectures - Add quantized embedding model implementation

donhardman requested a review from sanikolaev March 14, 2026 12:09

sanikolaev approved these changes Mar 14, 2026

View reviewed changes

donhardman added 3 commits March 18, 2026 18:16

feat(embeddings): add T5 encoder with CLS pooling

6ff10a6

- Support T5 model configurations - Add comprehensive tests

feat(embeddings): add HuggingFace token support

67396b5

- Enable access to gated models via optional hf_token parameter - Update all test calls to include token parameter - Add tests for token authentication

fix(embeddings): correct T5 CLS pooling to preserve batch dimension

c1019e9

- Fix tensor indexing to maintain batch dimension in T5 embeddings - Add integration tests for FRIDA and Google embeddinggemma models - Update test formatting for better readability

donhardman merged commit 16fb238 into master Mar 19, 2026
53 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(embeddings): add GGUF quantized model support#141

feat(embeddings): add GGUF quantized model support#141
donhardman merged 4 commits intomasterfrom
feature/gguf

donhardman commented Mar 14, 2026

Uh oh!

CLAassistant commented Mar 14, 2026 •

edited

Loading

Uh oh!

github-actions bot commented Mar 14, 2026 •

edited

Loading

Uh oh!

github-actions bot commented Mar 14, 2026 •

edited

Loading

Uh oh!

github-actions bot commented Mar 14, 2026 •

edited

Loading

Uh oh!

sanikolaev commented Mar 18, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

donhardman commented Mar 14, 2026

Uh oh!

CLAassistant commented Mar 14, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented Mar 14, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Linux debug test results

Uh oh!

github-actions bot commented Mar 14, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Windows test results

Uh oh!

github-actions bot commented Mar 14, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Linux release test results

Uh oh!

sanikolaev commented Mar 18, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

CLAassistant commented Mar 14, 2026 •

edited

Loading

github-actions bot commented Mar 14, 2026 •

edited

Loading

github-actions bot commented Mar 14, 2026 •

edited

Loading

github-actions bot commented Mar 14, 2026 •

edited

Loading