AdaWorldAPI · 2026-03-30T00:01:46Z

No description provided.

Reads GGUF tensor-by-tensor via seek, projects each weight matrix to Base17 via golden-step averaging, writes compressed output. Peak RAM = one tensor + buffers, regardless of model size. Supports: Attention, FFN, Conv2D, Embedding layer classification. Conv2D [out_ch, in_ch, kH, kW] reshaped to out_ch vectors of kernel_dim. 14 tests: classification, projection, reshape, end-to-end synthetic GGUF. https://claude.ai/code/session_01Y69Vnw751w75iVSBRws7o7

Fix signed arithmetic overflow in f16_to_f32 for subnormal exponents. Add integration test that streams OpenChat 3.5 Q8_0 (7.7 GB) through the bgz17 indexer → 42.6 MB output (679× overall compression). Results: Attention 328×, FeedForward 920×, Embedding 3765×. Peak RAM: 524 MB. Time: 185s. 226 tensors indexed, 65 skipped. https://claude.ai/code/session_01Y69Vnw751w75iVSBRws7o7

claude added 2 commits March 29, 2026 23:51

AdaWorldAPI merged commit ba95b4e into master Mar 30, 2026
4 of 10 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

AdaWorldAPI commented Mar 30, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

AdaWorldAPI commented Mar 30, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants