Skip to content

chore(deps): bump skainet to 0.23.0#101

Merged
michalharakal merged 1 commit intodevelopfrom
chore/bump-skainet-0.23.0
May 2, 2026
Merged

chore(deps): bump skainet to 0.23.0#101
michalharakal merged 1 commit intodevelopfrom
chore/bump-skainet-0.23.0

Conversation

@michalharakal
Copy link
Copy Markdown
Contributor

Summary

Picks up two SKaiNET 0.23.0 changes that benefit this repo's consumers without any source changes here:

  • K/N createRandomAccessSource now has a POSIX-pread implementation (SKaiNET#591). K/N consumers can now load GGUFs above the ~2 GiB Kotlin ByteArray ceiling. The randomAccessProvider overloads on *NetworkLoader.fromGguf(...) and the fromRandomAccessSource(...) variant on GGUFTokenizer already exist in this repo — they just couldn't actually open a file on macOS / Linux / iOS native targets before 0.23.0. Verified end-to-end against Qwen3-1.7B-Q8_0.gguf (1.8 GiB) on macOS arm64: previously OOMed at construction; now loads.
  • Lazy zero-init for parameter placeholders (SKaiNET#588). Ends real-model load-time OOMs in the DSL builders that the network loaders in this repo sit on top of (Apertus-8B Q4_K_S previously needed ~27 GB of FP32 zeros eagerly allocated; now loads in 12 GB heap).

Test plan

  • ./gradlew :llm-runtime:kllama:compileKotlinJvm :llm-runtime:kllama:compileKotlinMacosArm64
  • ./gradlew :llm-agent:compileKotlinJvm
  • ./gradlew :llm-inference:llama:compileKotlinJvm :llm-inference:qwen:compileKotlinJvm

All clean (only pre-existing LlamaRuntime deprecation warnings, unrelated). No source changes — pure version bump.

Follow-ups (separate PRs)

  • Migrate this repo's own native CLIs (kllama etc.) from LlamaIngestion.load { source } + GGUFTokenizer.fromSource(...) to the streaming siblings (loadStreaming(randomAccessProvider), fromRandomAccessSource(...)) so they can actually open larger GGUFs on K/N.

Picks up:

- The K/N `createRandomAccessSource` POSIX-pread implementation
  (SKaiNET PR #591), so K/N consumers can now load GGUFs above the
  ~2 GiB Kotlin `ByteArray` ceiling. Combined with this repo's
  existing `randomAccessProvider` factory overloads on
  `*NetworkLoader.fromGguf(...)` and the `fromRandomAccessSource(...)`
  variant on `GGUFTokenizer`, downstream native CLIs / consumers can
  switch to the streaming path mechanically.
- Lazy zero-init for parameter placeholders (SKaiNET PR #588), which
  ends real-model load-time OOMs in the DSL builders that this repo's
  loaders sit on top of.

No source changes — purely a version pin bump. Verified by compiling
:llm-runtime:kllama (JVM + macosArm64), :llm-agent (JVM),
:llm-inference:llama (JVM), :llm-inference:qwen (JVM) against the
new dep set.

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
@michalharakal michalharakal merged commit 68c2ff1 into develop May 2, 2026
2 checks passed
@michalharakal michalharakal deleted the chore/bump-skainet-0.23.0 branch May 4, 2026 05:17
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant