Fact-check and correct migrated Antora docs (#499) by michalharakal · Pull Request #500 · SKaiNET-developers/SKaiNET

michalharakal · 2026-04-13T17:32:09Z

Closes #499.

Summary

Fact-check pass over the Antora docs that landed in #494 / #495 + polish from #497. The pandoc markdown → asciidoc conversion preserved every stale claim the source files had accumulated since whenever they were last touched. This PR corrects the four concrete categories the audit found.

Local build stays clean at 0 warnings, 0 errors after every commit.

The three commits

1. `29043130` — stale version refs + wrong Maven group

Eight stale version references bumped from `0.13.0` / `0.5.0` → `0.19.0` (the upcoming release tag — see `CHANGELOG.md` `[Unreleased]`). Covers:

`how-to/java-cli-app.adoc`, `how-to/java-llm-inference.adoc`, `how-to/java-model-training.adoc`, `tutorials/java-getting-started.adoc` — `skainet.version` property + BOM coordinate
`how-to/io-readers.adoc` — the two `0.5.0` module coordinates

Two wrong-group coordinates fixed in `io-readers.adoc`: `sk.ainet.core:skainet-io-gguf` / `sk.ainet.core:skainet-io-onnx` → `sk.ainet:skainet-io-gguf` / `sk.ainet:skainet-io-onnx`. Nothing under `sk.ainet.core` was ever published — anyone copy-pasting the snippet before this fix would hit unresolvable dependencies.

2. `cfcdb8d2` — replace moved-LLM pages with redirect stubs

Three pages described APIs that moved to `SKaiNET-transformers` on the 2026-04-13 repo split, with 49 references to packages and Maven coordinates that no longer exist in mainline:

`sk.ainet:skainet-kllama-jvm` (module)
`sk.ainet.apps.kllama.java.{KLlamaJava, KLlamaSession, GenerationConfig}`
`sk.ainet.apps.bert.java.{KBertJava, KBertSession}`
`sk.ainet.apps.kllama.chat.java.{JavaAgentLoop, JavaTool, ToolDefinition}`

Replaced the body of each with a short CAUTION admonition + redirect links to the transformers repo and docs site. The files keep their original paths so inbound links from external blogs / search / old docs snapshots land on a useful redirect instead of a 404. Removed the three nav entries from `docs/modules/ROOT/nav.adoc`.

Pages affected:

`tutorials/kllama-getting-started.adoc`
`how-to/java-cli-app.adoc`
`how-to/java-llm-inference.adoc`

Each stub also lists what did stay in mainline and is still useful for LLM workflows (the `TokenizerFactory` for Qwen BPE / SentencePiece, the model loaders in `io-readers.adoc`, the `TensorEncoding` metadata flow for Q4_K / Q8_0 / TernaryPacked).

3. `29585880` — pandoc link artifact + JDK prereq wording

`explanation/perf/jvm-cpu.adoc` line 28 had `link:java-25-cpu-backend.md[...]` — wrong extension, wrong Antora syntax. Fixed to `xref:explanation/perf/java-25-cpu-backend.adoc[...]`.
Five pages mentioned JDK version requirements with three different phrasings. Normalized every prereq line to "JDK 21 or later (CI builds on JDK 25)" — factual (21 is the real minimum; CI actually runs 25 in every workflow).

Test plan

`docker run --rm --user "$(id -u):$(id -g)" -v "$PWD:/antora" -w /antora skainet-antora:local docs/antora-playbook.yml` after each commit — 0 warnings, 0 errors throughout
Grep for the stale signals (`0\.13\.0`, `sk\.ainet\.core`, `\.kllama\.`, `\.bert\.java`) in `docs/modules/ROOT/pages/` — no matches remain except inside the redirect-stub admonition bodies where they appear as quoted package names being explained
CI: full multiplatform build (the docs workflow rebuilds the whole site)

Out of scope

Rewriting the content of the remaining how-to pages beyond surgical corrections (prose is preserved except where it made wrong factual claims).
Any content changes to the `SKaiNET-transformers` sibling repo.
Moving `java-cli-app.adoc` and `java-llm-inference.adoc` files to the transformers repo — they're kept here as redirect stubs specifically so external inbound links don't 404.

🤖 Generated with Claude Code

Fact-check pass over the migrated Antora docs surfaced eight stale version references and two wrong Maven group coordinates — all inherited from the pre-Antora markdown sources that hadn't been touched in multiple release cycles. ## Version bumps (0.13.0 / 0.5.0 → 0.19.0) The 0.13.0 number dates back at least five releases (develop is currently at 0.18.0 with 0.19.0 about to ship). The 0.5.0 number in io-readers.adoc was even older. Bumped to 0.19.0 to match the upcoming release tag — readers on 0.18.0 are unaffected because every change between 0.13.0 and 0.19.0 is backwards- compatible at the Maven coordinate level (new modules added, nothing renamed). - `how-to/java-cli-app.adoc` — skainet.version property + BOM coordinate in the Gradle snippet - `how-to/java-llm-inference.adoc` — <version> in the Maven snippet - `how-to/java-model-training.adoc` — <version> in the Maven snippet - `how-to/io-readers.adoc` — the two 0.5.0 module coordinates - `tutorials/java-getting-started.adoc` — skainet.version property + BOM coordinate ## Maven group fix (sk.ainet.core → sk.ainet) `io-readers.adoc` lines 23 and 33 wrote the GGUF and ONNX IO module coordinates as `sk.ainet.core:skainet-io-gguf:0.5.0` and `sk.ainet.core:skainet-io-onnx:0.5.0`. The actual Maven group for every published module in this repo is `sk.ainet` (root `build.gradle.kts` line 17, `skainet-bom/build.gradle.kts` line 6). Nothing named `sk.ainet.core` was ever published, so anyone copy-pasting the snippet before this fix would hit unresolvable dependencies. Fixed to `sk.ainet:skainet-io-gguf:0.19.0` and `sk.ainet:skainet-io-onnx:0.19.0` respectively. Verified with a local Antora build — zero warnings, zero errors. First of four commits in the docs fact-check pass. See #499. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

The fact-check pass found three pages describing APIs that no longer live in this repo at all. Per the 2026-04-13 repo split, LLM runtimes (KLlama, KBert, chat, tools, agent loop, etc.) moved to the sibling `SKaiNET-transformers` repository. Mainline kept the engine layer only. These three pages were survivors from before the split and collectively held 49 references to packages and Maven coordinates that don't exist in mainline: - `sk.ainet:skainet-kllama-jvm` (module) - `sk.ainet.apps.kllama.java.{KLlamaJava, KLlamaSession, GenerationConfig}` - `sk.ainet.apps.bert.java.{KBertJava, KBertSession}` - `sk.ainet.apps.kllama.chat.java.{JavaAgentLoop, JavaTool, ToolDefinition}` Anyone copy-pasting a snippet from these pages into a mainline-0.19.0 project would hit unresolvable imports and unresolvable Maven coordinates. ## Changes 1. `tutorials/kllama-getting-started.adoc` — replaced body with a short CAUTION admonition + redirect to the SKaiNET-transformers repo and docs site. Notes what DID stay in mainline (tokenizers via `TokenizerFactory`, model loaders via `io-readers.adoc`, the Java entry point for tensor ops). 2. `how-to/java-cli-app.adoc` — same treatment. The page explicitly documented a KLlama CLI; the replacement links both out to transformers AND inward to `java-getting-started` and `java-model-training` for non-LLM Java flows that remain relevant. 3. `how-to/java-llm-inference.adoc` — same treatment, with a slightly longer "what stayed in mainline" section listing the concrete tokenizer + TensorEncoding + loader surfaces a Java LLM integrator can still use locally without bringing in transformers. 4. `docs/modules/ROOT/nav.adoc` — remove the three nav entries. Pages still exist (reachable by typed URL, useful if someone lands on them via a stale search result) but they're no longer presented as part of the Tutorials / How-to story. Note on page titles: the stub pages intentionally keep their original filenames so inbound links (external blogs, old docs snapshots, Google cache) still land on a useful redirect instead of 404'ing. Deleting the files outright is reversible but hostile to readers who arrive via external links. Local Antora build: zero warnings, zero errors. Second of four commits in the docs fact-check pass. See #499. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

Two small fact-check fixes bundled together. ## 1. Pandoc link artifact in jvm-cpu.adoc `explanation/perf/jvm-cpu.adoc` line 28 still had a markdown-style link that pandoc never rewrote: link:java-25-cpu-backend.md[Java 25 CPU Backend] Two problems: - wrong extension (`.md` — that file is `.adoc` after the migration) - wrong Antora syntax — `link:` is for external URLs, inside a module you use `xref:` and reference the target as a module-relative path from `pages/` Fixed to: xref:explanation/perf/java-25-cpu-backend.adoc[Java 25 CPU Backend notes] Antora resolves the xref cleanly now. ## 2. JDK prereq wording normalization The five pages that mention JDK version requirements had three different phrasings: - `tutorials/java-getting-started`, `how-to/java-cli-app`, `how-to/java-llm-inference`, `how-to/java-model-training` said "JDK 21 or later" in prose - `explanation/perf/jvm-cpu` said "JDK 21+ (JDK 22 toolchain configured by Gradle)" - `how-to/build` (post-migration) said "Sets up JDK 25" - CI actually runs JDK 25 via setup-java@v4 in docs.yml, build.yml, publish.yml, documentation.yml, schema-validation.yml (and JDK 22 in java-tests.yml which is the one outlier) None of these were factually wrong — 21 is the real minimum — but the version number inconsistency was confusing. Normalized to "JDK 21 or later (CI builds on JDK 25)" on every prereq line I could find. The `java-cli-app.adoc` and `java-llm-inference.adoc` pages were replaced with redirect stubs in the previous commit so they're excluded from this pass. Verified with a local Antora build — zero warnings, zero errors. Third of four commits in the docs fact-check pass. See #499. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

github-actions · 2026-04-13T17:34:28Z

📖 Documentation Preview

The documentation has been built successfully for this PR.

Generated Files:

Operator documentation: docs/modules/operators/_generated_/
JSON schema output: operators.json

Artifacts:

Download the documentation-preview-500 artifact to view the complete documentation locally.

This comment will be updated automatically when the PR is updated.

michalharakal and others added 3 commits April 13, 2026 19:01

michalharakal merged commit 3630819 into develop Apr 13, 2026
7 checks passed

michalharakal deleted the feature/499-docs-fact-check branch April 14, 2026 13:58

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fact-check and correct migrated Antora docs (#499)#500

Fact-check and correct migrated Antora docs (#499)#500
michalharakal merged 3 commits intodevelopfrom
feature/499-docs-fact-check

michalharakal commented Apr 13, 2026

Uh oh!

github-actions Bot commented Apr 13, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

michalharakal commented Apr 13, 2026

Summary

The three commits

1. `29043130` — stale version refs + wrong Maven group

2. `cfcdb8d2` — replace moved-LLM pages with redirect stubs

3. `29585880` — pandoc link artifact + JDK prereq wording

Test plan

Out of scope

Uh oh!

github-actions Bot commented Apr 13, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant