Skip to content

Fact-check and correct migrated Antora docs (#499)#500

Merged
michalharakal merged 3 commits intodevelopfrom
feature/499-docs-fact-check
Apr 13, 2026
Merged

Fact-check and correct migrated Antora docs (#499)#500
michalharakal merged 3 commits intodevelopfrom
feature/499-docs-fact-check

Conversation

@michalharakal
Copy link
Copy Markdown
Contributor

Closes #499.

Summary

Fact-check pass over the Antora docs that landed in #494 / #495 + polish from #497. The pandoc markdown → asciidoc conversion preserved every stale claim the source files had accumulated since whenever they were last touched. This PR corrects the four concrete categories the audit found.

Local build stays clean at 0 warnings, 0 errors after every commit.

The three commits

1. `29043130` — stale version refs + wrong Maven group

Eight stale version references bumped from `0.13.0` / `0.5.0` → `0.19.0` (the upcoming release tag — see `CHANGELOG.md` `[Unreleased]`). Covers:

  • `how-to/java-cli-app.adoc`, `how-to/java-llm-inference.adoc`, `how-to/java-model-training.adoc`, `tutorials/java-getting-started.adoc` — `skainet.version` property + BOM coordinate
  • `how-to/io-readers.adoc` — the two `0.5.0` module coordinates

Two wrong-group coordinates fixed in `io-readers.adoc`: `sk.ainet.core:skainet-io-gguf` / `sk.ainet.core:skainet-io-onnx` → `sk.ainet:skainet-io-gguf` / `sk.ainet:skainet-io-onnx`. Nothing under `sk.ainet.core` was ever published — anyone copy-pasting the snippet before this fix would hit unresolvable dependencies.

2. `cfcdb8d2` — replace moved-LLM pages with redirect stubs

Three pages described APIs that moved to `SKaiNET-transformers` on the 2026-04-13 repo split, with 49 references to packages and Maven coordinates that no longer exist in mainline:

  • `sk.ainet:skainet-kllama-jvm` (module)
  • `sk.ainet.apps.kllama.java.{KLlamaJava, KLlamaSession, GenerationConfig}`
  • `sk.ainet.apps.bert.java.{KBertJava, KBertSession}`
  • `sk.ainet.apps.kllama.chat.java.{JavaAgentLoop, JavaTool, ToolDefinition}`

Replaced the body of each with a short CAUTION admonition + redirect links to the transformers repo and docs site. The files keep their original paths so inbound links from external blogs / search / old docs snapshots land on a useful redirect instead of a 404. Removed the three nav entries from `docs/modules/ROOT/nav.adoc`.

Pages affected:

  • `tutorials/kllama-getting-started.adoc`
  • `how-to/java-cli-app.adoc`
  • `how-to/java-llm-inference.adoc`

Each stub also lists what did stay in mainline and is still useful for LLM workflows (the `TokenizerFactory` for Qwen BPE / SentencePiece, the model loaders in `io-readers.adoc`, the `TensorEncoding` metadata flow for Q4_K / Q8_0 / TernaryPacked).

3. `29585880` — pandoc link artifact + JDK prereq wording

  • `explanation/perf/jvm-cpu.adoc` line 28 had `link:java-25-cpu-backend.md[...]` — wrong extension, wrong Antora syntax. Fixed to `xref:explanation/perf/java-25-cpu-backend.adoc[...]`.
  • Five pages mentioned JDK version requirements with three different phrasings. Normalized every prereq line to "JDK 21 or later (CI builds on JDK 25)" — factual (21 is the real minimum; CI actually runs 25 in every workflow).

Test plan

  • `docker run --rm --user "$(id -u):$(id -g)" -v "$PWD:/antora" -w /antora skainet-antora:local docs/antora-playbook.yml` after each commit — 0 warnings, 0 errors throughout
  • Grep for the stale signals (`0\.13\.0`, `sk\.ainet\.core`, `\.kllama\.`, `\.bert\.java`) in `docs/modules/ROOT/pages/` — no matches remain except inside the redirect-stub admonition bodies where they appear as quoted package names being explained
  • CI: full multiplatform build (the docs workflow rebuilds the whole site)

Out of scope

  • Rewriting the content of the remaining how-to pages beyond surgical corrections (prose is preserved except where it made wrong factual claims).
  • Any content changes to the `SKaiNET-transformers` sibling repo.
  • Moving `java-cli-app.adoc` and `java-llm-inference.adoc` files to the transformers repo — they're kept here as redirect stubs specifically so external inbound links don't 404.

🤖 Generated with Claude Code

michalharakal and others added 3 commits April 13, 2026 19:01
Fact-check pass over the migrated Antora docs surfaced eight
stale version references and two wrong Maven group coordinates
— all inherited from the pre-Antora markdown sources that
hadn't been touched in multiple release cycles.

## Version bumps (0.13.0 / 0.5.0 → 0.19.0)

The 0.13.0 number dates back at least five releases (develop is
currently at 0.18.0 with 0.19.0 about to ship). The 0.5.0 number
in io-readers.adoc was even older. Bumped to 0.19.0 to match
the upcoming release tag — readers on 0.18.0 are unaffected
because every change between 0.13.0 and 0.19.0 is backwards-
compatible at the Maven coordinate level (new modules added,
nothing renamed).

- `how-to/java-cli-app.adoc` — skainet.version property + BOM
  coordinate in the Gradle snippet
- `how-to/java-llm-inference.adoc` — <version> in the Maven
  snippet
- `how-to/java-model-training.adoc` — <version> in the Maven
  snippet
- `how-to/io-readers.adoc` — the two 0.5.0 module coordinates
- `tutorials/java-getting-started.adoc` — skainet.version
  property + BOM coordinate

## Maven group fix (sk.ainet.core → sk.ainet)

`io-readers.adoc` lines 23 and 33 wrote the GGUF and ONNX IO
module coordinates as `sk.ainet.core:skainet-io-gguf:0.5.0`
and `sk.ainet.core:skainet-io-onnx:0.5.0`. The actual Maven
group for every published module in this repo is `sk.ainet`
(root `build.gradle.kts` line 17, `skainet-bom/build.gradle.kts`
line 6). Nothing named `sk.ainet.core` was ever published, so
anyone copy-pasting the snippet before this fix would hit
unresolvable dependencies.

Fixed to `sk.ainet:skainet-io-gguf:0.19.0` and
`sk.ainet:skainet-io-onnx:0.19.0` respectively.

Verified with a local Antora build — zero warnings, zero errors.

First of four commits in the docs fact-check pass. See #499.

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
The fact-check pass found three pages describing APIs that no
longer live in this repo at all. Per the 2026-04-13 repo split,
LLM runtimes (KLlama, KBert, chat, tools, agent loop, etc.) moved
to the sibling `SKaiNET-transformers` repository. Mainline kept
the engine layer only.

These three pages were survivors from before the split and
collectively held 49 references to packages and Maven coordinates
that don't exist in mainline:

  - `sk.ainet:skainet-kllama-jvm` (module)
  - `sk.ainet.apps.kllama.java.{KLlamaJava, KLlamaSession,
     GenerationConfig}`
  - `sk.ainet.apps.bert.java.{KBertJava, KBertSession}`
  - `sk.ainet.apps.kllama.chat.java.{JavaAgentLoop, JavaTool,
     ToolDefinition}`

Anyone copy-pasting a snippet from these pages into a
mainline-0.19.0 project would hit unresolvable imports and
unresolvable Maven coordinates.

## Changes

1. `tutorials/kllama-getting-started.adoc` — replaced body with
   a short CAUTION admonition + redirect to the
   SKaiNET-transformers repo and docs site. Notes what DID stay
   in mainline (tokenizers via `TokenizerFactory`, model loaders
   via `io-readers.adoc`, the Java entry point for tensor ops).

2. `how-to/java-cli-app.adoc` — same treatment. The page
   explicitly documented a KLlama CLI; the replacement links
   both out to transformers AND inward to `java-getting-started`
   and `java-model-training` for non-LLM Java flows that remain
   relevant.

3. `how-to/java-llm-inference.adoc` — same treatment, with a
   slightly longer "what stayed in mainline" section listing the
   concrete tokenizer + TensorEncoding + loader surfaces a Java
   LLM integrator can still use locally without bringing in
   transformers.

4. `docs/modules/ROOT/nav.adoc` — remove the three nav entries.
   Pages still exist (reachable by typed URL, useful if someone
   lands on them via a stale search result) but they're no
   longer presented as part of the Tutorials / How-to story.

Note on page titles: the stub pages intentionally keep their
original filenames so inbound links (external blogs, old docs
snapshots, Google cache) still land on a useful redirect
instead of 404'ing. Deleting the files outright is reversible
but hostile to readers who arrive via external links.

Local Antora build: zero warnings, zero errors.

Second of four commits in the docs fact-check pass. See #499.

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
Two small fact-check fixes bundled together.

## 1. Pandoc link artifact in jvm-cpu.adoc

`explanation/perf/jvm-cpu.adoc` line 28 still had a markdown-style
link that pandoc never rewrote:

    link:java-25-cpu-backend.md[Java 25 CPU Backend]

Two problems:
  - wrong extension (`.md` — that file is `.adoc` after the
    migration)
  - wrong Antora syntax — `link:` is for external URLs, inside a
    module you use `xref:` and reference the target as a
    module-relative path from `pages/`

Fixed to:

    xref:explanation/perf/java-25-cpu-backend.adoc[Java 25 CPU Backend notes]

Antora resolves the xref cleanly now.

## 2. JDK prereq wording normalization

The five pages that mention JDK version requirements had
three different phrasings:

  - `tutorials/java-getting-started`, `how-to/java-cli-app`,
    `how-to/java-llm-inference`, `how-to/java-model-training`
    said "JDK 21 or later" in prose
  - `explanation/perf/jvm-cpu` said "JDK 21+ (JDK 22 toolchain
    configured by Gradle)"
  - `how-to/build` (post-migration) said "Sets up JDK 25"
  - CI actually runs JDK 25 via setup-java@v4 in docs.yml,
    build.yml, publish.yml, documentation.yml,
    schema-validation.yml (and JDK 22 in java-tests.yml which
    is the one outlier)

None of these were factually wrong — 21 is the real minimum —
but the version number inconsistency was confusing. Normalized
to "JDK 21 or later (CI builds on JDK 25)" on every prereq line
I could find. The `java-cli-app.adoc` and `java-llm-inference.adoc`
pages were replaced with redirect stubs in the previous commit
so they're excluded from this pass.

Verified with a local Antora build — zero warnings, zero errors.

Third of four commits in the docs fact-check pass. See #499.

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
@github-actions
Copy link
Copy Markdown

📖 Documentation Preview

The documentation has been built successfully for this PR.

Generated Files:

  • Operator documentation: docs/modules/operators/_generated_/
  • JSON schema output: operators.json

Artifacts:

  • Download the documentation-preview-500 artifact to view the complete documentation locally.

This comment will be updated automatically when the PR is updated.

@michalharakal michalharakal merged commit 3630819 into develop Apr 13, 2026
7 checks passed
@michalharakal michalharakal deleted the feature/499-docs-fact-check branch April 14, 2026 13:58
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Fact-check and correct migrated Antora docs

1 participant