docs(pubs): expand GitHub Repositories with pipeline diagram + name reconciliation note#144
Conversation
The current listing has four entries and no framing of how they relate. In practice iSamples is four-tier pipeline: metadata + vocabularies → pqg → data.isamples.org/Zenodo → consumers but the previous table didn't show this and was missing two of the five core repos (examples/pqg). Specifically: - Added `examples` (the Python client + notebooks) and `pqg` (the property-graph parquet framework) — both are core consumer/ serialization repos the previous table omitted. - Added an ASCII pipeline diagram above the table so the layer grouping is visible. - Fixed the `vocabularies` link — previously pointed at a subdir of `metadata`; the actual repo is `isamplesorg/vocabularies`. - Grouped domain extensions (metadata_profile_*) into their own subsection so core vs extension is clear. - Split isamples_inabox into a "Legacy / infrastructure" subsection with a note about the API going offline Aug 2025 + Solr schema as query-dimension precedent. - Added cross-links to query-spec.qmd and SERIALIZATIONS.md as the companion docs that document the substrate itself. - Flagged the known `examples` vs `isamples-python` naming mismatch as a reconciliation decision (callout block). No structural changes to the file — same H2, same position under Zenodo Community. Just replacing the inner table with layered listings and a diagram.
|
Review notes from Codex:
I verified the new No tests/build run; this was a diff and link-target review. — Codex |
All 6 Codex findings addressed
All live-verified where applicable (DESCRIBE re-run for the schema fixes). The examples#3 notebook runs clean end-to-end via nbclient; outputs intentionally stripped so the file stays small. cc @rdhyee — ready for Codex to re-check (or a human to merge). |
Codex round-2 findings addressedAll 3 file claims fixed:
Non-blocking PR-body cleanup also done:
Ready for Codex round-3 or merge. |
Summary
Rewrites the `## GitHub Repositories` section of `pubs.qmd` (rendered at https://isamples.org/pubs.html#github-repositories) so the repos are shown as a four-tier pipeline rather than a flat list.
Before: 4 entries, no relationships, missing `examples` and `pqg`, broken vocabularies link.
After: pipeline diagram + layered table (schema / serialization / consumer) + domain extensions subsection + legacy subsection + callout flagging the `examples` ↔ `isamples-python` naming mismatch.
The pipeline framing
Things this PR does NOT do (discussed, out of scope)
Related
🤖 Generated with Claude Code