Skip to content

History / AI and Documentation

Revisions

  • AI & Documentation: document semanticmd datadict.yaml schema & JSON conversion (#3943) Add a "Converting to JSON" subsection covering the shipped datadict.yaml semantic-md schema, the semantic-md: datadict.yaml front-matter pointer, an API-accurate conversion snippet, the committed JSON artifact, and the check_semanticmd.py regenerate-and-verify check. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>

    @jqnatividad jqnatividad committed Jun 3, 2026
  • AI-and-Documentation: drop removed SemanticMd Example Queries section describegpt's SemanticMd output no longer emits a `# Example Queries` section (DuckDB SQL + pandas snippets); the feature was removed in qsv. Remove the corresponding bullet from the SemanticMd structure list.

    @jqnatividad jqnatividad committed Jun 3, 2026
  • AI & Documentation: bring semanticmd section up to #3935 enrichments The published SemanticMd section predated the "richer Data Dictionary for agents & catalogs" work. Augment it with the per-column Concept ID, Role, join keys & cardinality, and data-quality flags; the dataset grain, temporal/spatial envelope, concept index, and Example Queries sections; the --infer-content-type implication; and the --ds-source/--ds-updated/--ds-license frontmatter flags. Schema-table and ## Column bullets corrected to match the current output.

    @jqnatividad jqnatividad committed Jun 2, 2026
  • describegpt: document --format semanticmd (Semantic Markdown output)

    @jqnatividad jqnatividad committed Jun 2, 2026
  • wiki: adopt GitHub Alerts for callouts across the wiki Convert advisory blockquotes and inline callouts to semantic GitHub Alerts (NOTE/TIP/IMPORTANT/CAUTION): - [!NOTE] for the standard category-page "workflow layer" ledes and other top-of-page orientation / "canonical reference" notes - [!IMPORTANT] for behavior-affecting gotchas (auto approx-stats on OOM, group-by unsupported aggs, MiniJinja filter-errors-as-values, synthesize cross-column correlation, profile always-warn RFC4180) - [!TIP] for the Binary-Variants TL;DR and Why-qsv "why it matters" - [!CAUTION] for foreach shell-injection risk and joinp --cross blowup Also update the Contributing-to-the-Wiki category template to use the [!NOTE] lede so new pages follow the convention. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>

    @jqnatividad jqnatividad committed May 30, 2026
  • Document geocode OpenCage, describegpt JSON Schema & sample VarOpt - Geospatial: new "Online geocoding with OpenCage" section, decision-table rows, updated geocode intro & subcommand count (7 -> 9) - Selection & Inspection: sample VarOpt + mergeable-reservoir + sketch I/O examples, method count 8 -> 10 - AI & Documentation: describegpt --format jsonschema section, x-qsv annotations, --allow-extra-cols / --strict-dates - Command Reference: refreshed sample, geocode & describegpt rows - Environment Variables: QSV_OPENCAGE_API_KEY - Recipes: OpenCage variation in Geographic Enrichment; describegpt JSON Schema variation in JSON Schema Validation Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

    @jqnatividad jqnatividad committed May 20, 2026
  • docs(wiki): re-home color and pro to their natural categories color and pro lived on the AI & Documentation page for historical reasons, but neither has any LLM/AI surface — color is just a colorized table renderer, and pro is a desktop-app bridge. Now that synthesize has joined describegpt on the AI page, the mis-categorization stood out more. - AI & Documentation: drop the color and pro sections; trim the decision-table rows and the "color is the colorized cousin..." sentence in the intro; leave a redirect note pointing readers to the new homes - Selection & Inspection: add color right after table (its uncolored cousin); decision-table row; cross-link both directions - Integrations: add a "qsv pro Bridge" section covering the pro subcommands (lens, workflow) - Command Reference: move the color row into Selection & Inspection; add an "Integrations & Bridges" mini-section housing pro - Environment-Variables, Recipe-Inspect-Unknown-CSV, qsv-pro-Spotlight: rewrite stale AI-and-Documentation#color / #pro anchors to point at the new homes Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

    @jqnatividad jqnatividad committed May 18, 2026
  • docs(wiki): update for qsv 20.1.0 — "Synthetic Data" release - AI & Documentation: add synthesize section; expand describegpt with Content Types (47-token vocab), --two-pass cross-field refinement, deterministic unique_id tag, --markdown-template, lower-LLM-cost notes - Aggregation & Statistics: document auto-fallback to approximate modes on OOM (stats: --quantile/cardinality-method approx; frequency: --sketch-method frequent_items) with little-endian gating - Recipe-Larger-than-RAM: reframe explicit approx flags as auto-on-OOM (override-to-lock semantics) with stats cache mode-key note - SQL & Polars: pivotp --agg quantile@<p> / q@<p> with p95 example - Command Reference: add synthesize row in AI & Documentation table - Cookbook: new "Generate" section; new Recipe-Synthesize-Fake-Data.md with end-to-end describegpt --two-pass --infer-content-type → synthesize walkthrough, locale switching, --consistent-fakes variant, caveats - Sidebar: link the new Synthesize Fake Data recipe - Home: bump "70+ commands" → "73 commands across four binary variants"; add 20.1.0 highlight strip with deep-links - FAQ: refresh MSRV anchor 20.0.0 → 20.1.0 (Rust 1.95 unchanged) - Troubleshooting: link both 20.0.0 and 20.1.0 changelogs in excel section Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

    @jqnatividad jqnatividad committed May 18, 2026
  • wiki: complete Phase B - Scripting + Indexing/Compression/Diff + AI/Doc Scripting-Luau-Python covers luau (qsv's flagship DSL with BEGIN/MAIN/END, random-access mode, ~50 qsv helpers), py (Python 3.10+ expressions), foreach (shell-out per row with --unify pattern), and template (MiniJinja rendering with register_lookup). Examples: cumulative running total, NYC 311 classification with dynamicEnum lookup, getquarter.lua from docs/cookbook/lua, helper.py module pattern, per-borough markdown reports. Indexing-Compression-Diff covers index (with QSV_AUTOINDEX_SIZE auto-mode), diff (1M x 9 in <600ms primary-key diff), blake3 (pipeline gating, cache keys, integrity checks), plus cross-references to extsort/extdedup/snappy. AI-and-Documentation covers describegpt (neuro-symbolic data dictionary, Ollama/Jan/LM Studio local LLMs, SQL-RAG sub-mode with DuckDB/Polars, multilingual output, controlled tag vocabulary), color (theme-aware colorized table), and pro (qsv pro API bridge). All 13 Phase B pages now live. All 81 commands covered with non-trivial real-world examples anchored on the wiki's six anchor datasets. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

    @jqnatividad jqnatividad committed May 13, 2026
  • wiki: add stubs for Phase B/C/D/E pages so sidebar links resolve Adds 39 placeholder pages so every sidebar entry resolves to real content rather than a 404. Each stub declares its tier, the phase it will be filled in, and a one-paragraph preview of what's coming. They link back to Home / Getting-Started / Command-Reference / Cookbook for navigation. Pages added: - Phase B (Command Reference, 13): Command-Reference, Selection-and- Inspection, Transform-and-Reshape, Aggregation-and-Statistics, Joins- and-Set-Ops, SQL-and-Polars, Validation-and-Schema, Conversion-and-IO, Geospatial, HTTP-and-Web, Scripting-Luau-Python, Indexing-Compression- Diff, AI-and-Documentation - Phase C (Cookbook recipes, 12): Recipe-Inspect-Unknown-CSV, Recipe- Clean-and-Normalize, Recipe-Geographic-Enrichment, Recipe-Date- Enrichment, Recipe-CKAN-Integration, Recipe-JSON-Schema-Validate, Recipe-Build-a-Data-Pipeline, Recipe-Stats-to-Insights, Recipe-Fetch- and-Cache, Recipe-Larger-than-RAM, Recipe-Diff-and-Audit, Recipe-Multi- Table-Joins - Phase D (Tuning + ecosystem, 8): Performance-Tuning, Environment- Variables, Stats-Cache-and-Caching, Lookup-Tables, Claude-Cowork-Plugin, MCP-Server, qsv-pro-Spotlight, Integrations - Phase E (Polish, 6): Troubleshooting, FAQ, Comparison, Glossary, External-Resources, Contributing-to-the-Wiki Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

    @jqnatividad jqnatividad committed May 13, 2026