Skip to content

GoodQ02/goodq4all

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

49 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

GoodQ4All mark

GoodQ4All

ci doc-drift-lint codeql dependency-review

GoodQ4All is a local-first multimodal memory system for long-running video, audio, and text intelligence.

It ingests media into scene-level memory, persists what it learns locally, and keeps the proof path visible. The system is built around deterministic Windows-first execution, with CPU-safe baseline behavior and optional GPU / WSL2 acceleration when you want more throughput.

GoodQ4All's thesis is simple: machine memory should earn every claim it makes.

Watch The Guided Demo

Start with the two-minute onboarding film if you want to see the install and first ingestion path before reading the deeper docs.

Watch Welcome Aboard GoodQ4All

Watch the 2-minute guided demo · Read the demo guide

Before You Start

GoodQ4All's supported first-run host is Windows 11 with PowerShell. The runtime is local-first and CPU-safe by default; GPU and WSL2 acceleration are optional.

Have these ready before running the installer:

  • Windows 11
  • Git
  • Miniconda or Anaconda available to the current shell
  • Python 3.10 or newer
  • at least 25 GB free for the baseline install path
  • optional: NVIDIA GPU and WSL2 Ubuntu for accelerated lanes

macOS and Linux are not first-run hosts for this repository today. See docs/reference/PLATFORM_SUPPORT.md for the current platform contract.

Visual First Run

Each frame below is pulled from the final onboarding film and paired with the action it narrates. Click any frame to enlarge it.

Step Type or do this Demo frame
1 Clone the official source:
git clone https://github.com/GoodQ02/goodq4all.git
Clone the GoodQ4All repository
2 Enter the project cabin:
cd goodq4all
Enter the GoodQ4All project folder
3 Run the bootstrap installer:
python scripts/bootstrap_install.py
CPU-safe first-run variant: python scripts/bootstrap_install.py --disable-gpu --disable-wsl-audio --skip-model-prefetch.
Run the bootstrap installer
4 Optional local config:
copy .env.local.template to .env.local when using local model, cache, or provider settings.
Place env local configuration in the repo root
5 Validate the bootstrap:
.\scripts\bootstrap_validate.bat
Run the bootstrap validator
6 Run the launcher/readiness check:
.\LAUNCH_GOODQ.ps1
Launch GoodQ4All readiness checks
7 Start Watchdog, then drop one small media file into import_inbox:
conda run --no-capture-output -n goodq_core python -m cli.watchdog
Watchdog observes the imported media file
8 Start the API and inspect proof:
conda run --no-capture-output -n goodq_core python -m api.server
Ingestion completes and proof is recorded

First Success Loop

If you are new here, start by making one memory:

  1. Confirm the Windows 11, Conda, Git, and free-space prerequisites above.
  2. Bootstrap and validate the repo.
  3. Start Watchdog.
  4. Drop one small media file into the configured import_inbox.
  5. Open the local API docs.
  6. Confirm scene artifacts were written.

Guide:

What This Actually Is

GoodQ4All is not just an ingest runner or a benchmark harness. It is a full local memory stack with five major layers:

  • Perception engine Detects scenes, extracts keyframes, runs OCR and captions, tags objects and faces, transcribes audio, tracks speakers, and generates embeddings across modalities.

  • Interpretation engine Turns raw perception into scene meaning through scene_context_llm, epistemic evidence surfaces, arbitration, and Phase 6 multimodal harmonization. Phase 6 is the final harmonization step that turns per-scene outputs into coherent temporal and vector memory.

  • Memory engine Persists scene manifests, temporal indexes, SQLite memory state, knowledge graph state, and Qdrant vectors as durable local memory rather than disposable run logs.

  • Retrieval engine Supports vector search, KG-backed querying, natural-language lookup, and scene-level analysis against persisted memory.

  • Operations layer Exposes bootstrap, validation, watchdog, health, monitoring, and release-evidence surfaces so the system can be run and audited like infrastructure, not just a script.

Why This Project Exists

Most media-intelligence stacks are either:

  • cloud-dependent
  • opaque when they fail
  • or impressive in demos but weak under long-running, real-world ingestion

GoodQ4All is trying to be the opposite:

  • local-first
  • scene-centric
  • auditable
  • resilient under partial failure

The design goal is simple: a working memory system is more valuable than a clever one.

What Is Proven Today

Release 0.1.1 is the current supported checkpoint.

What is actually proven, not just intended:

  • The canonical runtime is Windows-first and local-first.
  • The supported surface is API + CLI + watchdog + persisted runtime artifacts.
  • Scene-context interpretation quality is witness-proven, not just anecdotal.
  • Phase 6 harmonization is operating cleanly on the proving run.
  • Episode-quality scoring now has a local offline eval lane using curated IMDb-backed anchors for audit only.

Post-release operator validation on the current main / public line additionally proves:

  • WSL audio readiness now means real offline diarization loadability, not import-only checks.
  • Successful unified audio preserves diarization and emotion sub-step truth instead of hiding partial failures behind a coarse success result.
  • Speaker continuity now persists through scene_ingest_results.json, scene_manifest.json, and temporal_index.json.
  • Episode-to-episode continuity is proven on fresh Season 5 material, not just on the release-era comparison witness.
  • API scene read models now expose persisted speaker-truth and continuity fields instead of thinner legacy projections.
  • Similar-scene retrieval is now a real multimodal feature and can fuse text, visual, and audio memory instead of falling back to an empty path.
  • Read-only control recurrence reporting can compare witnesses, index durable artifacts, draft deterministic inspection plans, and derive conservative trends from existing JSON reports without healing or mutation.

Current proving run and release proof path:

Current eval result on the proving witness:

  • 6/6 core beats covered
  • 9.0/9.0 salience weight hit

That result comes from the local episode-reference eval lane and is summarized in the release checkpoint and evidence map above.

Verify It Yourself

What Runs What

  • LAUNCH_GOODQ.ps1 checks readiness and opens operator monitors.
  • Watchdog watches the configured import_inbox.
  • cli.run_ingestion owns actual ingestion.
  • The API is a local read and inspection surface.
  • Runtime artifacts are the durable proof.

First Local Run

If you want the shortest honest path to "does this work on this machine?", run the same first success loop with the actual commands rather than only starting the API:

  1. Clone the repo and enter the project root.
  2. Run the bootstrap installer.
  3. Run the bootstrap validator.
  4. Run the safe launcher/readiness check.
  5. Start Watchdog in one terminal.
  6. Drop one small media file into the configured import_inbox.
  7. Start the API in another terminal.
  8. Inspect health and local docs.
git clone https://github.com/GoodQ02/goodq4all.git
cd goodq4all
python scripts/bootstrap_install.py
.\scripts\bootstrap_validate.bat
.\LAUNCH_GOODQ.ps1

LAUNCH_GOODQ.ps1 checks readiness and opens operator monitors. It does not start ingestion by itself.

The launcher also has LAUNCH_GOODQ.bat for double-click or classic Command Prompt use. Both wrappers reach the same readiness surface.

If you skipped the Qdrant service prompt during bootstrap, the first launcher run may report one Qdrant readiness warning. Install or repair the service later with scripts\qdrant\INSTALL_QDRANT_SERVICE.bat.

Leave Watchdog running in one terminal:

conda run --no-capture-output -n goodq_core python -m cli.watchdog

Copy one small media file into the configured inbox, then start the API in another terminal. GOODQ_DATA_ROOT is the base root; the runtime derives the drop zone as <GOODQ_DATA_ROOT>\GoodQ_Data\import_inbox\.

conda run --no-capture-output -n goodq_core python -m api.server

Then open:

  • http://127.0.0.1:30000/api/health/summary
  • http://127.0.0.1:30000/docs

The host and port default to GOODQ_API_HOST=127.0.0.1 and GOODQ_API_PORT=30000 and can be overridden in .env.local.

Reference:

Full Proof Path

If you want to verify the stronger claims, use the proving witness and release evidence directly:

Supported Surface Today

GoodQ4All currently supports:

  • local install and bootstrap on Windows
  • local API runtime
  • CLI ingestion
  • watchdog-driven long-running ingestion
  • SQLite + knowledge graph + Qdrant-backed persisted memory
  • CPU-safe baseline execution with optional GPU / WSL acceleration

GoodQ4All does not currently ship a polished end-user UI. That is a future layer, not a current product claim.

UI status:

What Makes It Different

  • Scene-centric memory Every major interpretation surface is built around scenes as the atomic unit.

  • Full perception-to-memory pipeline The system does not stop at captions or transcripts. It carries perception forward into harmonized scene truth, temporal rollups, graph relationships, and retrieval surfaces.

  • Knowledge graph with conservative identity logic People, concepts, objects, places, speaker patterns, and identity evidence are persisted locally, with promotion rules designed to avoid hallucinated merges.

  • Audit-first quality The system is tuned with witnesses, diagnostics, and reference evals instead of vibes.

  • Local truth boundary Public episode anchors can inform audit and scoring, but they do not overwrite runtime scene evidence.

  • Controlled acceleration GPU and WSL are additive performance layers, not hidden requirements.

  • Failure visibility Optional enrichments may fail without collapsing the whole run, and the failure path is meant to remain visible.

Architecture at a Glance

  • Host: Windows 11 is the canonical runtime host
  • Profiles: UNSET, BASELINE, GPU_ENHANCED
  • Perception: scene detection, captions, OCR, object signals, face signals, transcription, diarization, emotion, and embeddings
  • Storage: SQLite + knowledge graph + Qdrant
  • Memory surface: scene manifests, temporal index, projected run outputs
  • Core interpretation layer: scene_context_llm with primary_tags, contextual_tags, and structural_tags
  • Identity layer: speaker patterns, voice-pattern matches, identity candidates, supported identities, and evidence edges
  • Fusion layer: Phase 6 / Phase 6b harmonization
  • Operator surface: API + CLI + watchdog + validation and diagnostics

If you want the deeper technical picture:

Start Here

Current Limitations

  • This is pre-1.0 software. The supported runtime path is stable enough to use, but surrounding helpers and APIs may still evolve.
  • A polished product UI is not part of the current shipping surface.
  • Some optional enrichments can still fail on individual scenes without invalidating the whole ingest.
  • Audio-vector success is provenance-defined: current-run CLAP/Qdrant coverage requires clap_meta.status == ok plus a Qdrant audio payload with matching run_id and required provenance fields. Legacy scene-id matches are not current-run proof.
  • Context weighting is now strong, but the project still treats some interpretation choices as policy-level texture rather than frozen truth.

Security and Data Handling

  • Secrets belong in .env.local only.
  • The canonical runtime does not require cloud execution.
  • Local storage is the source of truth.
  • Public benchmark and eval materials describe outcomes and metrics, not copyrighted transcript dumps.

Reference:

Documentation and Evidence

If you only read a few things, read these:

Historical and superseded material is intentionally preserved under docs/archive/, but it is not the front door.

License

MIT. See LICENSE.

About

Local-first multimodal epistemic memory for scene-level video, audio, and text intelligence.

Topics

Resources

License

Code of conduct

Contributing

Security policy

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors