GoodQ4All is a local-first multimodal memory system for long-running video, audio, and text intelligence.
It ingests media into scene-level memory, persists what it learns locally, and keeps the proof path visible. The system is built around deterministic Windows-first execution, with CPU-safe baseline behavior and optional GPU / WSL2 acceleration when you want more throughput.
GoodQ4All's thesis is simple: machine memory should earn every claim it makes.
Start with the two-minute onboarding film if you want to see the install and first ingestion path before reading the deeper docs.
Watch the 2-minute guided demo · Read the demo guide
GoodQ4All's supported first-run host is Windows 11 with PowerShell. The runtime is local-first and CPU-safe by default; GPU and WSL2 acceleration are optional.
Have these ready before running the installer:
- Windows 11
- Git
- Miniconda or Anaconda available to the current shell
- Python 3.10 or newer
- at least 25 GB free for the baseline install path
- optional: NVIDIA GPU and WSL2 Ubuntu for accelerated lanes
macOS and Linux are not first-run hosts for this repository today. See
docs/reference/PLATFORM_SUPPORT.md for
the current platform contract.
Each frame below is pulled from the final onboarding film and paired with the action it narrates. Click any frame to enlarge it.
If you are new here, start by making one memory:
- Confirm the Windows 11, Conda, Git, and free-space prerequisites above.
- Bootstrap and validate the repo.
- Start Watchdog.
- Drop one small media file into the configured
import_inbox. - Open the local API docs.
- Confirm scene artifacts were written.
Guide:
GoodQ4All is not just an ingest runner or a benchmark harness. It is a full local memory stack with five major layers:
-
Perception engine Detects scenes, extracts keyframes, runs OCR and captions, tags objects and faces, transcribes audio, tracks speakers, and generates embeddings across modalities.
-
Interpretation engine Turns raw perception into scene meaning through
scene_context_llm, epistemic evidence surfaces, arbitration, and Phase 6 multimodal harmonization. Phase 6 is the final harmonization step that turns per-scene outputs into coherent temporal and vector memory. -
Memory engine Persists scene manifests, temporal indexes, SQLite memory state, knowledge graph state, and Qdrant vectors as durable local memory rather than disposable run logs.
-
Retrieval engine Supports vector search, KG-backed querying, natural-language lookup, and scene-level analysis against persisted memory.
-
Operations layer Exposes bootstrap, validation, watchdog, health, monitoring, and release-evidence surfaces so the system can be run and audited like infrastructure, not just a script.
Most media-intelligence stacks are either:
- cloud-dependent
- opaque when they fail
- or impressive in demos but weak under long-running, real-world ingestion
GoodQ4All is trying to be the opposite:
- local-first
- scene-centric
- auditable
- resilient under partial failure
The design goal is simple: a working memory system is more valuable than a clever one.
Release 0.1.1 is the current supported checkpoint.
What is actually proven, not just intended:
- The canonical runtime is Windows-first and local-first.
- The supported surface is API + CLI + watchdog + persisted runtime artifacts.
- Scene-context interpretation quality is witness-proven, not just anecdotal.
- Phase 6 harmonization is operating cleanly on the proving run.
- Episode-quality scoring now has a local offline eval lane using curated IMDb-backed anchors for audit only.
Post-release operator validation on the current main / public line additionally proves:
- WSL audio readiness now means real offline diarization loadability, not import-only checks.
- Successful unified audio preserves diarization and emotion sub-step truth instead of hiding partial failures behind a coarse success result.
- Speaker continuity now persists through
scene_ingest_results.json,scene_manifest.json, andtemporal_index.json. - Episode-to-episode continuity is proven on fresh Season 5 material, not just on the release-era comparison witness.
- API scene read models now expose persisted speaker-truth and continuity fields instead of thinner legacy projections.
- Similar-scene retrieval is now a real multimodal feature and can fuse text, visual, and audio memory instead of falling back to an empty path.
- Read-only control recurrence reporting can compare witnesses, index durable artifacts, draft deterministic inspection plans, and derive conservative trends from existing JSON reports without healing or mutation.
Current proving run and release proof path:
Current eval result on the proving witness:
6/6core beats covered9.0/9.0salience weight hit
That result comes from the local episode-reference eval lane and is summarized in the release checkpoint and evidence map above.
LAUNCH_GOODQ.ps1checks readiness and opens operator monitors.- Watchdog watches the configured
import_inbox. cli.run_ingestionowns actual ingestion.- The API is a local read and inspection surface.
- Runtime artifacts are the durable proof.
If you want the shortest honest path to "does this work on this machine?", run the same first success loop with the actual commands rather than only starting the API:
- Clone the repo and enter the project root.
- Run the bootstrap installer.
- Run the bootstrap validator.
- Run the safe launcher/readiness check.
- Start Watchdog in one terminal.
- Drop one small media file into the configured
import_inbox. - Start the API in another terminal.
- Inspect health and local docs.
git clone https://github.com/GoodQ02/goodq4all.git
cd goodq4all
python scripts/bootstrap_install.py
.\scripts\bootstrap_validate.bat
.\LAUNCH_GOODQ.ps1LAUNCH_GOODQ.ps1 checks readiness and opens operator monitors. It does not start ingestion by itself.
The launcher also has LAUNCH_GOODQ.bat for double-click or classic Command
Prompt use. Both wrappers reach the same readiness surface.
If you skipped the Qdrant service prompt during bootstrap, the first launcher
run may report one Qdrant readiness warning. Install or repair the service later
with scripts\qdrant\INSTALL_QDRANT_SERVICE.bat.
Leave Watchdog running in one terminal:
conda run --no-capture-output -n goodq_core python -m cli.watchdogCopy one small media file into the configured inbox, then start the API in
another terminal. GOODQ_DATA_ROOT is the base root; the runtime derives the
drop zone as <GOODQ_DATA_ROOT>\GoodQ_Data\import_inbox\.
conda run --no-capture-output -n goodq_core python -m api.serverThen open:
http://127.0.0.1:30000/api/health/summaryhttp://127.0.0.1:30000/docs
The host and port default to GOODQ_API_HOST=127.0.0.1 and
GOODQ_API_PORT=30000 and can be overridden in .env.local.
Reference:
docs/guides/FIRST_RUN.mddocs/guides/watchdog/WATCHDOG_QUICKREF.mddocs/bootstrap/INSTALL_BOOTSTRAP.mddocs/reference/API.md
If you want to verify the stronger claims, use the proving witness and release evidence directly:
GoodQ4All currently supports:
- local install and bootstrap on Windows
- local API runtime
- CLI ingestion
- watchdog-driven long-running ingestion
- SQLite + knowledge graph + Qdrant-backed persisted memory
- CPU-safe baseline execution with optional GPU / WSL acceleration
GoodQ4All does not currently ship a polished end-user UI. That is a future layer, not a current product claim.
UI status:
-
Scene-centric memory Every major interpretation surface is built around scenes as the atomic unit.
-
Full perception-to-memory pipeline The system does not stop at captions or transcripts. It carries perception forward into harmonized scene truth, temporal rollups, graph relationships, and retrieval surfaces.
-
Knowledge graph with conservative identity logic People, concepts, objects, places, speaker patterns, and identity evidence are persisted locally, with promotion rules designed to avoid hallucinated merges.
-
Audit-first quality The system is tuned with witnesses, diagnostics, and reference evals instead of vibes.
-
Local truth boundary Public episode anchors can inform audit and scoring, but they do not overwrite runtime scene evidence.
-
Controlled acceleration GPU and WSL are additive performance layers, not hidden requirements.
-
Failure visibility Optional enrichments may fail without collapsing the whole run, and the failure path is meant to remain visible.
- Host: Windows 11 is the canonical runtime host
- Profiles:
UNSET,BASELINE,GPU_ENHANCED - Perception: scene detection, captions, OCR, object signals, face signals, transcription, diarization, emotion, and embeddings
- Storage: SQLite + knowledge graph + Qdrant
- Memory surface: scene manifests, temporal index, projected run outputs
- Core interpretation layer:
scene_context_llmwithprimary_tags,contextual_tags, andstructural_tags - Identity layer: speaker patterns, voice-pattern matches, identity candidates, supported identities, and evidence edges
- Fusion layer: Phase 6 / Phase 6b harmonization
- Operator surface: API + CLI + watchdog + validation and diagnostics
If you want the deeper technical picture:
docs/architecture/README.mddocs/architecture/SYSTEM_ARCHITECTURE.mddocs/architecture/ARCHITECTURE_REFERENCE.mddocs/architecture/MEMORY_STORAGE.mddocs/architecture/diagrams/docs/PHASE6_MULTIMODAL_FUSION.md
- Guided demo:
docs/guides/DEMO.md - First run:
docs/guides/FIRST_RUN.md - Install:
docs/guides/install/INSTALL.md - Quickstart:
docs/guides/install/QUICKSTART.md - Laptop profile:
docs/guides/install/LAPTOP.md - Docs landing page:
docs/README.md - API reference:
docs/reference/API.md - Current release checkpoint:
docs/releases/RELEASE_0.1.1.md - Support and reporting:
SUPPORT.md
- This is pre-1.0 software. The supported runtime path is stable enough to use, but surrounding helpers and APIs may still evolve.
- A polished product UI is not part of the current shipping surface.
- Some optional enrichments can still fail on individual scenes without invalidating the whole ingest.
- Audio-vector success is provenance-defined: current-run CLAP/Qdrant coverage requires
clap_meta.status == okplus a Qdrant audio payload with matchingrun_idand required provenance fields. Legacy scene-id matches are not current-run proof. - Context weighting is now strong, but the project still treats some interpretation choices as policy-level texture rather than frozen truth.
- Secrets belong in
.env.localonly. - The canonical runtime does not require cloud execution.
- Local storage is the source of truth.
- Public benchmark and eval materials describe outcomes and metrics, not copyrighted transcript dumps.
Reference:
If you only read a few things, read these:
docs/releases/RELEASE_0.1.1.mddocs/releases/SHIP_PROFILE.mddocs/goodq4all_agent_status.mddocs/SYSTEM_SNAPSHOT.mdreports/README.mddocs/diagnostics/README.md
Historical and superseded material is intentionally preserved under docs/archive/, but it is not the front door.
MIT. See LICENSE.








