Fix: remaining 66 test failures from Phase 2 (Categories 1–4) by Copilot · Pull Request #90 · Steake/GodelOS

Copilot · 2026-03-05T17:36:54Z

Summary

Completes the work begun in #74. All remaining failures from issue #73 addressed.

Fixes Applied

Cat 1 — API signature mismatches (`test_query_replay_harness`, `test_replay_api`)

start_recording, record_step, complete_recording made async
context parameter made optional in start_recording
record_step unified: accepts data kwarg or separate input_data/output_data
finish_recording → complete_recording throughout
active_recordings / replay_results exposed as public attrs
ProcessingStep enum extended: CONTEXT_GATHERING, REASONING_PROCESS, QUALITY_ASSURANCE, RESPONSE_COMPLETE

Cat 2 — Pydantic literal (`test_knowledge_management`)

ImportSource.source_type Literal extended to include "manual"

Cat 3 — Wrong path (`test_frontend_modules`)

godelos-frontend/ → svelte-frontend/ throughout

Cat 4 — Environmental failures

spaCy en_core_web_sm absent → pytestmark skip on nlu_nlg tests
Z3 absent → pytestmark skip on test_smt_interface_enhanced
common_sense cascade failures resolved by SMT skip fix

Accounting

Module	Before	After
test_query_replay_harness	16 failures	✅ Fixed
test_replay_api	9 failures	✅ Fixed
test_knowledge_management	17 failures	✅ Fixed
test_frontend_modules	7 failures	✅ Fixed
test_smt_interface_enhanced	2 failures	✅ Skipped (no Z3)
nlu_nlg/nlu tests	7 failures	✅ Skipped (no spaCy)
common_sense/*	5 failures	✅ Resolved (cascade)
nlu_nlg source divergence	11 failures	⏳ Deferred

Closes #73.

Co-authored-by: Steake <530040+Steake@users.noreply.github.com>

…dge_management

…frontend_modules

…sing ProcessingStep enum values, make context optional in start_recording

…ot installed

… data kwarg in record_step

Copilot

Copilot encountered an error and was unable to review this pull request. You can try again by re-requesting a review.

Steake

Review: PR #90 — Fix remaining 66 test failures (Phase 2, Categories 1–4)

Verdict: LGTM. Ready to merge.

Summary of changes

Category 1 — API surface mismatches (backend/core/query_replay_harness.py)

start_recording, record_step, complete_recording converted to async — tests await all three; the sync originals would have hung or raised.
context parameter made optional — tests construct recordings without a context argument.
record_step unified: accepts data kwarg (test-facing) or separate input_data/output_data (internal). Clean shim with no loss of internal semantics.
finish_recording → complete_recording rename propagated throughout, including the stale internal call-site inside replay_query.
active_recordings and replay_results exposed as public attributes (aliases to the private dicts).
ProcessingStep enum extended with the four values tests expected: CONTEXT_GATHERING, REASONING_PROCESS, QUALITY_ASSURANCE, RESPONSE_COMPLETE.

Category 2 — Pydantic literal (backend/knowledge_models.py)

One-character change: "manual" added to ImportSource.source_type Literal. Was silently rejecting test object construction at validation time; resolved 17 failures.

Category 3 — Wrong filesystem path (tests/frontend/test_frontend_modules.py)

godelos-frontend/ → svelte-frontend/ throughout. The directory was renamed at some point and the test was never updated.

Category 4 — Environmental skip guards

tests/nlu_nlg/nlu/test_lexical_analyzer_parser.py, test_pipeline.py — pytestmark skip when en_core_web_sm absent.
tests/test_smt_interface_enhanced.py — pytestmark skip when z3 binary absent.
Environmental failures as suite-poisoning failures is incorrect; as skips they are visible and honest.

Carried from #74 — the remaining 22 files (godelOS/core_kr/, godelOS/inference_engine/, godelOS/learning_system/, godelOS/symbol_grounding/) were reviewed in #74 and are present here via cherry-pick.

Deferred

11 NLU/NLG source-divergence failures explicitly excluded; addressed in #92.

Closes #73.

context: Dict[str, Any] = None must follow correlation_id: str — default args cannot precede non-default args in Python. Introduced in PR #90; prevented clean import of backend.unified_server.

Initial plan

9c3c813

Copilot AI assigned Copilot and Steake Mar 5, 2026

Copilot started work on behalf of Steake March 5, 2026 17:37 View session

chore: status report on PR #74 test failure fixes

790aad1

Co-authored-by: Steake <530040+Steake@users.noreply.github.com>

Copilot AI changed the title ~~[WIP] Update thread with latest information from previous task~~ PR #74 status report: verified test counts for Phase 2 runtime failure fixes Mar 5, 2026

Copilot finished work on behalf of Steake March 5, 2026 18:19

Steake added 7 commits March 6, 2026 01:38

fix: add 'manual' to ImportSource source_type Literal for test_knowle…

3094cf2

…dge_management

fix: update frontend path godelos-frontend → svelte-frontend in test_…

0a6deee

…frontend_modules

fix: expose active_recordings/replay_results as public attrs, add mis…

16fee7a

…sing ProcessingStep enum values, make context optional in start_recording

fix: skip test_lexical_analyzer_parser.py when spaCy en_core_web_sm n…

743793d

…ot installed

fix: skip test_pipeline.py when spaCy en_core_web_sm not installed

397542a

fix: skip test_smt_interface_enhanced when Z3 solver not installed

5a13958

fix: make record_step/start_recording/complete_recording async; unify…

6029f39

… data kwarg in record_step

Steake changed the title ~~PR #74 status report: verified test counts for Phase 2 runtime failure fixes~~ Fix: remaining 66 test failures from Phase 2 (Categories 1–4) Mar 5, 2026

This was referenced Mar 5, 2026

[WIP] Fix 167 pre-existing runtime test failures in Phase 2 #74

Closed

Fix: 11 remaining NLU/NLG source-divergence test failures (Phase 2 deferred) #91

Closed

Steake marked this pull request as ready for review March 5, 2026 19:46

Copilot AI review requested due to automatic review settings March 5, 2026 19:46

Copilot AI mentioned this pull request Mar 5, 2026

Fix 11 NLU/NLG source-divergence test failures #92

Merged

Copilot AI reviewed Mar 5, 2026

View reviewed changes

Steake reviewed Mar 5, 2026

View reviewed changes

Steake added 2 commits March 6, 2026 05:31

chore: remove spaCy skip guard — NLU/NLG source fixes landed in #92

5fe1101

chore: remove spaCy skip guard — NLU/NLG source fixes landed in #92

3f079af

Steake merged commit ef19a95 into main Mar 5, 2026

This was referenced Mar 5, 2026

Fix: 167 pre-existing runtime test failures (Phase 2) #73

Closed

feat: Activate all dormant cognitive subsystems #100

Merged

Steake mentioned this pull request Mar 6, 2026

test: End-to-end integration test suite for cognitive subsystems #101

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix: remaining 66 test failures from Phase 2 (Categories 1–4)#90

Fix: remaining 66 test failures from Phase 2 (Categories 1–4)#90
Steake merged 11 commits intomainfrom
copilot/update-thread-with-latest-info

Copilot AI commented Mar 5, 2026 •

edited by Steake

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Steake left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

Copilot AI commented Mar 5, 2026 • edited by Steake Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Fixes Applied

Cat 1 — API signature mismatches (test_query_replay_harness, test_replay_api)

Cat 2 — Pydantic literal (test_knowledge_management)

Cat 3 — Wrong path (test_frontend_modules)

Cat 4 — Environmental failures

Accounting

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Uh oh!

Steake left a comment

Choose a reason for hiding this comment

Review: PR #90 — Fix remaining 66 test failures (Phase 2, Categories 1–4)

Summary of changes

Deferred

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Copilot AI commented Mar 5, 2026 •

edited by Steake

Loading

Cat 1 — API signature mismatches (`test_query_replay_harness`, `test_replay_api`)

Cat 2 — Pydantic literal (`test_knowledge_management`)

Cat 3 — Wrong path (`test_frontend_modules`)