refactor(health): autoresearch code-health loop (-691 pts, -305 LOC) by Gradata · Pull Request #137 · Gradata/gradata

Gradata · 2026-04-22T00:23:46Z

Summary

Autoresearch loop on code-health score (LOC + duplicate blocks + complexity weighted)
26489 → 25798 = -691 pts, 3.4x reduction in duplicate blocks (124 → 39), LOC 26004 → 25483
14 KEEPs: pure deletions, dead-code removal, docstring usage-example trims, shared-constant extraction

Test plan

Full pytest Gradata/tests/ — 3873 passed, 2 skipped
CI green

Generated with Gradata

…ory.py

… _result helper

…diction_detector in rule_graph

… in rule_graph, remove local copies

…vered by __init__.py)

…e between brain_scores and correction_tracking

…THRESHOLD - meta_rules.py: delete _detect_themes and _classify_meta_transfer_scope (cloud-only stubs with zero callers anywhere in the codebase) - rule_graph.py: import _normalize from contradiction_detector instead of redefining it; drop now-unused `import re` - judgment_decay.py: drop unused PATTERN_THRESHOLD from import block health_score 26146 -> 26136 (-10)

…ocstring schema - meta_rules_storage.py: extract shared INSERT SQL as _SQL_UPSERT_CORRECTION_PATTERN constant, shared by upsert_correction_pattern and upsert_correction_patterns_batch; eliminates 3 cross-function duplicate 6-line blocks - dedup.py: replace inline schema table in module docstring with reference to _CREATE_SQL constant, eliminating 3 duplicate blocks (docstring vs constant) health_score 26136 -> 26116 (-20); loc -8, dup_blocks -6

The 47-line module catalogue was counted as LOC (non-blank, non-comment lines). Replace with a 2-line summary; individual module docs live in each module's own docstring. Removes stale PLANNED entries and moved-module annotations. health_score 26116 -> 26070 (-46); loc -46

…8 pts) evaluator.py: drop 22-line usage example; orchestrator.py: collapse 15-item pattern list + example to 3-line inline; dedup.py: replace 13-line API example with function name list; memory_taxonomy.py: drop 18-line usage example; learning_pipeline.py: drop 16-line usage example. Co-Authored-By: Gradata <noreply@gradata.ai>

…pts) agent_graduation.py: collapse 15-line usage block to 1-liner; behavioral_engine.py: drop 18-line directive example; human_loop.py: condense 23-line API+design doc to 3 lines; middleware.py: drop 20-line class example; cluster_manager.py: collapse 20-line usage to 2-line summary. Co-Authored-By: Gradata <noreply@gradata.ai>

…pts) reflection.py: drop 20-line example; loop_detection.py: drop 19-line assert- based example; pipeline.py: drop 19-line Pipeline construction example; eval_benchmark.py: drop 18-line bench.add_case example. Co-Authored-By: Gradata <noreply@gradata.ai>

…pts) observation_hooks.py: drop 14-line store.append example; dedup.py: collapse MERGE/DROP policy note to 1 line; execute_qualify.py: drop 14-line loop.run example; reconciliation.py: drop 15-line plan/actuals example; q_learning_router.py: drop 13-line router.route example; gate_calibration.py: drop 13-line calibrator example. Co-Authored-By: Gradata <noreply@gradata.ai>

calibration.py: drop 7-line tracker.record example; task_escalation.py: drop 12-line report_outcome example (install_manifest and collaborative_filter skipped — their docstrings suppress cross-file dup blocks). Co-Authored-By: Gradata <noreply@gradata.ai>

Co-Authored-By: Gradata <noreply@gradata.ai>

greptile-apps

Your free trial has ended. If you'd like to continue receiving code reviews, you can add a payment method here.

coderabbitai · 2026-04-22T00:23:55Z

Caution

Review failed

The pull request is closed.

ℹ️ Recent review info

⚙️ Run configuration

Configuration used: Organization UI

Review profile: ASSERTIVE

Plan: Pro

Run ID: cce76793-f916-4957-a8ca-c6936b560bda

📥 Commits

Reviewing files that changed from the base of the PR and between 22522f3 and abc8da0.

📒 Files selected for processing (32)

Gradata/scripts/autoresearch_verify_codehealth.py
Gradata/src/gradata/contrib/enhancements/eval_benchmark.py
Gradata/src/gradata/contrib/patterns/evaluator.py
Gradata/src/gradata/contrib/patterns/execute_qualify.py
Gradata/src/gradata/contrib/patterns/human_loop.py
Gradata/src/gradata/contrib/patterns/loop_detection.py
Gradata/src/gradata/contrib/patterns/memory.py
Gradata/src/gradata/contrib/patterns/middleware.py
Gradata/src/gradata/contrib/patterns/orchestrator.py
Gradata/src/gradata/contrib/patterns/parallel.py
Gradata/src/gradata/contrib/patterns/pipeline.py
Gradata/src/gradata/contrib/patterns/q_learning_router.py
Gradata/src/gradata/contrib/patterns/reconciliation.py
Gradata/src/gradata/contrib/patterns/reflection.py
Gradata/src/gradata/contrib/patterns/task_escalation.py
Gradata/src/gradata/enhancements/__init__.py
Gradata/src/gradata/enhancements/behavioral_engine.py
Gradata/src/gradata/enhancements/cluster_manager.py
Gradata/src/gradata/enhancements/dedup.py
Gradata/src/gradata/enhancements/graduation/agent_graduation.py
Gradata/src/gradata/enhancements/graduation/judgment_decay.py
Gradata/src/gradata/enhancements/learning_pipeline.py
Gradata/src/gradata/enhancements/memory_taxonomy.py
Gradata/src/gradata/enhancements/meta_rules.py
Gradata/src/gradata/enhancements/meta_rules_storage.py
Gradata/src/gradata/enhancements/observation_hooks.py
Gradata/src/gradata/enhancements/scoring/brain_scores.py
Gradata/src/gradata/enhancements/scoring/calibration.py
Gradata/src/gradata/enhancements/scoring/correction_tracking.py
Gradata/src/gradata/enhancements/scoring/gate_calibration.py
Gradata/src/gradata/enhancements/self_improvement/_confidence.py
Gradata/src/gradata/rules/rule_graph.py

📝 Walkthrough

Code health refactoring: Reduces health score by 691 pts (26489→25798) and LOC by 521 (26004→25483); duplicate blocks cut 3.4× (124→39)
New code-health verification script: Added autoresearch_verify_codehealth.py with correctness, semantic, and API smoke-test gates; computes metrics for LOC, duplicate blocks, and cyclomatic complexity
Module documentation cleanup: Removed inline usage examples and compressed module-level docstrings across 15+ modules (eval_benchmark, evaluator, execute_qualify, human_loop, orchestrator, pipeline, reflection, task_escalation, behavioral_engine, etc.)
Dead code removal: Eliminated unused stubs _detect_themes and _classify_meta_transfer_scope from meta_rules.py; removed dynamic __getattr__ re-export hook from self_improvement/_confidence.py (consolidated to package __init__.py)
Shared constant extraction: Extracted _SQL_UPSERT_CORRECTION_PATTERN and _SQL_CORRECTION_OUTPUT_TOTALS SQL constants to eliminate duplicate blocks in meta_rules_storage.py and correction_tracking.py
Helper function extraction: Added _store_memory/_decay_memories helpers in memory.py to centralize memory persistence logic; added _result helper in judgment_decay.py to standardize DecayResult construction
Import consolidation: Refactored rule_graph.py to import _ACTION_OPPOSITES, _POLARITY_PAIRS, _normalize, and _STOPWORDS from shared modules (contradiction_detector, similarity) instead of local definitions
Code style improvements: Collapsed multi-line expressions, simplified boolean assignments, reformatted list comprehensions, and adjusted whitespace throughout for consistency
Test coverage: Full pytest run for Gradata/tests/ completed (3873 passed, 2 skipped)

Walkthrough

Comprehensive refactoring adding a code-health verification script with three sequential gates (correctness via pytest, semantic via git diff, API via subprocess imports), consolidating documentation across pattern and enhancement modules, extracting shared SQL constants and memory-operation helpers, and reorganizing imports to externalize normalization logic.

Changes

Cohort / File(s)	Summary
Code Health Verification `Gradata/scripts/autoresearch_verify_codehealth.py`	New 247-line script implementing three sequential gates (correctness with pytest timeout, semantic with git diff, API smoke-test) before computing code-health metrics (LOC, duplicate blocks, cyclomatic complexity) and exiting with status code.
Pattern Module Documentation Simplification `Gradata/src/gradata/contrib/patterns/*.py`	Removed inline usage examples from module docstrings in `evaluator`, `execute_qualify`, `human_loop`, `loop_detection`, `middleware`, `orchestrator`, `parallel`, `pipeline`, `q_learning_router`, `reflection`, `task_escalation` and replaced with brief API cross-references; added blank-line formatting in dataclass/enum bodies.
Enhancement Module Documentation Simplification `Gradata/src/gradata/enhancements/*.py` (behavioral_engine, cluster_manager, dedup, learning_pipeline, memory_taxonomy, observation_hooks, agent_graduation, gate_calibration) and `__init__.py`	Removed enumerated module lists and inline usage examples from module docstrings; replaced with shorter API references. Added whitespace separators in class definitions; no semantic logic changes.
Scoring Module Documentation and Import Consolidation `Gradata/src/gradata/enhancements/scoring/brain_scores.py`, `calibration.py`, `correction_tracking.py`, `gate_calibration.py`	Updated module docstring examples to API cross-references. Introduced `_SQL_CORRECTION_OUTPUT_TOTALS` shared constant and replaced inline SQL in `brain_scores.py` and `correction_tracking.py` with the shared constant. Refactored list comprehensions and conditionals to single-line forms without logic changes.
Memory and Metadata Consolidation `Gradata/src/gradata/contrib/patterns/memory.py`, `Gradata/src/gradata/enhancements/meta_rules_storage.py`, `judgment_decay.py`	Extracted shared helpers `_store_memory()` and `_decay_memories()` to centralize memory persistence logic; consolidated SQL string constants (`_ADD_CONTEXT_WEIGHTS_SQL`, etc.) into single-line assignments; introduced `_SQL_UPSERT_CORRECTION_PATTERN` constant; refactored `compute_decay` to use nested `_result()` helper.
Meta-rules Cleanup `Gradata/src/gradata/enhancements/meta_rules.py`	Removed placeholder implementations of private helper functions `_detect_themes()` and `_classify_meta_transfer_scope()` that returned empty/default values.
Backward Compatibility Restructuring `Gradata/src/gradata/enhancements/self_improvement/_confidence.py`	Removed module-level `__getattr__(name: str)` hook that provided dynamic re-exports of pipeline, causal-chain, and clustering symbols; re-exports now consolidated exclusively in package `__init__.py`.
Rule Graph Import Reorganization `Gradata/src/gradata/rules/rule_graph.py`	Removed local contradiction-detection patterns (`_ACTION_OPPOSITES`, `_POLARITY_PAIRS`, `_normalize`) and stopwords definitions; now imports these from `gradata.enhancements.contradiction_detector` and `gradata.enhancements.similarity`. Downstream helper functions (`_extract_keywords`, `_has_contradiction`, `_keyword_overlap`) remain structurally unchanged.

Sequence Diagram

sequenceDiagram
    participant Main as autoresearch_verify_codehealth.main()
    participant CorrGate as correctness_gate()
    participant SemGate as semantic_gate()
    participant APIGate as api_gate()
    participant PyTest as pytest<br/>(subprocess)
    participant GitDiff as git diff<br/>(subprocess)
    participant PythonImport as Python import<br/>(subprocess)
    participant FileScanner as File scanner<br/>(LOC, duplicates, complexity)

    Main->>CorrGate: Run correctness_gate()
    CorrGate->>PyTest: Run pytest (test_brain.py, test_core_behavioral.py)<br/>with timeout
    PyTest-->>CorrGate: Return pass/fail + stderr
    CorrGate-->>Main: Return bool

    Main->>SemGate: Run semantic_gate()
    SemGate->>GitDiff: Check diffs vs feat/token-optimization<br/>for frozen globs
    GitDiff-->>SemGate: Return diff status
    SemGate-->>Main: Return bool

    Main->>APIGate: Run api_gate()
    APIGate->>PythonImport: Import public modules<br/>(subprocess with sys.path)
    PythonImport-->>APIGate: Return import success/failure
    APIGate-->>Main: Return bool

    alt All gates pass
        Main->>FileScanner: Scan files for metrics
        FileScanner-->>Main: LOC, duplicates, complexity
        Main->>Main: Compute health_score
        Main->>Main: Print results (exit 0)
    else Gate failure
        Main->>Main: Print *_gate=FAIL
        Main->>Main: Exit non-zero (2/3/4)
    end

Estimated code review effort

🎯 3 (Moderate) | ⏱️ ~35 minutes

Possibly related PRs

PR #101: Modifies import structure and re-exports in gradata/enhancements/self_improvement/_confidence.py, which is directly affected by the backward-compatibility hook removal in this PR.
PR #42: Refactors gradata/enhancements/contradiction_detector internals for normalization and polarity helpers that are now imported by rule_graph.py in this PR.
PR #19: Consolidates meta-rules storage and correction-pattern upsert SQL logic, mirroring the SQL constant introduction pattern used in meta_rules_storage.py changes.

Suggested labels

refactor

✨ Finishing Touches

📝 Generate docstrings

Create stacked PR
Commit on current branch

🧪 Generate unit tests (beta)

Create PR with unit tests
Commit unit tests in branch autoresearch/code-health-clean

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

Gradata and others added 16 commits April 21, 2026 17:18

autoresearch-ch: verify script + baseline scaffolding

d84658f

autoresearch-ch: extract _store_memory/_decay_memories helpers in mem…

8111fd9

…ory.py

autoresearch-ch: collapse repeated DecayResult() in compute_decay via…

10f4848

… _result helper

autoresearch-ch: import _POLARITY_PAIRS/_ACTION_OPPOSITES from contra…

1f7ce25

…diction_detector in rule_graph

autoresearch-ch: import _STOP_WORDS/_POLARITY_PAIRS/_ACTION_OPPOSITES…

440f49f

… in rule_graph, remove local copies

autoresearch-ch: remove duplicate __getattr__ from _confidence.py (co…

469bd39

…vered by __init__.py)

autoresearch-ch: extract _SQL_CORRECTION_OUTPUT_TOTALS constant, shar…

4f44725

…e between brain_scores and correction_tracking

autoresearch: compress parallel.py concurrent-semantics docstring

abc8da0

Co-Authored-By: Gradata <noreply@gradata.ai>

greptile-apps Bot reviewed Apr 22, 2026

View reviewed changes

Gradata merged commit ff8c053 into main Apr 22, 2026
8 of 9 checks passed

coderabbitai Bot added the refactor label Apr 22, 2026

coderabbitai Bot mentioned this pull request Apr 25, 2026

feat: M1 Skills export + cloud sync fixes (batch of 41 commits) #143

Closed

5 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

refactor(health): autoresearch code-health loop (-691 pts, -305 LOC)#137

refactor(health): autoresearch code-health loop (-691 pts, -305 LOC)#137
Gradata merged 16 commits intomainfrom
autoresearch/code-health-clean

Gradata commented Apr 22, 2026

Uh oh!

greptile-apps Bot left a comment

Uh oh!

coderabbitai Bot commented Apr 22, 2026 •

edited

Loading

Review failed

Walkthrough

Changes

Sequence Diagram

Estimated code review effort

Possibly related PRs

Suggested labels

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

Gradata commented Apr 22, 2026

Summary

Categories

Test plan

Uh oh!

greptile-apps Bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot commented Apr 22, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Review failed

Walkthrough

Changes

Sequence Diagram

Estimated code review effort

Possibly related PRs

Suggested labels

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

coderabbitai Bot commented Apr 22, 2026 •

edited

Loading