Skip to content

chore: untrack private files leaking into public repo#125

Merged
Gradata merged 1 commit intomainfrom
chore/untrack-private-files
Apr 20, 2026
Merged

chore: untrack private files leaking into public repo#125
Gradata merged 1 commit intomainfrom
chore/untrack-private-files

Conversation

@Gradata
Copy link
Copy Markdown
Owner

@Gradata Gradata commented Apr 20, 2026

Summary

git ls-files -ci --exclude-standard surfaced 31 files tracked on main that match gitignore rules — these currently ship in the public repo and the PyPI wheel even though they're marked private.

Untracked (kept on local disk):

  • Sprites/brain/scripts/* — 24 files (tenant migrations, cloud SQL schemas, ablation harness, brain benchmark, mirofish sim, autoresearch scripts)
  • Gradata/docs/superpowers/* — 5 internal session-plan docs
  • Gradata/docs/RELEASE-v0.5.0-DRAFT.md — unpublished release draft
  • Gradata/src/gradata/demo/brain/brain.manifest.json — generated artifact in the wheel

Also: remove Gradata/docs/cloud/ from .gitignore — those are legitimate public cloud-product docs that were shipping correctly; the ignore rule was historical cruft.

After this PR, git ls-files -ci --exclude-standard returns empty. Nothing private ships.

Test plan

  • git ls-files -ci --exclude-standard → 0 lines
  • All files still on disk (used --cached only, not working-tree delete)

Generated with Gradata

`git ls-files -ci --exclude-standard` surfaced 31 files that were
tracked on main but matched gitignore rules — meaning they ship in
the public repo and PyPI wheel even though they're marked private.

Removing from tracking (keeping on local disk):
- Sprites/brain/scripts/* (24 files) — private sales/brain scripts
  that leaked into the public SDK repo. Include cloud_rls_test.sql,
  cloud_schema.sql, tenant migration scripts, ablation harness code.
- Gradata/docs/superpowers/* (5 files) — internal session-plan docs
- Gradata/docs/RELEASE-v0.5.0-DRAFT.md — unpublished release draft
- Gradata/src/gradata/demo/brain/brain.manifest.json — generated
  artifact that shouldn't be in the wheel

Also removing `Gradata/docs/cloud/` from .gitignore — those are
legitimate public cloud-product docs and were shipping correctly;
the ignore rule was historical cruft.

After this: `git ls-files -ci --exclude-standard` → empty. Nothing
private ships.

Co-Authored-By: Gradata <noreply@gradata.ai>
Copy link
Copy Markdown

@greptile-apps greptile-apps Bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Your free trial has ended. If you'd like to continue receiving code reviews, you can add a payment method here.

@coderabbitai
Copy link
Copy Markdown

coderabbitai Bot commented Apr 20, 2026

Caution

Review failed

The pull request is closed.

ℹ️ Recent review info
⚙️ Run configuration

Configuration used: Organization UI

Review profile: ASSERTIVE

Plan: Pro

Run ID: d8940d5f-3ed5-4001-a4dc-16305f74ddfb

📥 Commits

Reviewing files that changed from the base of the PR and between af06270 and 6dd5980.

📒 Files selected for processing (32)
  • .gitignore
  • Gradata/docs/RELEASE-v0.5.0-DRAFT.md
  • Gradata/docs/superpowers/plans/2026-04-10-s101-master-plan.md
  • Gradata/docs/superpowers/plans/2026-04-11-hierarchical-rule-tree.md
  • Gradata/docs/superpowers/specs/2026-04-10-s101-session-plan.md
  • Gradata/docs/superpowers/specs/2026-04-11-cross-validation-gaps-design.md
  • Gradata/docs/superpowers/specs/2026-04-11-hierarchical-rule-tree-design.md
  • Gradata/src/gradata/demo/brain/brain.manifest.json
  • Sprites/brain/scripts/README-ablation-beta-lb.md
  • Sprites/brain/scripts/_common.py
  • Sprites/brain/scripts/ab_test_constitutional.py
  • Sprites/brain/scripts/ablation_beta_lb_gate.py
  • Sprites/brain/scripts/audit_duplicates.py
  • Sprites/brain/scripts/autoresearch_build_scope.py
  • Sprites/brain/scripts/autoresearch_consolidation_verify.sh
  • Sprites/brain/scripts/autoresearch_smoke_test.py
  • Sprites/brain/scripts/brain_benchmark.py
  • Sprites/brain/scripts/cloud_rls_test.sql
  • Sprites/brain/scripts/cloud_schema.sql
  • Sprites/brain/scripts/cross_validate.py
  • Sprites/brain/scripts/fill_null_tenant.py
  • Sprites/brain/scripts/half_life_fit.py
  • Sprites/brain/scripts/migrate_tree_paths.py
  • Sprites/brain/scripts/migrations/001_add_tenant_id.py
  • Sprites/brain/scripts/migrations/__init__.py
  • Sprites/brain/scripts/migrations/_runner.py
  • Sprites/brain/scripts/migrations/tenant_uuid.py
  • Sprites/brain/scripts/mine_cc_transcripts.py
  • Sprites/brain/scripts/mirofish_config_a.json
  • Sprites/brain/scripts/mirofish_config_b.json
  • Sprites/brain/scripts/mirofish_sim.py
  • Sprites/brain/scripts/optimization_runner.py

📝 Walkthrough
  • Removes 31 tracked files from git that matched .gitignore rules, using git rm --cached to untrack them while keeping files on local disk
  • Deletes 24 private scripts from Sprites/brain/scripts/* including: tenant migrations, cloud SQL schemas, ablation test harnesses, brain benchmarking utilities, MiroFish simulations, and autoresearch tools
  • Removes 5 internal session-planning docs from Gradata/docs/superpowers/ (project plans, implementation specs, design specs)
  • Removes unpublished release draft Gradata/docs/RELEASE-v0.5.0-DRAFT.md
  • Removes generated artifact Gradata/src/gradata/demo/brain/brain.manifest.json that should not ship in PyPI wheel
  • Updates .gitignore to remove Gradata/docs/cloud/ exclusion, allowing public cloud-product documentation to be tracked
  • Verification: git ls-files -ci --exclude-standard returns empty after changes, ensuring no private files remain tracked

Walkthrough

This pull request removes approximately 30+ files across documentation, scripts, and configuration directories, including release/planning documents, demonstration assets, and standalone Python/SQL utility scripts. All changes are deletions with no new files or modifications.

Changes

Cohort / File(s) Summary
Git Configuration
.gitignore
Removed ignore rule for Gradata/docs/cloud/ directory, enabling its contents to be tracked in git.
Planning & Release Documentation
Gradata/docs/RELEASE-v0.5.0-DRAFT.md, Gradata/docs/superpowers/plans/2026-04-10-s101-master-plan.md, Gradata/docs/superpowers/plans/2026-04-11-hierarchical-rule-tree.md, Gradata/docs/superpowers/specs/2026-04-10-s101-session-plan.md, Gradata/docs/superpowers/specs/2026-04-11-cross-validation-gaps-design.md, Gradata/docs/superpowers/specs/2026-04-11-hierarchical-rule-tree-design.md
Removed release checklist, execution plans, and design specifications for S101 features, hierarchical rule tree, cross-validation, and session management.
Demo Assets
Gradata/src/gradata/demo/brain/brain.manifest.json
Removed demo brain manifest containing version, statistics, and quality metrics.
Cloud Infrastructure & Migration Scripts
Sprites/brain/scripts/cloud_rls_test.sql, Sprites/brain/scripts/cloud_schema.sql, Sprites/brain/scripts/migrations/001_add_tenant_id.py, Sprites/brain/scripts/migrations/__init__.py, Sprites/brain/scripts/migrations/_runner.py, Sprites/brain/scripts/migrations/tenant_uuid.py
Removed Supabase RLS testing, cloud schema migration, and tenant-management utilities including multi-tenant initialization and SQLite migration infrastructure.
Analysis & Simulation Scripts
Sprites/brain/scripts/ab_test_constitutional.py, Sprites/brain/scripts/ablation_beta_lb_gate.py, Sprites/brain/scripts/audit_duplicates.py, Sprites/brain/scripts/brain_benchmark.py, Sprites/brain/scripts/cross_validate.py, Sprites/brain/scripts/half_life_fit.py, Sprites/brain/scripts/mine_cc_transcripts.py, Sprites/brain/scripts/optimization_runner.py
Removed standalone evaluation, benchmarking, and analysis tools including A/B testing harnesses, ablation pilots, file auditing, brain scoring, cross-validation synthesis, retention modeling, implicit-feedback mining, and optimization loop orchestration.
Autoresearch Infrastructure
Sprites/brain/scripts/autoresearch_build_scope.py, Sprites/brain/scripts/autoresearch_consolidation_verify.sh, Sprites/brain/scripts/autoresearch_smoke_test.py
Removed autoresearch pipeline steps for scope whitelist generation, composite verification with pytest/ruff/LOC metrics, and end-to-end SDK smoke testing.
Migration & Utility Scripts
Sprites/brain/scripts/fill_null_tenant.py, Sprites/brain/scripts/migrate_tree_paths.py
Removed database tenant-backfill and lesson-path hierarchical migration utilities.
Documentation & Configuration
Sprites/brain/scripts/README-ablation-beta-lb.md, Sprites/brain/scripts/_common.py, Sprites/brain/scripts/mirofish_config_a.json, Sprites/brain/scripts/mirofish_config_b.json, Sprites/brain/scripts/mirofish_sim.py
Removed ablation documentation, SDK-path and Ollama generation utilities, MiroFish LLM multi-agent simulation configurations and runner.

Estimated code review effort

🎯 3 (Moderate) | ⏱️ ~25 minutes

Possibly related PRs

Suggested labels

docs, security

✨ Finishing Touches
📝 Generate docstrings
  • Create stacked PR
  • Commit on current branch
🧪 Generate unit tests (beta)
  • Create PR with unit tests
  • Commit unit tests in branch chore/untrack-private-files

Comment @coderabbitai help to get the list of available commands and usage tips.

@Gradata Gradata merged commit 5ef1a1e into main Apr 20, 2026
1 check was pending
@Gradata Gradata deleted the chore/untrack-private-files branch April 20, 2026 08:06
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant