Skip to content

Restore alpha-engine-data in boot-pull REPOS (pair with spot migration)#20

Merged
cipher813 merged 1 commit into
mainfrom
chore/restore-alpha-engine-data-in-boot-pull
Apr 16, 2026
Merged

Restore alpha-engine-data in boot-pull REPOS (pair with spot migration)#20
cipher813 merged 1 commit into
mainfrom
chore/restore-alpha-engine-data-in-boot-pull

Conversation

@cipher813
Copy link
Copy Markdown
Owner

Summary

Reverts the REPOS trim from #19. Pair with cipher813/alpha-engine-data#44 (DataPhase1 spot migration).

Why

My earlier audit missed that the Saturday Step Function targets ae-dashboard for four steps that run code from alpha-engine-data:

Step Dep on alpha-engine-data
DataPhase1 spot launcher (new in #44) — `infrastructure/spot_data_phase1.sh`
RAGIngestion `rag/pipelines/run_weekly_ingestion.sh` + heavy venv (deferred migration)
DriftDetection `/home/ec2-user/alpha-engine-data/.venv/bin/python -m monitoring.drift_detector`
SaturdayHealthCheck now fixed to point at alpha-engine-dashboard in #44

So alpha-engine-data needs to stay checked out on ae-dashboard. Removing it in #19 was premature — that audit only looked at crontab + dashboard-repo imports and didn't check the Saturday SF.

Change

```diff
REPOS=(
/home/ec2-user/alpha-engine-config

  • /home/ec2-user/alpha-engine-data
    /home/ec2-user/alpha-engine-research
    /home/ec2-user/alpha-engine-dashboard
    /home/ec2-user/flow-doctor
    )
    ```

After this merges — manual deploy steps

```

ae-dashboard: re-clone the repo (rm -rf was run earlier)

ae-dashboard "git clone git@github.com:cipher813/alpha-engine-data.git /home/ec2-user/alpha-engine-data"

rebuild .venv (heavy — yfinance, polygon, FRED, ArcticDB, SEC EDGAR tools)

ae-dashboard "cd /home/ec2-user/alpha-engine-data && python3 -m venv .venv && source .venv/bin/activate && pip install -r requirements.txt"
```

After that, the next `boot-pull.timer` fire at 12:00 UTC will keep it refreshed going forward.

Follow-up

Once RAGIngestion + DriftDetection also migrate to spot (planned next week), the heavy `.venv` on the micro can go away. The repo itself will still need to be checked out for the launcher scripts, but the memory footprint on the micro shrinks significantly.

🤖 Generated with Claude Code

Reverts the removal from #19. The Saturday Step Function still runs
RAGIngestion + DriftDetection on the micro from alpha-engine-data,
and the new DataPhase1 spot launcher (cipher813/alpha-engine-data#44)
lives at alpha-engine-data/infrastructure/spot_data_phase1.sh — the
micro invokes it as a dispatcher and needs the repo checked out.

Context: #19 assumed ae-dashboard had no runtime need for
alpha-engine-data once health_checker + trading_calendar moved here.
That was wrong — the Saturday SF has 4 separate steps that target
the micro from alpha-engine-data (DataPhase1, RAGIngestion,
DriftDetection, SaturdayHealthCheck). I missed this when
recommending #19.

After cipher813/alpha-engine-data#44 merges and RAG/Drift also
migrate to spot (planned follow-up), alpha-engine-data may still
need to be cloned on the micro for the launcher scripts but the
heavy .venv becomes unnecessary. At that point this line can be
removed again along with a lean-clone pattern.

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
@cipher813 cipher813 merged commit a89c0d8 into main Apr 16, 2026
1 check passed
@cipher813 cipher813 deleted the chore/restore-alpha-engine-data-in-boot-pull branch April 16, 2026 17:35
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant