# üìã BATCH 1: META-LEARNER ARCHITECTURE (Core - 6 Questions)

**Critical Path**: Module 4 depends on these answers  
**Estimated Time**: 60 min

## Q1: Optimal Ensemble Method for Trading Forecaster

**Perplexity Query**:
```
Given a trading forecaster combining:
- Pattern detection signals (Elliott Wave, candlesticks, S/R)
- Regime classification (12-regime system: VIX √ó breadth √ó ATR)
- Research features (40-60 engineered: dark pool, sentiment, microstructure)
- Pre-catalyst signals (options OI, insider trades, news)
- Cross-asset correlations (BTC, yields, VIX leads)

Which ensemble architecture minimizes overfitting on small samples (~100 trades/regime) while maximizing predictive accuracy:
A) Weighted average with regime-specific weights
B) Gradient boosting (XGBoost/LightGBM)
C) Bayesian model averaging
D) Small neural network (3 layers, 32-16-8 units)
E) Hierarchical ensemble (patterns‚Üíregime‚Üíresearch‚Üífinal)

Provide: Implementation recommendation, expected accuracy improvement, computational cost, pros/cons for each.
```

In [None]:
# Q1 ANSWER: [Paste Perplexity response here]
# 
# IMPLEMENTATION DECISION:
# - Architecture: 
# - Hyperparameters: 
# - Expected improvement: 
# - Computational cost: 
# 
# CODE SNIPPET:
# [Copy relevant code from Perplexity if provided]

## Q2: Regime-Aware Weight Matrices

**Perplexity Query**:
```
For 12 market regimes (BULL/BEAR/NEUTRAL √ó LOW/NORMAL/ELEVATED/EXTREME_VOL √ó STABLE/MODERATE/VOLATILE_ATR):

How should signal weights change dynamically:
- BULL_LOW_VOL_STABLE: favor patterns (40%?), regime (30%?), research (20%?)?
- BEAR_EXTREME_VOL_VOLATILE: favor catalysts (40%?), research (30%?), cross-asset (20%?)?
- NEUTRAL_NORMAL_VOL_MODERATE: favor microstructure (35%?), regime (25%?)?

Provide: Complete 12√ó5 weight matrix (12 regimes √ó 5 signal types) optimized for Sharpe ratio. Cite research if available (Hamilton 2024 regime-switching, others).
```

In [None]:
# Q2 ANSWER: [Paste Perplexity response here]
# 
# IMPLEMENTATION DECISION:
# - Weight matrix: {
#     'BULL_LOW_VOL': {'patterns': 0.40, 'regime': 0.30, 'research': 0.20, ...},
#     'BEAR_EXTREME_VOL': {'catalysts': 0.40, 'research': 0.30, ...},
#     ...
# }
# 
# CODE SNIPPET:
# [12x5 matrix as Python dict or numpy array]

## Q3: Confidence Calibration Method

**Perplexity Query**:
```
Problem: Raw model confidence 70% ‚Üí actual win-rate only 55% (overconfident).

Which calibration method works best for small samples per regime (~100 trades):
A) Platt scaling (logistic regression: P_cal = 1/(1+exp(A*P_raw + B)))
B) Isotonic regression (non-parametric, monotonic)
C) Beta calibration (extension of Platt for both under/over-confidence)
D) Temperature scaling (neural network calibration)
E) Histogram binning with Laplace smoothing

Provide: Method recommendation, expected calibration error (ECE target <0.05?), sample code, when to recalibrate (every 20 trades?).
```

In [None]:
# Q3 ANSWER: [Paste Perplexity response here]
# 
# IMPLEMENTATION DECISION:
# - Method: 
# - ECE target: 
# - Recalibration frequency: 
# 
# CODE SNIPPET:
# [Calibration formula implementation]

## Q4: Signal Agreement Boosting

**Perplexity Query**:
```
When all 5 signals agree (patterns + regime + research + catalyst + cross-asset all say BUY):
- Should confidence be boosted? By how much? (1.2√ó multiplier?)
- Should position size increase? (2√ó normal size?)
- Historical evidence: does signal agreement predict higher win-rate?

When only 1-2 signals fire:
- Should confidence be reduced? (0.7√ó multiplier?)
- Should position size decrease or skip trade entirely?
- Threshold: Minimum how many signals must agree to trade?

Provide: Signal agreement formula, confidence adjustment rules, position sizing rules.
```

In [None]:
# Q4 ANSWER: [Paste Perplexity response here]
# 
# IMPLEMENTATION DECISION:
# - Agreement boost formula: 
# - Minimum signals to trade: 
# - Position size multiplier: 
# 
# CODE SNIPPET:
# [Agreement calculation and adjustment logic]

## Q5: Handling Missing Signals

**Perplexity Query**:
```
In production, some signals may be unavailable:
- Dark pool data missing (API down)
- Sentiment data stale (EODHD rate limit hit)
- Insider data incomplete (Finnhub lag)

Fallback strategies:
A) Skip ticker if critical signal missing
B) Use last known value (with staleness penalty)
C) Impute from correlated tickers
D) Reduce signal weight proportionally
E) Use regime-specific default values

Provide: Recommended fallback per signal type, staleness thresholds (max age before ignore), imputation methods.
```

In [None]:
# Q5 ANSWER: [Paste Perplexity response here]
# 
# IMPLEMENTATION DECISION:
# - Fallback strategy: 
# - Staleness thresholds: {'dark_pool': '5min', 'sentiment': '2hr', ...}
# - Imputation method: 
# 
# CODE SNIPPET:
# [Fallback handler implementation]

## Q6: Meta-Learner Training Strategy

**Perplexity Query**:
```
Should weights be:
A) Fixed (hand-tuned based on research, never updated)
B) Learned once (train on 2021-2024, freeze)
C) Periodically retrained (every month on rolling 18-month window)
D) Adaptive (online learning, update after each trade)

Pros/cons of each? Risk of overfitting? Computational cost? Expected accuracy improvement?

Provide: Recommended strategy, retraining schedule, validation protocol (how to ensure new weights > old weights).
```

In [None]:
# Q6 ANSWER: [Paste Perplexity response here]
# 
# IMPLEMENTATION DECISION:
# - Training strategy: 
# - Retraining schedule: 
# - Validation protocol: 
# 
# CODE SNIPPET:
# [Training loop with validation]

---

# üìã BATCH 2: FEATURE ENGINEERING & SELECTION (Critical - 6 Questions)

**Critical Path**: Module 2 depends on these answers  
**Estimated Time**: 60 min

## Q7: Feature Selection from 60 Candidates

**Perplexity Query**:
```
Starting with 60 engineered features (microstructure, sentiment, momentum, volatility, cross-asset):

How to reduce to top 15-20 without losing predictive power:
A) SHAP importance ranking (keep top 15)
B) Correlation filtering (drop features with r>0.8)
C) Recursive feature elimination (RFE)
D) L1 regularization (LASSO) automatic selection
E) Forward selection (add one feature at a time, keep if Sharpe improves)

Provide: Recommended method, expected accuracy loss if any, computational cost, sector-specific feature importance (AI vs Quantum vs Robotaxi).
```

In [None]:
# Q7 ANSWER: [Paste Perplexity response here]
# 
# IMPLEMENTATION DECISION:
# - Method: 
# - Expected accuracy loss: 
# - Computational cost: 
# 
# CODE SNIPPET:
# [Feature selection implementation]

## Q8: Microstructure Proxies from Free Data

**Perplexity Query**:
```
Without paid Level 2 data, how to proxy institutional activity:

1. Dark pool ratio: Use yfinance minute volume clustering? FINRA ATS weekly data (2-week lag)? Other?
2. Spread compression: Use (High-Low)/Close ratio? Intraday VWAP deviation? Other?
3. Order flow imbalance: Volume-weighted price momentum? Tick direction from minute bars? Other?
4. Smart money accumulation: After-hours volume? Block trades (volume spikes >3œÉ)? Other?

Provide: Best free proxy for each, validation method (correlation with paid data if available), expected signal strength.
```

In [None]:
# Q8 ANSWER: [Paste Perplexity response here]
# 
# IMPLEMENTATION DECISION:
# - Dark pool proxy: 
# - Spread proxy: 
# - Order flow proxy: 
# - Smart money proxy: 
# 
# CODE SNIPPET:
# [Microstructure proxy calculations]

## Q9: Sentiment Integration

**Perplexity Query**:
```
EODHD sentiment score (-1 to +1) aggregated daily:

How to use effectively:
A) Raw sentiment as single feature
B) Sentiment trend (5-day change)
C) Sentiment divergence (price up, sentiment down ‚Üí bearish)
D) Sentiment-weighted volume (high volume + positive sentiment ‚Üí bullish)
E) News tier classification (break news vs earnings vs supply chain vs macro)

Research shows sentiment alone has R¬≤‚âà0.01 (useless). Combined with price action: R¬≤‚âà0.35 (meaningful).

Provide: Optimal sentiment feature engineering, combination rules, when to ignore sentiment (low article count?).
```

In [None]:
# Q9 ANSWER: [Paste Perplexity response here]
# 
# IMPLEMENTATION DECISION:
# - Feature engineering: 
# - Combination rules: 
# - Ignore threshold: 
# 
# CODE SNIPPET:
# [Sentiment feature calculations]

## Q10: Cross-Asset Lead-Lag Relationships

**Perplexity Query**:
```
Research discovered:
- BTC returns lead tech stocks by 6-24 hours (correlation 0.72 in bull regimes)
- 10Y yield changes lead sector rotation by 3-5 days
- VIX term structure predicts volatility regime by 1-3 days
- Sector breadth divergences predict corrections by 2-7 days

How to validate these leads without look-ahead bias:
- Use BTC data from T-1 day (available at market close)?
- Use yield data from T (FRED updates daily)?
- What correlation threshold to trust the signal (r>0.5?)?

Provide: Timestamp alignment checklist, lag validation method, correlation thresholds per asset class.
```

In [None]:
# Q10 ANSWER: [Paste Perplexity response here]
# 
# IMPLEMENTATION DECISION:
# - Timestamp alignment: 
# - Lag validation: 
# - Correlation thresholds: {'BTC': 0.5, '10Y': 0.4, ...}
# 
# CODE SNIPPET:
# [Cross-asset lag calculation with timestamp checks]

## Q11: Regime-Specific Feature Importance

**Perplexity Query**:
```
Hypothesis: Different features matter in different regimes:
- BULL: Patterns (EMA crosses, breakouts) matter most
- BEAR: Catalysts (insider trades, analyst downgrades) matter most
- CHOP: Microstructure (spread, volume) matters most

How to validate:
A) Train separate models per regime, compare feature importance
B) Use interaction terms (feature √ó regime indicator)
C) SHAP values conditional on regime

Provide: Recommended approach, expected accuracy improvement from regime-specific features, computational cost.
```

In [None]:
# Q11 ANSWER: [Paste Perplexity response here]
# 
# IMPLEMENTATION DECISION:
# - Approach: 
# - Expected accuracy improvement: 
# - Computational cost: 
# 
# CODE SNIPPET:
# [Regime-specific feature importance calculation]

## Q12: Feature Staleness & Drift Detection

**Perplexity Query**:
```
Features calculated on 2023-2025 data may drift by 2027:
- Market structure changes (more ETF trading)
- Algo evolution (HFTs adapting to patterns)
- Regime shifts (new regime not seen before)

How to detect drift:
A) Population Stability Index (PSI) per feature (threshold PSI>0.2 ‚Üí retrain)
B) Kolmogorov-Smirnov test (distribution shift)
C) Rolling correlation with returns (if drops >20% ‚Üí feature weakening)
D) Performance monitoring (if Sharpe drops >0.1 ‚Üí investigate features)

Provide: Recommended drift detection method, alert thresholds, retraining trigger logic.
```

In [None]:
# Q12 ANSWER: [Paste Perplexity response here]
# 
# IMPLEMENTATION DECISION:
# - Drift detection method: 
# - Alert thresholds: {'PSI': 0.2, 'KS': 0.3, ...}
# - Retraining triggers: 
# 
# CODE SNIPPET:
# [Drift detection implementation]

---

# üìã BATCH 3: TRAINING & VALIDATION (Critical - 6 Questions)

**Critical Path**: Module 4-5 training protocols  
**Estimated Time**: 45 min

## Q13-Q18: [Continue same format for remaining batches]

**Note**: Copy the exact questions from `COMPLETE_PERPLEXITY_RESEARCH_AGENDA.md` lines 196-394 here.

For each question:
1. Markdown cell with Perplexity query
2. Python cell for answer + implementation decision + code snippet

---

# üìã BATCH 4: CALIBRATION & POSITION SIZING (Critical - 6 Questions)

**Critical Path**: Module 5-6 depends on these answers  
**Estimated Time**: 45 min

[Q19-Q24 cells in same format]

---

# üìã BATCH 5: IMPLEMENTATION & PRODUCTION (Medium Priority - 7 Questions)

**Critical Path**: Module 9-10 deployment decisions  
**Estimated Time**: 30 min (optional if time permits)

[Q25-Q31 cells in same format]

---

# üìã BATCH 6: RESEARCH GAPS & ADVANCED TOPICS (Lower Priority - 6 Questions)

**Critical Path**: Future optimizations  
**Estimated Time**: 30 min (optional if time permits)

[Q32-Q37 cells in same format]

---

# ‚úÖ COMPLETION CHECKLIST

After answering all questions, verify you have:

- [ ] **Batch 1** (Q1-Q6): Meta-learner architecture decisions documented
- [ ] **Batch 2** (Q7-Q12): Feature engineering strategies documented
- [ ] **Batch 3** (Q13-Q18): Training/validation protocols documented
- [ ] **Batch 4** (Q19-Q24): Calibration + position sizing formulas documented
- [ ] **Batch 5** (Q25-Q31): Implementation best practices documented (optional)
- [ ] **Batch 6** (Q32-Q37): Advanced topics researched (optional)

---

## üì§ EXPORT INSTRUCTIONS

1. **Review all answers** - Ensure implementation decisions are clear
2. **Extract code snippets** - Copy Python code to separate files
3. **Export to Markdown**:
   - File ‚Üí Export Notebook As ‚Üí Markdown
   - Save as: `docs/research/PERPLEXITY_FORECASTER_ARCHITECTURE.md`
4. **Update build plan**:
   - Reference Perplexity answers in Module 4-6 implementation
   - Adjust hyperparameters based on research findings

---

## üöÄ NEXT ACTIONS (After Research Complete)

1. **Module 2** (Research Features): Implement Q7-Q12 decisions
2. **Module 3** (Feature Store): Build caching layer
3. **Module 4** (Meta-Learner): Implement Q1-Q6 architecture
4. **Module 5** (Calibrator): Implement Q3, Q19 calibration method
5. **Module 6** (Position Sizer): Implement Q20-Q24 formulas

**By end of Dec 9, you'll have all architectural unknowns resolved. Then you build with confidence.**

---

# üìä SESSION TRACKER

| Batch | Questions | Time Est | Status | Key Decisions |
|-------|-----------|----------|--------|---------------|
| Batch 1 | Q1-Q6 | 60 min | ‚è≥ Pending | Meta-learner architecture |
| Batch 2 | Q7-Q12 | 60 min | ‚è≥ Pending | Feature selection methods |
| Batch 3 | Q13-Q18 | 45 min | ‚è≥ Pending | Training protocols |
| Batch 4 | Q19-Q24 | 45 min | ‚è≥ Pending | Calibration + position sizing |
| Batch 5 | Q25-Q31 | 30 min | ‚è≥ Optional | Implementation details |
| Batch 6 | Q32-Q37 | 30 min | ‚è≥ Optional | Advanced topics |

**Total Time**: 3-4 hours  
**Priority**: Batch 1-4 (MUST COMPLETE), Batch 5-6 (optional if time permits)

---

**Start Time**: [Fill in tomorrow morning]  
**End Time**: [Fill in when complete]  
**Total Answers**: 0/37 (update as you progress)