# Hackathon Project Guide: Exoplanet Detection with AI

## 1. Exploratory Data Analysis (EDA)
- Inspect distributions of key variables (period, depth, S/N).
- Explore label imbalance (planet candidates vs non-candidates).
- Visualize raw and folded light curves (helps storytelling).

## 2. Feature Engineering (Classical ML path)
- Use TSFRESH or similar to extract time-series features (~700–800 features).
- Add astrophysical features: transit depth, duration, period (if available).
- Handle missing values, scaling, and class imbalance (e.g., class weights in LightGBM, oversampling like SMOTE).

## 3. Model Development

### Classical ML Baseline (fast + interpretable)
- LightGBM / XGBoost (Gradient Boosted Trees).
- Evaluate with AUC & recall as key metrics (don’t miss planets).
- Feature importance plots → interpretability.

### Deep Learning Approach (benchmark & comparison)
- **Input:** Raw or minimally processed light curves (1D arrays of flux vs time).
- **Options:**
  - 1D CNNs (common for light curve classification, capture local dips).
  - RNN/LSTM/GRU (sequence modeling).
  - Or reuse architectures like Astronet (Shallue & Vanderburg 2018).
- **Training considerations:**
  - Normalize/standardize light curves.
  - Handle class imbalance (weighted loss or oversampling).
  - Use GPU if available (but keep model small for hackathon time).

