# 🗺️ Project Roadmap: IEEE SciVis 2025 – Scrap-Based Alloy Discovery

---

## 🔰 Phase 1: Understanding the Problem & Dataset ✅ *(I have mostly done this!)*
**Goal:** Grasp what the dataset represents and what the competition is about.

- ✅ Download and clean the dataset
- ✅ Understand the column groups:
  - 6 scrap alloy input ratios
  - 12 resulting element compositions
  - 38 microstructure features
  - 14 physical/mechanical/thermal properties
- ✅ Read background info & challenges
- ✅ Perform EDA (distributions, correlations, pairplots)

---

## 🔍 Phase 2: Define Use Cases (Scenarios) 🔄 *(I am doing this now)*
**Goal:** Choose real-world applications to guide alloy optimization.

### Scenarios:
- Aerospace  
- Automotive  
- Power Generation  
- Additive Manufacturing  
- E-Mobility Battery Enclosures  
- 🆕 Utility Infrastructure (e.g., power lines)

**For each scenario:**
- Define key target properties (e.g., high YS + conductivity for power)
- List “good alloy” criteria


**Here are the scenarios:**
## 🧪 Defined Use Cases for Scrap-Based Alloy Discovery

### 1. Aerospace Structural Components
- 📌 **Context**: Lightweight + high strength materials for fuel-efficient aircraft.
- 🎯 **Target Properties**:
  - YS > 350 MPa
  - Density < 2.7 g/cm³
  - Thermal conductivity > 160 W/(m·K)
  - Linear thermal expansion < 2.5e-5 1/K


### 2. Automotive Engine Blocks
- 📌 **Context**: Strong, thermally stable alloys for engine durability.
- 🎯 **Target Properties**:
  - YS > 300 MPa
  - Hardness > 90 HV
  - Thermal conductivity > 150 W/(m·K)
  - CSC < 0.5


### 3. Power Generation Components
- 📌 **Context**: High-strength parts that resist cracking under heat.
- 🎯 **Target Properties**:
  - YS > 320 MPa
  - CSC < 0.45
  - delta_T > 80°C
  - Therm. diffusivity > 6.5e-5 m²/s


### 4. Additive Manufacturing
- 📌 **Context**: Alloys that print well and maintain stable properties.
- 🎯 **Target Properties**:
  - delta_T < 40°C
  - High Al + Si content (printability)
  - Thermal conductivity > 170 W/(m·K)
  - Good hardness (> 85 HV)


### 5. E-Mobility Battery Enclosures
- 📌 **Context**: Lightweight, thermally conductive, and safe materials.
- 🎯 **Target Properties**:
  - Low density (< 2.7 g/cm³)
  - Thermal conductivity > 180 W/(m·K)
  - Electrical resistivity < 7.8e-8 Ω·m
  - Moderate strength (YS > 250 MPa)


### 6. Utility Infrastructure (e.g., Power Lines)
- 📌 **Context**: Conductive, corrosion-resistant aluminum for overhead lines.
- 🎯 **Target Properties**:
  - Electrical conductivity > 1.35e7 S/m
  - Low density
  - High thermal stability
  - YS moderately high (around 250 MPa)


## 📊 Phase 3: Global Visualization (Challenge 1 - Task 1) 🔜
**Goal:** Reveal patterns in full dataset (alloy design → performance)

- Use dimensionality reduction (PCA, t-SNE, UMAP) on:
  - Compositions only
  - Microstructure only
  - All features together
- Plot colored by application-relevant outputs (e.g., YS, CSC, Thermal Conductivity)
- Create parallel coordinate plots for top vs. worst-performing alloys
- Highlight trends and clusters in composition–property relationships


### ✅ Summary: Phase 3 – Global Visualization

In this phase, we tackled **Challenge 1 – Task 1** by building visual overviews that reveal global patterns across the high-dimensional design space of alloy candidates.

#### 🔹 What We Did

- **Dimensionality Reduction (PCA)**  
  Applied PCA to:
  - ✅ Composition features (scrap %, elements)
  - ✅ Microstructure features (Vf_, T_, delta_T)
  - ✅ All features together

- **Colored Projections**  
  For each PCA plot, we colored points by:
  - `YS(MPa)` – Yield Strength  
  - `CSC` – Hot Crack Susceptibility  
  - `Thermal Conductivity (W/mK)`

- **Trend Identification**
  - High `YS(MPa)` alloys tend to cluster in specific PCA regions
  - Low `CSC` (safer alloys) are found away from regions of high conductivity
  - Trade-offs between strength, conductivity, and crack risk were visually apparent

#### 📌 Key Insights

| PCA Region      | Yield Strength | Thermal Conductivity | Crack Risk | Suitable For |
|------------------|----------------|------------------------|------------|--------------|
| Top-Right (Comp) | ✅ High         | ❌ Low                 | ✅ Safe     | Aerospace, Automotive |
| Bottom-Left      | ❌ Low          | ✅ High                | ❌ Risky    | Thermal apps, risky to cast |
| Middle Band      | ⚖️ Tradeoff     | ⚖️ Medium              | ⚖️ Balanced | E-Mobility, Consumer Electronics |

- **Microstructure PCA**: Yield strength and crack susceptibility are clearly separable, confirming strong microstructure-property linkage.
- **All Features PCA**: Similar gradient separation, affirming that combining all variables preserves meaningful variance.

#### 🔄 Next Steps

We now proceed to **Phase 4**, where we’ll build an **interactive exploration tool** (Streamlit app) to allow domain experts to filter, compare, and analyze candidate alloys based on target application needs.

---

## 🔎 Phase 4: Candidate Exploration Tool (Challenge 1 - Task 2)
**Goal:** Enable interactive exploration of alloy trade-offs

- Build a dashboard (e.g., with Plotly Dash, Streamlit, or Jupyter Widgets):
  - Filter by property ranges (e.g., YS > 300 MPa)
  - Select application and highlight suitable alloys
  - Show stats and charts for selected alloys
- Add:
  - Top-N alloys for each use case
  - Composition breakdown
  - Phase content and microstructure insights

---

## ⚙️ Phase 5: Optimization Guidance (Challenge 2 - Task 1)
**Goal:** Visualize how to *change inputs* to improve alloy designs

- **Sensitivity Analysis:**
  - Which input % or element affects YS, CSC, conductivity the most?
  - Use partial dependence plots or SHAP values
- **Volatility Analysis:**
  - Where do small composition changes cause large output shifts?
- **Visualization:**
  - Use heatmaps, ridge plots, or slider-based simulations to show "what-if" changes
  - Add annotations to steer toward better regions

---

## 🖼 Phase 6: Reporting & Presentation
**Goal:** Prepare a visual, interactive and impactful report

- Clean and comment notebooks
- Write a well-structured README with:
  - Project goals
  - Use case summary
  - Key insights
  - Visual demos or GIFs
- Submit a poster or cover visual highlighting your findings

---

## ✅ Deliverables Checklist

| Task                        | Status        |
|-----------------------------|---------------|
| Load & clean data           | ✅             |
| Perform EDA                 | ✅             |
| Define use cases            | 🟡 in progress |
| Visualize global patterns   | 🔜             |
| Build interactive tool      | 🔜             |
| Analyze sensitivity/volatility | 🔜         |
| Final write-up and cover    | 🔜             |
