# ISRO Launch Market Trend Analysis & Risk Prediction
### Minor in AI Final Project Submission

## Problem Definition & Objective
The primary objective of this project is to analyze the historical performance of the Indian Space Research Organisation (ISRO) launch vehicles and build a predictive model that estimates the probability of mission success. By leveraging machine learning, specifically a Random Forest classifier, I aim to generate a 'Risk Score' for future launches. This score will serve as a quantitative tool for commercial satellite operators and insurance providers to calculate risk-adjusted costs when choosing ISRO as their launch partner.

## Selected Project Track
This project is submitted under the **Machine Learning & Predictive Analytics** track, with a specific focus on Aerospace Data Engineering.

## Clear Problem Statement
Despite ISRO's reputation as a cost-effective and reliable space agency, recent anomalies in the PSLV program—specifically the PSLV-C61 (May 2025) and the PSLV-C62 (January 2026) failures—have highlighted the need for more granular risk assessment. Traditional reliability is calculated as a simple percentage of historical successes. However, this does not account for the complexity of multi-payload rideshares, rocket configurations, or specific orbital requirements. My project addresses this by moving from static success rates to a dynamic, feature-driven risk prediction model.

## Real-World Relevance and Motivation
The global small-satellite launch market is becoming increasingly competitive with the rise of private players like SpaceX. For India to maintain its 2033 goal of capturing 8–10% of the global market, it must provide transparency to international customers. High-profile failures like the PSLV-C62 anomaly on January 12, 2026, lead to a 'trust deficit' and immediate spikes in insurance premiums (estimated at 20-30%). This project provides an open-source, data-driven method to quantify these risks, helping startups and commercial partners make informed financial and technical decisions.

## Data Understanding & Preparation
The dataset is constructed by scraping Wikipedia’s launch logs for the PSLV, GSLV, and LVM3 rocket families.

- **Data Integrity:** Used BeautifulSoup-based scraper to handle 'rowspan' issues.
- **Feature Engineering:** Extracted and summed payload masses using regex.
- **Target Labeling:** Binary mapping (1 for Success, 0 for Failure).

## Model / System Design
The system follows a modular three-notebook architecture: Pipeline, EDA, and ML Model. I chose Random Forest because it handles non-linear relationships and provides Feature Importance scores, identifying contributing factors to mission risk.

## Core Implementation
The implementation is built using Python, utilizing pandas, BeautifulSoup, and scikit-learn. It includes specific 'Check-then-Scrape' logic and data propagation through forward-filling.

## Evaluation & Analysis
The model is evaluated using Accuracy, Precision, Recall, and the Brier Score to assess the quality of predicted probabilities (Risk Scores).

## Ethical Considerations & Responsible AI
The project ensures transparency by using public data and addresses class imbalance issues to prevent biased success predictions in safety-critical domains.

## Conclusion & Future Scope
Machine learning provides valuable financial insights in aerospace. Future work involves integrating real-time weather data and expanding the dataset to include global launchers for a comparative Market Risk Index.