# 📌 Phase 2: Feature Engineering for Naxalite Hideout Prediction

This notebook processes the raw location data and enriches it with additional geospatial features such as elevation, distance to villages, etc. These features will be used to train an ML model in the next phase.

In [None]:
# ✅ Step 1: Load the combined CSV data
import pandas as pd
df = pd.read_csv("../Data/naxal_hideouts_combined.csv")
df.head()

## ✅ Step 2: Add Elevation using Open-Elevation API
This will fetch elevation (in meters) based on latitude and longitude. It may be slow due to API rate limits.

In [None]:
import requests
import time

def get_elevation(lat, lon):
    url = f"https://api.open-elevation.com/api/v1/lookup?locations={lat},{lon}"
    try:
        response = requests.get(url)
        elevation = response.json()['results'][0]['elevation']
        return elevation
    except:
        return None

# Uncomment below to run for all rows (slow)
# df['elevation'] = df.apply(lambda row: get_elevation(row['latitude'], row['longitude']), axis=1)

## ✅ Step 3: Add Distance to Simulated Village
You can later replace this with real road or village coordinates using GIS tools or OSM data.

In [None]:
from geopy.distance import geodesic

# Simulate a village nearby (replace with real village coordinates later)
village_coord = (19.35000, 80.95000)

df['distance_to_village'] = df.apply(
    lambda row: geodesic((row['latitude'], row['longitude']), village_coord).km,
    axis=1
)

## ✅ Step 4: Save Enhanced Dataset
This will be used for ML model training in Phase 3.

In [None]:
df.to_csv("../Data/naxal_hideouts_features.csv", index=False)
print("✅ Feature-enhanced dataset saved successfully.")
df.head()