**Project Theme:**
Climate Risk & Disaster Management

**Project Title:**
Landslide Risk Prediction

**Problem Statement:**
Landslides cause significant damage to lives, infrastructure, and the environment. Predicting landslide risk levels (Low, Moderate, High, Very High) based on factors like temperature, humidity, precipitation, soil moisture, and elevation can help mitigate these impacts. Accurate risk assessment supports early warning systems, disaster management, and safer land-use planning.

**Description:**
This project uses a dataset containing environmental measurements such as temperature, humidity, precipitation, soil moisture, and elevation to build machine learning models that predict landslide risk levels. The models will reveal how weather patterns and terrain features influence landslide susceptibility. The insights can improve disaster preparedness, land management, and infrastructure development in vulnerable regions.

*Import necessary libraries*

In [16]:
import pandas as pd
import numpy as np

*Load The Dataset*

In [17]:
df = pd.read_csv('/content/regenerated_landslide_risk_dataset.csv')

*Explore The Dataset*

In [18]:
print("Data Info:")
df.info()

Data Info:
<class 'pandas.core.frame.DataFrame'>
RangeIndex: 5000 entries, 0 to 4999
Data columns (total 6 columns):
 #   Column                     Non-Null Count  Dtype 
---  ------                     --------------  ----- 
 0   Temperature (°C)           5000 non-null   int64 
 1   Humidity (%)               5000 non-null   int64 
 2   Precipitation (mm)         5000 non-null   int64 
 3   Soil Moisture (%)          5000 non-null   int64 
 4   Elevation (m)              5000 non-null   int64 
 5   Landslide Risk Prediction  5000 non-null   object
dtypes: int64(5), object(1)
memory usage: 234.5+ KB


In [19]:
print("Descriptive Stats:")
print(df.describe())

Descriptive Stats:
       Temperature (°C)  Humidity (%)  Precipitation (mm)  Soil Moisture (%)  \
count       5000.000000   5000.000000         5000.000000        5000.000000   
mean          24.815800     62.614000          123.126400          54.915800   
std            6.052867     19.108533           72.144734          20.345788   
min           15.000000     30.000000            0.000000          20.000000   
25%           20.000000     46.000000           61.000000          37.000000   
50%           25.000000     63.000000          121.000000          55.000000   
75%           30.000000     79.000000          186.000000          72.250000   
max           35.000000     95.000000          250.000000          90.000000   

       Elevation (m)  
count    5000.000000  
mean      503.015800  
std       288.700745  
min         0.000000  
25%       253.000000  
50%       505.000000  
75%       757.000000  
max      1000.000000  


In [20]:
print("Missing Values:")
print(df.isnull().sum())
print("\nTotal Missing Values Count:",df.isnull().sum().sum())

Missing Values:
Temperature (°C)             0
Humidity (%)                 0
Precipitation (mm)           0
Soil Moisture (%)            0
Elevation (m)                0
Landslide Risk Prediction    0
dtype: int64

Total Missing Values Count: 0
