# Exploratory Data Analysis (EDA) on Emergy Metabolism Dataset


This Jupyter Notebook performs basic Exploratory Data Analysis (EDA) on the Emergy Metabolism dataset for China's cities between 2000 and 2020.


In [None]:

import pandas as pd
import numpy as np
import matplotlib.pyplot as plt
import seaborn as sns



## Loading the dataset


In [None]:

# Load the Emergy Metabolism dataset
df = pd.read_excel('data/Emergy flows of 281 China's cities 2000-2020.xlsx)

# Display the first few rows of the dataset
df.head()



## Basic Data Information


In [None]:

# Checking data types and missing values
df.info()

# Basic statistical summary of numerical features
df.describe()



## Visualizing Missing Data


In [None]:

# Plot missing data heatmap
plt.figure(figsize=(10, 6))
sns.heatmap(df.isnull(), cbar=False, cmap="viridis")
plt.title('Missing Data Heatmap')
plt.show()



## Distribution of Variables


In [None]:

# Histogram of Emergy Flow
plt.figure(figsize=(10, 6))
df['Emergy Flow'].hist(bins=30)
plt.title('Distribution of Emergy Flow')
plt.xlabel('Emergy Flow (sej)')
plt.ylabel('Frequency')
plt.show()



## Relationship between Emergy Flow and Population


In [None]:

# Scatter plot of Emergy Flow vs Population
plt.figure(figsize=(10, 6))
sns.scatterplot(data=df, x='Population', y='Emergy Flow')
plt.title('Emergy Flow vs Population')
plt.xlabel('Population')
plt.ylabel('Emergy Flow (sej)')
plt.show()



## Year-wise Trend of Emergy Flow


In [None]:

# Plotting the year-wise trend of Emergy Flow
plt.figure(figsize=(12, 6))
df.groupby('Year')['Emergy Flow'].sum().plot()
plt.title('Year-wise Trend of Emergy Flow')
plt.xlabel('Year')
plt.ylabel('Total Emergy Flow (sej)')
plt.show()



## Correlation Heatmap


In [None]:

# Correlation heatmap for numerical features
plt.figure(figsize=(10, 6))
sns.heatmap(df.corr(), annot=True, cmap='coolwarm')
plt.title('Correlation Heatmap')
plt.show()



## Conclusion

This notebook gives an overview of the dataset, highlighting important trends and relationships. Further analysis can be performed based on specific research questions.
