
# Superstore Data Analysis - Modified
This notebook provides a revised analysis of the Superstore dataset. The primary focus is to analyze the profit, sales, and shipping costs by state, using different visualizations and metrics.

We will explore the following:
- Profit distribution across different states.
- Profit-to-sales ratio to assess profitability.
- Comparison of shipping costs by state.


In [None]:

import pandas as pd
import matplotlib.pyplot as plt

# Sample data (as we don't have the original dataset)
data = {
    'State': ['California', 'Texas', 'New York', 'Florida', 'Illinois'],
    'Profit': [30000, 25000, 18000, 12000, 10000],
    'Sales': [150000, 120000, 90000, 60000, 50000],
    'Shipping_Cost': [5000, 4000, 3000, 2000, 1000]
}

df = pd.DataFrame(data)
df


In [None]:

plt.figure(figsize=(8, 8))
plt.pie(df['Profit'], labels=df['State'], autopct='%1.1f%%', startangle=140, colors=plt.cm.Paired.colors)
plt.title('Profit Distribution by State')
plt.show()


In [None]:

# Calculate profit-to-sales ratio
df['Profit_to_Sales_Ratio'] = df['Profit'] / df['Sales']
df[['State', 'Profit_to_Sales_Ratio']]


In [None]:

# Bar chart comparing shipping costs by state
plt.figure(figsize=(8, 6))
df.groupby('State')['Shipping_Cost'].sum().plot.bar(color='skyblue')
plt.title('Shipping Costs by State')
plt.xlabel('State')
plt.ylabel('Shipping Cost')
plt.show()
