# Yulu Demand Analysis & Hypothesis Testing

**By Akanksha Trivedi**

## 1. Business Case

- Understand demand factors for Yulu's shared e-cycles
- Use hypothesis testing to validate assumptions

## 2. Load and Profile Data

In [None]:
```python
import pandas as pd
import numpy as np
import matplotlib.pyplot as plt
import seaborn as sns
from scipy import stats

# Load dataset
df = pd.read_csv('yulu_data.csv')
df.info()
df.describe()
```

## 3. Univariate Analysis

In [None]:
```python
# Distributions
cols = ['casual', 'registered', 'count', 'temp', 'atemp', 'humidity', 'windspeed']
df[cols].hist(figsize=(12,8))
plt.show()
```

### 🔍 Insights

- Casual, registered and count: log-normal
- Humidity, count, registered have outliers

## 4. Bivariate Analysis

In [None]:
```python
sns.boxplot(x='season', y='count', data=df)
plt.title('Season vs Count')
plt.show()
```

### 🔍 Insights

- More bikes rented in summer/fall
- Rainy/foggy days show reduced usage

## 5. Correlation Analysis

In [None]:
```python
sns.heatmap(df.corr(), annot=True)
plt.title('Correlation Matrix')
plt.show()
```

### 🔍 Insights

- Strong positive correlation between count and registered users

## 6. Hypothesis Testing

In [None]:
```python
# Chi-Square Test: Weather vs Season
ct = pd.crosstab(df['season'], df['weather'])
chi2, p, dof, _ = stats.chi2_contingency(ct)
print('Chi-square p-value:', p)
```

## 7. Recommendations

- Promote rentals in summer/spring
- Weather-specific discounts
- Loyalty programs for registered users
- Provide customer comforts (umbrellas, water)
- Use green campaigns to boost eco-awareness