
# 📱 Smartphone Market Analysis

In this project, we analyze a dataset of smartphones and their specifications to uncover insights about pricing, features, and market trends.

**Goals:**
- Clean and prepare the dataset.
- Explore feature distributions and correlations.
- Visualize pricing by brand and specs.
- Provide recommendations and insights.

---


In [None]:

import pandas as pd
import matplotlib.pyplot as plt
import seaborn as sns

# Load data
df = pd.read_csv('Smartphones_cleaned_dataset.csv')

# Fill missing values
df['rating'] = df['rating'].fillna(df['rating'].median())
df['processor_brand'] = df['processor_brand'].fillna(df['processor_brand'].mode()[0])
df['num_cores'] = df['num_cores'].fillna(df['num_cores'].median())
df['processor_speed'] = df['processor_speed'].fillna(df['processor_speed'].median())
df['battery_capacity'] = df['battery_capacity'].fillna(df['battery_capacity'].median())
df['num_front_cameras'] = df['num_front_cameras'].fillna(df['num_front_cameras'].median())
df['os'] = df['os'].fillna(df['os'].mode()[0])
df['primary_camera_front'] = df['primary_camera_front'].fillna(df['primary_camera_front'].median())
df['extended_upto'] = df['extended_upto'].fillna(0)
df['fast_charging'] = df['fast_charging'].fillna(0)

df.info()
df.head()



## 📊 Exploratory Data Analysis (EDA)

Let's check price distribution, brand-wise pricing, and correlations with specs.


In [None]:

# Price distribution
plt.figure(figsize=(10,6))
sns.histplot(df['price'], bins=30, kde=True)
plt.title('Price Distribution')
plt.xlabel('Price (INR)')
plt.show()

# Price by brand
plt.figure(figsize=(14,6))
brand_avg = df.groupby('brand_name')['price'].mean().sort_values(ascending=False)
sns.barplot(x=brand_avg.index, y=brand_avg.values)
plt.title('Average Price by Brand')
plt.xlabel('Brand')
plt.ylabel('Average Price (INR)')
plt.xticks(rotation=90)
plt.show()


In [None]:

# Correlation with price
corr = df[['price', 'rating', 'battery_capacity', 'ram_capacity', 'internal_memory',
           'screen_size', 'refresh_rate', 'primary_camera_rear', 'primary_camera_front']].corr()

plt.figure(figsize=(10,8))
sns.heatmap(corr, annot=True, cmap='coolwarm')
plt.title('Correlation Heatmap')
plt.show()


In [None]:

plt.figure(figsize=(10,6))
sns.scatterplot(x='ram_capacity', y='price', data=df, hue='brand_name', legend=False)
plt.title('Price vs. RAM Capacity')
plt.xlabel('RAM (GB)')
plt.ylabel('Price (INR)')
plt.show()



## ✅ Summary & Recommendations

- Some brands focus on premium pricing (e.g., Vertu, Apple) while others target budget segments.
- RAM, internal memory, and camera specs show strong correlation with price.
- Battery capacity and refresh rate have moderate impact.
- Brands could use this data to identify gaps in price segments or to optimize specs for target markets.

This notebook demonstrates an end-to-end EDA for smartphone market analysis.

---
