# SQL Queries 

#### Top Categories by Engagement

In [0]:
%sql
SELECT 
    category,
    COUNT(*) AS total_events
FROM Gold.ecommerce.fact_events
GROUP BY category
ORDER BY total_events DESC;


category,total_events
others,20094946
electronics,18049562
appliances,6293586
apparel,3348750
computers,3133835
furniture,1953235
auto,1128201
construction,899085
kids,756959
accessories,435529


#### Daily Trend of User Activity

In [0]:
%sql
SELECT 
    date,
    COUNT(DISTINCT user_id) AS active_users
FROM Gold.ecommerce.fact_events
GROUP BY date
ORDER BY date;


date,active_users
2019-10-01,180990
2019-10-02,163146
2019-10-03,148474
2019-10-04,187764
2019-10-05,179148
2019-10-06,168609
2019-10-07,163184
2019-10-08,210205
2019-10-09,193028
2019-10-10,180254


#### Most Viewed Products

In [0]:
silver_df = spark.read.table("workspace.default.df_silver_events")
views_count_df = silver_df.filter(silver_df.event_type == "view") \
    .groupBy("product_name") \
    .count()
display(views_count_df)
views_count_df.write.mode("overwrite").saveAsTable("gold.ecommerce.views_count_df")

product_name,count
jumper,29842
tools,869486
sewing_machine,186490
trousers,36092
shoes,2557010
glove,2773
telephone,332768
clocks,1783270
bag,348193
universal,18116


#### Categoryâ€‘Level Price Insights

In [0]:
%sql
SELECT 
    category,
    MIN(price) AS min_price,
    MAX(price) AS max_price,
    AVG(price) AS avg_price
FROM gold.ecommerce.fact_events
GROUP BY category;


category,min_price,max_price,avg_price
apparel,0,1338,81.45047614781635
auto,0,2165,149.0030322610953
country_yard,0,2426,273.2345886442642
stationery,0,699,34.099333967649855
others,0,2574,187.5180538429912
construction,0,2571,178.21435904280463
kids,0,2571,146.1105581676154
furniture,0,2574,283.69046632893634
computers,0,2574,535.893931237605
appliances,0,2574,238.7093648994389


#### Brand Performance

In [0]:
%sql
SELECT 
    brand,
    COUNT(*) AS total_events,
    AVG(price) AS avg_price
FROM gold.ecommerce.fact_events
GROUP BY brand
ORDER BY total_events DESC;


brand,total_events,avg_price
Unspecified,9167238,210.48632608862124
samsung,5848153,368.2662910836977
apple,4602522,857.0580922807104
xiaomi,3076867,205.1606705782213
huawei,1150309,276.207819811894
lucente,1125787,242.56486173672283
bosch,733535,266.1050352062274
lg,732087,490.6793564152894
respect,710305,78.62964782734177
sony,615247,444.798287517046


### Using SQL, I explored the curated Gold tables to generate meaningful business insights. I analyzed product popularity, categoryâ€‘level trends, brand performance, and user engagement by writing aggregation, filtering, and ranking queries. These SQL analyses validated the quality of the Gold layer and provided clear, dataâ€‘driven interpretations of user behavior and product performance