# Challenge: Promotions

In this challenge, you'll develop codes to parse and analyze data returned from another API on Zalando such as [Promos homme (Men's Promotions)
](https://www.zalando.fr/promo-homme/) or [Promos femme (Women's Promotions)](https://www.zalando.fr/promo-femme/). The workflow is almost the same as in the guided lesson but you'll work with different data.

## Obtaining the link

Wrote your codes in the cell below to obtain the data from the API endpoint you choose. A recap of the workflow:

1. Examine the webpages and choose one that you want to work with.

1. Use Google Chrome's DevTools to inspect the XHR network requests. Find out the API endpoint that serves data to the webpage.

1. Test the API endpoint in the browser to verify its data.

1. Change the page number offset of the API URL to test if it's working.

In [1]:
# your code here
url = "https://www.zalando.fr/api/catalog/articles?categories=promo-homme&limit=84&offset=84&sort=sale"

## Reading the data

In the next cell, use Python to obtain data from the API endpoint you chose in the previous step. Workflow:

1. Import libraries.

1. Define the initial API endpoint URL.

1. Make request to obtain data of the 1st page. Flatten the data and store it in an empty object variable.

1. Find out the total page count in the 1st page data.

1. Use a FOR loop to make requests for the additional pages from 2 to page count. Append the data of each additional page to the flatterned data object.

1. Print and review the data you obtained.

In [2]:
import json
import urllib
import requests
import pandas as pd
from pandas import json_normalize


In [3]:
# your code here
response = urllib.request.urlopen(url)
results = json.load(response)


flattened_data = json_normalize(results)
flattened_data1 = json_normalize(flattened_data.articles[0])
flattened_data1.head()

Unnamed: 0,sku,name,sizes,url_key,media,brand_name,is_premium,family_articles,flags,product_group,outfits,delivery_promises,price.original,price.promotional,price.has_different_prices,price.has_different_original_prices,price.has_different_promotional_prices,price.has_discount_on_selected_sizes_only
0,JA222G0VA-K11,JJILIAM JJORIGINAL - Jeans Skinny - blue denim,"[27x30, 28x30, 29x30, 29x32, 30x30, 30x32, 30x...",jack-and-jones-jjiliam-jjoriginal-jeans-skinny...,[{'path': 'spp-media-p1/61cf96c837723db1892137...,Jack & Jones,False,[],"[{'key': 'discountRate', 'value': 'Jusqu’à -10...",clothing,"[{'id': '4LkTduB9R6-', 'url_key': '/outfits/4L...",[],"49,99 €","44,99 €",True,False,True,False
1,PI922SA07-Q13,veste en sweat zippée - black,"[XS, S, M, L, XL, XXL, 3XL, 4XL, 5XL]",pier-one-sweat-zippe-black-pi922sa07-q13,[{'path': 'spp-media-p1/61e297ed0b8b30d5b6b692...,Pier One,False,[],"[{'key': 'discountRate', 'value': '-25%', 'tra...",clothing,,[],"29,99 €","22,49 €",False,False,False,False
2,LA212O06W-A11,LEROND - Baskets basses - white/navy/red,"[40, 40.5, 41, 42, 42.5, 43, 44, 44.5, 45, 46,...",lacoste-lerond-baskets-basses-whitenavyred-la2...,[{'path': 'spp-media-p1/615139e109fc3767a967e9...,Lacoste,False,[],"[{'key': 'discountRate', 'value': '-25%', 'tra...",shoe,,[],"94,95 €","70,95 €",False,False,False,False
3,PU142A11O-A12,Chaussures de basket - puma white-puma white,"[39, 40, 40.5, 41, 42, 42.5, 43, 44, 44.5, 45,...",puma-chaussures-de-basket-puma-white-puma-whit...,[{'path': 'spp-media-p1/5f7d6b06a5103f9c970887...,Puma,False,[],"[{'key': 'discountRate', 'value': '-62%', 'tra...",shoe,,[],"130,00 €","49,95 €",False,False,False,False
4,NE352B0MJ-Q11,9FORTY MLB LOS ANGELES DODGERS - Casquette - ...,[One Size],new-era-9forty-mlb-los-angeles-dodgers-casquet...,[{'path': 'spp-media-p1/ac87d754052833ac8b4248...,New Era,False,[],"[{'key': 'discountRate', 'value': '-30%', 'tra...",accessoires,,[],"19,95 €","13,95 €",False,False,False,False


In [4]:
results

{'total_count': 54824,
 'pagination': {'page_count': 653, 'current_page': 2, 'per_page': 84},
 'sort': 'sale',
 'articles': [{'sku': 'JA222G0VA-K11',
   'name': 'JJILIAM JJORIGINAL - Jeans Skinny - blue denim',
   'price': {'original': '49,99\xa0€',
    'promotional': '44,99\xa0€',
    'has_different_prices': True,
    'has_different_original_prices': False,
    'has_different_promotional_prices': True,
    'has_discount_on_selected_sizes_only': False},
   'sizes': ['27x30',
    '28x30',
    '29x30',
    '29x32',
    '30x30',
    '30x32',
    '30x34',
    '31x30',
    '31x32',
    '31x34',
    '32x30',
    '32x32',
    '32x34',
    '33x32',
    '33x34',
    '34x32',
    '34x34',
    '36x32',
    '36x34'],
   'url_key': 'jack-and-jones-jjiliam-jjoriginal-jeans-skinny-blue-denim-ja222g0va-k11',
   'media': [{'path': 'spp-media-p1/61cf96c837723db18921374c879d1139/65df0db3c8fa43e5a558a54552fcbb2b.jpg',
     'role': 'DEFAULT',
     'packet_shot': False},
    {'path': 'spp-media-p1/3e2c054c2

In [5]:
total_pages = results["pagination"]["page_count"]
total_pages

653

In [6]:
df = pd.DataFrame()
for i in range(30):
    k= 84*i
    url = f"https://www.zalando.fr/api/catalog/articles?categories=promo-homme&limit=84&offset={84}&sort=sale"
    response = urllib.request.urlopen(url)
    results = json.load(response)
    data = json_normalize(results)
    data1 = json_normalize(data.articles[0])
    data1 = data1.set_index('sku')
    df = df.append(data1)
    print(f"getting page {i} of {total_pages}", end='\r')
    
df.head()

getting page 29 of 653

Unnamed: 0_level_0,name,sizes,url_key,media,brand_name,is_premium,family_articles,flags,product_group,outfits,delivery_promises,price.original,price.promotional,price.has_different_prices,price.has_different_original_prices,price.has_different_promotional_prices,price.has_discount_on_selected_sizes_only
sku,Unnamed: 1_level_1,Unnamed: 2_level_1,Unnamed: 3_level_1,Unnamed: 4_level_1,Unnamed: 5_level_1,Unnamed: 6_level_1,Unnamed: 7_level_1,Unnamed: 8_level_1,Unnamed: 9_level_1,Unnamed: 10_level_1,Unnamed: 11_level_1,Unnamed: 12_level_1,Unnamed: 13_level_1,Unnamed: 14_level_1,Unnamed: 15_level_1,Unnamed: 16_level_1,Unnamed: 17_level_1
JA222G0VA-K11,JJILIAM JJORIGINAL - Jeans Skinny - blue denim,"[27x30, 28x30, 29x30, 29x32, 30x30, 30x32, 30x...",jack-and-jones-jjiliam-jjoriginal-jeans-skinny...,[{'path': 'spp-media-p1/61cf96c837723db1892137...,Jack & Jones,False,[],"[{'key': 'discountRate', 'value': 'Jusqu’à -10...",clothing,"[{'id': '4LkTduB9R6-', 'url_key': '/outfits/4L...",[],"49,99 €","44,99 €",True,False,True,False
PI922SA07-Q13,veste en sweat zippée - black,"[XS, S, M, L, XL, XXL, 3XL, 4XL, 5XL]",pier-one-sweat-zippe-black-pi922sa07-q13,[{'path': 'spp-media-p1/61e297ed0b8b30d5b6b692...,Pier One,False,[],"[{'key': 'discountRate', 'value': '-25%', 'tra...",clothing,,[],"29,99 €","22,49 €",False,False,False,False
LA212O06W-A11,LEROND - Baskets basses - white/navy/red,"[40, 40.5, 41, 42, 42.5, 43, 44, 44.5, 45, 46,...",lacoste-lerond-baskets-basses-whitenavyred-la2...,[{'path': 'spp-media-p1/615139e109fc3767a967e9...,Lacoste,False,[],"[{'key': 'discountRate', 'value': '-25%', 'tra...",shoe,,[],"94,95 €","70,95 €",False,False,False,False
PU142A11O-A12,Chaussures de basket - puma white-puma white,"[39, 40, 40.5, 41, 42, 42.5, 43, 44, 44.5, 45,...",puma-chaussures-de-basket-puma-white-puma-whit...,[{'path': 'spp-media-p1/5f7d6b06a5103f9c970887...,Puma,False,[],"[{'key': 'discountRate', 'value': '-62%', 'tra...",shoe,,[],"130,00 €","49,95 €",False,False,False,False
NE352B0MJ-Q11,9FORTY MLB LOS ANGELES DODGERS - Casquette - ...,[One Size],new-era-9forty-mlb-los-angeles-dodgers-casquet...,[{'path': 'spp-media-p1/ac87d754052833ac8b4248...,New Era,False,[],"[{'key': 'discountRate', 'value': '-30%', 'tra...",accessoires,,[],"19,95 €","13,95 €",False,False,False,False


## Bonus

Extract the following information from the data:

* The trending brand.

* The product(s) with the highest discount.

* The sum of discounts of all goods (sum_discounted_prices divided by sum_original_prices).

In [7]:
# your code here

In [8]:
df.brand_name.value_counts().head()

Jack & Jones        390
Pier One            300
Tommy Hilfiger      240
Puma                180
adidas Originals    152
Name: brand_name, dtype: int64

In [9]:
df["price.original"] = df["price.original"].str.replace(",",".")
df["price.original"] = df["price.original"].str.replace("€","").str.strip()

df["price.promotional"] = df["price.promotional"].str.replace(",",".")
df["price.promotional"] = df["price.promotional"].str.replace("€","").str.strip()

In [10]:
df["discount"]= 100 - ((df['price.promotional'].astype(float)*100)/df['price.original'].astype(float))

In [11]:
df1 = df[["name","brand_name","price.original","price.promotional","discount"]].copy()
df1.head()

Unnamed: 0_level_0,name,brand_name,price.original,price.promotional,discount
sku,Unnamed: 1_level_1,Unnamed: 2_level_1,Unnamed: 3_level_1,Unnamed: 4_level_1,Unnamed: 5_level_1
JA222G0VA-K11,JJILIAM JJORIGINAL - Jeans Skinny - blue denim,Jack & Jones,49.99,44.99,10.002
PI922SA07-Q13,veste en sweat zippée - black,Pier One,29.99,22.49,25.008336
LA212O06W-A11,LEROND - Baskets basses - white/navy/red,Lacoste,94.95,70.95,25.276461
PU142A11O-A12,Chaussures de basket - puma white-puma white,Puma,130.0,49.95,61.576923
NE352B0MJ-Q11,9FORTY MLB LOS ANGELES DODGERS - Casquette - ...,New Era,19.95,13.95,30.075188


In [12]:
df1.sort_values(by = "discount", ascending = False).head()

Unnamed: 0_level_0,name,brand_name,price.original,price.promotional,discount
sku,Unnamed: 1_level_1,Unnamed: 2_level_1,Unnamed: 3_level_1,Unnamed: 4_level_1,Unnamed: 5_level_1
PU142A11O-A12,Chaussures de basket - puma white-puma white,Puma,130.0,49.95,61.576923
PU142A11O-A12,Chaussures de basket - puma white-puma white,Puma,130.0,49.95,61.576923
PU142A11O-A12,Chaussures de basket - puma white-puma white,Puma,130.0,49.95,61.576923
PU142A11O-A12,Chaussures de basket - puma white-puma white,Puma,130.0,49.95,61.576923
PU142A11O-A12,Chaussures de basket - puma white-puma white,Puma,130.0,49.95,61.576923


In [19]:
sum_original=df1['price.original'].astype(float).sum()
sum_discount=df1['price.promotional'].astype(float).sum()
total_disc= sum_discount/sum_original
total_disc

0.7338461002483492