# Challenge: Promotions

In this challenge, you'll develop codes to parse and analyze data returned from another API on Zalando such as [Promos homme (Men's Promotions)
](https://www.zalando.fr/promo-homme/) or [Promos femme (Women's Promotions)](https://www.zalando.fr/promo-femme/). The workflow is almost the same as in the guided lesson but you'll work with different data.

## Obtaining the link

Wrote your codes in the cell below to obtain the data from the API endpoint you choose. A recap of the workflow:

1. Examine the webpages and choose one that you want to work with.

1. Use Google Chrome's DevTools to inspect the XHR network requests. Find out the API endpoint that serves data to the webpage.

1. Test the API endpoint in the browser to verify its data.

1. Change the page number offset of the API URL to test if it's working.

### Examine the webpages and choose one that you want to work with

**Examing https://www.zalando.fr/promo-homme/**

### Use Google Chrome's DevTools to inspect the XHR network requests. Find out the API endpoint that serves data to the webpage.


**https://www.zalando.fr/api/catalog/articles?categories=promo-homme&limit=84&offset=84&sort=popularity**

###  Test the API endpoint in the browser to verify its data

**Tested in Mozilla browser and it works**

### Change the page number offset of the API URL to test if it's working

**Changing the offset for multiple of 84 works**

**https://www.zalando.fr/api/catalog/articles?categories=promo-homme&limit=84&offset=168&sort=popularity**

**https://www.zalando.fr/api/catalog/articles?categories=promo-homme&limit=84&offset=252&sort=popularity**

## Reading the data

In the next cell, use Python to obtain data from the API endpoint you chose in the previous step. Workflow:

1. Import libraries.

1. Define the initial API endpoint URL.

1. Make request to obtain data of the 1st page. Flatten the data and store it in an empty object variable.

1. Find out the total page count in the 1st page data.

1. Use a FOR loop to make requests for the additional pages from 2 to page count. Append the data of each additional page to the flatterned data object.

1. Print and review the data you obtained.

### Import libraries

In [1]:
import json
import requests
import pandas as pd
from pandas.io.json import json_normalize
import urllib.request

### Define the initial API endpoint URL.

In [22]:
URL = 'https://www.zalando.fr/api/catalog/articles?categories=promo-homme&limit=84&offset=84&sort=popularity'

### Make request to obtain data of the 1st page. Flatten the data and store it in an empty object variable

In [23]:
headers = {'user-agent': 'Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/84.0.4147.105 Safari/537.36'}

req = requests.get(URL, headers=headers)

In [7]:
fp = urllib.request.urlopen(URL)
mybytes = fp.read()

response = mybytes.decode("utf8")
fp.close()

In [8]:
results = json.loads(response)

In [9]:
results_flatten = pd.json_normalize(results)
results_flatten

Unnamed: 0,total_count,sort,articles,query_path,previous_page_path,next_page_path,page_gender,premium,filters,total_article_count,...,teaser.content.tracking_data.cf_tracking_params.slot_id,teaser.content.tracking_data.cf_tracking_params.flow_id,teaser.content.tracking_data.cf_tracking_params.shop_slot_id,teaser.content.tracking_data.cf_tracking_params.campaign_source_id,teaser.content.tracking_data.cf_tracking_params.request_id,teaser.content.tracking_data.cf_tracking_params.campaign_id,teaser.content.tracking_data.cf_tracking_params.creative_id,teaser.content.click_url,teaser.channel_name,teaser.slot_id
0,72013,popularity,"[{'sku': '1KE22O07Y-M11', 'name': 'RIOT BUTTON...",/promo-homme/?p=2,/promo-homme/,/promo-homme/?p=3,men,False,"[{'key': 'sizes', 'label': 'Taille', 'url_key'...",72028,...,21672e65-4b93-4cd0-9943-404c19eece9a,2-NzqeU1d0XDdNmj,catalog,K6h64IsdQ6q,785cbf89-a5b5-405e-a0d5-4bf970837065,23b73eb6-7923-421f-a54d-5abd95a88e36,64ac1450-b7b5-4f4b-8319-ed19fe1a50c7,https://www.zalando.fr/promo-homme/&Sale=True,CF,21672e65-4b93-4cd0-9943-404c19eece9a


In [28]:
results_flatten_articles = pd.json_normalize(results_flatten.articles[0])
results_flatten_articles.head()

Unnamed: 0,sku,name,sizes,url_key,media,brand_name,is_premium,family_articles,flags,product_group,delivery_promises,price.original,price.promotional,price.has_different_prices,price.has_different_original_prices,price.has_different_promotional_prices,price.has_discount_on_selected_sizes_only,outfits,amount,price.base_price
0,A0H15O02B-A11,GEL-KAYANO 5 OG - Baskets basses - white,"[37, 37.5, 40, 40.5, 41.5, 42, 42.5, 43.5, 44,...",asics-tiger-gel-kayano-baskets-basses-a0h15o02...,[{'path': 'A0/H1/5O/02/BA/11/A0H15O02B-A11@8.j...,ASICS SportStyle,False,[],"[{'key': 'discountRate', 'value': 'Jusqu’à -60...",shoe,[],"139,95 €","55,95 €",True,False,True,False,,,
1,CO412A04J-502,CHUCK TAYLOR ALL STAR HI - Baskets montantes -...,"[35, 36, 36.5, 37, 37.5, 38, 39, 39.5, 40, 41,...",converse-baskets-montantes-bleu-co412a04j-502,[{'path': 'CO/41/2A/04/J5/02/CO412A04J-502@19....,Converse,False,[],"[{'key': 'discountRate', 'value': '-20%', 'tra...",shoe,[],"69,95 €","55,95 €",False,False,False,False,"[{'id': '_tvFlJDlTBi', 'url_key': '/outfits/_t...",,
2,UR682H000-G11,BLOCK SWIM - Short de bain - cherry,"[S, M, L, XL, XXL]",urban-classics-block-swim-short-de-bain-ur682h...,[{'path': 'UR/68/2H/00/0G/11/UR682H000-G11@8.j...,Urban Classics,False,[],"[{'key': 'discountRate', 'value': '-20%', 'tra...",beach_wear,[],"19,99 €","15,99 €",False,False,False,False,,,
3,PI912C08F-Q11,Mocassins - black,"[40, 41, 45, 47]",pier-one-mocassins-black-pi912c08f-q11,[{'path': 'PI/91/2C/08/FQ/11/PI912C08F-Q11@6.j...,Pier One,False,[],"[{'key': 'discountRate', 'value': '-60%', 'tra...",shoe,[],"43,99 €","17,79 €",False,False,False,False,,,
4,PRJ32G000-S11,BEARD OIL - Huile à barbe - wood & spice,[],proraso-beard-oil-30ml-beard-oil-wood-and-spic...,[{'path': 'PR/J3/2G/00/0S/11/PRJ32G000-S11@4.j...,Proraso,False,[],"[{'key': 'discountRate', 'value': '-20%', 'tra...",beauty,[],"11,95 €","9,55 €",False,False,False,False,,30 ml,"31,83 € / 100 ml"


### Find out the total page count in the 1st page data

In [10]:
page_count = results['pagination']['page_count']
print(page_count)

858


### Use a FOR loop to make requests for the additional pages from 2 to page count. Append the data of each additional page to the flatterned data object.

### Print and review the data you obtained

In [24]:
#I only set the page_count to 2 because otherwises with page_count 858 I got an IncompleteRead error

dataframe = pd.DataFrame()
page_count2=2
for page in range(page_count2):
    url = "https://www.zalando.fr/api/catalog/articles?categories=promo-homme&limit=84&offset="+str(page*84)+"&sort=popularity"
    response = requests.get(url, headers=headers)
    fp = urllib.request.urlopen(url)
    mybytes = fp.read()
    response = mybytes.decode("utf8")
    fp.close()
    results = json.loads(response)
    results_flatten = pd.json_normalize(results)
    results_flatten_articles = pd.json_normalize(results_flatten.articles[0])
    articles = results_flatten_articles.set_index("sku")
    dataframe = dataframe.append(articles)

In [16]:
dataframe

Unnamed: 0_level_0,name,sizes,url_key,media,brand_name,is_premium,family_articles,flags,product_group,delivery_promises,price.original,price.promotional,price.has_different_prices,price.has_different_original_prices,price.has_different_promotional_prices,price.has_discount_on_selected_sizes_only,outfits,amount
sku,Unnamed: 1_level_1,Unnamed: 2_level_1,Unnamed: 3_level_1,Unnamed: 4_level_1,Unnamed: 5_level_1,Unnamed: 6_level_1,Unnamed: 7_level_1,Unnamed: 8_level_1,Unnamed: 9_level_1,Unnamed: 10_level_1,Unnamed: 11_level_1,Unnamed: 12_level_1,Unnamed: 13_level_1,Unnamed: 14_level_1,Unnamed: 15_level_1,Unnamed: 16_level_1,Unnamed: 17_level_1,Unnamed: 18_level_1
TOB22O018-A11,ORIGINAL TEE REGULAR FIT - T-shirt basique - c...,"[XS, S, M, L, XL, XXL]",tommy-jeans-original-tee-regular-fit-t-shirt-b...,[{'path': 'TO/B2/2O/01/8A/11/TOB22O018-A11@8.j...,Tommy Jeans,False,[],"[{'key': 'discountRate', 'value': '-10%', 'tra...",clothing,[],"23,95 €","21,55 €",False,False,False,False,,
NI112O0D2-A11,AIR MAX 270 REACT SE - Baskets basses - white/...,"[44.5, 45, 45.5, 46, 47, 47.5, 49.5]",nike-sportswear-air-max-270-react-se-baskets-b...,[{'path': 'NI/11/2O/0D/2A/11/NI112O0D2-A11@9.j...,Nike Sportswear,False,[],"[{'key': 'campaign', 'value': 'HOT DROP', 'tra...",shoe,[],"149,95 €","89,95 €",False,False,False,False,,
NI115O00M-Q11,P-6000 - Baskets basses - black/lemon/platinum...,"[35.5, 36, 36.5, 37.5, 38, 38.5, 39, 40, 40.5,...",nike-sportswear-p-6000-baskets-basses-ni115o00...,[{'path': 'NI/11/5O/00/MQ/11/NI115O00M-Q11@8.1...,Nike Sportswear,False,[],"[{'key': 'campaign', 'value': 'HOT DROP', 'tra...",shoe,[],"104,95 €","31,45 €",True,False,True,False,,
LA222S02S-B11,Pullover - viennesse,"[L, XL, XXL, 3XL, 4XL]",lacoste-pullover-viennesse-la222s02s-b11,[{'path': 'LA/22/2S/02/SB/11/LA222S02S-B11@13....,Lacoste,False,[],"[{'key': 'discountRate', 'value': '-50%', 'tra...",clothing,[],"119,95 €","59,95 €",False,False,False,False,,
LA222Q035-K11,Gilet - navy blue,"[XS, S, M, L, XL, 4XL]",lacoste-gilet-navy-blue-la222q035-k11,[{'path': 'LA/22/2Q/03/5K/11/LA222Q035-K11@7.j...,Lacoste,False,[],"[{'key': 'discountRate', 'value': 'Jusqu’à -65...",clothing,[],"149,95 €","51,95 €",True,False,True,False,,
...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...
OR322Q01D-Q11,CHICAGO CREW NECK - Pullover - black,"[S, L, XL, XXL]",shine-original-chicago-crew-neck-pullover-blac...,[{'path': 'OR/32/2Q/01/DQ/11/OR322Q01D-Q11@4.j...,Shine Original,False,[],"[{'key': 'discountRate', 'value': '-70%', 'tra...",clothing,[],"39,95 €","11,95 €",False,False,False,False,,
L0642G004-C11,Sweatshirt - gray,"[XS, S, L, XL, XXL, 3XL, 4XL]",lacoste-sport-sweatshirt-gray-l0642g004-c11,[{'path': 'L0/64/2G/00/4C/11/L0642G004-C11@2.j...,Lacoste Sport,False,[],"[{'key': 'discountRate', 'value': 'Jusqu’à -30...",clothing,[],"99,95 €","69,99 €",True,False,True,False,,
M2Y12O00D-A11,DARSON - Baskets basses - white/green,"[43, 44, 46]",madden-by-steve-madden-darson-baskets-basses-w...,[{'path': 'M2/Y1/2O/00/DA/11/M2Y12O00D-A11@11....,Madden by Steve Madden,False,[],"[{'key': 'discountRate', 'value': '-50%', 'tra...",shoe,[],"64,95 €","32,45 €",False,False,False,False,,
TO112C02F-A12,EASY SUMMER - Espadrilles - white,"[42, 43, 44, 45]",tommy-hilfiger-granada-espadrilles-to112c02f-a12,[{'path': 'TO/11/2C/02/FA/12/TO112C02F-A12@18....,Tommy Hilfiger,False,[],"[{'key': 'discountRate', 'value': '-30%', 'tra...",shoe,[],"49,95 €","34,95 €",False,False,False,False,,


## Bonus

Extract the following information from the data:

* The trending brand.

* The product(s) with the highest discount.

* The sum of discounts of all goods (sum_discounted_prices divided by sum_original_prices).

In [None]:
#I didn't do it, sorry!