# Challenge: Promotions

In this challenge, you'll develop codes to parse and analyze data returned from another API on Zalando such as [Promos homme (Men's Promotions)
](https://www.zalando.fr/promo-homme/) or [Promos femme (Women's Promotions)](https://www.zalando.fr/promo-femme/). The workflow is almost the same as in the guided lesson but you'll work with different data.

## Obtaining the link

Wrote your codes in the cell below to obtain the data from the API endpoint you choose. A recap of the workflow:

1. Examine the webpages and choose one that you want to work with.

1. Use Google Chrome's DevTools to inspect the XHR network requests. Find out the API endpoint that serves data to the webpage.

1. Test the API endpoint in the browser to verify its data.

1. Change the page number offset of the API URL to test if it's working.

In [1]:
# your code here
url = 'https://www.zalando.fr/api/catalog/articles?categories=promo-sport-homme&limit=84&offset=0&sort=popularity'

<Response [200]>

## Reading the data

In the next cell, use Python to obtain data from the API endpoint you chose in the previous step. Workflow:

1. Import libraries.

1. Define the initial API endpoint URL.

1. Make request to obtain data of the 1st page. Flatten the data and store it in an empty object variable.

1. Find out the total page count in the 1st page data.

1. Use a FOR loop to make requests for the additional pages from 2 to page count. Append the data of each additional page to the flatterned data object.

1. Print and review the data you obtained.

In [3]:
#1. IMPORT LIBRARIES
import json
import requests
import pandas as pd
from pandas.io.json import json_normalize

In [6]:
#2. DEFINE API ENDPOINT
url='https://www.zalando.fr/api/catalog/articles?categories=promo-enfant&limit=84&offset=0&sort=sale'
header={'user-agent': 'Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/79.0.3945.117 Safari/537.36'}

response = requests.get(url, headers=header)

In [7]:
#3.1 MAKE REQUEST (for the 1rst page)

result = response.json()
result

{'total_count': 22054,
 'pagination': {'page_count': 263, 'current_page': 1, 'per_page': 84},
 'sort': 'sale',
 'articles': [{'sku': 'NI116D04L-Q11',
   'name': 'COURT BOROUGH MID BOOT WINTERIZED - Chaussures de skate - black/white',
   'price': {'original': '54,95\xa0€',
    'promotional': '43,95\xa0€',
    'has_different_prices': True,
    'has_different_original_prices': False,
    'has_different_promotional_prices': True,
    'has_discount_on_selected_sizes_only': False},
   'sizes': ['28', '28.5', '29.5', '30', '31.5', '32', '33.5', '34', '35'],
   'url_key': 'nike-sportswear-court-borough-mid-boot-winterized-baskets-montantes-blackwhite-ni116d04l-q11',
   'media': [{'path': 'NI/11/6D/04/LQ/11/NI116D04L-Q11@8.jpg',
     'role': 'DEFAULT',
     'packet_shot': False}],
   'brand_name': 'Nike Sportswear',
   'is_premium': False,
   'family_articles': [{'sku': 'NI116D04L-Q11',
     'url_key': 'nike-sportswear-court-borough-mid-boot-winterized-baskets-montantes-blackwhite-ni116d04l-q11

In [9]:
#3.2. FLATTEN THE DATA
flattened_data = json_normalize(result)
flattened_data

Unnamed: 0,total_count,sort,articles,query_path,next_page_path,page_gender,premium,filters,total_article_count,plusStatus,...,iconPaths.filters.standard_delivery_filter,iconPaths.filters.fast_delivery_filter,iconPaths.filters.zalando_plus,iconPaths.mobileFilters.standard_delivery_filter,iconPaths.mobileFilters.fast_delivery_filter,iconPaths.mobileFilters.zalando_plus,iconPaths.flags.slow_delivery_flag,iconPaths.flags.fast_delivery_flag,iconPaths.flags.plus_delivery_flag,iconPaths.flags.zalando_plus
0,22054,sale,"[{'sku': 'NI116D04L-Q11', 'name': 'COURT BOROU...",/promo-enfant/?order=sale,/promo-enfant/?p=2&order=sale,kids,False,"[{'key': 'sizes', 'label': 'Taille', 'url_key'...",22057,non-eligible,...,icons/truck.svg,icons/truck-fast.svg,icons/plus-short-1.svg,icons/truck.svg,icons/truck-fast.svg,icons/plus-short-1.svg,icons/clock.svg,icons/truck-fast-orange-3.svg,icons/plus-short-1.svg,icons/zalando-plus.svg


In [10]:
goods = json_normalize(flattened_data.articles[0])
goods

Unnamed: 0,sku,name,sizes,url_key,media,brand_name,is_premium,family_articles,flags,product_group,...,price.original,price.promotional,price.has_different_prices,price.has_different_original_prices,price.has_different_promotional_prices,price.has_discount_on_selected_sizes_only,tracking_information.metrigo_impression_urls,tracking_information.impression_beacon,tracking_information.source,amount
0,NI116D04L-Q11,COURT BOROUGH MID BOOT WINTERIZED - Chaussures...,"[28, 28.5, 29.5, 30, 31.5, 32, 33.5, 34, 35]",nike-sportswear-court-borough-mid-boot-winteri...,[{'path': 'NI/11/6D/04/LQ/11/NI116D04L-Q11@8.j...,Nike Sportswear,False,"[{'sku': 'NI116D04L-Q11', 'url_key': 'nike-spo...","[{'key': 'discountRate', 'value': 'Jusqu’à -20...",shoe,...,"54,95 €","43,95 €",True,False,True,False,[https://ccp-et.metrigo.zalan.do/event/sbv?z=b...,https://ccp-et.metrigo.zalan.do/event/sbv?z=be...,ccp,
1,NI114D0B9-Q14,AIR MAX 200 - Baskets basses - black/metallic ...,"[35.5, 36.5, 37.5, 38, 38.5, 40]",nike-sportswear-air-max-200-baskets-basses-ni1...,[{'path': 'NI/11/4D/0B/9Q/14/NI114D0B9-Q14@9.j...,Nike Sportswear,False,"[{'sku': 'NI114D0B9-Q14', 'url_key': 'nike-spo...","[{'key': 'discountRate', 'value': '-20%', 'tra...",shoe,...,"104,95 €","83,95 €",False,False,False,False,[https://ccp-et.metrigo.zalan.do/event/sbv?z=b...,https://ccp-et.metrigo.zalan.do/event/sbv?z=be...,ccp,
2,TO113I00G-K11,Bottines - blue,"[28, 29, 30, 33, 34, 35, 36, 37, 38, 40]",tommy-hilfiger-bottines-blue-to113i00g-k11,[{'path': 'TO/11/3I/00/GK/11/TO113I00G-K11@8.1...,Tommy Hilfiger,False,"[{'sku': 'TO113I00G-K11', 'url_key': 'tommy-hi...","[{'key': 'discountRate', 'value': '-30%', 'tra...",shoe,...,"94,95 €","66,45 €",True,False,True,False,[https://ccp-et.metrigo.zalan.do/event/sbv?z=b...,https://ccp-et.metrigo.zalan.do/event/sbv?z=be...,ccp,
3,AD116D007-A11,STAN SMITH - Baskets basses - white,"[28, 29, 30, 31, 32, 33, 34, 35, 28 1/2, 30 1/...",adidas-originals-stan-smith-baskets-basses-bla...,[{'path': 'AD/11/6D/00/7A/11/AD116D007-A11@12....,adidas Originals,False,"[{'sku': 'AD116D007-A11', 'url_key': 'adidas-o...","[{'key': 'discountRate', 'value': 'Jusqu’à -30...",shoe,...,"54,95 €","38,45 €",True,False,True,False,,,,
4,AD126L00J-Q11,JACKET - Veste d'hiver - black/white,"[3-4a, 4-5a, 5-6a, 6-7a, 7-8a]",adidas-originals-jacket-veste-dhiver-blackwhit...,[{'path': 'AD/12/6L/00/JQ/11/AD126L00J-Q11@8.j...,adidas Originals,False,"[{'sku': 'AD126L00J-Q11', 'url_key': 'adidas-o...","[{'key': 'discountRate', 'value': 'Jusqu’à -35...",clothing,...,"64,95 €","41,95 €",True,False,True,False,,,,
...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...
79,AD114D00P-A11,TOP TEN - Baskets montantes - white,"[19, 20, 21, 22, 23, 24, 25, 26, 27]",adidas-originals-top-ten-baskets-montantes-bla...,[{'path': 'AD/11/4D/00/PA/11/AD114D00P-A11@12....,adidas Originals,False,"[{'sku': 'AD114D00P-A11', 'url_key': 'adidas-o...","[{'key': 'discountRate', 'value': 'Jusqu’à -20...",shoe,...,"44,95 €","35,95 €",True,False,True,False,,,,
80,AD116D008-A12,STAN SMITH - Baskets basses - white/bold pink,"[36, 38, 40, 35 1/2, 36 2/3, 37 1/3, 38 2/3]",adidas-originals-stan-smith-baskets-basses-ad1...,[{'path': 'AD/11/6D/00/8A/12/AD116D008-A12@12....,adidas Originals,False,"[{'sku': 'AD116D008-A12', 'url_key': 'adidas-o...","[{'key': 'discountRate', 'value': 'Jusqu’à -30...",shoe,...,"64,95 €","45,45 €",True,False,True,False,,,,
81,NI114D0CX-B11,FORCE 1 - Baskets basses - wheat/light brown,"[29.5, 30, 31.5, 32, 33, 33.5, 34, 35]",nike-sportswear-force-1-baskets-basses-wheatli...,[{'path': 'NI/11/4D/0C/XB/11/NI114D0CX-B11@11....,Nike Sportswear,False,"[{'sku': 'NI114D0CX-B11', 'url_key': 'nike-spo...","[{'key': 'discountRate', 'value': 'Jusqu’à -20...",shoe,...,"54,95 €","43,95 €",True,False,True,False,,,,
82,NA824L0D7-M11,NKMMALIEN JACKET - Veste d'hiver - green gables,"[8a, 9a, 10a, 11a, 12a, 13a, 14a]",name-it-nkmmalien-jacket-veste-dhiver-green-ga...,[{'path': 'NA/82/4L/0D/7M/11/NA824L0D7-M11@9.j...,Name it,False,"[{'sku': 'NA824L0D7-M11', 'url_key': 'name-it-...","[{'key': 'discountRate', 'value': '-60%', 'tra...",clothing,...,"48,99 €","19,59 €",False,False,False,False,,,,


In [12]:
#4. FIND THE TOTAL NUMBER OF PAGES

page_number= result['pagination']['page_count']
page_number

263

In [14]:
#5. Append the data of each additional page to the flatterned data object.

complete = []
for i in range(page_number):
    offset=i*84
    url='https://www.zalando.fr/api/catalog/articles?categories=promo-enfant&limit=84&offset=0'+str(offset)+'&sort=sale'
    
    response = requests.get(url, headers=header)
    results = response.json()
    x = json_normalize(result)
    flattened_data1 = json_normalize(x.articles[0])
    complete.append(flattened_data1)

data = pd.concat(complete, sort=False)
data.set_index('sku',inplace=True)
data

Unnamed: 0_level_0,name,sizes,url_key,media,brand_name,is_premium,family_articles,flags,product_group,delivery_promises,price.original,price.promotional,price.has_different_prices,price.has_different_original_prices,price.has_different_promotional_prices,price.has_discount_on_selected_sizes_only,tracking_information.metrigo_impression_urls,tracking_information.impression_beacon,tracking_information.source,amount
sku,Unnamed: 1_level_1,Unnamed: 2_level_1,Unnamed: 3_level_1,Unnamed: 4_level_1,Unnamed: 5_level_1,Unnamed: 6_level_1,Unnamed: 7_level_1,Unnamed: 8_level_1,Unnamed: 9_level_1,Unnamed: 10_level_1,Unnamed: 11_level_1,Unnamed: 12_level_1,Unnamed: 13_level_1,Unnamed: 14_level_1,Unnamed: 15_level_1,Unnamed: 16_level_1,Unnamed: 17_level_1,Unnamed: 18_level_1,Unnamed: 19_level_1,Unnamed: 20_level_1
NI116D04L-Q11,COURT BOROUGH MID BOOT WINTERIZED - Chaussures...,"[28, 28.5, 29.5, 30, 31.5, 32, 33.5, 34, 35]",nike-sportswear-court-borough-mid-boot-winteri...,[{'path': 'NI/11/6D/04/LQ/11/NI116D04L-Q11@8.j...,Nike Sportswear,False,"[{'sku': 'NI116D04L-Q11', 'url_key': 'nike-spo...","[{'key': 'discountRate', 'value': 'Jusqu’à -20...",shoe,[],"54,95 €","43,95 €",True,False,True,False,[https://ccp-et.metrigo.zalan.do/event/sbv?z=b...,https://ccp-et.metrigo.zalan.do/event/sbv?z=be...,ccp,
NI114D0B9-Q14,AIR MAX 200 - Baskets basses - black/metallic ...,"[35.5, 36.5, 37.5, 38, 38.5, 40]",nike-sportswear-air-max-200-baskets-basses-ni1...,[{'path': 'NI/11/4D/0B/9Q/14/NI114D0B9-Q14@9.j...,Nike Sportswear,False,"[{'sku': 'NI114D0B9-Q14', 'url_key': 'nike-spo...","[{'key': 'discountRate', 'value': '-20%', 'tra...",shoe,[],"104,95 €","83,95 €",False,False,False,False,[https://ccp-et.metrigo.zalan.do/event/sbv?z=b...,https://ccp-et.metrigo.zalan.do/event/sbv?z=be...,ccp,
TO113I00G-K11,Bottines - blue,"[28, 29, 30, 33, 34, 35, 36, 37, 38, 40]",tommy-hilfiger-bottines-blue-to113i00g-k11,[{'path': 'TO/11/3I/00/GK/11/TO113I00G-K11@8.1...,Tommy Hilfiger,False,"[{'sku': 'TO113I00G-K11', 'url_key': 'tommy-hi...","[{'key': 'discountRate', 'value': '-30%', 'tra...",shoe,[],"94,95 €","66,45 €",True,False,True,False,[https://ccp-et.metrigo.zalan.do/event/sbv?z=b...,https://ccp-et.metrigo.zalan.do/event/sbv?z=be...,ccp,
AD116D007-A11,STAN SMITH - Baskets basses - white,"[28, 29, 30, 31, 32, 33, 34, 35, 28 1/2, 30 1/...",adidas-originals-stan-smith-baskets-basses-bla...,[{'path': 'AD/11/6D/00/7A/11/AD116D007-A11@12....,adidas Originals,False,"[{'sku': 'AD116D007-A11', 'url_key': 'adidas-o...","[{'key': 'discountRate', 'value': 'Jusqu’à -30...",shoe,[],"54,95 €","38,45 €",True,False,True,False,,,,
AD126L00J-Q11,JACKET - Veste d'hiver - black/white,"[3-4a, 4-5a, 5-6a, 6-7a, 7-8a]",adidas-originals-jacket-veste-dhiver-blackwhit...,[{'path': 'AD/12/6L/00/JQ/11/AD126L00J-Q11@8.j...,adidas Originals,False,"[{'sku': 'AD126L00J-Q11', 'url_key': 'adidas-o...","[{'key': 'discountRate', 'value': 'Jusqu’à -35...",clothing,[],"64,95 €","41,95 €",True,False,True,False,,,,
...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...
AD114D00P-A11,TOP TEN - Baskets montantes - white,"[19, 20, 21, 22, 23, 24, 25, 26, 27]",adidas-originals-top-ten-baskets-montantes-bla...,[{'path': 'AD/11/4D/00/PA/11/AD114D00P-A11@12....,adidas Originals,False,"[{'sku': 'AD114D00P-A11', 'url_key': 'adidas-o...","[{'key': 'discountRate', 'value': 'Jusqu’à -20...",shoe,[],"44,95 €","35,95 €",True,False,True,False,,,,
AD116D008-A12,STAN SMITH - Baskets basses - white/bold pink,"[36, 38, 40, 35 1/2, 36 2/3, 37 1/3, 38 2/3]",adidas-originals-stan-smith-baskets-basses-ad1...,[{'path': 'AD/11/6D/00/8A/12/AD116D008-A12@12....,adidas Originals,False,"[{'sku': 'AD116D008-A12', 'url_key': 'adidas-o...","[{'key': 'discountRate', 'value': 'Jusqu’à -30...",shoe,[],"64,95 €","45,45 €",True,False,True,False,,,,
NI114D0CX-B11,FORCE 1 - Baskets basses - wheat/light brown,"[29.5, 30, 31.5, 32, 33, 33.5, 34, 35]",nike-sportswear-force-1-baskets-basses-wheatli...,[{'path': 'NI/11/4D/0C/XB/11/NI114D0CX-B11@11....,Nike Sportswear,False,"[{'sku': 'NI114D0CX-B11', 'url_key': 'nike-spo...","[{'key': 'discountRate', 'value': 'Jusqu’à -20...",shoe,[],"54,95 €","43,95 €",True,False,True,False,,,,
NA824L0D7-M11,NKMMALIEN JACKET - Veste d'hiver - green gables,"[8a, 9a, 10a, 11a, 12a, 13a, 14a]",name-it-nkmmalien-jacket-veste-dhiver-green-ga...,[{'path': 'NA/82/4L/0D/7M/11/NA824L0D7-M11@9.j...,Name it,False,"[{'sku': 'NA824L0D7-M11', 'url_key': 'name-it-...","[{'key': 'discountRate', 'value': '-60%', 'tra...",clothing,[],"48,99 €","19,59 €",False,False,False,False,,,,


## Bonus

Extract the following information from the data:

* The trending brand.

* The product(s) with the highest discount.

* The sum of discounts of all goods (sum_discounted_prices divided by sum_original_prices).

In [26]:
#1. Trending brand

trending = data['brand_name'].value_counts().to_frame()

trending.head(1)

Unnamed: 0,brand_name
Nike Sportswear,9994


In [39]:
#2. Products with the higher discount
discounts = pd.DataFrame()
discounts['price.original'] = data['price.original'].str.replace('€', '', regex=True).str.replace(',', '.', regex=True)
discounts['price.promotional'] = data['price.promotional'].str.replace('€', '', regex=True).str.replace(',', '.', regex=True)

#change data type to float
discounts[['price.original','price.promotional']] = discounts[['price.original','price.promotional']].astype('float64')

# create discount column substracting original to promotional
discounts['discount'] = discounts['price.original'] - discounts['price.promotional']
discounts = discounts.sort_values('discount', ascending=False)

#Top 10 products
discounts.head(10)

Unnamed: 0_level_0,price.original,price.promotional,discount
sku,Unnamed: 1_level_1,Unnamed: 2_level_1,Unnamed: 3_level_1
QU124L020-K11,129.99,65.0,64.99
QU124L020-K11,129.99,65.0,64.99
QU124L020-K11,129.99,65.0,64.99
QU124L020-K11,129.99,65.0,64.99
QU124L020-K11,129.99,65.0,64.99
QU124L020-K11,129.99,65.0,64.99
QU124L020-K11,129.99,65.0,64.99
QU124L020-K11,129.99,65.0,64.99
QU124L020-K11,129.99,65.0,64.99
QU124L020-K11,129.99,65.0,64.99


In [61]:
#3. Sum of discounts of all goods 
discounts_goods = data[['brand_name','price.original', 'price.promotional']]
discounts_goods['price.original'] = discounts_goods['price.original'].str.replace('€', '', regex=True).str.replace(',', '.', regex=True)
discounts_goods['price.promotional'] = discounts_goods['price.promotional'].str.replace('€', '', regex=True).str.replace(',', '.', regex=True)

#change data type to float
discounts_goods[['price.original','price.promotional']] = discounts_goods[['price.original','price.promotional']].astype('float64')
# create discount column substracting original to promotional
discounts_goods['discounts'] = discounts_goods['price.original'] - discounts_goods['price.promotional']

# group by brand name
discounts_goods = discounts_goods[['price.original', 'price.promotional', 'discounts', 'brand_name']].groupby('brand_name').sum()
discounts_goods = discounts_goods.sort_values('discounts', ascending=False)

discounts_goods

A value is trying to be set on a copy of a slice from a DataFrame.
Try using .loc[row_indexer,col_indexer] = value instead

See the caveats in the documentation: http://pandas.pydata.org/pandas-docs/stable/user_guide/indexing.html#returning-a-view-versus-a-copy
  This is separate from the ipykernel package so we can avoid doing imports until
A value is trying to be set on a copy of a slice from a DataFrame.
Try using .loc[row_indexer,col_indexer] = value instead

See the caveats in the documentation: http://pandas.pydata.org/pandas-docs/stable/user_guide/indexing.html#returning-a-view-versus-a-copy
  after removing the cwd from sys.path.
A value is trying to be set on a copy of a slice from a DataFrame.
Try using .loc[row_indexer,col_indexer] = value instead

See the caveats in the documentation: http://pandas.pydata.org/pandas-docs/stable/user_guide/indexing.html#returning-a-view-versus-a-copy
  if __name__ == '__main__':


Unnamed: 0_level_0,price.original,price.promotional,discounts
brand_name,Unnamed: 1_level_1,Unnamed: 2_level_1,Unnamed: 3_level_1
Nike Sportswear,802702.3,622626.2,180076.1
adidas Originals,151869.35,106475.55,45393.8
Quiksilver,88620.48,44312.87,44307.61
Levi's®,65657.95,50719.55,14938.4
Tommy Hilfiger,55190.55,41448.8,13741.75
Boboli,37793.1,26431.5,11361.6
River Island,27325.7,16095.6,11230.1
Name it,12884.37,5152.17,7732.2
Diesel,13136.85,5904.35,7232.5
Blue Seven,11821.85,4720.85,7101.0
