# Módulo 4: APIs
## Spotify
<img src="https://developer.spotify.com/assets/branding-guidelines/logo@2x.png" width=400></img>

En este módulo utilizaremos APIs para obtener información sobre artistas, discos y tracks disponibles en Spotify. Pero primero.. ¿Qué es una **API**?<br>
Por sus siglas en inglés, una API es una interfaz para programar aplicaciones (*Application Programming Interface*). Es decir que es un conjunto de funciones, métodos, reglas y definiciones que nos permitirán desarrollar aplicaciones (en este caso un scraper) que se comuniquen con los servidores de Spotify. Las APIs son diseñadas y desarrolladas por las empresas que tienen interés en que se desarrollen aplicaciones (públicas o privadas) que utilicen sus servicios. Spotify tiene APIs públicas y bien documentadas que estaremos usando en el desarrollo de este proyecto.
#### REST
Un término se seguramente te vas a encontrar cuando estés buscando información en internet es **REST** o *RESTful*. Significa *representational state transfer* y si una API es REST o RESTful, implica que respeta unos determinados principios de arquitectura, como por ejemplo un protocolo de comunicación cliente/servidor (que será HTTP) y (entre otras cosas) un conjunto de operaciones definidas que conocemos como **métodos**. Ya veníamos usando el método GET para hacer solicitudes a servidores web.
#### Documentación
Como mencioné antes, las APIs son diseñadas por las mismas empresas que tienen interés en que se desarrollen aplicaciones (públicas o privadas) que consuman sus servicios o información. Es por eso que la forma de utilizar las APIs variará dependiendo del servicio que querramos consumir. No es lo mismo utilizar las APIs de Spotify que las APIs de Twitter. Por esta razón es de suma importancia leer la documentación disponible, generalmente en la sección de desarrolladores de cada sitio. Te dejo el [link a la de Spotify](https://developer.spotify.com/documentation/)
#### JSON
Json significa *JavaScript Object Notation* y es un formato para describir objetos que ganó tanta popularidad en su uso que ahora se lo considera independiente del lenguaje. De hecho, lo utilizaremos en este proyecto por más que estemos trabajando en Python, porque es la forma en la que obtendremos las respuestas a las solicitudes que realicemos utilizando las APIs. Para nosotros, no será ni más ni menos que un diccionario con algunas particularidades que iremos viendo a lo largo del curso.



Links útiles para la clase:
- [Documentación de Spotify - Artistas](https://developer.spotify.com/documentation/web-api/reference/artists/)
- [Iron Maiden en Spotify](https://open.spotify.com/artist/6mdiAmATAx73kdxrNrnlao)

In [1]:
import requests

In [2]:
id_im = '6mdiAmATAx73kdxrNrnlao'

In [3]:
url_base = 'https://api.spotify.com/v1'

In [4]:
ep_artist = '/artists/{artist_id}'

In [5]:
url_base+ep_artist.format(artist_id=id_im)

'https://api.spotify.com/v1/artists/6mdiAmATAx73kdxrNrnlao'

In [6]:
r = requests.get(url_base+ep_artist.format(artist_id=id_im))

In [7]:
r.status_code

401

In [8]:
r.json()

{'error': {'status': 401, 'message': 'No token provided'}}

In [9]:
token_url = 'https://accounts.spotify.com/api/token'

In [10]:
params = {'grant_type': 'client_credentials'}

In [11]:
headers = {'Authorization': 'Basic NDRiN2IzNmVjMTQ1NDY3ZjlhOWVlYWY3ZTQxN2NmOGI6N2I0YWE3YTBlZjQ4NDQwNDhhYjFkMjI0MzBhMWViMWY='}

In [12]:
r = requests.post(token_url, data=params, headers=headers)

In [13]:
r.status_code

200

In [14]:
r.json()

{'access_token': 'BQBnE69BRs2s6RKAR9SswQCJTTY7aS-7wARG5ix2Qin8c9HRdxj5B7i5Dn0jsvUZHq04uL2JH870DZ1auak',
 'token_type': 'Bearer',
 'expires_in': 3600,
 'scope': ''}

In [15]:
token = r.json()['access_token']
token

'BQBnE69BRs2s6RKAR9SswQCJTTY7aS-7wARG5ix2Qin8c9HRdxj5B7i5Dn0jsvUZHq04uL2JH870DZ1auak'

In [16]:
header = {"Authorization": "Bearer {}".format(token)}

In [17]:
r = requests.get(url_base+ep_artist.format(artist_id=id_im), headers=header)

In [18]:
r.status_code

200

In [19]:
r.json()

{'external_urls': {'spotify': 'https://open.spotify.com/artist/6mdiAmATAx73kdxrNrnlao'},
 'followers': {'href': None, 'total': 4768250},
 'genres': ['album rock', 'hard rock', 'metal', 'nwobhm', 'rock'],
 'href': 'https://api.spotify.com/v1/artists/6mdiAmATAx73kdxrNrnlao',
 'id': '6mdiAmATAx73kdxrNrnlao',
 'images': [{'height': 640,
   'url': 'https://i.scdn.co/image/4da0201eb9473be7d6dd138b81678e79dfd7eb02',
   'width': 640},
  {'height': 320,
   'url': 'https://i.scdn.co/image/7f99805fcfe3bf12e6c29977200c7e58c234c010',
   'width': 320},
  {'height': 160,
   'url': 'https://i.scdn.co/image/32b9989c0c47736535d76564ed6ae11ebb57948c',
   'width': 160}],
 'name': 'Iron Maiden',
 'popularity': 77,
 'type': 'artist',
 'uri': 'spotify:artist:6mdiAmATAx73kdxrNrnlao'}

In [20]:
url_busqueda = 'https://api.spotify.com/v1/search'

In [21]:
search_params = {'q': "Iron+Maiden", 'type':'artist', 'market':'AR'}

In [22]:
busqueda = requests.get(url_busqueda, headers=header, params=search_params)

In [23]:
busqueda.status_code

200

In [24]:
busqueda.json()

{'artists': {'href': 'https://api.spotify.com/v1/search?query=Iron%2BMaiden&type=artist&market=AR&offset=0&limit=20',
  'items': [{'external_urls': {'spotify': 'https://open.spotify.com/artist/6mdiAmATAx73kdxrNrnlao'},
    'followers': {'href': None, 'total': 4768250},
    'genres': ['album rock', 'hard rock', 'metal', 'nwobhm', 'rock'],
    'href': 'https://api.spotify.com/v1/artists/6mdiAmATAx73kdxrNrnlao',
    'id': '6mdiAmATAx73kdxrNrnlao',
    'images': [{'height': 640,
      'url': 'https://i.scdn.co/image/4da0201eb9473be7d6dd138b81678e79dfd7eb02',
      'width': 640},
     {'height': 320,
      'url': 'https://i.scdn.co/image/7f99805fcfe3bf12e6c29977200c7e58c234c010',
      'width': 320},
     {'height': 160,
      'url': 'https://i.scdn.co/image/32b9989c0c47736535d76564ed6ae11ebb57948c',
      'width': 160}],
    'name': 'Iron Maiden',
    'popularity': 77,
    'type': 'artist',
    'uri': 'spotify:artist:6mdiAmATAx73kdxrNrnlao'},
   {'external_urls': {'spotify': 'https://open.

In [25]:
import pandas as pd

In [26]:
df = pd.DataFrame(busqueda.json()['artists']['items'])
df.head()

Unnamed: 0,external_urls,followers,genres,href,id,images,name,popularity,type,uri
0,{'spotify': 'https://open.spotify.com/artist/6...,"{'href': None, 'total': 4768250}","[album rock, hard rock, metal, nwobhm, rock]",https://api.spotify.com/v1/artists/6mdiAmATAx7...,6mdiAmATAx73kdxrNrnlao,"[{'height': 640, 'url': 'https://i.scdn.co/ima...",Iron Maiden,77,artist,spotify:artist:6mdiAmATAx73kdxrNrnlao
1,{'spotify': 'https://open.spotify.com/artist/5...,"{'href': None, 'total': 1837}",[],https://api.spotify.com/v1/artists/5jtCtv88wno...,5jtCtv88wno9jIusR82dmQ,"[{'height': 640, 'url': 'https://i.scdn.co/ima...",The Iron Maiden,6,artist,spotify:artist:5jtCtv88wno9jIusR82dmQ
2,{'spotify': 'https://open.spotify.com/artist/2...,"{'href': None, 'total': 365}",[],https://api.spotify.com/v1/artists/2YZr3Xthgy2...,2YZr3Xthgy283qC7s4tVM1,"[{'height': 640, 'url': 'https://i.scdn.co/ima...",The Bolton Iron Maiden,1,artist,spotify:artist:2YZr3Xthgy283qC7s4tVM1
3,{'spotify': 'https://open.spotify.com/artist/0...,"{'href': None, 'total': 944}",[],https://api.spotify.com/v1/artists/0VhddzEFAYb...,0VhddzEFAYbACFHu6K6btD,"[{'height': 640, 'url': 'https://i.scdn.co/ima...",Paul Dianno & Dennis Stratton from Iron Maiden,6,artist,spotify:artist:0VhddzEFAYbACFHu6K6btD
4,{'spotify': 'https://open.spotify.com/artist/7...,"{'href': None, 'total': 582}",[],https://api.spotify.com/v1/artists/7p7Pae1Xc78...,7p7Pae1Xc78SpEAknWJwWL,"[{'height': 640, 'url': 'https://i.scdn.co/ima...",Iron Maidnem (tribute to Iron Maiden),12,artist,spotify:artist:7p7Pae1Xc78SpEAknWJwWL


In [27]:
df.sort_values(by='popularity', ascending=False).iloc[0]['id']

'6mdiAmATAx73kdxrNrnlao'

In [28]:
import base64
def get_token(client_id, client_secret):
    encoded = base64.b64encode(bytes(client_id+':'+client_secret, 'utf-8'))
    params = {'grant_type':'client_credentials'}
    header={'Authorization': 'Basic ' + str(encoded, 'utf-8')}
    r = requests.post('https://accounts.spotify.com/api/token', headers=header, data=params)
    if r.status_code != 200:
        print('Error en la request.', r.json())
        return None
    print('Token válido por {} segundos.'.format(r.json()['expires_in']))
    return r.json()['access_token']

In [29]:
client_id = '44b7b36ec145467f9a9eeaf7e417cf8b'
client_secret = '7b4aa7a0ef4844048ab1d22430a1eb1f'

In [30]:
token = get_token(client_id, client_secret)

Token válido por 3600 segundos.


In [31]:
header = {"Authorization": "Bearer {}".format(token)}

In [32]:
id_im

'6mdiAmATAx73kdxrNrnlao'

In [33]:
artist_im = requests.get(url_base+ep_artist.format(artist_id=id_im), headers=header)
artist_im.status_code

200

In [34]:
artist_im.json()

{'external_urls': {'spotify': 'https://open.spotify.com/artist/6mdiAmATAx73kdxrNrnlao'},
 'followers': {'href': None, 'total': 4768250},
 'genres': ['album rock', 'hard rock', 'metal', 'nwobhm', 'rock'],
 'href': 'https://api.spotify.com/v1/artists/6mdiAmATAx73kdxrNrnlao',
 'id': '6mdiAmATAx73kdxrNrnlao',
 'images': [{'height': 640,
   'url': 'https://i.scdn.co/image/4da0201eb9473be7d6dd138b81678e79dfd7eb02',
   'width': 640},
  {'height': 320,
   'url': 'https://i.scdn.co/image/7f99805fcfe3bf12e6c29977200c7e58c234c010',
   'width': 320},
  {'height': 160,
   'url': 'https://i.scdn.co/image/32b9989c0c47736535d76564ed6ae11ebb57948c',
   'width': 160}],
 'name': 'Iron Maiden',
 'popularity': 77,
 'type': 'artist',
 'uri': 'spotify:artist:6mdiAmATAx73kdxrNrnlao'}

In [35]:
params = {'country': 'AR'}

In [36]:
albums_im = requests.get(url_base+ep_artist.format(artist_id=id_im)+'/albums', headers=header, params=params)
albums_im.status_code

200

In [37]:
albums_im.json()['items']

[{'album_group': 'album',
  'album_type': 'album',
  'artists': [{'external_urls': {'spotify': 'https://open.spotify.com/artist/6mdiAmATAx73kdxrNrnlao'},
    'href': 'https://api.spotify.com/v1/artists/6mdiAmATAx73kdxrNrnlao',
    'id': '6mdiAmATAx73kdxrNrnlao',
    'name': 'Iron Maiden',
    'type': 'artist',
    'uri': 'spotify:artist:6mdiAmATAx73kdxrNrnlao'}],
  'external_urls': {'spotify': 'https://open.spotify.com/album/3oFAX7PeOFbZnKiPmpUPv4'},
  'href': 'https://api.spotify.com/v1/albums/3oFAX7PeOFbZnKiPmpUPv4',
  'id': '3oFAX7PeOFbZnKiPmpUPv4',
  'images': [{'height': 640,
    'url': 'https://i.scdn.co/image/ab67616d0000b273f831658588b69a862c054861',
    'width': 640},
   {'height': 300,
    'url': 'https://i.scdn.co/image/ab67616d00001e02f831658588b69a862c054861',
    'width': 300},
   {'height': 64,
    'url': 'https://i.scdn.co/image/ab67616d00004851f831658588b69a862c054861',
    'width': 64}],
  'name': 'The Book of Souls: Live Chapter',
  'release_date': '2017-11-17',
  'r

In [38]:
[(album['id'], album['name']) for album in albums_im.json()['items']]

[('3oFAX7PeOFbZnKiPmpUPv4', 'The Book of Souls: Live Chapter'),
 ('4vSfHrq6XxVyMcJ6PguFR2', 'The Book of Souls'),
 ('44myoe2TKVeLrIhfcN7xVK', "Maiden England '88 (2013 Remaster)"),
 ('3LymDdEISKszJqeN2z9DBI', 'En Vivo!'),
 ('5YAfxU5OZKqAvE1GZPRQYY', 'The Final Frontier (2015 Remaster)'),
 ('3yTneaS0Z3xsAmCafaYjPw', 'Flight 666: The Original Soundtrack'),
 ('1gdB9kn59KSAVG5VQcjdHi', 'A Matter of Life and Death (2015 Remaster)'),
 ('2rVdAYBUuEnciOvQRrn4YL', 'Death on the Road'),
 ('2Y8x0EEu7il0K2gCQIqVRh', 'Dance of Death (2015 Remaster)'),
 ('4SiZSq9igWhpJoMzjxE1xE', 'Rock In Rio [Live]'),
 ('1hDF0QPIHVTnSJtxyQVguB', 'Brave New World (2015 Remaster)'),
 ('0axV6lvqshTCcGxT2AYiIK', 'Ed Hunter'),
 ('4olc018Cln2QaMRFy1sk7v', 'Virtual XI (2015 Remaster)'),
 ('3irqbaStVsDR9IEdg8Cdwz', 'The X Factor (2015 Remaster)'),
 ('3DRvgymMVPG0TQ2DugHJRb', 'A Real Live Dead One (Live; 2006 Remaster)'),
 ('12vTiEGN96ZQtGb3zfpKE8', 'A Real Live One'),
 ('77SZJrj0HAxMGvZtDzhvw7', 'Live at Donington (1998 Re

In [None]:
bnw_id = '1hDF0QPIHVTnSJtxyQVguB'

In [None]:
album_ep = '/albums/{album_id}'

In [None]:
album_params = {'market':'AR'}

In [None]:
bnw = requests.get(url_base+album_ep.format(album_id=bnw_id)+'/tracks', headers=header, params=album_params)
bnw

In [None]:
bnw.json()

In [None]:
bnw.json()['items']

In [None]:
[(track['id'], track['name']) for track in bnw.json()['items']]

## Clase 5

In [None]:
def obtener_discografia(artist_id, token, return_name=False, page_limit=50, country=None):
    url = f'https://api.spotify.com/v1/artists/{artist_id}/albums'
    header = {'Authorization': f'Bearer {token}'}
    params = {'limit': page_limit, 
              'offset': 0,
              'country': country}
    lista = []
    r = requests.get(url, params=params, headers=header)
    
    if r.status_code != 200:
        print('Error en request.', r.json())
        return None
    
    if return_name:
        lista += [(item['id'], item['name']) for item in r.json()['items']]
    else:
        lista += [item['id'] for item in r.json()['items']]
        
    while r.json()['next']:
        r = requests.get(r.json()['next'], headers=header) # El resto de los parámetros están dentro de la URL
        if return_name:
            lista += [(item['id'], item['name']) for item in r.json()['items']]
        else:
            lista += [item['id'] for item in r.json()['items']]
    
    return lista

In [None]:
def obtener_tracks(album_id, token, return_name=False, page_limit=50, market=None):
    url=f'https://api.spotify.com/v1/albums/{album_id}/tracks'
    header = {'Authorization': f'Bearer {token}'}
    params = {'limit': page_limit, 
              'offset': 0,
              'market': market}
    lista = []
    r = requests.get(url, params=params, headers=header)
    
    if r.status_code != 200:
        print('Error en request.', r.json())
        return None
    
    if return_name:
        lista += [(item['id'], item['name']) for item in r.json()['items']]
    else:
        lista += [item['id'] for item in r.json()['items']]
        
    while r.json()['next']:
        r = requests.get(r.json()['next'], headers=header) # El resto de los parámetros están dentro de la URL
        if return_name:
            lista += [(item['id'], item['name']) for item in r.json()['items']]
        else:
            lista += [item['id'] for item in r.json()['items']]
    
    return lista