## Spotipy API

Create an Spotify account and follow these steps to register an app: https://developer.spotify.com/documentation/general/guides/app-settings/

After the app is created, you can see it on your dashboard
https://developer.spotify.com/dashboard/applications

Click on it and you'll find the client id and client secret.

#### Authentification

In [3]:
import spotipy # install if needed
from spotipy.oauth2 import SpotifyClientCredentials

In [34]:
#Initialize SpotiPy with user credentias
sp = spotipy.Spotify(auth_manager=SpotifyClientCredentials(
    client_id=client_id, client_secret=client_sectret))

#### Searching songs with 'queries' with `sp.search`

In [9]:
results = sp.search(q='Jazzotron', limit=50)

Explore the object returned by the request:

In [10]:
#results

Explore a single song:

In [12]:
#results['tracks']['items'][0]

# Activity 1

In [13]:
artists = ['Jazzotron', 'Shazalakazoo', 'BalkanBeats SoundSystem', 'Neki', 'Damir out Loud']

In [14]:
my_music = []
for artist in artists:
    my_music.append(sp.search(artist, limit=10))


In [19]:
#my_music[4]

In [29]:
def spotify_search(artists_list):
    return {artist: sp.search(artist, limit=50) for artist in artists}

In [31]:
#spotify_search(artists)

# Password security

In [32]:
import getpass

In [33]:
client_id = str(getpass.getpass('client_id?'))
client_sectret = str(getpass.getpass('client_secret?'))

Spotify songs are identified by either a "url", a "uri" or an "id". 

- The `id` is an alphanumeric code, and it's the nuclear part of the identifier.

- The `uri` contains "spotify:track" before the id. An uri is useful because it can be searched manually in the Spotify app.

- The `url` is a link to the song on the Spotify web player.

We'll use the `uri` in this code-along, but feel free to use whatever you think fits best your needs.

#### Searching multiple artists

In [None]:
artists = ["Dire Straits", "Queen", "Ella Fitzgerald"]

In [35]:
results.keys()

dict_keys(['tracks'])

#### Exploring the tracks

Function to get the artists involved in a song:

In [36]:
results['tracks']['href']

'https://api.spotify.com/v1/search?query=Jazzotron&type=track&offset=0&limit=50'

In [50]:
results['tracks']['next']

'https://api.spotify.com/v1/search?query=Jazzotron&type=track&offset=50&limit=50'

In [51]:
results['tracks']['total']

89

In [44]:
results['tracks'].keys()

dict_keys(['href', 'items', 'limit', 'next', 'offset', 'previous', 'total'])

Function to get the "id's" of the artists from a song:

In [49]:
results['tracks']['items'][0].keys()

dict_keys(['album', 'artists', 'available_markets', 'disc_number', 'duration_ms', 'explicit', 'external_ids', 'external_urls', 'href', 'id', 'is_local', 'name', 'popularity', 'preview_url', 'track_number', 'type', 'uri'])

In [53]:
results['tracks']['items'][0]['album'].keys()

dict_keys(['album_type', 'artists', 'available_markets', 'external_urls', 'href', 'id', 'images', 'name', 'release_date', 'release_date_precision', 'total_tracks', 'type', 'uri'])

In [55]:
results['tracks']['items'][0]['album']['artists']

[{'external_urls': {'spotify': 'https://open.spotify.com/artist/1YumFgivFXVVg1AKBJKE5e'},
  'href': 'https://api.spotify.com/v1/artists/1YumFgivFXVVg1AKBJKE5e',
  'id': '1YumFgivFXVVg1AKBJKE5e',
  'name': 'Jazzotron',
  'type': 'artist',
  'uri': 'spotify:artist:1YumFgivFXVVg1AKBJKE5e'}]

In [59]:
results['tracks']['items'][0]['popularity']

39

In [60]:
results['tracks']['items'][0]['id']

'2kkdf4naY0ddS9WN4OZPxV'

In [61]:
results['tracks']['items'][0]['uri']

'spotify:track:2kkdf4naY0ddS9WN4OZPxV'

In [63]:
results['tracks']['items'][0]['href']

'https://api.spotify.com/v1/tracks/2kkdf4naY0ddS9WN4OZPxV'

In [64]:
results['tracks']['items'][0]['explicit']

False

# Activity 2

In [65]:
results['tracks']['items'][0]['artists']

[{'external_urls': {'spotify': 'https://open.spotify.com/artist/1YumFgivFXVVg1AKBJKE5e'},
  'href': 'https://api.spotify.com/v1/artists/1YumFgivFXVVg1AKBJKE5e',
  'id': '1YumFgivFXVVg1AKBJKE5e',
  'name': 'Jazzotron',
  'type': 'artist',
  'uri': 'spotify:artist:1YumFgivFXVVg1AKBJKE5e'},
 {'external_urls': {'spotify': 'https://open.spotify.com/artist/67CxTr6NMaF4v8X8rxXFIA'},
  'href': 'https://api.spotify.com/v1/artists/67CxTr6NMaF4v8X8rxXFIA',
  'id': '67CxTr6NMaF4v8X8rxXFIA',
  'name': 'Sofija Knezevic',
  'type': 'artist',
  'uri': 'spotify:artist:67CxTr6NMaF4v8X8rxXFIA'}]

In [69]:
for artist in results['tracks']['items'][0]['artists']:
    print(artist['id'])

1YumFgivFXVVg1AKBJKE5e
67CxTr6NMaF4v8X8rxXFIA


In [72]:
def get_artists(song):
    names = []
    for artist in song:
        names.append(artist['name'])
    return names

In [73]:
def get_ids(song):
    ids = []
    for artist in song:
        ids.append(artist['id'])
    return ids

In [74]:
get_artists(results['tracks']['items'][0]['artists'])

['Jazzotron', 'Sofija Knezevic']

In [75]:
get_ids(results['tracks']['items'][0]['artists'])

['1YumFgivFXVVg1AKBJKE5e', '67CxTr6NMaF4v8X8rxXFIA']

### Playlists

We will need to collect a "database" of songs. Playlists are a good way to access relatively large amounts of songs.

In [76]:
playlist = sp.user_playlist_tracks("spotify", "0Swkgrsji8x4cQWoJ4kfo2")

In [77]:
playlist_2 = sp.user_playlist_tracks("spotify", "75GeX247deNdTX7zdyZDnU")


In [78]:
playlist_2["total"]

99

Function to extract all songs from a playlist

In [79]:
playlist.keys()

dict_keys(['href', 'items', 'limit', 'next', 'offset', 'previous', 'total'])

Function to extract just the uri's:

In [83]:
playlist['items'][0].keys()

dict_keys(['added_at', 'added_by', 'is_local', 'primary_color', 'track', 'video_thumbnail'])

In [86]:
playlist['items'][0]['track']['name']

'Sandstorm'

In [None]:
for i in range(len(playlist['items'])):
    print(playlist['items'][i]['track']['name'])

In [117]:
rock_playlist = sp.user_playlist_tracks("spotify", '37i9dQZF1DWXRqgorJj26U')

In [None]:
# for i in range(len(rock_playlist['items'])):
#     print(rock_playlist['items'][i]['track']['name'])

In [97]:
# song id
rock_playlist['items'][0]['track']['id']

'0hCB0YR03f6AmQaHbwWDe8'

In [90]:
# song name
rock_playlist['items'][0]['track']['name']

'Whole Lotta Love - 1990 Remaster'

In [96]:
# artist name
rock_playlist['items'][0]['track']['artists'][0]['name']

'Led Zeppelin'

In [104]:
rock_playlist_ids = []
for item in rock_playlist['items']:
    rock_playlist_ids.append(item['track']['id'])

rock_playlist_song_name = []
for item in rock_playlist['items']:
    rock_playlist_song_name.append(item['track']['name'])
    
rock_playlist_artist = []
for item in rock_playlist['items']:
    rock_playlist_artist.append(item['track']['artists'][0]['name'])

rock_list_dict = {'id': rock_playlist_ids, 'title': rock_playlist_song_name, 'artist': rock_playlist_artist}



#### Pagination using "next"

When you collect songs from a playlist using `sp.playlist_tracks`, you're limited by the `limit` parameter, which has a maximum (and default) value of 100. When the playlist has more than 100 songs, you have to collect them by navigating through the "pages" of the results.

The parameter `offset` allows you to retrieve resuls starting at a certain position: if you start at position 101, you'd get the next "page" of results. An offset of 201 would give you the third page, and so on.

The function `sp.next()` does the same, but in a simpler way: it can be used on the results from any request to directly retrieve the results for the next page.

We can check whether there's a next page or not by accessing the key `next` on the results from any request.

### Audio features

You can check here an explanation of the audio features: https://developer.spotify.com/documentation/web-api/reference/tracks/get-audio-features/

In [125]:
rock_playlist_ids = []
for item in rock_playlist['items']:
    rock_playlist_ids.append(item['track']['id'])
    

In [133]:
#rock_playlist_ids

In [127]:
audio_feat_rock = sp.audio_features(tracks=rock_playlist_ids)

In [129]:
import pandas as pd

In [130]:
rock_features = pd.DataFrame(audio_feat_rock)

In [131]:
rock_features.head()

Unnamed: 0,danceability,energy,key,loudness,mode,speechiness,acousticness,instrumentalness,liveness,valence,tempo,type,id,uri,track_href,analysis_url,duration_ms,time_signature
0,0.412,0.902,9,-11.6,1,0.405,0.0484,0.131,0.405,0.422,89.74,audio_features,0hCB0YR03f6AmQaHbwWDe8,spotify:track:0hCB0YR03f6AmQaHbwWDe8,https://api.spotify.com/v1/tracks/0hCB0YR03f6A...,https://api.spotify.com/v1/audio-analysis/0hCB...,333893,4
1,0.55,0.824,2,-5.988,1,0.0334,0.448,0.000127,0.366,0.777,114.512,audio_features,7MRyJPksH3G2cXHN8UKYzP,spotify:track:7MRyJPksH3G2cXHN8UKYzP,https://api.spotify.com/v1/tracks/7MRyJPksH3G2...,https://api.spotify.com/v1/audio-analysis/7MRy...,214733,4
2,0.31,0.7,9,-5.678,1,0.047,0.011,0.00965,0.0828,0.763,188.386,audio_features,08mG3Y1vljYA6bvDt4Wqkj,spotify:track:08mG3Y1vljYA6bvDt4Wqkj,https://api.spotify.com/v1/tracks/08mG3Y1vljYA...,https://api.spotify.com/v1/audio-analysis/08mG...,255493,4
3,0.743,0.836,2,-6.465,1,0.116,0.0804,0.0,0.384,0.82,113.375,audio_features,39shmbIHICJ2Wxnk1fPSdz,spotify:track:39shmbIHICJ2Wxnk1fPSdz,https://api.spotify.com/v1/tracks/39shmbIHICJ2...,https://api.spotify.com/v1/audio-analysis/39sh...,188987,4
4,0.546,0.529,9,-13.6,1,0.0436,0.0629,0.000567,0.0383,0.574,151.727,audio_features,2X6gdRlGOQgfaXU9ALUQFQ,spotify:track:2X6gdRlGOQgfaXU9ALUQFQ,https://api.spotify.com/v1/tracks/2X6gdRlGOQgf...,https://api.spotify.com/v1/audio-analysis/2X6g...,271000,4


In [132]:
rock_features.shape

(100, 18)

In [141]:
rock_list_df = pd.DataFrame.from_dict(rock_list_dict)

In [142]:
rock_list_df.head()

Unnamed: 0,id,title,artist
0,0hCB0YR03f6AmQaHbwWDe8,Whole Lotta Love - 1990 Remaster,Led Zeppelin
1,7MRyJPksH3G2cXHN8UKYzP,American Girl,Tom Petty and the Heartbreakers
2,08mG3Y1vljYA6bvDt4Wqkj,Back In Black,AC/DC
3,39shmbIHICJ2Wxnk1fPSdz,Should I Stay or Should I Go - Remastered,The Clash
4,2X6gdRlGOQgfaXU9ALUQFQ,The Chain,Fleetwood Mac


In [154]:
rock_df = pd.concat([rock_list_df.drop(['id'], axis=1), rock_features.drop(['type'], axis=1)], axis=1)

In [155]:
rock_df.head()

Unnamed: 0,title,artist,danceability,energy,key,loudness,mode,speechiness,acousticness,instrumentalness,liveness,valence,tempo,id,uri,track_href,analysis_url,duration_ms,time_signature
0,Whole Lotta Love - 1990 Remaster,Led Zeppelin,0.412,0.902,9,-11.6,1,0.405,0.0484,0.131,0.405,0.422,89.74,0hCB0YR03f6AmQaHbwWDe8,spotify:track:0hCB0YR03f6AmQaHbwWDe8,https://api.spotify.com/v1/tracks/0hCB0YR03f6A...,https://api.spotify.com/v1/audio-analysis/0hCB...,333893,4
1,American Girl,Tom Petty and the Heartbreakers,0.55,0.824,2,-5.988,1,0.0334,0.448,0.000127,0.366,0.777,114.512,7MRyJPksH3G2cXHN8UKYzP,spotify:track:7MRyJPksH3G2cXHN8UKYzP,https://api.spotify.com/v1/tracks/7MRyJPksH3G2...,https://api.spotify.com/v1/audio-analysis/7MRy...,214733,4
2,Back In Black,AC/DC,0.31,0.7,9,-5.678,1,0.047,0.011,0.00965,0.0828,0.763,188.386,08mG3Y1vljYA6bvDt4Wqkj,spotify:track:08mG3Y1vljYA6bvDt4Wqkj,https://api.spotify.com/v1/tracks/08mG3Y1vljYA...,https://api.spotify.com/v1/audio-analysis/08mG...,255493,4
3,Should I Stay or Should I Go - Remastered,The Clash,0.743,0.836,2,-6.465,1,0.116,0.0804,0.0,0.384,0.82,113.375,39shmbIHICJ2Wxnk1fPSdz,spotify:track:39shmbIHICJ2Wxnk1fPSdz,https://api.spotify.com/v1/tracks/39shmbIHICJ2...,https://api.spotify.com/v1/audio-analysis/39sh...,188987,4
4,The Chain,Fleetwood Mac,0.546,0.529,9,-13.6,1,0.0436,0.0629,0.000567,0.0383,0.574,151.727,2X6gdRlGOQgfaXU9ALUQFQ,spotify:track:2X6gdRlGOQgfaXU9ALUQFQ,https://api.spotify.com/v1/tracks/2X6gdRlGOQgf...,https://api.spotify.com/v1/audio-analysis/2X6g...,271000,4


Above, we stored all the uri's of a playlist into a list called `iron_uris`. We're going to get all the audio features from that playlist's songs now.

### Searching the audio features for a song

When the user inputs a song, you are gonna want to retrieve the audio features of that song. How to do it?

1. Search the user input using the `sp.search()` function. This function works similarly to the "search" bar on the spotify app - using Spotify's intelligent search engine. That means that it can handle names of any songs or artists - even certain typos.

2. Find the uri of the song that the API gives you back.

3. Use `sp.audio_features` to retrieve the audio features of the song.

### Lab: Create your collection of songs & audio features

To move forward witht the project, you need to create a collection of songs with their audio features - as large as possible! 

These are the songs that we will cluster. And, later, when the user inputs a song, we will find the cluster to which the song belongs and recommend a song from the same cluster.

The more songs you have, the more accurate and diverse recommendations you'll be able to give. Although... you might want to make sure the collected songs are "curated" in a certain way. Try to find playlists of songs that are diverse, but also that meet certain standards.

The process of sending hundreds or thousands of requests can take some time - it's normal if you have to wait a few minutes (or, if you're ambitious, even hours) to get all the data you need.

An idea for collecting as many songs as possible is to start with all the songs of a big, diverse playlist and then go to every artist present in the playlist and grab every song of every album of that artist. The amount of songs you'll be collecting per playlist will grow exponentially!

In [37]:
def accum(s):
    res = ''
    for i in range(len(s)):
        res += (s[i] * (i+1) +'-')
    res2 = res.split('-')
    res3 = ''
    for j in res2:
        res3 += (j.capitalize() + '-')
    res3.rstrip('--')
    return res3.rstrip('--')
accum('ase')

'A-Ss-Eee'

In [44]:
'-'.join([letter.upper() + letter.lower()*number for number, letter in enumerate('ase')])

'A-Ss-Eee'