# Using the Yelp API - Codealong

## Introduction

Now that we've discussed HTTP requests and OAuth, it's time to practice applying those skills to a production level API. In this codealong, we'll take you through the process of signing up for an OAuth token and then using that to make requests to the Yelp API!

## Objectives

You will be able to:

* Make requests using OAuth
* Use the JSON module to load and parse JSON documents
* Convert JSON to a pandas dataframe

## Generating Access Tokens

As discussed, in order to use many APIs, one needs to use OAuth which requires an access token. As such, our first step will be to generate this login information so that we can start making some requests.  

With that, let's go grab an access token from an API site and make some API calls!
Point your browser over to this [yelp page](https://www.yelp.com/developers/v3/manage_app) and start creating an app in order to obtain and API access token:

![](./images/yelp_app.png)


You can either sign in to an existing Yelp account or create a new one if needed.

On the page you see above, simply fill out some sample information such as "Flatiron Edu API Example" for the app name, or whatever floats your boat. Afterward, you should be presented with an API key that you can use to make requests!

With that, let's set up our authentication tokens so that we can start making some API calls!

### Should I publicly share my passwords on Github?

When using an API that requires an API key and password you should **NEVER** hardcode theses values into your main file. When you upload your project onto github it is completely public and vulnerable to attack. Assume that if you put sensitive information publicly on the internet it will be found and abused. 

To this end, how can we easily access our API key without opening ourselves up to vulnerabilities?

There are many ways to store sensitive information but we will go with this method. 

#### Move to your home (root) directory:

```
cd ~
```

#### Now make the `.secret/` directory:

```
mkdir .secret
```

This will create a new folder in your home directory where you can store files for any of the API information you have. 

Can you find the file you just made in your terminal? 
NOTE: dot files won't show up with just `ls` you must use the show all command as well `ls -a`


#### Move into the newly created `.secret/` folder and create a file using vscode or any text editor to store your yelp API login info.

```
cd .secret/
code yelp_api.json
```

In this file, let's create a dictionary of values representing the client id and API key that looks something like this:

`{"api_key": "input api key here!"}`

NOTE: Double quotes are important! You'll copy and paste the `api_key` value that yelp grants you after you create your app.

Ok, so now we have a file in our .secret folder on our home directory. Safe and sound (mostly) from anyone trying to steal our info off github.

#### Finally, let's get our client id and API key into our jupyter notebook.

If we remember that our file is just a regular JSON file, open the file and pull out the appropriate information from the `~/.secret/yelp_api.json` file. 


In [18]:
import json

def get_keys(path):
    with open(path) as f:
        return json.load(f)

> **Note**: Change the file path below to be your root directory. 
If you're not sure what your username is, check it with `pwd`  
For example, my current working directory is ```/Users/matthew.mitchell/Documents/dsc-using-yelp-api-codealong```  
So the line below would become:
```keys = get_keys("/Users/matthew.mitchell/.secret/yelp_api.json")```

In [19]:
keys = get_keys("/Users/maxsteele/.secret/yelp_api.json")

api_key = keys['api_key']

#While you may wish to print out these API keys to check that they imported properly,
#be sure to clear the output before uploading to Github. 
#Again, you don't want your keys stolen!!!

## An Example Request with OAuth <a id="oauth_request"></a>
https://www.yelp.com/developers/documentation/v3/get_started

In the next lesson, we'll further dissect how to read and translate online documentation like the link here. For now, let's simply look at an example request and dissect it into its constituent parts:

In [20]:
import requests
term = 'Mexican'
location = 'Astoria NY'
SEARCH_LIMIT = 10

url = 'https://api.yelp.com/v3/businesses/search'

headers = {
        'Authorization': 'Bearer {}'.format(api_key),
    }

url_params = {
                'term': term.replace(' ', '+'),
                'location': location.replace(' ', '+'),
                'limit': SEARCH_LIMIT
            }
response = requests.get(url, headers=headers, params=url_params)
print(response)
print(type(response.text))
print(response.text[:1000])

<Response [200]>
<class 'str'>
{"businesses": [{"id": "20G3V7x_XqKEwcgiB2RIDQ", "alias": "the-taco-spot-nyc-queens", "name": "The Taco Spot NYC", "image_url": "https://s3-media1.fl.yelpcdn.com/bphoto/yb-7najS1zIABdFaN82xXg/o.jpg", "is_closed": false, "url": "https://www.yelp.com/biz/the-taco-spot-nyc-queens?adjust_creative=6ObNOuQQ9GDPgs0E-sIgJA&utm_campaign=yelp_api_v3&utm_medium=api_v3_business_search&utm_source=6ObNOuQQ9GDPgs0E-sIgJA", "review_count": 4, "categories": [{"alias": "newmexican", "title": "New Mexican Cuisine"}, {"alias": "tacos", "title": "Tacos"}], "rating": 5.0, "coordinates": {"latitude": 40.771282, "longitude": -73.9220834}, "transactions": ["pickup", "delivery"], "location": {"address1": "25-17 Astoria Blvd", "address2": "", "address3": null, "city": "Queens", "zip_code": "11102", "country": "US", "state": "NY", "display_address": ["25-17 Astoria Blvd", "Queens, NY 11102"]}, "phone": "+13475271414", "display_phone": "(347) 527-1414", "distance": 496.8180983238797}

## Breaking Down the Request

As you can see, there are three main parts to our request.  
  
They are:
* The URL
* The header
* The parameters
  
The URL is fairly straightforward and is simply the base URL as described in the documentation (again more details in the upcoming lesson).

The header is a dictionary of key-value pairs. In this case, we are using a fairly standard header used by many APIs. It has a strict form where 'Authorization' is the key and 'Bearer YourApiKey' is the value.

The parameters are the filters that we wish to pass into the query. These will be embedded into the URL when the request is made to the API. Similar to the header, they form key-value pairs. Valid key parameters by which to structure your queries are described in the API documentation which we'll look at further shortly. A final important note, however, is the need to replace spaces with "+". This is standard to many requests as URLs cannot contain spaces. (Note that the header itself isn't directly embedded into the URL itself and as such, the space between 'Bearer' and YourApiKey is valid.)


## The Response

As before, our response object has both a status code, as well as the data itself. With that, let's start with a little data exploration!

In [21]:
response.json().keys()

dict_keys(['businesses', 'total', 'region'])

Now let's go a bit further and start to preview what's stored in each of the values for these keys.

In [22]:
for key in response.json().keys():
    print(key)
    value = response.json()[key] #Use standard dictionary formatting
    print(type(value)) #What type is it?
    print('\n\n') #Separate out data

businesses
<class 'list'>



total
<class 'int'>



region
<class 'dict'>





Let's continue to preview these further to get a little better acquainted.

In [23]:
response.json()['businesses'][:2]

[{'id': '20G3V7x_XqKEwcgiB2RIDQ',
  'alias': 'the-taco-spot-nyc-queens',
  'name': 'The Taco Spot NYC',
  'image_url': 'https://s3-media1.fl.yelpcdn.com/bphoto/yb-7najS1zIABdFaN82xXg/o.jpg',
  'is_closed': False,
  'url': 'https://www.yelp.com/biz/the-taco-spot-nyc-queens?adjust_creative=6ObNOuQQ9GDPgs0E-sIgJA&utm_campaign=yelp_api_v3&utm_medium=api_v3_business_search&utm_source=6ObNOuQQ9GDPgs0E-sIgJA',
  'review_count': 4,
  'categories': [{'alias': 'newmexican', 'title': 'New Mexican Cuisine'},
   {'alias': 'tacos', 'title': 'Tacos'}],
  'rating': 5.0,
  'coordinates': {'latitude': 40.771282, 'longitude': -73.9220834},
  'transactions': ['pickup', 'delivery'],
  'location': {'address1': '25-17 Astoria Blvd',
   'address2': '',
   'address3': None,
   'city': 'Queens',
   'zip_code': '11102',
   'country': 'US',
   'state': 'NY',
   'display_address': ['25-17 Astoria Blvd', 'Queens, NY 11102']},
  'phone': '+13475271414',
  'display_phone': '(347) 527-1414',
  'distance': 496.81809832

In [24]:
response.json()['total']

689

In [25]:
response.json()['region']

{'center': {'longitude': -73.92219543457031, 'latitude': 40.76688875374591}}

As you can see, we're primarily interested in the 'businesses' entry. 

Let's go ahead and create a dataframe from that.

In [26]:
import pandas as pd

df = pd.DataFrame.from_dict(response.json()['businesses'])
print(len(df)) #Print how many rows
print(df.columns) #Print column names
df.head() #Previews the first five rows. 
#You could also write df.head(10) to preview 10 rows or df.tail() to see the bottom

10
Index(['id', 'alias', 'name', 'image_url', 'is_closed', 'url', 'review_count',
       'categories', 'rating', 'coordinates', 'transactions', 'location',
       'phone', 'display_phone', 'distance', 'price'],
      dtype='object')


Unnamed: 0,id,alias,name,image_url,is_closed,url,review_count,categories,rating,coordinates,transactions,location,phone,display_phone,distance,price
0,20G3V7x_XqKEwcgiB2RIDQ,the-taco-spot-nyc-queens,The Taco Spot NYC,https://s3-media1.fl.yelpcdn.com/bphoto/yb-7na...,False,https://www.yelp.com/biz/the-taco-spot-nyc-que...,4,"[{'alias': 'newmexican', 'title': 'New Mexican...",5.0,"{'latitude': 40.771282, 'longitude': -73.9220834}","[pickup, delivery]","{'address1': '25-17 Astoria Blvd', 'address2':...",13475271414,(347) 527-1414,496.818098,
1,yvva7IYpD6J7OfKlCdQrkw,mi-espiguita-taqueria-astoria,Mi Espiguita Taqueria,https://s3-media2.fl.yelpcdn.com/bphoto/R6xTyU...,False,https://www.yelp.com/biz/mi-espiguita-taqueria...,115,"[{'alias': 'mexican', 'title': 'Mexican'}]",4.5,"{'latitude': 40.7612033639422, 'longitude': -7...","[pickup, delivery]","{'address1': '32-44 31st St', 'address2': '', ...",17187775648,(718) 777-5648,714.30108,$
2,jzVv_21473lAMYXIhVbuTA,de-mole-astoria-astoria,De Mole Astoria,https://s3-media4.fl.yelpcdn.com/bphoto/w56szE...,False,https://www.yelp.com/biz/de-mole-astoria-astor...,372,"[{'alias': 'mexican', 'title': 'Mexican'}]",4.0,"{'latitude': 40.7625999, 'longitude': -73.9129...","[pickup, delivery]","{'address1': '4220 30th Ave', 'address2': '', ...",17187771655,(718) 777-1655,915.607446,$$
3,AUyKmFjpaVLwc3awfUnqgQ,chela-and-garnacha-astoria,Chela & Garnacha,https://s3-media1.fl.yelpcdn.com/bphoto/ChVbA1...,False,https://www.yelp.com/biz/chela-and-garnacha-as...,380,"[{'alias': 'mexican', 'title': 'Mexican'}, {'a...",4.5,"{'latitude': 40.7557171543477, 'longitude': -7...","[pickup, delivery]","{'address1': '33-09 36th Ave', 'address2': '',...",19178326876,(917) 832-6876,1318.326547,$$
4,EHUNrnIgnhwTnpOm3gEESg,tacuba-astoria,Tacuba,https://s3-media2.fl.yelpcdn.com/bphoto/ijC8aZ...,False,https://www.yelp.com/biz/tacuba-astoria?adjust...,658,"[{'alias': 'mexican', 'title': 'Mexican'}, {'a...",4.0,"{'latitude': 40.75585, 'longitude': -73.92447}","[pickup, delivery]","{'address1': '35-01 36th St', 'address2': '', ...",17187862727,(718) 786-2727,1229.351705,$$


## Summary <a id="sum"></a>

Congratulations! We've covered a lot here! We took some of your previous knowledge with HTTP requests and OAuth in order to leverage an enterprise API! Then we made some requests to retrieve information that came back as a JSON format. We then transformed this data into a dataframe using the Pandas package. In the next lab, we'll break down how to read API documentation!