## Python API Tutorial: Getting Started with APIs

In this Python API tutorial, we’ll learn how to retrieve data for data science projects. There are millions of APIs online which provide access to data. Websites like Reddit, Twitter, and Facebook all offer certain data through their APIs.

To use an API, you make a request to a remote web server, and retrieve the data you need.

But why use an API instead of a static CSV dataset you can download from the web? APIs are useful in the following cases:

## Avantage of API 

<b>The data is changing quickly</b>. An example of this is stock price data. It doesn’t really make sense to regenerate a dataset and download it every minute — this will take a lot of bandwidth, and be pretty slow. 

<b>You want a small piece of a much larger set of data</b>. Reddit comments are one example. What if you want to just pull your own comments on Reddit? It doesn’t make much sense to download the entire Reddit database, then filter just your own comments.

<b>There is repeated computation involved</b>. Spotify has an API that can tell you the genre of a piece of music. You could theoretically create your own classifier, and use it to compute music categories, but you’ll never have as much data as Spotify does.

In cases like the ones above, an API is the right solution. In this blog post, we’ll be querying a simple API to retrieve data about the International Space Station (ISS).

# What is an API?

An API, or Application Programming Interface, is a server that you can use to retrieve and send data to using code. APIs are most commonly used to retrieve data, and that will be the focus of this beginner tutorial.

When we want to receive data from an API, we need to make a request. Requests are used all over the web. For instance, when you visited this blog post, your web browser made a request to the Dataquest web server, which responded with the content of this web page. 
<img src="api.PNG">

In order to work with APIs in Python, we need tools that will make those requests. In Python, the most common library for making requests and working with APIs is the requests library. The requests library isn’t part of the standard Python library, so you’ll need to install it to get started.

If you use pip to manage your Python packages, you can install requests using the following command:

#### pip install requests
or 
#### conda install requests

Once you’ve installed the library, you’ll need to import it. Let’s start with that important step: 

In [3]:
import requests

Now that we’ve installed and imported the requests library, let’s start using it.

## Making Our First API Request

There are many different types of requests. The most commonly used one, a GET request, is used to retrieve data. Because we’ll just be working with retrieving data, our focus will be on making ‘get’ requests.

When we make a request, the response from the API comes with a response code which tells us whether our request was successful. Response codes are important because they immediately tell us if something went wrong.

To make a ‘GET’ request, we’ll use the requests.get() function, which requires one argument — the URL we want to make the request to. We’ll start by making a request to an API endpoint that doesn’t exist, so we can see what that response code looks like. 

In [None]:
response = requests.get("https://api.open-notify.org/this-api-doesnt-exist")
# if you run this cell you will get an error. 


The get() function returns a response object. We can use the response.status_code attribute to receive the status code for our request:
in this case we will have the status code 404. 


In [5]:
print(response.status_code)

NameError: name 'response' is not defined

The ‘404’ status code might be familiar to you — it’s the status code that a server returns if it can’t find the file we requested. In this case, we asked for this-api-doesnt-exist which (surprise, surprise) didn’t exist!

Let’s learn a little more about common status codes.

## API Status Codes

Status codes are returned with every request that is made to a web server. Status codes indicate information about what happened with a request. Here are some codes that are relevant to GET requests:
 - 200: Everything went okay, and the result has been returned (if any).
 - 301: The server is redirecting you to a different endpoint. This can happen when a company switches domain names, or an endpoint name is changed.
 - 400: The server thinks you made a bad request. This can happen when you don’t send along the right data, among other things.
 - 401: The server thinks you’re not authenticated. Many APIs require login ccredentials, so this happens when you don’t send the right credentials to access an API.
 - 403: The resource you’re trying to access is forbidden: you don’t have the right permissions to see it.
 - 404: The resource you tried to access wasn’t found on the server.
 - 503: The server is not ready to handle the request.
 
 
You might notice that all of the status codes that begin with a ‘4’ indicate some sort of error. The first number of status codes indicate their categorization. This is useful — you can know that if your status code starts with a ‘2’ it was successful and if it starts with a ‘4’ or ‘5’ there was an error. If you’re interested you can read more about [status codes here](https://developer.mozilla.org/en-US/docs/Web/HTTP/Status)

## API Documentation


In order to ensure we make a successful request, when we work with APIs it’s important to consult the documentation. Documentation can seem scary at first, but as you use documentation more and more you’ll find it gets easier.

In [7]:
url = 'http://api.open-notify.org/astros.json'

In [8]:
response = requests.get(url)
response.status_code

200

In [9]:
print(response.json())

{'people': [{'name': 'Mark Vande Hei', 'craft': 'ISS'}, {'name': 'Oleg Novitskiy', 'craft': 'ISS'}, {'name': 'Pyotr Dubrov', 'craft': 'ISS'}, {'name': 'Thomas Pesquet', 'craft': 'ISS'}, {'name': 'Megan McArthur', 'craft': 'ISS'}, {'name': 'Shane Kimbrough', 'craft': 'ISS'}, {'name': 'Akihiko Hoshide', 'craft': 'ISS'}, {'name': 'Nie Haisheng', 'craft': 'Tiangong'}, {'name': 'Liu Boming', 'craft': 'Tiangong'}, {'name': 'Tang Hongbo', 'craft': 'Tiangong'}], 'number': 10, 'message': 'success'}


## Working with JSON Data in Python


JSON (JavaScript Object Notation) is the language of APIs. JSON is a way to encode data structures that ensures that they are easily readable by machines. JSON is the primary format in which data is passed back and forth to APIs, and most API servers will send their responses in JSON format.

You might have noticed that the JSON output we received from the API looked like it contained Python dictionaries, lists, strings and integers. You can think of JSON as being a combination of these objects represented as strings. Let’s look at a simple example: 

<img src="json.PNG">

Python has great JSON support with the json package. The json package is part of the standard library, so we don’t have to install anything to use it. We can both convert lists and dictionaries to JSON, and convert strings to lists and dictionaries. In the case of our ISS Pass data, it is a dictionary encoded to a string in JSON format.

The json library has two main functions:

 - json.dumps() — Takes in a Python object, and converts (dumps) it to a string.
 - json.loads() — Takes a JSON string, and converts (loads) it to a Python object.
 
The dumps() function is particularly useful as we can use it to print a formatted string which makes it easier to understand the JSON output, like in the diagram we saw above: 

In [10]:
import json

def jprint(obj):
    # create a formated string of the Python json object
    text = json.dumps(obj,sort_keys=True,indent=4)
    print(text)
    

jprint(response.json())

{
    "message": "success",
    "number": 10,
    "people": [
        {
            "craft": "ISS",
            "name": "Mark Vande Hei"
        },
        {
            "craft": "ISS",
            "name": "Oleg Novitskiy"
        },
        {
            "craft": "ISS",
            "name": "Pyotr Dubrov"
        },
        {
            "craft": "ISS",
            "name": "Thomas Pesquet"
        },
        {
            "craft": "ISS",
            "name": "Megan McArthur"
        },
        {
            "craft": "ISS",
            "name": "Shane Kimbrough"
        },
        {
            "craft": "ISS",
            "name": "Akihiko Hoshide"
        },
        {
            "craft": "Tiangong",
            "name": "Nie Haisheng"
        },
        {
            "craft": "Tiangong",
            "name": "Liu Boming"
        },
        {
            "craft": "Tiangong",
            "name": "Tang Hongbo"
        }
    ]
}


Immediately we can understand the structure of the data more easily – we can see that their are Ten people currently in space, with their names existing as dictionaries inside a list.


### Using an API with Query Parameters
The <a> http://api.open-notify.org/astros.json</a> endpoint we used earlier does not take any parameters. We just send a GET request and the API sends back data about the number of people currently in space.

It’s very common, however, to have an API endpoint that requires us to specify parameters. An example of this the <a> http://api.open-notify.org/astros.json</a> endpoint. This endpoint tells us the next times that the international space station will pass over a given location on the earth.

If we look at the documentation, it specifies required lat (latitude) and long (longitude) parameters.

We can do this by adding an optional keyword argument, params, to our request. We can make a dictionary with these parameters, and then pass them into the requests.get function. Here’s what our dictionary would look like, using coordinates for New York City: 

In [15]:
parameters = {
    "lat": 40.71,
    "lon": -74
}

We can also do the same thing directly by adding the parameters directly to the URL. like this

http://api.open-notify.org/astros.json?lat=40.71&lon=-74

It’s almost always preferable to setup the parameters as a dictionary, because requests takes care of some things that come up, like properly formatting the query parameters, and we don’t need to worry about inserting the values into the URL string.

Let’s make a request using these coordinates and see what response we get. 

In [18]:
res = requests.get('http://api.open-notify.org/astros.json?lat=40.71&lon=-74')
jprint(res.json())

{
    "message": "success",
    "number": 10,
    "people": [
        {
            "craft": "ISS",
            "name": "Mark Vande Hei"
        },
        {
            "craft": "ISS",
            "name": "Oleg Novitskiy"
        },
        {
            "craft": "ISS",
            "name": "Pyotr Dubrov"
        },
        {
            "craft": "ISS",
            "name": "Thomas Pesquet"
        },
        {
            "craft": "ISS",
            "name": "Megan McArthur"
        },
        {
            "craft": "ISS",
            "name": "Shane Kimbrough"
        },
        {
            "craft": "ISS",
            "name": "Akihiko Hoshide"
        },
        {
            "craft": "Tiangong",
            "name": "Nie Haisheng"
        },
        {
            "craft": "Tiangong",
            "name": "Liu Boming"
        },
        {
            "craft": "Tiangong",
            "name": "Tang Hongbo"
        }
    ]
}


## Understanding the Pass Times
The JSON response matches what the documentation specified:

 - A dictionary with three keys
 - The third key, response, contains a list of pass times
 - Each pass time is a dictionary with risetime (pass start time) and duration keys.
Let’s extract the pass times from our JSON object: 

if you want to know more about the advance API refer to this [advance API](https://www.dataquest.io/blog/last-fm-api-python/)