# Simple APIs 
## Random User and Fruitvice API Examples


Estimated time needed: **25** minutes

## Objectives

After completing this lab you will be able to:

*   Load and use RandomUser API, using `RandomUser()` Python library
*   Load and use Fruitvice API, using `requests` Python library

---

The purpose of this notebook is to provide more examples on how to use simple APIs. API stands for **Application Programming Interface** and is a software intermediary that allows two applications to talk to each other.

Advantages of using APIs:

- **Automation**
  - Less human effort required 
  - Workflows be easily updated to become faster and more productive
- **Efficiency**
  - It allows to use the capabilities of one of the already developed APIs than to try to independently implement some functionality from scratch.

The disadvantage of using APIs:
- **Secirity**. If the API is poorly integrated, it means it will be vulnerable to attacks, resulting in data
     breeches or losses having financial or reputation implications.

One of the applications we will use in this notebook is Random User Generator. RandomUser is an open-source, free API providing developers with randomly generated users to be used as placeholders for testing purposes. This makes the tool similar to Lorem Ipsum, but is a placeholder for people instead of text. The API can return multiple results, as well as specify generated user details such as gender, email, image, username, address, title, first and last name, and more. More information on [RandomUser](https://randomuser.me/documentation?utm_medium=Exinfluencer&utm_source=Exinfluencer&utm_content=000026UJ&utm_term=10006555&utm_id=NA-SkillsNetwork-Channel-SkillsNetworkCoursesIBMDeveloperSkillsNetworkPY0101ENSkillsNetwork1005-2022-01-01#intro) can be found here.

Another example of simple API we will use in this notebook is Fruitvice application. The Fruitvice API webservice which provides data for all kinds of fruit! You can use Fruityvice to find out interesting information about fruit and educate yourself. The webservice is completely free to use and contribute to.


## API 01: Random User Generator

### Package Installation

To start using the API, we need to install the `randomuser` library. We can use the shell (OS dependent) or magic (provided by IPython kernel) commands in Jupyter Notebook cells.

Check the environment in use before running the command cells below.

In [1]:
## Upgrade Pip
!python.exe -m pip install --upgrade pip



If we try using the magic command (`%python.exe -m pip install --upgrade pip`) above, we will receive `UsageError`.

If the current notebook is in the Anaconda environment, use `conda install`. But this may raise `PackagesNotFoundError`. This means that Conda cannot find the package in active channels (see [Managing channels](https://docs.anaconda.com/navigator/tutorials/manage-channels/#)). Two possible solutions are: 

1. Find the right channel that contains the package 
2. Change the kernel to the Python interpreter from the Python version intended to use

In [2]:
## Install package with Conda
## 1) Platform dependant shell commands
# !conda install randomuser
## 2) Magic commands provided by IPython kernel
# %conda install randomuser

In [3]:
## Install package with Pip
## 1) Platform dependant shell commands
# !pip install randomuser
## 2) Magic commands provided by IPython kernel
%pip install randomuser

Note: you may need to restart the kernel to use updated packages.


The shell command `!pip install randomuser` will generate the error below if the wrong environment is used. To solve it, change the kernel to the Python interpreter from the Python version intended to use.

![shell-command-error-pip-install.png](../images/shell-command-error-pip-install.png)

Make sure that both `randomuser` and `pandas` are installed in the evnironment.

### Brief Introduction

Random User Generator API is used to generate random user data for application testing. For more information, see [Documentation for the Random User Generator API](https://randomuser.me/documentation).

#### Method Overview

For details on the RandomUser class and optional parameters for these methods, see the [documentation](https://connordelacruz.com/python-randomuser/randomuser.html).

#### Getter Methods

- `get_cell()`
- `get_city()`
- `get_dob()`
- `get_email()`
- `get_first_name()`
- `get_full_name()`
- `get_gender()`
- `get_id()`
- `get_id_number()`
- `get_id_type()`
- `get_info()`
- `get_last_name()`
- `get_login_md5()`
- `get_login_salt()`
- `get_login_sha1()`
- `get_login_sha256()`
- `get_nat()`
- `get_password()`
- `get_phone()`
- `get_picture()`
- `get_postcode()`
- `get_registered()`
- `get_state()`
- `get_street()`
- `get_username()`
- `get_zipcode()`

### Example

We will load the necessary libraries.

In [8]:
from randomuser import RandomUser
import pandas as pd

Create a random user object.

In [9]:
## Generate a single user
user = RandomUser()
print(user)

<randomuser.RandomUser object at 0x000002A677264460>


We can also get a list of random users using `generate_users()`.

In [10]:
## Generate multiple users
users = user.generate_users(10)
print(users)

[<randomuser.RandomUser object at 0x000002A677030B80>, <randomuser.RandomUser object at 0x000002A666347850>, <randomuser.RandomUser object at 0x000002A6663477F0>, <randomuser.RandomUser object at 0x000002A666347C40>, <randomuser.RandomUser object at 0x000002A666347520>, <randomuser.RandomUser object at 0x000002A666347F10>, <randomuser.RandomUser object at 0x000002A666347760>, <randomuser.RandomUser object at 0x000002A677206490>, <randomuser.RandomUser object at 0x000002A67721ABB0>, <randomuser.RandomUser object at 0x000002A677264550>]


The Getter methods mentioned above can generate the required parameters to construct a dataset. For example, to get the full name, we call `get_full_name()`.

In [11]:
## Get user's full name
f_name = user.get_full_name()
print(f_name)

Fulgêncio da Rosa


Let's say we only need 10 users with full names and their email addresses. We can write a for-loop to print them.

In [13]:
## Get mulitple users' info
for user in users:
	print(user.get_full_name(), "\t", user.get_email())

آرمیتا زارعی 	 armyt.zraay@example.com
Sergej Effenberger 	 sergej.effenberger@example.com
Susie Allen 	 susie.allen@example.com
Gabínio da Mata 	 gabinio.damata@example.com
Fredy Kropf 	 fredy.kropf@example.com
Fatma Özdoğan 	 fatma.ozdogan@example.com
Neelam Albayrak 	 neelam.albayrak@example.com
Chloe Moore 	 chloe.moore@example.com
Viljami Heinonen 	 viljami.heinonen@example.com
Jerome Henderson 	 jerome.henderson@example.com


## API 02: Fruityvice

Another more common way to use APIs is through the `requests` library. We will obtain the [Fruityvice](https://fruityvice.com) API data using `requests.get()` function. The data is in JSON format.

In [16]:
import requests
import json

In [17]:
## Get a web response
data = requests.get("https://fruityvice.com/api/fruit/all")

We will retrieve results using `json.loads()` function.

In [19]:
## Retrieve data in JSON format
results = json.loads(data.text)
print(results)

[{'genus': 'Malus', 'name': 'Apple', 'id': 6, 'family': 'Rosaceae', 'order': 'Rosales', 'nutritions': {'carbohydrates': 11.4, 'protein': 0.3, 'fat': 0.4, 'calories': 52, 'sugar': 10.3}}, {'genus': 'Prunus', 'name': 'Apricot', 'id': 35, 'family': 'Rosaceae', 'order': 'Rosales', 'nutritions': {'carbohydrates': 3.9, 'protein': 0.5, 'fat': 0.1, 'calories': 15, 'sugar': 3.2}}, {'genus': 'Persea', 'name': 'Avocado', 'id': 84, 'family': 'Lauraceae', 'order': 'Laurales', 'nutritions': {'carbohydrates': 8.53, 'protein': 2, 'fat': 14.66, 'calories': 160, 'sugar': 0.66}}, {'genus': 'Musa', 'name': 'Banana', 'id': 1, 'family': 'Musaceae', 'order': 'Zingiberales', 'nutritions': {'carbohydrates': 22, 'protein': 1, 'fat': 0.2, 'calories': 96, 'sugar': 17.2}}, {'genus': 'Rubus', 'name': 'Blackberry', 'id': 64, 'family': 'Rosaceae', 'order': 'Rosales', 'nutritions': {'carbohydrates': 9, 'protein': 1.3, 'fat': 0.4, 'calories': 40, 'sugar': 4.5}}, {'genus': 'Fragaria', 'name': 'Blueberry', 'id': 33, 'fam

We will convert our JSON data into Pandas DataFrame.

In [22]:
print(results)
df_results = pd.DataFrame(results)
print(df_results)

[{'genus': 'Malus', 'name': 'Apple', 'id': 6, 'family': 'Rosaceae', 'order': 'Rosales', 'nutritions': {'carbohydrates': 11.4, 'protein': 0.3, 'fat': 0.4, 'calories': 52, 'sugar': 10.3}}, {'genus': 'Prunus', 'name': 'Apricot', 'id': 35, 'family': 'Rosaceae', 'order': 'Rosales', 'nutritions': {'carbohydrates': 3.9, 'protein': 0.5, 'fat': 0.1, 'calories': 15, 'sugar': 3.2}}, {'genus': 'Persea', 'name': 'Avocado', 'id': 84, 'family': 'Lauraceae', 'order': 'Laurales', 'nutritions': {'carbohydrates': 8.53, 'protein': 2, 'fat': 14.66, 'calories': 160, 'sugar': 0.66}}, {'genus': 'Musa', 'name': 'Banana', 'id': 1, 'family': 'Musaceae', 'order': 'Zingiberales', 'nutritions': {'carbohydrates': 22, 'protein': 1, 'fat': 0.2, 'calories': 96, 'sugar': 17.2}}, {'genus': 'Rubus', 'name': 'Blackberry', 'id': 64, 'family': 'Rosaceae', 'order': 'Rosales', 'nutritions': {'carbohydrates': 9, 'protein': 1.3, 'fat': 0.4, 'calories': 40, 'sugar': 4.5}}, {'genus': 'Fragaria', 'name': 'Blueberry', 'id': 33, 'fam

The result is in a nested JSON format. The "nutritions" column contains multiple sub-columns so the data needs to be "flattened" or normalized.

In [23]:
df_norm = pd.json_normalize(results)
print(df_norm)

           genus          name  id           family             order  \
0          Malus         Apple   6         Rosaceae           Rosales   
1         Prunus       Apricot  35         Rosaceae           Rosales   
2         Persea       Avocado  84        Lauraceae          Laurales   
3           Musa        Banana   1         Musaceae      Zingiberales   
4          Rubus    Blackberry  64         Rosaceae           Rosales   
5       Fragaria     Blueberry  33         Rosaceae           Rosales   
6         Prunus        Cherry   9         Rosaceae           Rosales   
7      Vaccinium     Cranberry  87        Ericaceae          Ericales   
8   Selenicereus   Dragonfruit  80        Cactaceae    Caryophyllales   
9          Durio        Durian  60        Malvaceae          Malvales   
10    Sellowiana        Feijoa  76        Myrtaceae        Myrtoideae   
11         Ficus           Fig  68         Moraceae           Rosales   
12         Ribes    Gooseberry  69  Grossulariaceae

Let's extract some information from this dataframe. Perhaps, we need to know the family and genus of a cherry.

In [16]:
cherry = df_norm.loc[df_norm["name"] == 'Cherry']
(cherry.iloc[0]['family']), (cherry.iloc[0]['genus'])

('Rosaceae', 'Prunus')

## Exercises

### Exercise 1

Generate photos of 5 random users.

In [25]:
## TODO Unable to do this
for user in users:
    print(user.get_picture())

https://randomuser.me/api/portraits/women/88.jpg
https://randomuser.me/api/portraits/men/56.jpg
https://randomuser.me/api/portraits/women/95.jpg
https://randomuser.me/api/portraits/men/20.jpg
https://randomuser.me/api/portraits/men/78.jpg
https://randomuser.me/api/portraits/women/57.jpg
https://randomuser.me/api/portraits/women/17.jpg
https://randomuser.me/api/portraits/women/43.jpg
https://randomuser.me/api/portraits/men/87.jpg
https://randomuser.me/api/portraits/men/98.jpg


<details><summary>Click here for the solution</summary>

```python
for user in users:
    print(user.get_picture())
```

</details>

To generate a table with information about the users, we can write a function containing all desirable parameters, e.g., name, gender, city, etc. The parameters will depend on the requirements of the test to be performed. We can call the Getter methods, listed at the beginning of this notebook. Then, we return Pandas dataframe with the users.

In [29]:
def get_users():
	users = []
	for user in RandomUser.generate_users(10):
		users.append({"Name": user.get_full_name(), "Gender": user.get_gender(), "City": user.get_city(), "State": user.get_state(), "Email": user.get_email(), "Date of Birth": user.get_dob(), "Picture": user.get_picture()})
	return pd.DataFrame(users)

In [30]:
get_users()

Unnamed: 0,Name,Gender,City,State,Email,Date of Birth,Picture
0,Rigo Schmelzer,male,Schwedt/Oder,Mecklenburg-Vorpommern,rigo.schmelzer@example.com,1982-03-15T01:25:22.420Z,https://randomuser.me/api/portraits/men/60.jpg
1,Paige Grant,female,Salisbury,Dorset,paige.grant@example.com,1958-08-21T20:52:22.171Z,https://randomuser.me/api/portraits/women/26.jpg
2,Bently Ambrose,male,Armstrong,Nova Scotia,bently.ambrose@example.com,1958-06-09T08:24:26.593Z,https://randomuser.me/api/portraits/men/42.jpg
3,Matthew Mcdonalid,male,Wexford,Tipperary,matthew.mcdonalid@example.com,1970-09-08T14:38:29.333Z,https://randomuser.me/api/portraits/men/25.jpg
4,Marlon Guillaume,male,Niederried bei Interlaken,St. Gallen,marlon.guillaume@example.com,1990-07-23T02:28:49.520Z,https://randomuser.me/api/portraits/men/12.jpg
5,Jasper Uthaug,male,Vormsund,Bergen,jasper.uthaug@example.com,1959-08-21T11:07:24.554Z,https://randomuser.me/api/portraits/men/50.jpg
6,Ugo Simon,male,Strasbourg,Seine-Maritime,ugo.simon@example.com,1951-07-17T12:48:11.335Z,https://randomuser.me/api/portraits/men/47.jpg
7,Jord Wijdeven,male,Puth,Zeeland,jord.wijdeven@example.com,1971-07-05T03:54:04.963Z,https://randomuser.me/api/portraits/men/33.jpg
8,Dylan Thompson,male,Havelock,Alberta,dylan.thompson@example.com,1968-02-03T01:56:23.719Z,https://randomuser.me/api/portraits/men/55.jpg
9,Romarigo Ferreira,male,São José do Rio Preto,Piauí,romarigo.ferreira@example.com,1949-04-19T15:34:19.263Z,https://randomuser.me/api/portraits/men/33.jpg


In [20]:
df_users = pd.DataFrame(get_users())  

Now we have a Pandas dataframe that can be used for any testing purposes that the tester might have.

### Exercise 2

In this Exercise, find out how many calories are contained in a banana.

In [21]:
# Write your code here


<details><summary>Click here for the solution</summary>

```python
cal_banana = df2.loc[df2["name"] == 'Banana']
cal_banana.iloc[0]['nutritions.calories']
```

</details>


### Exercise 3

This [page](https://github.com/public-apis/public-apis#animals) contains a list of free public APIs. Choose any API of your interest and use it to load/extract some information, as shown in the example above.
1. Using `requests.get("url")` function, load your data.


In [22]:
# Write your code here


<details><summary>Click here for the solution</summary>

```python
data2 = requests.get("https://www.fishwatch.gov/api/species")
```

</details>


2. Retrieve results using `json.loads()` function.


In [23]:
# Write your code here


<details><summary>Click here for the solution</summary>

```python
results2 = json.loads(data2.text)
```

</details>


3. Convert json data into *pandas* data frame. 


In [24]:
# Write your code here


<details><summary>Click here for the solution</summary>

```python
df3 = pd.DataFrame(results2)
df3
```

</details>


---

Author(s):

- [Svitlana Kramar](www.linkedin.com/in/svitlana-kramar)
  - Svitlana is a master’s degree Data Science and Analytics student at University of Calgary, who enjoys travelling, learning new languages and cultures and loves spreading her passion for Data Science.

Other Contributor(s):

- N/A