
# APIs


The purpose of this notebook is to provide more examples on how to use simple APIs. API stands for Application Programming Interface and is a software intermediary that allows two applications to talk to each other. 

The advantages of using APIs:
 *   **Automation**. Less human effort is required and workflows can be easily updated to become faster and more      
     productive.
 *   **Efficiency**. It allows to use the capabilities of one of the already developed APIs than to try to 
     independently implement some functionality from scratch.
 
The disadvantage of using APIs:
 *   **Security**. If the API is poorly integrated, it means it will be vulnerable to attacks, resulting in data breeches or losses having financial or reputation implications.

One of the applications we will use in this notebook is Random User Generator. RandomUser is an open-source, free API providing developers with randomly generated users to be used as placeholders for testing purposes. This makes the tool similar to Lorem Ipsum, but is a placeholder for people instead of text. The API can return multiple results, as well as specify generated user details such as gender, email, image, username, address, title, first and last name, and more. More information on [RandomUser](https://randomuser.me/documentation#intro) can be found here.

Another example of simple API we will use in this notebook is Fruityvice application. The Fruityvice API web service which provides data for all kinds of fruit! You can use Fruityvice to find out interesting information about fruit and educate yourself. The web service is completely free to use and contribute to.


## Example 1: RandomUser API
Bellow are Get Methods parameters that we can generate. For more information on the parameters, please visit this [documentation](https://randomuser.me/documentation) page.


## **Get Methods**

- get_cell()
- get_city()
- get_dob()
- get_email()
- get_first_name()
- get_full_name()
- get_gender()
- get_id()
- get_id_number()
- get_id_type()
- get_info()
- get_last_name()
- get_login_md5()
- get_login_salt()
- get_login_sha1()
- get_login_sha256()
- get_nat()
- get_password()
- get_phone()
- get_picture()
- get_postcode()
- get_registered()
- get_state()
- get_street()
- get_username()
- get_zipcode()


To start using the API you can install the `randomuser` library running the `pip install` command.


In [1]:
!pip install randomuser
!pip install pandas

Collecting randomuser
  Downloading randomuser-1.6.tar.gz (5.0 kB)
  Preparing metadata (setup.py) ... [?25ldone
[?25hBuilding wheels for collected packages: randomuser
  Building wheel for randomuser (setup.py) ... [?25done
[?25h  Created wheel for randomuser: filename=randomuser-1.6-py3-none-any.whl size=5104 sha256=52ed259394bebdf8880840316e7d724c16312b0f45dfc99a072806f04408f113
  Stored in directory: /home/jupyterlab/.cache/pip/wheels/be/62/c8/71e1b48f4758ea5b78af7595d87178f628cde315a3326610ee
Successfully built randomuser
Installing collected packages: randomuser
Successfully installed randomuser-1.6
Collecting pandas
  Downloading pandas-2.3.3-cp312-cp312-manylinux_2_24_x86_64.manylinux_2_28_x86_64.whl.metadata (91 kB)
Collecting numpy>=1.26.0 (from pandas)
  Downloading numpy-2.3.4-cp312-cp312-manylinux_2_27_x86_64.manylinux_2_28_x86_64.whl.metadata (62 kB)
Collecting tzdata>=2022.7 (from pandas)
  Downloading tzdata-2025.2-py2.py3-none-any.whl.metadata (1.4 kB)
Downloading 

Then, we will load the necessary libraries.


In [2]:
from randomuser import RandomUser
import pandas as pd

First, we will create a random user object, r.


In [4]:
r = RandomUser()
r

<randomuser.RandomUser at 0x7ac5442c0620>

Then, using `generate_users()` function, we get a list of random 10 users.


In [5]:
some_list = r.generate_users(10)

In [6]:
some_list

[<randomuser.RandomUser at 0x7ac52c5cddc0>,
 <randomuser.RandomUser at 0x7ac52c5cddf0>,
 <randomuser.RandomUser at 0x7ac52c5cde20>,
 <randomuser.RandomUser at 0x7ac52c5cde50>,
 <randomuser.RandomUser at 0x7ac52c5cde80>,
 <randomuser.RandomUser at 0x7ac52c5cdeb0>,
 <randomuser.RandomUser at 0x7ac52c5cdee0>,
 <randomuser.RandomUser at 0x7ac52c5cdf10>,
 <randomuser.RandomUser at 0x7ac52c5cdf40>,
 <randomuser.RandomUser at 0x7ac52c5cdf70>]

The **"Get Methods"** functions mentioned at the beginning of this notebook, can generate the required parameters to construct a dataset. For example, to get full name, we call `get_full_name()` function.


In [8]:
name = r.get_full_name()
print(name)

Hildegard Lecomte


Let's say we only need 10 users with full names and their email addresses. We can write a "for-loop" to print these 10 users.


In [9]:
for user in some_list:
    print (user.get_full_name()," ",user.get_email())

Harper Harris   harper.harris@example.com
Rosa Johansen   rosa.johansen@example.com
Pilar Torres   pilar.torres@example.com
Anthony Kennedy   anthony.kennedy@example.com
Hümeyra Elsendoorn   humeyra.elsendoorn@example.com
Friederike Drees   friederike.drees@example.com
Harper Martin   harper.martin@example.com
Abby Jimenez   abby.jimenez@example.com
Colin Cox   colin.cox@example.com
Giulia Marie   giulia.marie@example.com


## Exercise 1
In this Exercise, generate photos of the random 10 users.


In [11]:
## Write your code here
for user in some_list:
    print(user.get_picture())

https://randomuser.me/api/portraits/women/19.jpg
https://randomuser.me/api/portraits/women/49.jpg
https://randomuser.me/api/portraits/women/60.jpg
https://randomuser.me/api/portraits/men/58.jpg
https://randomuser.me/api/portraits/women/9.jpg
https://randomuser.me/api/portraits/women/39.jpg
https://randomuser.me/api/portraits/women/27.jpg
https://randomuser.me/api/portraits/women/59.jpg
https://randomuser.me/api/portraits/men/55.jpg
https://randomuser.me/api/portraits/women/38.jpg


To generate a table with information about the users, we can write a function containing all desirable parameters. For example, name, gender, city, etc. The parameters will depend on the requirements of the test to be performed. We call the Get Methods, listed at the beginning of this notebook. Then, we return pandas dataframe with the users.


In [12]:
def get_users():
    users =[]
     
    for user in RandomUser.generate_users(10):
        users.append({"Name":user.get_full_name(),"Gender":user.get_gender(),"City":user.get_city(),"State":user.get_state(),"Email":user.get_email(), "DOB":user.get_dob(),"Picture":user.get_picture()})
      
    return pd.DataFrame(users)     

In [13]:
get_users()

Unnamed: 0,Name,Gender,City,State,Email,DOB,Picture
0,Kadir Tuğluk,male,Hakkâri,Giresun,kadir.tugluk@example.com,1972-02-02T03:03:54.840Z,https://randomuser.me/api/portraits/men/3.jpg
1,Yvonne Carpentier,female,Bottighofen,Obwalden,yvonne.carpentier@example.com,1978-06-24T20:13:04.034Z,https://randomuser.me/api/portraits/women/62.jpg
2,Roope Tanner,male,Rusko,North Karelia,roope.tanner@example.com,1986-06-27T05:16:58.173Z,https://randomuser.me/api/portraits/men/22.jpg
3,Reginald Fletcher,male,Bradford,West Midlands,reginald.fletcher@example.com,1969-03-11T20:20:21.746Z,https://randomuser.me/api/portraits/men/50.jpg
4,Ivan Ferreira,male,Uberlândia,Sergipe,ivan.ferreira@example.com,1989-09-26T03:09:03.080Z,https://randomuser.me/api/portraits/men/40.jpg
5,Anita Bailey,female,Hobart,Northern Territory,anita.bailey@example.com,1985-06-20T08:16:42.860Z,https://randomuser.me/api/portraits/women/53.jpg
6,Conny Ernst,female,Kierspe,Baden-Württemberg,conny.ernst@example.com,1972-10-25T13:24:12.443Z,https://randomuser.me/api/portraits/women/51.jpg
7,Avery Anderson,female,Beaumont,New Brunswick,avery.anderson@example.com,1979-05-20T17:41:51.850Z,https://randomuser.me/api/portraits/women/0.jpg
8,Harrison Moore,male,Masterton,Southland,harrison.moore@example.com,1988-09-15T14:46:44.294Z,https://randomuser.me/api/portraits/men/70.jpg
9,Tim Rey,male,Saint-Étienne,Jura,tim.rey@example.com,1983-05-20T08:02:05.862Z,https://randomuser.me/api/portraits/men/90.jpg


In [15]:
df1 = pd.DataFrame(get_users())  
df1

Unnamed: 0,Name,Gender,City,State,Email,DOB,Picture
0,Malou Jørgensen,female,Aarhus N,Danmark,malou.jorgensen@example.com,1973-11-16T08:36:11.069Z,https://randomuser.me/api/portraits/women/83.jpg
1,August Rasmussen,male,Stenderup,Sjælland,august.rasmussen@example.com,1992-05-31T03:02:21.604Z,https://randomuser.me/api/portraits/men/51.jpg
2,Nihal Sandalcı,male,Ağrı,Manisa,nihal.sandalci@example.com,1984-03-15T12:19:50.072Z,https://randomuser.me/api/portraits/men/24.jpg
3,Mestan Yıldırım,female,Bitlis,Sivas,mestan.yildirim@example.com,1945-11-08T16:30:58.392Z,https://randomuser.me/api/portraits/women/62.jpg
4,Elena Salewski,female,Hemau,Baden-Württemberg,elena.salewski@example.com,1969-03-20T04:59:00.198Z,https://randomuser.me/api/portraits/women/24.jpg
5,Lucineide Santos,female,Paranaguá,Amazonas,lucineide.santos@example.com,1993-01-05T16:20:44.237Z,https://randomuser.me/api/portraits/women/17.jpg
6,Noelle Sagmo,female,Bø,Sør-Trøndelag,noelle.sagmo@example.com,1959-10-17T16:47:17.332Z,https://randomuser.me/api/portraits/women/45.jpg
7,Aleksi Wiitala,male,Vantaa,Kainuu,aleksi.wiitala@example.com,1977-05-16T22:27:00.122Z,https://randomuser.me/api/portraits/men/73.jpg
8,Ritske Van den Eerenbeemt,male,Maastricht-Airport,Friesland,ritske.vandeneerenbeemt@example.com,1993-12-09T22:56:13.465Z,https://randomuser.me/api/portraits/men/23.jpg
9,Oliver Ma,male,Stirling,Newfoundland and Labrador,oliver.ma@example.com,1982-01-22T17:40:51.418Z,https://randomuser.me/api/portraits/men/14.jpg


Now we have a *pandas* dataframe that can be used for any testing purposes that the tester might have.


## Example 2: Fruityvice API

Another, more common way to use APIs, is through `requests` library.
We will start by importing all required libraries.


In [16]:
import requests
import json

We will obtain the [fruityvice](https://www.fruityvice.com) API data using `requests.get("url")` function. The data is in a json format.


In [21]:
data = requests.get("https://web.archive.org/web/20240929211114/https://fruityvice.com/api/fruit/all")
data.text

'[{"name":"Persimmon","id":52,"family":"Ebenaceae","order":"Rosales","genus":"Diospyros","nutritions":{"calories":81,"fat":0.0,"sugar":18.0,"carbohydrates":18.0,"protein":0.0}},{"name":"Strawberry","id":3,"family":"Rosaceae","order":"Rosales","genus":"Fragaria","nutritions":{"calories":29,"fat":0.4,"sugar":5.4,"carbohydrates":5.5,"protein":0.8}},{"name":"Banana","id":1,"family":"Musaceae","order":"Zingiberales","genus":"Musa","nutritions":{"calories":96,"fat":0.2,"sugar":17.2,"carbohydrates":22.0,"protein":1.0}},{"name":"Tomato","id":5,"family":"Solanaceae","order":"Solanales","genus":"Solanum","nutritions":{"calories":74,"fat":0.2,"sugar":2.6,"carbohydrates":3.9,"protein":0.9}},{"name":"Pear","id":4,"family":"Rosaceae","order":"Rosales","genus":"Pyrus","nutritions":{"calories":57,"fat":0.1,"sugar":10.0,"carbohydrates":15.0,"protein":0.4}},{"name":"Durian","id":60,"family":"Malvaceae","order":"Malvales","genus":"Durio","nutritions":{"calories":147,"fat":5.3,"sugar":6.75,"carbohydrates"

We will retrieve results using `json.loads()` function.


In [23]:
results = json.loads(data.text)
print(results)

[{'name': 'Persimmon', 'id': 52, 'family': 'Ebenaceae', 'order': 'Rosales', 'genus': 'Diospyros', 'nutritions': {'calories': 81, 'fat': 0.0, 'sugar': 18.0, 'carbohydrates': 18.0, 'protein': 0.0}}, {'name': 'Strawberry', 'id': 3, 'family': 'Rosaceae', 'order': 'Rosales', 'genus': 'Fragaria', 'nutritions': {'calories': 29, 'fat': 0.4, 'sugar': 5.4, 'carbohydrates': 5.5, 'protein': 0.8}}, {'name': 'Banana', 'id': 1, 'family': 'Musaceae', 'order': 'Zingiberales', 'genus': 'Musa', 'nutritions': {'calories': 96, 'fat': 0.2, 'sugar': 17.2, 'carbohydrates': 22.0, 'protein': 1.0}}, {'name': 'Tomato', 'id': 5, 'family': 'Solanaceae', 'order': 'Solanales', 'genus': 'Solanum', 'nutritions': {'calories': 74, 'fat': 0.2, 'sugar': 2.6, 'carbohydrates': 3.9, 'protein': 0.9}}, {'name': 'Pear', 'id': 4, 'family': 'Rosaceae', 'order': 'Rosales', 'genus': 'Pyrus', 'nutritions': {'calories': 57, 'fat': 0.1, 'sugar': 10.0, 'carbohydrates': 15.0, 'protein': 0.4}}, {'name': 'Durian', 'id': 60, 'family': 'Malv

We will convert our json data into *pandas* data frame. 


In [24]:
pd.DataFrame(results)

Unnamed: 0,name,id,family,order,genus,nutritions
0,Persimmon,52,Ebenaceae,Rosales,Diospyros,"{'calories': 81, 'fat': 0.0, 'sugar': 18.0, 'c..."
1,Strawberry,3,Rosaceae,Rosales,Fragaria,"{'calories': 29, 'fat': 0.4, 'sugar': 5.4, 'ca..."
2,Banana,1,Musaceae,Zingiberales,Musa,"{'calories': 96, 'fat': 0.2, 'sugar': 17.2, 'c..."
3,Tomato,5,Solanaceae,Solanales,Solanum,"{'calories': 74, 'fat': 0.2, 'sugar': 2.6, 'ca..."
4,Pear,4,Rosaceae,Rosales,Pyrus,"{'calories': 57, 'fat': 0.1, 'sugar': 10.0, 'c..."
5,Durian,60,Malvaceae,Malvales,Durio,"{'calories': 147, 'fat': 5.3, 'sugar': 6.75, '..."
6,Blackberry,64,Rosaceae,Rosales,Rubus,"{'calories': 40, 'fat': 0.4, 'sugar': 4.5, 'ca..."
7,Lingonberry,65,Ericaceae,Ericales,Vaccinium,"{'calories': 50, 'fat': 0.34, 'sugar': 5.74, '..."
8,Kiwi,66,Actinidiaceae,Struthioniformes,Apteryx,"{'calories': 61, 'fat': 0.5, 'sugar': 9.0, 'ca..."
9,Lychee,67,Sapindaceae,Sapindales,Litchi,"{'calories': 66, 'fat': 0.44, 'sugar': 15.0, '..."


The result is in a nested json format. The 'nutrition' column contains multiple subcolumns, so the data needs to be 'flattened' or normalized.


In [25]:
df2 = pd.json_normalize(results)

In [26]:
df2

Unnamed: 0,name,id,family,order,genus,nutritions.calories,nutritions.fat,nutritions.sugar,nutritions.carbohydrates,nutritions.protein
0,Persimmon,52,Ebenaceae,Rosales,Diospyros,81,0.0,18.0,18.0,0.0
1,Strawberry,3,Rosaceae,Rosales,Fragaria,29,0.4,5.4,5.5,0.8
2,Banana,1,Musaceae,Zingiberales,Musa,96,0.2,17.2,22.0,1.0
3,Tomato,5,Solanaceae,Solanales,Solanum,74,0.2,2.6,3.9,0.9
4,Pear,4,Rosaceae,Rosales,Pyrus,57,0.1,10.0,15.0,0.4
5,Durian,60,Malvaceae,Malvales,Durio,147,5.3,6.75,27.1,1.5
6,Blackberry,64,Rosaceae,Rosales,Rubus,40,0.4,4.5,9.0,1.3
7,Lingonberry,65,Ericaceae,Ericales,Vaccinium,50,0.34,5.74,11.3,0.75
8,Kiwi,66,Actinidiaceae,Struthioniformes,Apteryx,61,0.5,9.0,15.0,1.1
9,Lychee,67,Sapindaceae,Sapindales,Litchi,66,0.44,15.0,17.0,0.8


Let's see if we can extract some information from this dataframe. Perhaps, we need to know the family and genus of a cherry.


In [27]:
cherry = df2.loc[df2["name"] == 'Cherry']
(cherry.iloc[0]['family']) , (cherry.iloc[0]['genus'])

('Rosaceae', 'Prunus')

## Exercise 2
In this Exercise, find out how many calories are contained in a banana.


In [36]:
banana = df2.loc[df2["name"] == "Banana"]
print(banana)
calories = banana.iloc[0]['nutritions.calories']
print(calories)

     name  id    family         order genus  nutritions.calories  \
2  Banana   1  Musaceae  Zingiberales  Musa                   96   

   nutritions.fat  nutritions.sugar  nutritions.carbohydrates  \
2             0.2              17.2                      22.0   

   nutritions.protein  
2                 1.0  
96


<details><summary>Click here for the solution</summary>

```python
cal_banana = df2.loc[df2["name"] == 'Banana']
cal_banana.iloc[0]['nutritions.calories']
```

</details>


## Exercise 3

This [page](https://mixedanalytics.com/blog/list-actually-free-open-no-auth-needed-apis/) contains a list of free public APIs for you to practice. Let us deal with the following example.

#### Official Joke API 
This API returns random jokes from a database. The following URL can be used to retrieve 10 random jokes.

https://official-joke-api.appspot.com/jokes/ten

1. Using `requests.get("url")` function, load the data from the URL.


In [37]:
response = requests.get("https://official-joke-api.appspot.com/jokes/ten")

2. Retrieve results using `json.loads()` function.


In [41]:
json_rsp = json.loads(response.text)
print(json_rsp)

[{'type': 'general', 'setup': 'Who did the wizard marry?', 'punchline': 'His ghoul-friend', 'id': 296}, {'type': 'general', 'setup': 'What do you call a fat psychic?', 'punchline': 'A four-chin teller.', 'id': 205}, {'type': 'general', 'setup': 'My older brother always tore the last pages of my comic books, and never told me why.', 'punchline': 'I had to draw my own conclusions.', 'id': 386}, {'type': 'general', 'setup': 'What kind of pants do ghosts wear?', 'punchline': 'Boo jeans.', 'id': 256}, {'type': 'general', 'setup': 'What kind of tree fits in your hand?', 'punchline': 'A palm tree!', 'id': 257}, {'type': 'general', 'setup': 'What do you call a belt made out of watches?', 'punchline': 'A waist of time.', 'id': 4}, {'type': 'general', 'setup': 'What do you call a fashionable lawn statue with an excellent sense of rhythmn?', 'punchline': 'A metro-gnome', 'id': 204}, {'type': 'general', 'setup': 'What do you call two barracuda fish?', 'punchline': 'A Pairacuda!', 'id': 225}, {'typ

3. Convert json data into *pandas* data frame. Drop the type and id columns.


In [44]:
df = pd.DataFrame(json_rsp)
df.drop(columns=["type", "id"], inplace=True)
df

Unnamed: 0,setup,punchline
0,Who did the wizard marry?,His ghoul-friend
1,What do you call a fat psychic?,A four-chin teller.
2,My older brother always tore the last pages of...,I had to draw my own conclusions.
3,What kind of pants do ghosts wear?,Boo jeans.
4,What kind of tree fits in your hand?,A palm tree!
5,What do you call a belt made out of watches?,A waist of time.
6,What do you call a fashionable lawn statue wit...,A metro-gnome
7,What do you call two barracuda fish?,A Pairacuda!
8,Why don't oysters give to charity?,Because they're shellfish.
9,Knock knock. \n Who's there? \n Opportunity.,That is impossible. Opportunity doesn’t come k...


<details><summary>Click here for the solution</summary>

```python
df3 = pd.DataFrame(results2)
df3.drop(columns=["type","id"],inplace=True)
df3
```

</details>
