## RCS JSON

# JSON - Javascript Object Notation
#### Invented by Douglas Crockford when working at Yahoo in early 2000s.

* Goal - Human Readable, Machine Parsable

* Specification: https://www.json.org/

In [3]:
# if this was string starting with { it would be our json
mydata = {
    "firstName": "Jane",
    "lastName": "Doe",
    "hobbies": ["running", "sky diving", "dancing"],
    "age": 43,
    "children": [
        {
            "firstName": "Alice",
            "age": 7
        },
        {
            "firstName": "Bob",
            "age": 13
        }
    ]
}

In [6]:
type(mydata)

dict

In [4]:
mydata

{'age': 43,
 'children': [{'age': 7, 'firstName': 'Alice'},
  {'age': 13, 'firstName': 'Bob'}],
 'firstName': 'Jane',
 'hobbies': ['running', 'sky diving', 'dancing'],
 'lastName': 'Doe'}

The process of encoding JSON is usually called serialization. This term refers to the transformation of data into a series of bytes (hence serial) to be stored or transmitted across a network. You may also hear the term marshaling, but that’s a whole other discussion. Naturally, deserialization is the reciprocal process of decoding data that has been stored or delivered in the JSON standard.

All we’re talking about here is reading and writing. Think of it like this: encoding is for writing data to disk, while decoding is for reading data into memory.
 https://realpython.com/python-json/

In [1]:
import json

In [5]:
with open("data_file.json", "w") as write_file:
    json.dump(mydata, write_file)

In [7]:
# use json string in our program
json_string = json.dumps(mydata)
json_string

'{"firstName": "Jane", "lastName": "Doe", "hobbies": ["running", "sky diving", "dancing"], "age": 43, "children": [{"firstName": "Alice", "age": 7}, {"firstName": "Bob", "age": 13}]}'

In [8]:
type(json_string)

str

In [None]:
# Avove example JSON and Python object have the same syntax but there are some differences

![object](https://www.json.org/object.gif)

![Array](https://www.json.org/array.gif)

![Value](https://www.json.org/value.gif)

Simple Python objects are translated to JSON according to a fairly intuitive conversion.

Python	JSON

dict	object

list, tuple	array

str	string

int, long, 

float	number

True	true

False	false

None	null

In [9]:
# The first option most people want to change is whitespace. You can use the indent keyword argument to specify the indentation size for nested structures. Check out the difference for yourself by using data, which we defined above, and running the following commands in a console:

json.dumps(mydata)


'{"firstName": "Jane", "lastName": "Doe", "hobbies": ["running", "sky diving", "dancing"], "age": 43, "children": [{"firstName": "Alice", "age": 7}, {"firstName": "Bob", "age": 13}]}'

In [10]:
# very useful for visibility!
print(json.dumps(mydata, indent=4))

{
    "firstName": "Jane",
    "lastName": "Doe",
    "hobbies": [
        "running",
        "sky diving",
        "dancing"
    ],
    "age": 43,
    "children": [
        {
            "firstName": "Alice",
            "age": 7
        },
        {
            "firstName": "Bob",
            "age": 13
        }
    ]
}


In [11]:
with open("data_file.json", "w") as write_file:
    json.dump(mydata, write_file, indent=4)

In [12]:
with open("data_file.json", "r") as read_file:
    data = json.load(read_file)
data

{'age': 43,
 'children': [{'age': 7, 'firstName': 'Alice'},
  {'age': 13, 'firstName': 'Bob'}],
 'firstName': 'Jane',
 'hobbies': ['running', 'sky diving', 'dancing'],
 'lastName': 'Doe'}

In [13]:
type(data)

dict

Keep in mind that the result of this method could return any of the allowed data types from the conversion table. This is only important if you’re loading in data you haven’t seen before. In most cases, the root object will be a dict or a list.

If you've gotten JSON data in from another program or have otherwise obtained a string of JSON formatted data in Python, you can easily deserialize that with loads(), which naturally loads from a string:

In [14]:
json_string = """
{
    "researcher": {
        "name": "Ford Prefect",
        "species": "Betelgeusian",
        "relatives": [
            {
                "name": "Zaphod Beeblebrox",
                "species": "Betelgeusian"
            }
        ]
    }
}
"""
data = json.loads(json_string)
data

{'researcher': {'name': 'Ford Prefect',
  'relatives': [{'name': 'Zaphod Beeblebrox', 'species': 'Betelgeusian'}],
  'species': 'Betelgeusian'}}

In [15]:
type(data)

dict

In [17]:
data['researcher']['relatives'][0]['name']

'Zaphod Beeblebrox'

In [18]:
import json
import requests

In [4]:
## Lets get some data https://jsonplaceholder.typicode.com/

In [19]:
response = requests.get("https://jsonplaceholder.typicode.com/todos")
todos = json.loads(response.text)


can open https://jsonplaceholder.typicode.com/todos in regular browser too..

In [20]:
type(todos)

list

In [27]:
len(todos)

200

In [21]:
todos[:10]

[{'completed': False, 'id': 1, 'title': 'delectus aut autem', 'userId': 1},
 {'completed': False,
  'id': 2,
  'title': 'quis ut nam facilis et officia qui',
  'userId': 1},
 {'completed': False, 'id': 3, 'title': 'fugiat veniam minus', 'userId': 1},
 {'completed': True, 'id': 4, 'title': 'et porro tempora', 'userId': 1},
 {'completed': False,
  'id': 5,
  'title': 'laboriosam mollitia et enim quasi adipisci quia provident illum',
  'userId': 1},
 {'completed': False,
  'id': 6,
  'title': 'qui ullam ratione quibusdam voluptatem quia omnis',
  'userId': 1},
 {'completed': False,
  'id': 7,
  'title': 'illo expedita consequatur quia in',
  'userId': 1},
 {'completed': True,
  'id': 8,
  'title': 'quo adipisci enim quam ut ab',
  'userId': 1},
 {'completed': False,
  'id': 9,
  'title': 'molestiae perspiciatis ipsa',
  'userId': 1},
 {'completed': True,
  'id': 10,
  'title': 'illo est ratione doloremque quia maiores aut',
  'userId': 1}]

In [23]:
myl = [('Valdis', 40), ('Alice',35), ('Bob', 23),('Carol',70)]

In [None]:
# Lambda = anonymous function

In [None]:
def myfun(el):
    return el[1]
# same as myfun = lambda el: el[1]

In [28]:
sorted(myl, key = lambda el: el[1], reverse=True)

[('Carol', 70), ('Valdis', 40), ('Alice', 35), ('Bob', 23)]

In [None]:
# Exercise find out top 3 users with most tasks completed!

# TIPS
# we need some sort of structure to store these user results before finding out top 3
# at least two good data structure choices here :)
# here the simplest might actually be the best if we consider userId values


In [29]:
users = [ el['userId'] for el in todos]
len(users),users[:15]

(200, [1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1])

In [30]:
uniqusers = set(users)
uniqusers

{1, 2, 3, 4, 5, 6, 7, 8, 9, 10}

In [32]:
# dictionary comprehension but could live without one
users = { el['userId'] : 0 for el in todos} 

In [33]:
users

{1: 0, 2: 0, 3: 0, 4: 0, 5: 0, 6: 0, 7: 0, 8: 0, 9: 0, 10: 0}

In [35]:
users.keys()

dict_keys([1, 2, 3, 4, 5, 6, 7, 8, 9, 10])

In [None]:
users.value

In [None]:
#{'completed': True,
# 'id': 8,
#  'title': 'quo adipisci enim quam ut ab',
#  'userId': 1}

In [36]:
#idiomatic
for el in todos:
    users[el['userId']] += el['completed'] # Boolean False is 0 True is 1 obviously this might not be too readable

In [35]:
# same as above could be useful in more complicated cases
for el in todos:
    if el['completed'] == True:
        users[el['userId']] += 1

In [None]:
# there could be a one liner or a solution with from collections import Counter

In [36]:
users.items()

dict_items([(1, 11), (2, 8), (3, 7), (4, 6), (5, 12), (6, 6), (7, 9), (8, 11), (9, 8), (10, 12)])

In [39]:
list(users.items())

[(1, 11),
 (2, 8),
 (3, 7),
 (4, 6),
 (5, 12),
 (6, 6),
 (7, 9),
 (8, 11),
 (9, 8),
 (10, 12)]

In [37]:
userlist=list(users.items())

In [38]:
type(userlist[0])

tuple

In [40]:
# we pass a key anonymous(lambda) function
sorted(userlist, key=lambda el: el[1], reverse=True)[:3]

[(5, 12), (10, 12), (1, 11)]

In [46]:
# lets try a simple way

In [48]:
mylist=[0]
mylist*=11

In [49]:
for el in todos:
    if el['completed'] == True:
        mylist[el['userId']] +=1

In [50]:
mylist

[0, 11, 8, 7, 6, 12, 6, 9, 11, 8, 12]

In [51]:
mylist.index(max(mylist))

5

In [None]:
# kind of hard to get more values need to get tricky