Dict to Dataframe Converter

This is a Python package designed to convert JSON data into tabular format, a common task in the daily work of engineers and data scientists. With this tool, you can easily transform JSON data into a tabular representation, facilitating its analysis and manipulation.

The codebase was initially inspired by a solution provided on Stack Overflow and later refined to address various bugs before being published on PyPI. Thanks to my friend Emerson Leão for helping me fix this code during Carnival! 🍻 Install this package using the following command:

pip install dict2dataframe

1. Requirements

Following the minimum system requirements to execute the scripts:

Processor: 1 Core, 32-Bit, 1.4 GHz
Memory: 512 MB RAM
Storage: 50 MB free space
OS: Linux, Windows, macOS

Before using this project, you must install its requirements by executing the following commands*:

user@host:~$ cd dict2dataframe\
user@host:~$ python -m venv venv
user@host:~$ venv\Scripts\activate
(venv) user@host:~$ pip install -U pip setuptools wheel
(venv) user@host:~$ pip install -r requirements.txt

Run the following command* if you want to generate the .tar.gz binary file uploaded on Pypi:

(venv) user@host:~$ python setup.py sdist

Run the following command* to generate the .whl binary file that can be uploaded on Databricks or some other place:

(venv) user@host:~$ python setup.py clean --all bdist_wheel

2. Usage

To use this package in your environment, just import the modules you want to use. Available modules are in the dict2dataframe/ directory. Below you can check out some examples of using this package.

Importing our sample data from file:

import json

with open("samples/data.json", mode="rt", encoding="utf-8") as file:
    data = json.load(file)

print(data)

Which gives us the following dictionary:

Let's use this data to exemplify the package's usage.

2.1. Converting a `dict` to a `pandas.DataFrame`

Using the following code, we can convert our dictionary into a table:

from dict2dataframe.core import dict2dataframe

df = dict2dataframe(data['values'])
print(df)

# Output:
#    a   c  b_x  b_y  d_z
# 0  1   2   10   20   30
# 1  5   6   15   25   35
# 2  9  10   20   30   40

2.2. Manipulating `dict` based on nested keys

Getting the handler set up:

from dict2dataframe.handlers import Dict

d = Dict(data=data['values'][0])

Grabbing a value:

keys = ["b", "x"]
value, value_exists = d.get(keys=keys)
print(value)

# Output:
# 10

Updating an existing value:

d.set(keys=keys, value=11)
value, value_exists = d.get(keys=keys)
print(value)

# Output:
# 11

Adding a new value:

keys = ["b", "z"]
d.add(keys=keys, value=12)
value, value_exists = d.get(keys=keys)
print(value)

# Output:
# 12

Removing an existing value:

keys = ["b", "y"]
d.remove(keys=keys)
value, value_exists = d.get(keys=keys)
print(value)

# Output:
# None

Peeping the updated dictionary:

print(d.data)

# Output:
# {'a': 1, 'b': {'x': 11, 'z': 12}, 'c': 2, 'd': [{'z': 30}]}

Listing out the nested keys-values:

print(list(d.items()))

# Output:
# [(['a'], 1), (['b', 'x'], 11), (['b', 'z'], 12), (['c'], 2), (['d'], [{'z': 30}])]

2.3. Other features

There are a few more features in this package, but I'm too lazy to describe them. Dive into the handlers module and figure it out yourself.

* Windows OS syntax-based commands.

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
dict2dataframe		dict2dataframe
samples		samples
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Dict to Dataframe Converter

1. Requirements

2. Usage

2.1. Converting a `dict` to a `pandas.DataFrame`

2.2. Manipulating `dict` based on nested keys

2.3. Other features

About

Releases

Packages

Languages

joao8tunes/dict2dataframe

Folders and files

Latest commit

History

Repository files navigation

Dict to Dataframe Converter

1. Requirements

2. Usage

2.1. Converting a dict to a pandas.DataFrame

2.2. Manipulating dict based on nested keys

2.3. Other features

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

2.1. Converting a `dict` to a `pandas.DataFrame`

2.2. Manipulating `dict` based on nested keys

Packages