Update docs with EpisodeData info #109

enerrio · 2023-07-09T19:35:03Z

Description

This PR adds a section to the Dataset Standards documentation page describing the EpisodeData data structure and the fields it contains. Also includes a small snippet of sampling episodes from a Minari dataset.

Type of change

This change requires a documentation update

Checklist:

I have run the pre-commit checks with pre-commit run --all-files (see CONTRIBUTING.md instructions to set it up)
I have run pytest -v and no errors are present.
I have commented my code, particularly in hard-to-understand areas
I have made corresponding changes to the documentation
I solved any possible warnings that pytest -v has generated that are related to my code to the best of my knowledge.
I have added tests that prove my fix is effective or that my feature works
New and existing unit tests pass locally with my changes

balisujohn

One change, otherwise, looks good!

balisujohn · 2023-07-09T22:12:36Z

docs/content/dataset_standards.md

+| `id`              | `np.int64`   | ID of the episode.                                            |
+| `seed`            | `np.int64`   | Seed used to reset the episode.                               |
+| `total_timesteps` | `np.int64`   | Number of timesteps in the episode.                           |
+| `observations`    | `np.ndarray` | Observations for each timestep including initial observation. |


Observations and actions are not necessary represented as a single np.ndarray; it's worth looking at episodes from the various dummy envs used in test_dataset_creation.py, to get an idea of what is possible for observations and action types.

@balisujohn I updated the data types with the test spaces I saw in the test_dataset_creation.py and common.py files. Let me know what you think.

I found the types tuple, list and dict, and numpy.ndarray from when I added at line 74 of test_dataset_creation.py:

print(type(dataset[0].observations))

It might be fair to say List[str] though if so, I'd confirm that string spaces are the only context where lists occur.

I see, I updated data types. When it comes to list I only saw data type List[str] in datasets created in test_dataset_creation.py but left it as list in case there's some other dataset type I'm missing.

balisujohn · 2023-07-17T12:41:09Z

LGTM :^)

add section to docs describing EpisodeData

8f52280

balisujohn self-requested a review July 9, 2023 22:07

balisujohn requested changes Jul 9, 2023

View reviewed changes

enerrio added 2 commits July 9, 2023 17:03

update data types supported for EpisodeData

0d3f42a

updated available data types for obs/actions

49eed9e

balisujohn approved these changes Jul 17, 2023

View reviewed changes

balisujohn merged commit aecc2a1 into Farama-Foundation:main Jul 17, 2023
12 checks passed

enerrio deleted the update_dataset_doc branch July 17, 2023 17:40

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update docs with EpisodeData info #109

Update docs with EpisodeData info #109

enerrio commented Jul 9, 2023

balisujohn left a comment

balisujohn Jul 9, 2023 •

edited

enerrio Jul 10, 2023

balisujohn Jul 12, 2023

enerrio Jul 13, 2023 •

edited

balisujohn commented Jul 17, 2023

Update docs with EpisodeData info #109

Update docs with EpisodeData info #109

Conversation

enerrio commented Jul 9, 2023

Description

Type of change

Checklist:

balisujohn left a comment

Choose a reason for hiding this comment

balisujohn Jul 9, 2023 • edited

Choose a reason for hiding this comment

enerrio Jul 10, 2023

Choose a reason for hiding this comment

balisujohn Jul 12, 2023

Choose a reason for hiding this comment

enerrio Jul 13, 2023 • edited

Choose a reason for hiding this comment

balisujohn commented Jul 17, 2023

balisujohn Jul 9, 2023 •

edited

enerrio Jul 13, 2023 •

edited