Use public API for loading in prototype datasets tests #5212

pmeier · 2022-01-19T10:34:46Z

Follow-up to #5207 (comment)

cc @pmeier @bjuncek

facebook-github-bot · 2022-01-19T10:34:55Z

💊 CI failures summary and remediations

As of commit cbf128b (more details on the Dr. CI page):

✅ None of the CI failures appear to be your fault 💚

2/2 broken upstream at merge base 6512146 since Jan 14

🚧 2 ongoing upstream failures:

These were probably caused by upstream breakages that are not fixed yet.

unittest_windows_gpu_py3.8 since Jan 20 (6512146)
- 🔁 rerun
unittest_prototype since Jan 14 (adf8466)
- 🔁 rerun

This comment was automatically generated by Dr. CI (expand for details).

Please report bugs/suggestions to the (internal) Dr. CI Users group.

Click here to manually regenerate this comment.

pmeier · 2022-01-19T10:36:02Z

test/builtin_dataset_mocks.py

-    def _parse_mock_data(self, mock_data_fn):
-        def wrapper(info, root, config):
-            mock_infos = mock_data_fn(info, root, config)
+    def _parse_mock_data(self, config, mock_infos):


This is just GitHub not picking up on the identation change. Before this was a decorator, but I changed it into a regular method. The actual body has not changed.

pmeier · 2022-01-19T10:36:41Z

test/builtin_dataset_mocks.py

    num_samples = 5 if config.split == "train" else 3

-    path = root / f"{config.split}.txt"
+    path = root / f"{config.split}.csv"


This change and everything below in this file are actual bugs in our mock data generation that were hidden by our custom loading logic.

Do you know why the tests were passing before despite the files had the wrong name?

Before the resources where not collected by the regular logic, but rather "hand-fed" to _make_datapipe. So as long as the data in mock_resources[0] corresponded to Dataset.resources(...)[0] the test suite didn't notice.

test/datasets_utils.py

NicolasHug · 2022-01-20T10:13:19Z

test/builtin_dataset_mocks.py

    num_samples = 5 if config.split == "train" else 3

-    path = root / f"{config.split}.txt"
+    path = root / f"{config.split}.csv"


Do you know why the tests were passing before despite the files had the wrong name?

test/datasets_utils.py

Summary: * refactor prototype dataset tests to use public API for loading * add explanation * use loop alternative Reviewed By: jdsgomes, prabhat00155 Differential Revision: D33739387 fbshipit-source-id: 1ea4f7e925d6c686fa937d6cab162076ed887bc3

pmeier · 2022-04-06T14:50:15Z

test/builtin_dataset_mocks.py

-        try:
-            self.info.check_dependencies()
-        except ModuleNotFoundError as error:
-            pytest.skip(str(error))


This should have not been removed. The tests should be runnable even if no or not all third party dependencies are installed.

pmeier added 2 commits January 19, 2022 10:57

refactor prototype dataset tests to use public API for loading

97ade0f

Merge branch 'main' into datasets/test-load-thorugh-api

aeabd33

pmeier added module: tests prototype labels Jan 19, 2022

pmeier requested a review from NicolasHug January 19, 2022 10:34

pytorch-probot bot added the ciflow/default label Jan 19, 2022

facebook-github-bot added the cla signed label Jan 19, 2022

pmeier commented Jan 19, 2022

View reviewed changes

test/datasets_utils.py Outdated Show resolved Hide resolved

Merge branch 'main' into datasets/test-load-thorugh-api

2c111aa

NicolasHug approved these changes Jan 20, 2022

View reviewed changes

add explanation

e4957de

pmeier commented Jan 20, 2022

View reviewed changes

test/datasets_utils.py Outdated Show resolved Hide resolved

pmeier added 2 commits January 20, 2022 16:18

use loop alternative

38aee72

Merge branch 'main' into datasets/test-load-thorugh-api

cbf128b

pmeier merged commit 4d08a67 into pytorch:main Jan 20, 2022

pmeier deleted the datasets/test-load-thorugh-api branch January 20, 2022 15:21

pmeier commented Apr 6, 2022

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Use public API for loading in prototype datasets tests #5212

Use public API for loading in prototype datasets tests #5212

Uh oh!

pmeier commented Jan 19, 2022 •

edited by pytorch-probot bot

Loading

Uh oh!

facebook-github-bot commented Jan 19, 2022 •

edited

Loading

Uh oh!

pmeier Jan 19, 2022

Uh oh!

pmeier Jan 19, 2022

Uh oh!

NicolasHug Jan 20, 2022

Uh oh!

pmeier Jan 20, 2022

Uh oh!

Uh oh!

NicolasHug Jan 20, 2022

Uh oh!

Uh oh!

Uh oh!

pmeier Apr 6, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Use public API for loading in prototype datasets tests #5212

Use public API for loading in prototype datasets tests #5212

Uh oh!

Conversation

pmeier commented Jan 19, 2022 • edited by pytorch-probot bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

facebook-github-bot commented Jan 19, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

💊 CI failures summary and remediations

🚧 2 ongoing upstream failures:

Uh oh!

pmeier Jan 19, 2022

Choose a reason for hiding this comment

Uh oh!

pmeier Jan 19, 2022

Choose a reason for hiding this comment

Uh oh!

NicolasHug Jan 20, 2022

Choose a reason for hiding this comment

Uh oh!

pmeier Jan 20, 2022

Choose a reason for hiding this comment

Uh oh!

Uh oh!

NicolasHug Jan 20, 2022

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

pmeier Apr 6, 2022

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

pmeier commented Jan 19, 2022 •

edited by pytorch-probot bot

Loading

facebook-github-bot commented Jan 19, 2022 •

edited

Loading