Disable flaky twitter bots dataset loading test. #3439

justinxzhao · 2023-06-23T16:55:53Z

No description provided.

arnavgarg1 · 2023-06-23T16:56:54Z

tests/ludwig/datasets/test_dataset_configs.py

+    # DISABLED: Flaky for tests, probably due to the dataset size.
+    # # Test loading dataset without 'split' and 'Unnamed: 0' columns in config.
+    # twitter_bots_config = ludwig.datasets._get_dataset_config("twitter_bots")
+    # assert isinstance(twitter_bots_config, DatasetConfig)
+
+    # twitter_bots_dataset = ludwig.datasets.get_dataset("twitter_bots", cache_dir=tmpdir)
+    # assert isinstance(twitter_bots_dataset, DatasetLoader)
+    # df = twitter_bots_dataset.load()
+    # assert df is not None
+    # assert len(df.columns) == 22  # Expected number of columns in Twitter bots dataset including split column.


Should we skip the test? Or is this fine?

There's an upper half of the test that tests the basic mechanics of loading a dataset that hasn't been flake, so still want to keep the test overall.

The twitter bots block additionally checks that loading works when there's a pre-set split column, which seems incremental. My feeling is that it's not worth keeping it enabled since we already have the first test.

Do you happen to remember what the purpose/intention of this test was? I remember we added this twitter bots component in the last year but I can't remember the reason for it. Just want to make sure it's ok to disable!

Going to temporarily approve for now

Looks like this test was added in https://github.com/ludwig-ai/ludwig/pull/3285/files by @connor-mccorm to verify loading when there's a pre-set split column. Removing this test seems low risk to me, but it's certainly checking behavior that isn't covered by the first test.

Since the test hasn't failed for a few CI runs now, my guess is that the original test failed because of a momentary Kaggle outage, similar to how our HF tests used to fail whenever there was an HF outage.

I'd be more inclined to merge this PR if we've seen the test fail multiple times, but it's just been once, this PR could be an overreaction.

github-actions · 2023-06-23T18:18:25Z

Unit Test Results

  6 files ±0   6 suites ±0 1h 25m 3s ⏱️ + 3m 49s
33 tests ±0 29 ✔️ ±0   4 💤 ±0 0 ❌ ±0
99 runs ±0 87 ✔️ ±0 12 💤 ±0 0 ❌ ±0

Results for commit 51d6891. ± Comparison against base commit 91c28f8.

justinxzhao · 2023-08-11T20:11:42Z

Seems like this test has failed again in https://github.com/ludwig-ai/ludwig/actions/runs/5836017733/job/15828730546. Re-opening.

Disable flaky twitter bots test.

51d6891

justinxzhao requested a review from arnavgarg1 June 23, 2023 16:55

arnavgarg1 reviewed Jun 23, 2023

View reviewed changes

arnavgarg1 approved these changes Jun 23, 2023

View reviewed changes

justinxzhao closed this Jul 11, 2023

justinxzhao reopened this Aug 11, 2023

justinxzhao merged commit 7ecdfa5 into master Aug 11, 2023
30 checks passed

justinxzhao deleted the flaky_test branch August 11, 2023 20:12

arnavgarg1 pushed a commit that referenced this pull request Aug 11, 2023

Disable flaky twitter bots dataset loading test. (#3439)

507e52c

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Disable flaky twitter bots dataset loading test. #3439

Disable flaky twitter bots dataset loading test. #3439

justinxzhao commented Jun 23, 2023

arnavgarg1 Jun 23, 2023

justinxzhao Jun 23, 2023

arnavgarg1 Jun 23, 2023

justinxzhao Jun 26, 2023

github-actions bot commented Jun 23, 2023

justinxzhao commented Aug 11, 2023

Disable flaky twitter bots dataset loading test. #3439

Disable flaky twitter bots dataset loading test. #3439

Conversation

justinxzhao commented Jun 23, 2023

arnavgarg1 Jun 23, 2023

Choose a reason for hiding this comment

justinxzhao Jun 23, 2023

Choose a reason for hiding this comment

arnavgarg1 Jun 23, 2023

Choose a reason for hiding this comment

justinxzhao Jun 26, 2023

Choose a reason for hiding this comment

github-actions bot commented Jun 23, 2023

Unit Test Results

justinxzhao commented Aug 11, 2023