Increase testing granularity for speedup #3242

ddobrinskiy · 2021-03-06T10:48:47Z

Using 9 batches instead of 3:

Current setup of 3 batches means that 20+ notebooks fall into the [5-7] batch and it takes 9minutes to test all the notebooks, example: https://github.com/fastai/fastai/pull/3235/checks?check_run_id=1969103831

With minimal code changes, we can split notebooks into 9 batches instead of 3, which should speed-up testing

upd: with new configuration, actual notebook testing takes at most 1m20s (excluding docker setup, cache download, et cetera)

Use 9 batches instead of 3

ddobrinskiy · 2021-03-06T11:21:07Z

Hmmm, not sure what the problem is.

The tests fail for nbs/04_data.external.ipynb saying that file is not found, full traceback below.

The CALTECH url is reachable at the moment. https://s3.amazonaws.com/fast-ai-imageclas/caltech_101.tgz

Reading from the notebook, I get a feeling that maybe this cell should not be present anyway?

Error in /__w/fastai/fastai/nbs/04_data.external.ipynb:
An error occurred while executing the following cell:
------------------
url = URLs.CALTECH_101
untar_data(url)
_add_check(url, URLs.path(url))
------------------

rror in /__w/fastai/fastai/nbs/04_data.external.ipynb:
An error occurred while executing the following cell:
------------------
url = URLs.CALTECH_101
untar_data(url)
_add_check(url, URLs.path(url))
------------------

---------------------------------------------------------------------------
FileNotFoundError                         Traceback (most recent call last)
/__w/fastai/fastai/fastai/data/external.py in <module>
      1 url = URLs.CALTECH_101
      2 untar_data(url)
----> 3 _add_check(url, URLs.path(url))

/__w/fastai/fastai/fastai/data/external.py in _add_check(url, fname)
      3     "Internal function to update the internal check file with `url` and check on `fname`."
      4     checks = json.load(open(Path(__file__).parent/'checks.txt', 'r'))
----> 5     checks[url] = _check_file(fname)
      6     json.dump(checks, open(Path(__file__).parent/'checks.txt', 'w'), indent=2)

/__w/fastai/fastai/fastai/data/external.py in _check_file(fname)
      7 def _check_file(fname):
      8     "internal function to get the hash of the local file at `fname`."
----> 9     size = os.path.getsize(fname)
     10     with open(fname, "rb") as f: hash_nb = hashlib.md5(f.read(2**20)).hexdigest()
     11     return [size,hash_nb]

/usr/lib/python3.8/genericpath.py in getsize(filename)
     48 def getsize(filename):
     49     """Return the size of a file, reported by os.stat()."""
---> 50     return os.stat(filename).st_size
     51 
     52 

FileNotFoundError: [Errno 2] No such file or directory: '/github/home/.fastai/archive/caltech_101.tgz'

URLs.CALTECH_101 this cell looks like it should not be here

ddobrinskiy · 2021-03-06T11:35:35Z

On further investigation, I can see that the cell in question was added (commit) by @jph00 couple months ago.

I don't understand why it did not fail for other PRs

jph00 · 2021-03-07T21:18:50Z

Thanks!

increase testing granularity

b1db8fb

Use 9 batches instead of 3

ddobrinskiy requested a review from hamelsmu as a code owner March 6, 2021 10:48

fix CI matrix setup

a1ee136

ddobrinskiy changed the title ~~increase testing granularity~~ Increase testing granularity for speedup Mar 6, 2021

ddobrinskiy marked this pull request as draft March 6, 2021 10:57

restart GitHub Actions

edd6725

David Dobrinskiy added 2 commits March 6, 2021 14:24

remove manual check that fails tests

1968932

URLs.CALTECH_101 this cell looks like it should not be here

clean nbs

2cb7fb6

ddobrinskiy mentioned this pull request Mar 6, 2021

Increase testing granularity for speedup (v2) #3243

Closed

ddobrinskiy marked this pull request as ready for review March 6, 2021 11:42

ddobrinskiy requested a review from jph00 as a code owner March 6, 2021 11:42

restart GitHub Actions

897fa90

jph00 merged commit 00d8863 into fastai:master Mar 7, 2021

hamelsmu added the enhancement label May 1, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Increase testing granularity for speedup #3242

Increase testing granularity for speedup #3242

ddobrinskiy commented Mar 6, 2021 •

edited

ddobrinskiy commented Mar 6, 2021

ddobrinskiy commented Mar 6, 2021

jph00 commented Mar 7, 2021

Increase testing granularity for speedup #3242

Increase testing granularity for speedup #3242

Conversation

ddobrinskiy commented Mar 6, 2021 • edited

ddobrinskiy commented Mar 6, 2021

ddobrinskiy commented Mar 6, 2021

jph00 commented Mar 7, 2021

ddobrinskiy commented Mar 6, 2021 •

edited