CORe50 Dataset + reporthook for download in Utils #340

vlomonaco · 2017-11-23T02:56:10Z

Hi! This PR would add support to the CORe50 dataset, recently published @ CoRL2017.

I've also added a simple report hook function (credit to Shichao) in the Utils.py file for showing up the status of the download which can be useful for big Datasets.

Note: This implementation it's not super efficient since similar to the ImageFolder class but good for small RAM devices. I plan to update the code with a more efficient pre-loading strategy in the near future.

alykhantejani

Hi @vlomonaco,

I haven't had time to check the PR fully yet, but instead of adding a report hook function to utils why not use tqdm (if available)?

i.e.

try:
    from tqdm import tqdm
except ImportError:
    tqdm = lambda x:x

Then you can just wrap the loop in tqdm

You can also add something to the README/docstring about optional packages for torchvision (to enhance usability but not required to use it)

vlomonaco · 2017-12-01T16:33:47Z

Hi @alykhantejani! Thank you for the feedback!

Maybe I misunderstood your suggestion but using tqdm with urllib is not that straightforward. You still have to create a report hook function since there's no external loop in a urllib request (see an example here).

Moreover, with just a 10-lines function we can provide download details also to users who don't have tqdm installed. Your call!

vlomonaco · 2019-05-30T07:09:20Z

Will this PR ever get merged? It's been pending for more than a year now...

fmassa

Sorry for the long delay in reviewing!

I've made some comments, let me know what you think.

Also, do you think it would be possible to add some tests to the dataset loading, similar to #976 ?

fmassa · 2019-05-31T16:55:11Z

torchvision/datasets/core50.py

+        root = self.root
+
+        # Downloading the dataset and filelists
+        for name in (self.img_size, 'filelists'):


we have recently added functionality to extract files in datasets/utils.py, can you use those instead?

fmassa · 2019-05-31T17:00:04Z

torchvision/datasets/core50.py

+        'batches_filelists.zip': 'e3297508a8998ba0c99a83d6b36bde62'
+    }
+
+    def __init__(self, root, check_integrity=True, scenario='ni', train=True,


I don't think we need a check_integrity argument in the constructor, and it should by default do the integrity check

fmassa · 2019-05-31T17:02:40Z

torchvision/datasets/utils.py

@@ -18,6 +20,20 @@ def check_integrity(fpath, md5):
    return True


+def reporthook(count, block_size, total_size):
+    global start_time


I don't think start_time is defined anywhere in this file.

Let's remove this functional altogether for now, and maybe send another PR adding it if necessary?

fmassa · 2019-05-31T17:05:00Z

torchvision/datasets/core50.py

+        the test set).
+
+    """
+    ntrain_batch = {


This is a bit confusing, as it's not used (with this name) anywhere in the code, and is referenced as max-batch in the documentation.

If you want to keep this around, I think you should add some asserts in the __init__ checking that batch is within ntrain_batch

fmassa · 2019-05-31T17:10:39Z

torchvision/datasets/core50.py

+
+        if train:
+            self.fpath = os.path.join(
+                scenario.upper() + '_' + suffix, 'run' + str(run),


nit: maybe something like

'{}_{}'.format(scenario.upper(), suffix), 'run{}'.format(run), 'train_batch_{:02d}_filelist.txt'.format(batch)

fmassa · 2019-05-31T17:14:40Z

torchvision/datasets/core50.py

+        path = os.path.join(self.root, self.filenames['filelists'][:-4],
+                            self.fpath)
+        with open(path, 'r') as f:
+            for i, line in enumerate(f):


Looks like the i is not used. Maybe you could use f.readlines() instead?

* Update README.md * Update README.md * Update README.md

vlomonaco added 6 commits November 22, 2017 20:52

CORe50 Dataset added. Utils imporved with reporthook during download.

8507390

CORe50 Dataset added. Utils imporved with reporthook during download.

49cb92b

CORe50 Dataset added. Utils improved with reporthook during download.

430312b

CORe50 Dataset added. Utils improved with reporthook during download.

a36eaef

CORe50 Dataset added. Utils improved with reporthook during download.

426cf54

CORe50 Dataset added. Utils improved with reporthook during download.

4f874cc

alykhantejani reviewed Dec 1, 2017

View reviewed changes

fmassa requested changes May 31, 2019

View reviewed changes

pmeier self-assigned this Apr 8, 2022

rajveerb pushed a commit to rajveerb/vision that referenced this pull request Nov 30, 2023

Update README.md (pytorch#340)

ed40d49

* Update README.md * Update README.md * Update README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

CORe50 Dataset + reporthook for download in Utils #340

CORe50 Dataset + reporthook for download in Utils #340

Uh oh!

vlomonaco commented Nov 23, 2017 •

edited

Loading

Uh oh!

alykhantejani left a comment •

edited

Loading

Uh oh!

vlomonaco commented Dec 1, 2017 •

edited

Loading

Uh oh!

vlomonaco commented May 30, 2019 •

edited

Loading

Uh oh!

fmassa left a comment

Uh oh!

fmassa May 31, 2019

Uh oh!

fmassa May 31, 2019

Uh oh!

fmassa May 31, 2019

Uh oh!

fmassa May 31, 2019

Uh oh!

fmassa May 31, 2019

Uh oh!

fmassa May 31, 2019

Uh oh!

Uh oh!

CORe50 Dataset + reporthook for download in Utils #340

Are you sure you want to change the base?

CORe50 Dataset + reporthook for download in Utils #340

Uh oh!

Conversation

vlomonaco commented Nov 23, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

alykhantejani left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

vlomonaco commented Dec 1, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

vlomonaco commented May 30, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

fmassa left a comment

Choose a reason for hiding this comment

Uh oh!

fmassa May 31, 2019

Choose a reason for hiding this comment

Uh oh!

fmassa May 31, 2019

Choose a reason for hiding this comment

Uh oh!

fmassa May 31, 2019

Choose a reason for hiding this comment

Uh oh!

fmassa May 31, 2019

Choose a reason for hiding this comment

Uh oh!

fmassa May 31, 2019

Choose a reason for hiding this comment

Uh oh!

fmassa May 31, 2019

Choose a reason for hiding this comment

Uh oh!

Uh oh!

vlomonaco commented Nov 23, 2017 •

edited

Loading

alykhantejani left a comment •

edited

Loading

vlomonaco commented Dec 1, 2017 •

edited

Loading

vlomonaco commented May 30, 2019 •

edited

Loading