New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Update Dataloader with default parameter device #65402

Closed

jeejakp12 wants to merge 1 commit into pytorch:master from jeejakp12:origin/jeeja_dataloader_change

Contributor

jeejakp12 commented Sep 21, 2021

pin_memory, has optional device parameter to specify
which device you want to pin for. With this above change
the Dataloader will work only for CUDA backend. To add
support for other backend which supports pinned memory,
dataloader is updated with device as optional parameter.

Fixes #{issue number}

Contributor

facebook-github-bot commented Sep 21, 2021

Thank you for your pull request and welcome to our community.

Action Required

In order to merge any pull request (code, docs, etc.), we require contributors to sign our Contributor License Agreement, and we don't seem to have one on file for you.

Process

In order for us to review and merge your suggested changes, please sign at https://code.facebook.com/cla. If you are contributing on behalf of someone else (eg your employer), the individual CLA may not be sufficient and your employer may need to sign the corporate CLA.

Once the CLA is signed, our tooling will perform checks and validations. Afterwards, the pull request will be tagged with CLA signed. The tagging process may take up to 1 hour after signing. Please give it that time before contacting us about it.

If you have received this in error or have any questions, please contact us at cla@fb.com. Thanks!

Contributor

facebook-github-bot commented Sep 21, 2021 •

edited

🔗 Helpful links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/65402
📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓Need help or want to give feedback on the CI? Visit our office hours

💊 CI failures summary and remediations

As of commit c955627 (more details on the Dr. CI page):

💚 💚 Looks good so far! There are no failures yet. 💚 💚

This comment was automatically generated by Dr. CI (expand for details).

Please report bugs/suggestions to the (internal) Dr. CI Users group.

Click here to manually regenerate this comment.

pytorchbot added the open source label

soulitzer requested a review from VitalyFedyunin

September 22, 2021 02:51

soulitzer added the triaged label

jeejakp12 force-pushed the origin/jeeja_dataloader_change branch from aa4abdc to a568e8c Compare

September 22, 2021 05:12

wconstab self-requested a review

September 23, 2021 14:32

wconstab reviewed

View reviewed changes

torch/utils/data/_utils/pin_memory.py Outdated

    
            @@ -45,18 +45,30 @@ def _pin_memory_loop(in_queue, out_queue, device_id, done_event):
          
                      del r  # save memory

              def pin_memory(data):

              def pin_memory(data, device=""):

Contributor

wconstab Sep 23, 2021

is device="" an idiom that we use in other places? or is device=None better?

Contributor

VitalyFedyunin Sep 23, 2021

should be device=None as more pythonic

wconstab reviewed

View reviewed changes

torch/utils/data/_utils/pin_memory.py

    
                      if (len(device) == 0):

                          return {k: pin_memory(sample) for k, sample in data.items()}

                      else:

                          return {k: pin_memory(sample, device) for k, sample in data.items()}

Contributor

wconstab Sep 23, 2021

@VitalyFedyunin is it preferable to check device==None here and omit from call to pin_memory? or should we push device=None check into pin_memory and always pass device parameter (and simplify this code)

Contributor

VitalyFedyunin Sep 23, 2021

You can always pass device=device in recurrent calls, without additional checks.

Contributor Author

jeejakp12 Sep 27, 2021

will fix it.

VitalyFedyunin reviewed

View reviewed changes

torch/utils/data/_utils/pin_memory.py Outdated

    
            @@ -45,18 +45,30 @@ def _pin_memory_loop(in_queue, out_queue, device_id, done_event):
          
                      del r  # save memory

              def pin_memory(data):

              def pin_memory(data, device=""):

Contributor

VitalyFedyunin Sep 23, 2021

should be device=None as more pythonic

torch/utils/data/_utils/pin_memory.py Outdated

    
                  elif hasattr(data, "pin_memory"):

                      return data.pin_memory()

                          return data.pin_memory()

Contributor

VitalyFedyunin Sep 23, 2021

unnecessary indent

Contributor Author

jeejakp12 Sep 27, 2021

will fix it

torch/utils/data/_utils/pin_memory.py Outdated

    
                  if isinstance(data, torch.Tensor):

                      return data.pin_memory()

                      if (len(device) == 0):

Contributor

VitalyFedyunin Sep 23, 2021

Should be device is None

Contributor Author

jeejakp12 Sep 27, 2021

will fix it

torch/utils/data/dataloader.py Outdated

    
            @@ -154,6 +156,7 @@ class DataLoader(Generic[T_co]):
          
                  pin_memory: bool

                  drop_last: bool

                  timeout: float

                  device: str

Contributor

VitalyFedyunin Sep 23, 2021

As device here applied to pin_memory only, please call it pin_memory_device

Contributor Author

jeejakp12 Sep 27, 2021

will fix it.

torch/utils/data/dataloader.py Outdated

    
            @@ -491,7 +496,13 @@ def __init__(self, loader: DataLoader) -> None:
          
                      self._index_sampler = loader._index_sampler

                      self._num_workers = loader.num_workers

                      self._prefetch_factor = loader.prefetch_factor

                      self._pin_memory = loader.pin_memory and torch.cuda.is_available()

                      # for CUDA, behaviour is default. for other backends

Contributor

VitalyFedyunin Sep 23, 2021

Add expection when pin_memory_device is specified, but pin_memory is false

Contributor Author

jeejakp12 Sep 27, 2021

added warn msg, in case pin_memory_device is set and pin_memory flag its set to false. as this a valid case.

Contributor

VitalyFedyunin commented Sep 23, 2021

Please add tests

sujoysaraswati mentioned this pull request

Adding support for HPU as a backend device #65609

Closed

jeejakp12 force-pushed the origin/jeeja_dataloader_change branch 6 times, most recently from d9c5073 to 97191d2 Compare

September 28, 2021 03:50

facebook-github-bot added the cla signed label

Contributor

facebook-github-bot commented Sep 30, 2021

Thank you for signing our Contributor License Agreement. We can now accept your code for this (and any) Facebook open source project. Thanks!

1 similar comment

Contributor

facebook-github-bot commented Sep 30, 2021

Thank you for signing our Contributor License Agreement. We can now accept your code for this (and any) Facebook open source project. Thanks!

jeejakp12 force-pushed the origin/jeeja_dataloader_change branch 2 times, most recently from fd55645 to b5488ce Compare

October 4, 2021 04:52

pytorch-probot bot added the ciflow/default label

pytorch-probot bot commented Oct 4, 2021 •

edited

CI Flow Status

⚛️ CI Flow

Ruleset - Version: v1
Ruleset - File: https://github.com/jeejakp12/pytorch/blob/a1de099fdbe1d7679f429c3a3636a33e3991af52/.github/generated-ciflow-ruleset.json
PR ciflow labels: ciflow/default

Workflows	Labels (bold enabled)	Status
Triggered Workflows
linux-bionic-py3.6-clang9	`ciflow/all`, `ciflow/cpu`, `ciflow/default`, `ciflow/linux`, `ciflow/noarch`, `ciflow/xla`	✅ triggered
linux-docs	`ciflow/all`, `ciflow/cpu`, `ciflow/default`, `ciflow/docs`, `ciflow/linux`	✅ triggered
linux-vulkan-bionic-py3.6-clang9	`ciflow/all`, `ciflow/cpu`, `ciflow/default`, `ciflow/linux`, `ciflow/vulkan`	✅ triggered
linux-xenial-cuda11.3-py3.6-gcc7	`ciflow/all`, `ciflow/cuda`, `ciflow/default`, `ciflow/linux`	✅ triggered
linux-xenial-py3-clang5-mobile-build	`ciflow/all`, `ciflow/default`, `ciflow/linux`, `ciflow/mobile`	✅ triggered
linux-xenial-py3-clang5-mobile-custom-build-static	`ciflow/all`, `ciflow/default`, `ciflow/linux`, `ciflow/mobile`	✅ triggered
linux-xenial-py3.6-clang7-asan	`ciflow/all`, `ciflow/cpu`, `ciflow/default`, `ciflow/linux`, `ciflow/sanitizers`	✅ triggered
linux-xenial-py3.6-clang7-onnx	`ciflow/all`, `ciflow/cpu`, `ciflow/default`, `ciflow/linux`, `ciflow/onnx`	✅ triggered
linux-xenial-py3.6-gcc5.4	`ciflow/all`, `ciflow/cpu`, `ciflow/default`, `ciflow/linux`	✅ triggered
linux-xenial-py3.6-gcc7	`ciflow/all`, `ciflow/cpu`, `ciflow/default`, `ciflow/linux`	✅ triggered
linux-xenial-py3.6-gcc7-bazel-test	`ciflow/all`, `ciflow/bazel`, `ciflow/cpu`, `ciflow/default`, `ciflow/linux`	✅ triggered
pytorch-linux-xenial-py3-clang5-android-ndk-r19c-gradle-custom-build-single	`ciflow/all`, `ciflow/android`, `ciflow/cpu`, `ciflow/default`, `ciflow/linux`	✅ triggered
pytorch-linux-xenial-py3-clang5-android-ndk-r19c-gradle-custom-build-single-full-jit	`ciflow/all`, `ciflow/android`, `ciflow/cpu`, `ciflow/default`, `ciflow/linux`	✅ triggered
win-vs2019-cpu-py3	`ciflow/all`, `ciflow/cpu`, `ciflow/default`, `ciflow/win`	✅ triggered
win-vs2019-cuda11.3-py3	`ciflow/all`, `ciflow/cuda`, `ciflow/default`, `ciflow/win`	✅ triggered
Skipped Workflows
caffe2-linux-xenial-py3.6-gcc5.4	`ciflow/all`, `ciflow/cpu`, `ciflow/linux`	🚫 skipped
docker-builds	`ciflow/all`	🚫 skipped
ios-12-5-1-arm64	`ciflow/all`, `ciflow/ios`, `ciflow/macos`	🚫 skipped
ios-12-5-1-arm64-coreml	`ciflow/all`, `ciflow/ios`, `ciflow/macos`	🚫 skipped
ios-12-5-1-arm64-custom-ops	`ciflow/all`, `ciflow/ios`, `ciflow/macos`	🚫 skipped
ios-12-5-1-arm64-full-jit	`ciflow/all`, `ciflow/ios`, `ciflow/macos`	🚫 skipped
ios-12-5-1-arm64-metal	`ciflow/all`, `ciflow/ios`, `ciflow/macos`	🚫 skipped
ios-12-5-1-x86-64	`ciflow/all`, `ciflow/ios`, `ciflow/macos`	🚫 skipped
ios-12-5-1-x86-64-coreml	`ciflow/all`, `ciflow/ios`, `ciflow/macos`	🚫 skipped
ios-12-5-1-x86-64-full-jit	`ciflow/all`, `ciflow/ios`, `ciflow/macos`	🚫 skipped
libtorch-linux-xenial-cuda10.2-py3.6-gcc7	`ciflow/all`, `ciflow/cuda`, `ciflow/libtorch`, `ciflow/linux`	🚫 skipped
libtorch-linux-xenial-cuda11.3-py3.6-gcc7	`ciflow/all`, `ciflow/cuda`, `ciflow/libtorch`, `ciflow/linux`	🚫 skipped
linux-bionic-cuda10.2-py3.9-gcc7	`ciflow/all`, `ciflow/cuda`, `ciflow/linux`, `ciflow/slow`	🚫 skipped
linux-docs-push	`ciflow/all`, `ciflow/cpu`, `ciflow/linux`, `ciflow/scheduled`	🚫 skipped
macos-10-15-py3-arm64	`ciflow/all`, `ciflow/macos`	🚫 skipped
macos-10-15-py3-lite-interpreter-x86-64	`ciflow/all`, `ciflow/macos`	🚫 skipped
macos-11-py3-x86-64	`ciflow/all`, `ciflow/macos`	🚫 skipped
parallelnative-linux-xenial-py3.6-gcc5.4	`ciflow/all`, `ciflow/cpu`, `ciflow/linux`	🚫 skipped
periodic-libtorch-linux-bionic-cuda11.5-py3.6-gcc7	`ciflow/all`, `ciflow/cuda`, `ciflow/libtorch`, `ciflow/linux`, `ciflow/scheduled`	🚫 skipped
periodic-libtorch-linux-xenial-cuda11.1-py3.6-gcc7	`ciflow/all`, `ciflow/cuda`, `ciflow/libtorch`, `ciflow/linux`, `ciflow/scheduled`	🚫 skipped
periodic-linux-bionic-cuda11.5-py3.6-gcc7	`ciflow/all`, `ciflow/cuda`, `ciflow/linux`, `ciflow/scheduled`	🚫 skipped
periodic-linux-xenial-cuda10.2-py3-gcc7-slow-gradcheck	`ciflow/all`, `ciflow/cuda`, `ciflow/linux`, `ciflow/scheduled`, `ciflow/slow`, `ciflow/slow-gradcheck`	🚫 skipped
periodic-linux-xenial-cuda11.1-py3.6-gcc7-debug	`ciflow/all`, `ciflow/cuda`, `ciflow/linux`, `ciflow/scheduled`	🚫 skipped
periodic-win-vs2019-cuda11.1-py3	`ciflow/all`, `ciflow/cuda`, `ciflow/scheduled`, `ciflow/win`	🚫 skipped
periodic-win-vs2019-cuda11.5-py3	`ciflow/all`, `ciflow/cuda`, `ciflow/scheduled`, `ciflow/win`	🚫 skipped

You can add a comment to the PR and tag @pytorchbot with the following commands:

# ciflow rerun, "ciflow/default" will always be added automatically
@pytorchbot ciflow rerun

# ciflow rerun with additional labels "-l <ciflow/label_name>", which is equivalent to adding these labels manually and trigger the rerun
@pytorchbot ciflow rerun -l ciflow/scheduled -l ciflow/slow

For more information, please take a look at the CI Flow Wiki.

Contributor Author

jeejakp12 commented Oct 5, 2021

@VitalyFedyunin @wconstab i have addressed the review comments.

Contributor

wconstab commented Oct 5, 2021

Looks good to me, but i'll let @VitalyFedyunin stamp.

Contributor Author

jeejakp12 commented Oct 19, 2021

@VitalyFedyunin i have addressed the review comments, can you please check if rework patch is fine.

jeejakp12 force-pushed the origin/jeeja_dataloader_change branch from b5488ce to 7488722 Compare

October 26, 2021 11:08

VitalyFedyunin approved these changes

View reviewed changes

Contributor

facebook-github-bot commented Nov 9, 2021

@VitalyFedyunin has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

Contributor

VitalyFedyunin commented Dec 13, 2021

Sorry it takes so much time to land. Infra system issues. Can you please rebase (to fix merge conflicts).

jeejakp12 force-pushed the origin/jeeja_dataloader_change branch from 7488722 to a1de099 Compare

December 14, 2021 05:27

Contributor Author

jeejakp12 commented Dec 14, 2021

@VitalyFedyunin i have rebased and pushed the patch. Thanks

suo removed the ciflow/default label

Contributor Author

jeejakp12 commented Mar 23, 2022

@wconstab @VitalyFedyunin can this patch be merged, it is already approved.

Contributor

VitalyFedyunin commented Mar 23, 2022

Sure, can you please rebase it once again to avoid merge conflicts?


          Update Dataloader with default parameter device

c955627

pin_memory, has optional device parameter to specify
which device you want to pin for.  With this above change
the Dataloader will work only for CUDA backend. To add
support for other backend which supports pinned memory,
dataloader is updated with device as optional parameter.

Signed-off-by: Jeeja <jeejakp@habana.ai>

jeejakp12 force-pushed the origin/jeeja_dataloader_change branch from a1de099 to c955627 Compare

March 24, 2022 03:49

Contributor Author

jeejakp12 commented Mar 24, 2022

@VitalyFedyunin rebased and updated the patch.

Contributor Author

jeejakp12 commented Apr 6, 2022

@VitalyFedyunin can this patch be merged? already rebased it.

Contributor

facebook-github-bot commented Apr 20, 2022

@VitalyFedyunin has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

facebook-github-bot pushed a commit that referenced this pull request


          Update Dataloader with default parameter device (#65402)

3b76e15

Summary:
pin_memory, has optional device parameter to specify
which device you want to pin for.  With this above change
the Dataloader will work only for CUDA backend. To add
support for other backend which supports pinned memory,
dataloader is updated with device as optional parameter.

Fixes #{issue number}

Pull Request resolved: #65402

Reviewed By: zou3519

Differential Revision: D32282204

Pulled By: VitalyFedyunin

fbshipit-source-id: e2e09876969af108d0db38af7c2d1b2f1cfa9858

pytorchmergebot closed this in

45bbc4c

github-actions bot commented Apr 21, 2022

Hey @jeejakp12.
You've committed this PR, but it does not have both a 'release notes: ...' and 'topics: ...' label. Please add one of each to the PR. The 'release notes: ...' label should represent the part of PyTorch that this PR changes (fx, autograd, distributed, etc) and the 'topics: ...' label should represent the kind of PR it is (not user facing, new feature, bug fix, perf improvement, etc). The list of valid labels can be found here for the 'release notes: ...' and here for the 'topics: ...'.
For changes that are 'topic: not user facing' there is no need for a release notes label.

VitalyFedyunin added topic: improvements release notes: dataloader labels

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment