New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

add 1st pytorch example #1180

Open

radekosmulski wants to merge 7 commits into main from add_pytorch_DLRM_example

Contributor

radekosmulski commented Jul 5, 2023 •

edited

This adds the first pytorch example 🥳

Additionally, I propose in DLRMModel we rename dim to embedding_dim. This aligns us with the tf api and (which is probably more important) is more informative to the reader (this is the language used in the paper, otherwise it is a bit confusing what does output refer to -- is it the output of the model? of mlp hidden layers? embedding_dim is more informative)

github-actions bot commented Jul 5, 2023

Documentation preview

https://nvidia-merlin.github.io/models/review/pr-1180

radekosmulski added the examples label

radekosmulski marked this pull request as draft

July 5, 2023 03:25

radekosmulski marked this pull request as ready for review

July 5, 2023 05:12

review-notebook-app bot commented Jul 5, 2023

Check out this pull request on

See visual diffs & provide feedback on Jupyter Notebooks.

Powered by ReviewNB

bschifferer reviewed

View reviewed changes

examples/pytorch/01-Getting-started.ipynb

		@@ -0,0 +1,364 @@
		{

Contributor

bschifferer Jul 5, 2023 •

edited

why do we propose with Loader(train, batch_size=1024) as loader: which is different to our TensorFlow examples?

Reply via ReviewNB

Contributor

marcromeyn Jul 5, 2023

This is something @edknv suggested, it ensures that the background thread gets removed. I am working on a way to see if we can move this inside our model/trainer code. Because I think the context manager approach works for single GPU, but I don't think it will work in a multi-GPU setting.

examples/pytorch/01-Getting-started.ipynb

		@@ -0,0 +1,364 @@
		{

Contributor

bschifferer Jul 5, 2023 •

edited

I am not sure, if we have the next notebook available on PyTorch - we might need to reference the TensorFlow one OR the next steps are removed OR we link to the other training examples

Reply via ReviewNB

Contributor Author

radekosmulski Jul 5, 2023

good point! removed the cell for now and will add it once we have more examples

bschifferer reviewed

View reviewed changes

examples/pytorch/01-Getting-started.ipynb Outdated

+                 "id": "23d9bf34",
+                 "metadata": {},
+                 "source": [
+                  "<img src=\"https://developer.download.nvidia.com/notebooks/dlsw-notebooks/merlin_models_01-getting-started/nvidia_logo.png\" style=\"width: 90px; float: right;\">\n",

Contributor

bschifferer Jul 5, 2023

we might need to update the logo?

Contributor Author

radekosmulski Jul 5, 2023

yes, absolutely! good point, created a new tracking logo

radekosmulski requested review from bschifferer and marcromeyn

July 5, 2023 13:02

rnyak reviewed

View reviewed changes

examples/pytorch/01-Getting-started.ipynb

		@@ -0,0 +1,348 @@
		{

Contributor

rnyak Jul 5, 2023 •

edited

Line #4.    model.initialize(train_loader)

can we add some explanation why we do need model.initialize() step?

Reply via ReviewNB

Contributor Author

radekosmulski Jul 6, 2023

added note on the functionality of initialize

examples/pytorch/01-Getting-started.ipynb

		@@ -0,0 +1,348 @@
		{

Contributor

rnyak Jul 5, 2023 •

edited

do we need this entire block again?

Reply via ReviewNB

Contributor Author

radekosmulski Jul 6, 2023

I am not sure -- was just copying over what we have on the TF side, we follow the same pattern there

radekosmulski added 4 commits

July 6, 2023 18:27


          * rename arg dim -> embedding_dim

09b4688

* fix docstring
* add test and cleanup
* add example


          update logo

b54569b


          add test

7de630d


          appease linter

radekosmulski force-pushed the add_pytorch_DLRM_example branch from 7dfa115 to 7368970 Compare

July 6, 2023 08:30


          add information regarding initialize

b1d04bb

radekosmulski requested a review from rnyak

July 6, 2023 08:35

radekosmulski and others added 2 commits

July 7, 2023 11:49


          Merge branch 'main' into add_pytorch_DLRM_example

33379da


          Merge branch 'main' into add_pytorch_DLRM_example

458f3ce

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment