Excellent work (`mae.ipynb`)! #1

sayakpaul · 2021-11-16T08:52:09Z

@ariG23498 this is fantastic stuff. Super clean, readable, and coherent with the original implementation. A couple of suggestions that would likely make things even better:

Since you have already implemented masking visualization utilities how about making them part of the PatchEncoder itself? That way you could let it accept a test image, apply random masking, and plot it just like the way you are doing in the earlier cells. This way I believe the notebook will be cleaner.
AdamW (tfa.optimizers.adamw) is a better choice when it comes to training Transformer-based models.
Are we taking the loss on the correct component? I remember you mentioning it being dealt with differently.

After these points are addressed I will take a crack at porting the training loop to TPUs along with other performance monitoring callbacks.

The text was updated successfully, but these errors were encountered:

ariG23498 · 2021-11-16T09:24:12Z

Since you have already implemented masking visualization utilities how about making them part of the PatchEncoder itself? That way you could let it accept a test image, apply random masking, and plot it just like the way you are doing in the earlier cells. This way I believe the notebook will be cleaner.

This makes sense. I will try incorporating this.

AdamW (tfa.optimizers.adamw) is a better choice when it comes to training Transformer-based models.

Noted!

Are we taking the loss on the correct component? I remember you mentioning it being dealt with differently.

Yes you are right! You will find the correct loss implementation in the mae_loss.ipynb notebook.

TODO:

Incorporating AdamW
Putting the visual utility inside PatchEncoder
Get everything inside a single notebook
Reuse shared logic inside the MAE class as suggested by @sayakpaul

ariG23498 · 2021-11-16T09:24:47Z

After these points are addressed I will take a crack at porting the training loop to TPUs along with other performance monitoring callbacks.

Can't wait for this baby to train! 😃

sayakpaul · 2021-11-16T09:26:30Z

this happens as soon as you are okay with the mae_loss.ipynb implementation.

Taking a look now.

sayakpaul · 2021-11-16T09:29:55Z

@ariG23498 mae_loss.ipynb looks good. Since train_step() and test_step() share a large block of code, let's make another class method implementing that shared logic so that it can be reused. The method could be parameterized on a training argument to distinguish between training and inference.

ariG23498 · 2021-11-16T09:34:54Z

I have updated the TODO accordingly.

ariG23498 · 2021-11-16T11:03:27Z

@sayakpaul I have pushed a single notebook MaskedAutoEncoders.ipynb in this commit 7e8788a
and have deleted all the other notebook for clarity. All the todos are done. Please provide your feedback on the same.

ariG23498 · 2021-11-17T07:02:50Z

Closing this.
#2 takes care of all the TODOs.

ariG23498 self-assigned this Nov 16, 2021

ariG23498 closed this as completed Nov 17, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Excellent work (`mae.ipynb`)! #1

Excellent work (`mae.ipynb`)! #1

sayakpaul commented Nov 16, 2021 •

edited

ariG23498 commented Nov 16, 2021 •

edited

ariG23498 commented Nov 16, 2021

sayakpaul commented Nov 16, 2021

sayakpaul commented Nov 16, 2021

ariG23498 commented Nov 16, 2021

ariG23498 commented Nov 16, 2021

ariG23498 commented Nov 17, 2021

Excellent work (mae.ipynb)! #1

Excellent work (mae.ipynb)! #1

Comments

sayakpaul commented Nov 16, 2021 • edited

ariG23498 commented Nov 16, 2021 • edited

TODO:

ariG23498 commented Nov 16, 2021

sayakpaul commented Nov 16, 2021

sayakpaul commented Nov 16, 2021

ariG23498 commented Nov 16, 2021

ariG23498 commented Nov 16, 2021

ariG23498 commented Nov 17, 2021

Excellent work (`mae.ipynb`)! #1

Excellent work (`mae.ipynb`)! #1

sayakpaul commented Nov 16, 2021 •

edited

ariG23498 commented Nov 16, 2021 •

edited