[RFC] Batteries Included - Phase 3 #6323

datumbox · 2022-07-27T14:43:03Z

datumbox · 2022-07-27T17:18:26Z

Tagging a few of the regular contributors in case they are interested in specific items:
@abhi-glitchhg @federicopozzi33 @frgfm @lezwon @oke-aditya @xiaohu2015 @yassineAlouini @zhiqwang

Feel free to propose additional candidates.

oke-aditya · 2022-07-27T18:54:01Z

I will be happy to take losses :) dice loss first.

frgfm · 2022-07-27T21:19:39Z

Like Christmas in july hehe 😁

I have a few questions though:

for losses, I already have implemented the poly loss on my end. Do you want us to do the Python implementation or also the C++ / CUDA binding? (I saw that there are some in PyTorch core, so I'm not sure what's the target here)
about optimizers, I also have implementations for LARS & LAMB, should we open PR directly on core? or do we need to contact them on a dedicated issue beforehand?
about models, happy to go for the implementation but I don't have the gear to train it on Imagenet with the usual procedure. Do we need to train them as well?

Looking forward to help for the next release :)

federicopozzi33 · 2022-07-28T04:08:05Z

I'd like to take the Polynomial LR scheduler!

about schedulers, should we open PR directly on core? or do we need to contact them on a dedicated issue beforehand?

datumbox · 2022-07-28T09:02:17Z

Thanks for offering to help! We are lucky to have you guys! :)

@frgfm excellent questions. Let me try to provide some more context here:

Losses are going to be added for now in TorchVision with a Python only implementation. Ideally we should reuse as much as possible Core's existing methods that have C++/CUDA bindings. That's particularly true for the PolyLoss where we want to reuse Core's cross_entropy rather than rewriting it with pure tensor op. Unless I missed a development on Core (in which case please correct me and provide reference) neither Poly nor Dice are planned to be added.
For the 2 optimizers and 1 scheduler, the PR should come directly on Core but we will provide help to maximize the changes of getting this landed. Due to the nature of the PR, it has higher risk of not getting merged but I've already spoke with some Core devs about it and I'm hopeful we can get it in.
For the models the plan is to use the process of our new model contribution guideline. The TLDR is that yes we want the PR to contain weights that prove at least a tiny variant of the model works as expected but then we will help you train them by running the e2e training on our internal infra.

abhi-glitchhg · 2022-08-03T08:07:07Z

Hello all 👋
Bit late to the party!

Is mosaic still available? If not I would like to try it.

datumbox · 2022-08-03T10:17:31Z

@abhi-glitchhg Mosaic is available. It's a bit unclear how it will be implemented at the moment as there are multiple approaches seen online. I would prefer it if we can implement it as a Transform (rather than a Dataset or preloader etc), potentially similar to what we do for MixUp or SimpleCopyPaste. I think it would be best to disconnect its addition from the Transforms V2 initiative and add it first on the references. Then @pmeier and @vfdev-5 can propose moving forward with it using the new API.

To test the new transform we can use a similar approach as in #5825. The contributor has provided enough visual proof that the transform works as expected and then I helped him verify it by training models on internal infra. Let me know if that makes sense to you.

datumbox · 2022-08-03T10:20:16Z

BTW @lezwon just let us know he is busy and thus AutoAugment for Detection is also up for grabs if someone wants it. See #6224

yassineAlouini · 2022-08-06T16:42:48Z

@datumbox I can work on MobileViT. 👌

federicopozzi33 · 2022-08-09T07:05:08Z

I wanted to know how helpful it is to implement network architectures without training them, validating the implementations just by using/adapting/porting weights released by the authors.

I can provide some specific examples if needed.

datumbox · 2022-08-09T07:35:48Z

@federicopozzi33 Though the final training (especially of the large variants) is done by our team, we typically request to train at least one variant of the architecture to prove it works. The hardest part of such contributions is often to reproduce the accuracies of the paper and that's why we request this. We've been known to be flexible though, especially if a contributor has experience in implementing and contributing similar architectures to us. Another approach would be to partner with another contributor who has access to an infra and co-author the PR. That's the approach taken on the FCOS model by @xiaohu2015 and @zhiqwang.

datumbox · 2022-08-10T14:19:35Z

@yassineAlouini I realized that my fat finger gave you a thumbs down instead of thumbs up on your comment to work on MobileViT. Sorry about that. Are you still interested in it?

yassineAlouini · 2022-08-10T14:51:21Z

@datumbox Yes I am and I understood that you meant 👍 instead so all is good. I should start on Friday. 👌

federicopozzi33 · 2022-08-11T08:31:43Z

I'd like to take on LARS optimizer, but I have a question: to test the correctness of the optimizer, is it required to reproduce the experiments of the paper?

datumbox · 2022-08-11T08:47:01Z

@federicopozzi33 I don't think it's required to reproduce experiments but we would need to be very careful to ensure the optimizer works the same as a reference implementation. If reproducing experiments is necessary, I can run them for you.

@frgfm you said you had already implementations, are you still interested and have the time to contribute? Perhaps you could work with Federico.

Let me know your preferences guys and we can come up with a plan. Because the contribution will land on Core, we would need to align with their practices. The earlier PR on PolynomialLR went super smoothly, so we can try replicating that approach.

federicopozzi33 · 2022-08-11T09:17:26Z

@federicopozzi33 I don't think it's required to reproduce experiments but we would need to be very careful to ensure the optimizer works the same as a reference implementation. If reproducing experiments is necessary, I can run them for you.

Ok. There should be some reference implementations.

@frgfm you said you had already implementations, are you still interested and have the time to contribute? Perhaps you could work with Federico.

Let me know your preferences guys and we can come up with a plan. Because the contribution will land on Core, we would need to align with their practices. The earlier PR on PolynomialLR went super smoothly, so we can try replicating that approach.

Oh sorry, I looked at the main thread, without paying attention to the other messages.

@frgfm let me know if you're still interested to contribute, I can choose other issues without any problem :)

datumbox · 2022-08-11T10:20:45Z

@federicopozzi33 @frgfm @yassineAlouini @abhi-glitchhg @oke-aditya It would be great if you can either open issues with the items you plan to work on, or open dummy (empty) initial PRs with them so that we can link from the ticket and know which work is assigned to whom.

This would allow other community members to pick up work. I would also recommend to assign one task to each so that we can progress the work faster and without blocking others who want to contribute (though I'm happy to group together things that make sense such as the losses or the optimizers if that's what we want).

yassineAlouini · 2022-08-11T12:41:30Z

Will do @datumbox 👌

yassineAlouini · 2022-08-12T05:37:04Z

#6404 @datumbox not sure if it is the proper way to do it (since it is the first time for me), please comment/enhance if you have some time. Will start exploring the code soon.

oke-aditya · 2022-08-24T16:18:26Z

Yes. Fortunately I'm well and having good health. So will take dice loss and this. : 😊

YosuaMichael · 2022-08-24T17:02:44Z

Thanks a lot @oke-aditya for the help!

ambujpawar · 2022-10-07T08:25:01Z

I see that Mixup for Detection [1, 2] is still available.
Can I pick it up?

datumbox · 2022-10-07T09:26:36Z

@ambujpawar It is! Would you be happy to give a try of the new Transforms API (it's at torchvision.prototype.transforms) or you prefer to stick with hacking together an implementation based on what we have on the references of classification?

ambujpawar · 2022-10-07T10:34:03Z

I dont have a preference for now. But I think the new transforms API would be nicer right?
So, I think I would go with that one

datumbox · 2022-10-07T10:36:13Z

Sounds great! Could you create an issue or a dummy PR so that we can assign it to you and keep track of this item easier?

federicopozzi33 · 2022-10-17T19:09:37Z

Since LARS optimizer is still available, I would like to pick it. @datumbox, is it ok for you?

Atharva-Phatak · 2022-10-28T23:55:15Z

@datumbox I would like to take LAMB optimizer.

datumbox · 2022-10-31T09:24:36Z

@federicopozzi33 I think your message fell through the cracks... I apologise, would you like to pick it up?

@Atharva-Phatak sounds great, I assigned the issue you started to you. Note that this is meant to be upstreamed to PyTorch Core. Ping me when you have an early version, to do an early check before we involve PyTorch Core engineers. :)

federicopozzi33 · 2022-10-31T10:05:19Z

@federicopozzi33 I think your message fell through the cracks... I apologise, would you like to pick it up?

No problem. Yes, I'm still interested. Do I open the PR draft directly in the PyTorch repo, right?

datumbox · 2022-10-31T10:12:51Z

@federicopozzi33 Yes that sounds great! Feel free to ping me like the previous time to go through checks together and when we are mostly ready, I'll ping the Core engineers to get their input. :)

Atharva-Phatak · 2022-10-31T15:04:50Z

@datumbox Are you recommending I make a draft PR, and we can go over the changes? Then we can file a main PR for pytorch-core right?

datumbox · 2022-10-31T15:31:53Z

@Atharva-Phatak Yes sounds good.

You can start a draft PR on core and put me as reviewer to discuss details before looping other devs in. If you have specific details in mind, you can also post them on the issue you raised at TorchVision. Previously that's what we did with @federicopozzi33 and worked fine (see #4438 (comment)). I'm quite flexible to adjust on the way it works for you, just make sure you mark the PR as draft to indicate it's work in progress.

frgfm · 2022-11-01T14:37:43Z

Oops I realized I hadn't opened the PR for LARS & LAMB on core 😅
Sorry about that 🙃 Do you prefer to keep working on this? Or should I open the PRs?

Atharva-Phatak · 2022-11-01T14:39:46Z

@frgfm I thought no one was working on LAMB and hence I took it up. If it's okay I would like to work on it. 😃

frgfm · 2022-11-01T16:05:40Z

Of course it is!
What about LARS @federicopozzi33? :)

federicopozzi33 · 2022-11-01T17:44:07Z

Of course it is!

What about LARS @federicopozzi33? :)

Same as @Atharva-Phatak. Moreover, I've already started working on that, so I'd like to continue.

frgfm · 2022-11-01T22:12:33Z

Roger that! In case one of you encounters trouble, let me know as I've implemented those already a while back 👍 (cf. https://github.com/frgfm/Holocron/tree/main/holocron/optim)

ambujpawar · 2022-11-07T13:01:41Z

I am (almost) finished with the Mixup for Detection. Would like to pickup Deformable DeTR next, since its not taken up yet.
Shall I create a issue and a draft PR for this issue like done previously?

datumbox · 2022-11-07T13:07:18Z

@ambujpawar I was wondering if you could perhaps support on normal DeTR first. The work has previously started at #5922 but was not completed. Let me know if that's of interest.

ambujpawar · 2022-11-07T14:18:17Z

Sure, it seems interesting. I thought other contirbutors were already working on it. Therefore, chose the different one.
I will pickup the normal DeTR then :)

I see a PR for DeTR but not an issue. Shall I create one?

datumbox · 2022-11-07T14:29:54Z

Yes good idea, create an issue and perhaps ping the devs who are on the PR to see if there are opportunities for collaboration. We've done previously shared PRs (for FCOS, see #4961) so this might also work here. Otherwise we can find another ticket for you. I just wanted to make sure we add DeTR soon as it would be the first Transformer-based detection model, something missing in TorchVision at the moment.

weike382 · 2023-01-02T01:25:57Z

So good.Looking forward to the realization of MTV candidates.

deepwilson · 2023-02-16T16:36:01Z

@datumbox I have some free time and would like to contribute. I see that the DETR implementation is also not moving ahead.
If you are aware of any other tasks please do let me know.
Thanks!

deepwilson · 2023-02-16T17:10:36Z

@oke-aditya Thanks. :) Are there any open topics? I can see that many of the topics/tasks are already taken.

datumbox · 2023-02-17T21:45:30Z

@deepwilson It's a tough period for the team as it's doesn't have enough resources. Myself I have changed jobs so it's harder to follow up with every ongoing initiative. It would be very nice to finally add DETR to the library but it might be a challenge training it. Not sure if @pmeier or @vfdev-5 have any good issues that they could use community help?

datumbox added help wanted module: ops module: models module: transforms module: reference scripts new feature labels Jul 27, 2022

federicopozzi33 mentioned this issue Jul 29, 2022

Investigate if lr_scheduler from segmentation can use PyTorch's schedulers #4438

Closed

TeodorPoncu mentioned this issue Aug 1, 2022

MaxVit model #6342

Merged

datumbox mentioned this issue Aug 1, 2022

[RFC] Support YOLOX detection model #6341

Open

1 task

datumbox mentioned this issue Aug 3, 2022

Add MViT architecture in TorchVision #6198

Merged

yassineAlouini mentioned this issue Aug 12, 2022

[FEAT] Add MobileViT v1 & v2 #6404

Open

abhi-glitchhg mentioned this issue Sep 2, 2022

Mosaic Transform #6534

Open

ambujpawar mentioned this issue Oct 7, 2022

New Feature: Mixup Transform for Object Detection #6720

Open

Atharva-Phatak mentioned this issue Oct 28, 2022

Implementation of LAMB optimizer #6868

Open

federicopozzi33 mentioned this issue Oct 31, 2022

WIP: feat: LARS optimizer pytorch/pytorch#88106

Draft

6 tasks

[RFC] Batteries Included - Phase 3 #6323

[RFC] Batteries Included - Phase 3 #6323

Comments

datumbox commented Jul 27, 2022 • edited Loading

🚀 The feature

1. New Primitives

Data Augmentations

Losses

Operators added in PyTorch Core

2. New Architectures & Model Iterations

Image Classification

Video Classification

3. Improved Training Recipes & Pre-trained models

Reference Scripts

Pre-trained weights

Other Candidates

datumbox commented Jul 27, 2022

oke-aditya commented Jul 27, 2022

frgfm commented Jul 27, 2022

federicopozzi33 commented Jul 28, 2022 • edited Loading

datumbox commented Jul 28, 2022

abhi-glitchhg commented Aug 3, 2022

datumbox commented Aug 3, 2022

datumbox commented Aug 3, 2022

yassineAlouini commented Aug 6, 2022

federicopozzi33 commented Aug 9, 2022

datumbox commented Aug 9, 2022

datumbox commented Aug 10, 2022

yassineAlouini commented Aug 10, 2022

federicopozzi33 commented Aug 11, 2022 • edited Loading

datumbox commented Aug 11, 2022

federicopozzi33 commented Aug 11, 2022 • edited Loading

datumbox commented Aug 11, 2022

yassineAlouini commented Aug 11, 2022

yassineAlouini commented Aug 12, 2022

oke-aditya commented Aug 24, 2022

YosuaMichael commented Aug 24, 2022

ambujpawar commented Oct 7, 2022

datumbox commented Oct 7, 2022

ambujpawar commented Oct 7, 2022

datumbox commented Oct 7, 2022

federicopozzi33 commented Oct 17, 2022 • edited Loading

Atharva-Phatak commented Oct 28, 2022

datumbox commented Oct 31, 2022

federicopozzi33 commented Oct 31, 2022

datumbox commented Oct 31, 2022

Atharva-Phatak commented Oct 31, 2022

datumbox commented Oct 31, 2022

frgfm commented Nov 1, 2022

Atharva-Phatak commented Nov 1, 2022

frgfm commented Nov 1, 2022

federicopozzi33 commented Nov 1, 2022 • edited Loading

frgfm commented Nov 1, 2022

ambujpawar commented Nov 7, 2022

datumbox commented Nov 7, 2022

ambujpawar commented Nov 7, 2022 • edited Loading

datumbox commented Nov 7, 2022

weike382 commented Jan 2, 2023

deepwilson commented Feb 16, 2023

deepwilson commented Feb 16, 2023

datumbox commented Feb 17, 2023

datumbox commented Jul 27, 2022 •

edited

Loading

federicopozzi33 commented Jul 28, 2022 •

edited

Loading

federicopozzi33 commented Aug 11, 2022 •

edited

Loading

federicopozzi33 commented Aug 11, 2022 •

edited

Loading

federicopozzi33 commented Oct 17, 2022 •

edited

Loading

federicopozzi33 commented Nov 1, 2022 •

edited

Loading

ambujpawar commented Nov 7, 2022 •

edited

Loading