add example of self-supervised SimCLR training - V2 #50

DevinCheung · 2021-12-02T11:45:09Z

The previous version uses Nvidia DALI to create a dataloader. I found that data augmentations in DALI are different from those of torchvision. As a result, the desired performance could not be achieved. In this version, dataloader is implemented with colossalai.nn.data and torchvision. The final linear evaluation accuracy could be up to 85.4%.

FrankLeeeee · 2021-12-08T03:45:01Z

Hi @DevinCheung , sorry for my late reply and thank you very much for your contribution. The current version is still an unstable beta version and we are updating the API for better usability. Could you integrate your example with the latest API? I will notify you when the new API is merged into main branch. :)

FrankLeeeee · 2021-12-08T03:48:28Z

Meanwhile, it will be good to rebase the commits into one commit so that the git log will look cleaner. Since this PR is newer, I will close #42

DevinCheung · 2021-12-08T06:33:08Z

Hi, @FrankLeeeee Sure, please notify me when the updated API is ready :)

DevinCheung · 2021-12-10T01:43:38Z

Hi, @FrankLeeeee I notice the API is updated. Are all updates finished already?

FrankLeeeee · 2021-12-10T02:16:42Z

@DevinCheung Hi Devin. Yes, the API is merged. The documentation and example code will be updated by today. You may refer to the new doc later.

DevinCheung · 2021-12-11T15:57:30Z

Hi, Frank @FrankLeeeee I have integrated my example into the latest API. Thanks for your evaluation. Please feel free to contact me if anything is needed. :)

FrankLeeeee · 2021-12-13T07:35:48Z

Hi @DevinCheung , thanks for your update! The SimCLR looks like a fantastic example for us. However, can you refer to the new documentation and the examples folder to sync with latest code writing style? We hope to introduce minimum interference to the code writing process of a normal PyTorch user, so we are no longer having model, optimizer, loss etc. in the config file unless someone really needs it (e.g. for pipeline model partitioning) and knows how to build it from config. It will be great if you can refactor your code so that things look consistent :)

Meanwhile, rebasing the commits into only one commit will do us a great favour as the git log will look cleaner and more meaningful! Feel free to reach me if you need any sort of help!

DevinCheung · 2021-12-13T09:55:52Z

Hi @FrankLeeeee , sorry, I did not quite get it. Did you mean I need to modify my config file for consistency? Specifically, try not to contain the definitions of "model, optimizer, loss, etc"? In my case, I do define my own models and a loss function. And I arrange the config file in a similar style as the "vit-b16" example. It would be appreciated if you could help make it more clear of which to modify :)

FrankLeeeee · 2021-12-15T02:10:03Z

Hi @DevinCheung , sorry for my late reply. You may take a look at the vit example which uses the latest api. You can follow this style. Perhaps I wasn't clear last, definitions of "model, optimizer, loss, etc" meant configuration in my reply. We would like to keep the configuration file light-weight so as to not intervene the user habit of writing in PyTorch. Do let me know if you have any issue :)

DevinCheung · 2021-12-15T10:46:03Z

Hi, @FrankLeeeee The example has been synced to the latest code writing style. Please take time to have a look. @me if further modification is needed :)

FrankLeeeee

Hi @DevinCheung , this is an awesome example and I have viewed all the file changes! There are some areas I think is worth our attention.

untypical path should be avoided, for example ../../../../../datasets in your code. Try to use ./dataset or os.environ['DATA']. Do talk about where the data will be downloaded in your README.md as well.
setup.py should not be changed usually. I understand that you comment it out probably because you encounter some issue during installation. You might want to create an issue in the github and we will look into that issue.
The readme file can be more detailed as many people are not from the self-supervised learning background and may not understand what is going on in this example. Perhaps more details (e.g. short introduction or a hyperlink to some web page) will help such as the definition of linear evaluation, PreAct, t-SNE, etc.
To keep a clean git log for future back-track, you should first squash your commits into one commit so that it looks cleaner. Then, you can rebase with your upstream main branch.

Hope you will find these suggestions useful :)

DevinCheung · 2021-12-16T14:01:10Z

Hi, @FrankLeeeee , README has already been detailed. dataset is also standardized. Have a look :)

DevinCheung · 2021-12-20T11:35:23Z

@FrankLeeeee

FrankLeeeee · 2021-12-21T00:07:24Z

@DevinCheung Thanks for your contribution, I will close this PR.

DevinCheung added 2 commits December 1, 2021 01:54

add example of self-supervised SimCLR training

216a9b4

simclr v2, replace nvidia dali dataloader

e64fca3

DevinCheung added 2 commits December 11, 2021 15:12

updated

9d6c588

Merge branch 'hpcaitech:main' into taskv2-branch

d7876f8

DevinCheung added 3 commits December 15, 2021 10:12

Merge branch 'hpcaitech:main' into taskv2-branch

1ade53d

sync to latest code writing style

fe79fcf

sync to latest code writing style and modify README

f9de7fe

FrankLeeeee requested changes Dec 16, 2021

View reviewed changes

FrankLeeeee added the documentation Improvements or additions to documentation label Dec 16, 2021

DevinCheung added 2 commits December 16, 2021 21:25

Merge branch 'hpcaitech:main' into taskv2-branch

0079d0c

detail README & standardize dataset path

eaf2517

DevinCheung requested a review from FrankLeeeee December 18, 2021 09:49

FrankLeeeee approved these changes Dec 21, 2021

View reviewed changes

FrankLeeeee merged commit 648f806 into hpcaitech:main Dec 21, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add example of self-supervised SimCLR training - V2 #50

add example of self-supervised SimCLR training - V2 #50

DevinCheung commented Dec 2, 2021

FrankLeeeee commented Dec 8, 2021

FrankLeeeee commented Dec 8, 2021

DevinCheung commented Dec 8, 2021

DevinCheung commented Dec 10, 2021

FrankLeeeee commented Dec 10, 2021

DevinCheung commented Dec 11, 2021

FrankLeeeee commented Dec 13, 2021

DevinCheung commented Dec 13, 2021

FrankLeeeee commented Dec 15, 2021

DevinCheung commented Dec 15, 2021

FrankLeeeee left a comment

DevinCheung commented Dec 16, 2021

DevinCheung commented Dec 20, 2021

FrankLeeeee commented Dec 21, 2021

add example of self-supervised SimCLR training - V2 #50

add example of self-supervised SimCLR training - V2 #50

Conversation

DevinCheung commented Dec 2, 2021

FrankLeeeee commented Dec 8, 2021

FrankLeeeee commented Dec 8, 2021

DevinCheung commented Dec 8, 2021

DevinCheung commented Dec 10, 2021

FrankLeeeee commented Dec 10, 2021

DevinCheung commented Dec 11, 2021

FrankLeeeee commented Dec 13, 2021

DevinCheung commented Dec 13, 2021

FrankLeeeee commented Dec 15, 2021

DevinCheung commented Dec 15, 2021

FrankLeeeee left a comment

Choose a reason for hiding this comment

DevinCheung commented Dec 16, 2021

DevinCheung commented Dec 20, 2021

FrankLeeeee commented Dec 21, 2021