Add an introduction section to the MD file #321

ksalama · 2020-12-02T15:48:59Z

No description provided.

Sync to origin

ksalama · 2020-12-02T15:50:12Z

@fchollet - Thank you so much for merging my previous PR. I am not sure why the introduction section was not added to the MD file. I created this PR to add the introduction

examples/vision/md/supervised-contrastive-learning.md

Co-authored-by: 8bitmp3 <19637339+8bitmp3@users.noreply.github.com>

8bitmp3

Hey @ksalama I have a few proposals to improve this intro, if you don't mind.

I think here you're describing the results of an experiment (by saying it "outperforms"). Maybe it'd be more useful for the readers to first learn about the gist of this supervised contrastive learning (SCL) and how it works in 1-2 sentences.

Then, you could finish off this small introductory paragraph with the "outperforms" statement, while also being explicit about how SCL outperforms the traditional vanilla cross-entropy supervised learning stuff (judging by Table 2 and 3 on page 7 of https://arxiv.org/pdf/2004.11362.pdf, SCL "outperforms" i.t.o. accuracy by a margin).

I think the cool thing about SCL is that it extends the previous (?) self-supervised approach to supervised learning - you should definitely highlight it here ("the self-supervised batch contrastive approach to the fully-supervised" - page 1, https://arxiv.org/pdf/2004.11362.pdf). SCL "contrasts the set of all samples from the same class as positives against the negatives from the remainder of the batch" (page 2, figure 3).

On page 4 under "Method", the paper actually summarizes what SCL does:

"Given an input batch of data, we first apply data augmentation twice to obtain two copies of the batch. Both copies are forward propagated through the encoder network to obtain a 2048-dimensional normalized embedding. During training, this representation is further propagated through a projection network that is discarded at inference time. The supervised contrastive loss is computed on the outputs of the projection network. To use the trained model for classification, we train a linear classifier on top of the frozen representations using a cross-entropy loss."

I recommend you summarize it in a human-friendly way to appeal to non-academics.

I can assist you with that, if you need help.

Anyway, these are just suggestions.

8bitmp3 · 2020-12-02T16:55:13Z

examples/vision/md/supervised-contrastive-learning.md

+[Supervised Contrastive Learning](https://arxiv.org/abs/2004.11362)
+(Prannay Khosla et al.) is a training methodology that outperforms
+plain crossentropy-supervised training on classification tasks.


Hey @ksalama I have a few proposals to improve this intro, if you don't mind.

I think here you're describing the results of an experiment (by saying it "outperforms"). Maybe it'd be more useful for the readers to first learn about the gist of this supervised contrastive learning (SCL) and how it works in 1-2 sentences.

Then, you could finish off this small introductory paragraph with the "outperforms" statement, while also being explicit about how SCL outperforms the traditional vanilla cross-entropy supervised learning stuff (judging by Table 2 and 3 on page 7 of https://arxiv.org/pdf/2004.11362.pdf, SCL "outperforms" i.t.o. accuracy by a margin).

I think the cool thing about SCL is that it extends the previous (?) self-supervised approach to supervised learning - you should definitely highlight it here ("the self-supervised batch contrastive approach to the fully-supervised" - page 1, https://arxiv.org/pdf/2004.11362.pdf). SCL "contrasts the set of all samples from the same class as positives against the negatives from the remainder of the batch" (page 2, figure 3).

On page 4 under "Method", the paper actually summarizes what SCL does:

"Given an input batch of data, we first apply data augmentation twice to obtain two copies of the batch. Both copies are forward propagated through the encoder network to obtain a 2048-dimensional normalized embedding. During training, this representation is further propagated through a projection network that is discarded at inference time. The supervised contrastive loss is computed on the outputs of the projection network. To use the trained model for classification, we train a linear classifier on top of the frozen representations using a cross-entropy loss."

I recommend you summarize it in a human-friendly way to appeal to non-academics.

I can assist you with that, if you need help.

Anyway, these are just suggestions.

@8bitmp3 Thanks a lot for suggestion. Please feel free to provide a intro text that you think it could be simple and useful, and will be happy to commit your suggestion

@ksalama Would it fair to say that the the supervised contrastive learning paper introduces a way of training that includes the supervised contrastive loss? Also, we could say that the method offers a two-stage framework that enhances the image classification performance (borrowed from: https://github.com/sayakpaul/Supervised-Contrastive-Learning-in-TensorFlow-2). I also like how they worded it here: "Learn how to map the normalized encoding of samples belonging to the same category closer and the samples belonging to the other classes farther." (https://wandb.ai/authors/scl/reports/Improving-Image-Classifiers-With-Supervised-Contrastive-Learning--VmlldzoxMzQwNzE). We could rephrase it with attribution. cc @fchollet

Bear with me, there's a talk by one of the sponsors at the NeurIPS conference today - and probably next week by the paper's authors - that cover(s) contrastive and supervised contrastive learning. I'll take some notes and revise the intro to make it more useful for the readers. cc @fchollet

@8bitmp3 Sounds good. I would merge this basic introduction to the .md file so that the example page on the website will have "an" introduction (as it is currently hasn't!). Then we can update the introduction as you suggest.

examples/vision/md/supervised-contrastive-learning.md

Co-authored-by: 8bitmp3 <19637339+8bitmp3@users.noreply.github.com>

fchollet · 2020-12-04T23:02:58Z

examples/vision/md/supervised-contrastive-learning.md

 [Supervised Contrastive Learning](https://arxiv.org/abs/2004.11362)
 (Prannay Khosla et al.) is a training methodology that outperforms
-plain crossentropy-supervised training on classification tasks.
+supervised training on classification tasks with cross-entropy.


In the Keras API, "crossentropy" is a single word

Got it @fchollet

fchollet

Thanks for the PR. Any changes should be first applied to the .py file, then replicated in the md and ipynb files.

fchollet · 2020-12-04T23:04:31Z

Also note that I have fixed the issue with the intro not showing up. The reason why it happened is that it was part of the same block of text as the header. I've added a test that makes sure we'll catch this sort of issue in the future.

examples/vision/md/supervised-contrastive-learning.md

Co-authored-by: 8bitmp3 <19637339+8bitmp3@users.noreply.github.com>

ksalama · 2020-12-06T22:31:08Z

@fchollet - I have update the introduction in the .py, .md, and .ipynb files.

Probable error in example "TemporalSoftmax" (keras-team#320)

fchollet

LGTM otherwise

examples/vision/ipynb/supervised-contrastive-learning.ipynb

ksalama · 2020-12-08T14:07:13Z

@fchollet - I have updated the intro in the three files

fchollet

LGTM, thank you

Add an introduction section to the MD file (keras-team#321)

ksalama and others added 24 commits October 18, 2020 01:21

Adding an example for Supervised Contrastive Learning

12b033a

Removing the notebook and the md files

00b211a

Update supervised-contrastive-learning.py

0616833

Update supervised-contrastive-learning.py

1a55186

Update supervised-contrastive-learning.py

e213499

Formatting fixes

4899b26

Update supervised-contrastive-learning.py

bc1e59f

Update supervised-contrastive-learning.py

2dcea87

Update supervised-contrastive-learning.py

4470490

Update supervised-contrastive-learning.py

b88380d

Update supervised-contrastive-learning.py

0bc6a51

Update supervised-contrastive-learning.py

3bc0334

Update supervised-contrastive-learning.py

2bde23a

Update supervised-contrastive-learning.py

4324445

Update supervised-contrastive-learning.py

ab8a4cb

Update supervised-contrastive-learning.py

872714f

Update supervised-contrastive-learning.py

394fccc

Format code file using black tool

65ebb3e

Update supervised-contrastive-learning.py

d787ef2

Fix typos

1b05825

Update supervised-contrastive-learning.py

8b0a692

Add the generated .md and .ipynb files

b61bad2

Merge pull request #1 from keras-team/master

a2fc2d7

Sync to origin

Add an introduction section to the MD file

c5cad40

8bitmp3 reviewed Dec 2, 2020

View reviewed changes

examples/vision/md/supervised-contrastive-learning.md Outdated Show resolved Hide resolved

Update examples/vision/md/supervised-contrastive-learning.md

9a529c3

Co-authored-by: 8bitmp3 <19637339+8bitmp3@users.noreply.github.com>

8bitmp3 suggested changes Dec 2, 2020

View reviewed changes

8bitmp3 reviewed Dec 2, 2020

View reviewed changes

examples/vision/md/supervised-contrastive-learning.md Outdated Show resolved Hide resolved

Update examples/vision/md/supervised-contrastive-learning.md

c995fb8

Co-authored-by: 8bitmp3 <19637339+8bitmp3@users.noreply.github.com>

ksalama added 2 commits December 3, 2020 09:59

Merge branch 'master' into master

7264a56

Update supervised-contrastive-learning.md

ca00e47

fchollet reviewed Dec 4, 2020

View reviewed changes

8bitmp3 reviewed Dec 4, 2020

View reviewed changes

examples/vision/md/supervised-contrastive-learning.md Outdated Show resolved Hide resolved

ksalama and others added 3 commits December 6, 2020 22:20

Update examples/vision/md/supervised-contrastive-learning.md

b20fe57

Co-authored-by: 8bitmp3 <19637339+8bitmp3@users.noreply.github.com>

Update the introduction section

ba197d2

Update the introduction section accordingly

f3e9a71

Merge pull request #3 from keras-team/master

8f7ea74

Probable error in example "TemporalSoftmax" (keras-team#320)

fchollet reviewed Dec 8, 2020

View reviewed changes

examples/vision/ipynb/supervised-contrastive-learning.ipynb Outdated Show resolved Hide resolved

Update introduction

c65d7e8

fchollet approved these changes Dec 8, 2020

View reviewed changes

fchollet merged commit de7ea52 into keras-team:master Dec 8, 2020

ksalama added a commit to ksalama/keras-io that referenced this pull request Dec 19, 2020

Merge pull request #4 from keras-team/master

1848359

Add an introduction section to the MD file (keras-team#321)

Add an introduction section to the MD file #321

Add an introduction section to the MD file #321

Uh oh!

Conversation

ksalama commented Dec 2, 2020

Uh oh!

ksalama commented Dec 2, 2020

Uh oh!

Uh oh!

8bitmp3 left a comment

Choose a reason for hiding this comment

Uh oh!

8bitmp3 Dec 2, 2020

Choose a reason for hiding this comment

Uh oh!

ksalama Dec 2, 2020

Choose a reason for hiding this comment

Uh oh!

8bitmp3 Dec 4, 2020

Choose a reason for hiding this comment

Uh oh!

8bitmp3 Dec 6, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ksalama Dec 6, 2020

Choose a reason for hiding this comment

Uh oh!

Uh oh!

fchollet Dec 4, 2020

Choose a reason for hiding this comment

Uh oh!

8bitmp3 Dec 4, 2020

Choose a reason for hiding this comment

Uh oh!

fchollet left a comment

Choose a reason for hiding this comment

Uh oh!

fchollet commented Dec 4, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

ksalama commented Dec 6, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

fchollet left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

ksalama commented Dec 8, 2020

Uh oh!

fchollet left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

8bitmp3 Dec 6, 2020 •

edited

Loading

fchollet commented Dec 4, 2020 •

edited

Loading

ksalama commented Dec 6, 2020 •

edited

Loading