New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

[FLAVA] Make projections part of the core model #106

Closed

ankitade wants to merge 10 commits into gh/ankitade/5/base from gh/ankitade/5/head

Contributor

ankitade commented Jun 21, 2022 •

edited

Move projections from the contrastive loss to the core model
This will allow users to use the model (instead of the pretraining model) for doing zero shot
Also moved to using the translated the checkpoint.

Test plan

pytest
python -m flava.train config=flava/configs/pretraining/debug.yaml
python -m flava.finetune config=flava/configs/finetuning/qnli.yaml

Stack from ghstack (oldest at bottom):

Differential Revision: D37481127


          Temp CL

738111a

[ghstack-poisoned]

This was referenced Jun 21, 2022

Moving flava model to its own folder #96

Closed

[FLAVA] Separate out text and image encoders #102

Closed

[FLAVA]Change some initialization orders and corresponding tests #105

Closed

facebook-github-bot added the CLA Signed label

ankitade added a commit that referenced this pull request


          Temp CL

182c75e

ghstack-source-id: 1b7477c156c2ff5466909f44c1959218d5dbc651
Pull Request resolved: #106


          Update on "Temp CL"

2934b63

[ghstack-poisoned]

ankitade added a commit that referenced this pull request


          Temp CL

e4709b3

ghstack-source-id: 0bca6e6b8df1f0c3098790dd90d8f60c8022b04c
Pull Request resolved: #106

codecov-commenter commented Jun 23, 2022 •

edited

Codecov Report

❗ No coverage uploaded for pull request base (gh/ankitade/5/base@3f7009e). Click here to learn what that means.
The diff coverage is n/a.

@@                  Coverage Diff                  @@
##             gh/ankitade/5/base     #106   +/-   ##
=====================================================
  Coverage                      ?   93.04%           
=====================================================
  Files                         ?       47           
  Lines                         ?     2776           
  Branches                      ?        0           
=====================================================
  Hits                          ?     2583           
  Misses                        ?      193           
  Partials                      ?        0

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 3f7009e...4bcee67. Read the comment docs.


          Update on "Temp CL"

c59460e

[ghstack-poisoned]

ankitade mentioned this pull request

[Flava] Add ckpt loading and accuracy metric to finetuning #119

Closed

ankitade added a commit that referenced this pull request


          Temp CL

ffdbc6a

ghstack-source-id: 4c0738f0a0d96b6cc7ec47f924ba00b7008ddc77
Pull Request resolved: #106


          Update on "Temp CL"

43f7cff

[ghstack-poisoned]

ankitade added a commit that referenced this pull request


          [FLAVA] Move projections from contrastive loss to model

901662f

ghstack-source-id: a97330da33e2f83ddd0070427d01647d4948f64c
Pull Request resolved: #106

ankitade changed the title ~~Temp CL~~ [FLAVA] Make projections part of the core model


          Update on "[FLAVA] Make projections part of the core model"

89b126e

Move projections from the contrastive loss to the core model
This will allow users to use the model (instead of the pretraining model) for doing zero shot
Also moved to using the translated the checkpoint.

Test plan
1. pytest
2.  python -m flava.train config=flava/configs/pretraining/debug.yaml
3. python -m flava.finetune config=flava/configs/finetuning/qnli.yaml
 



[ghstack-poisoned]

ankitade added a commit that referenced this pull request


          [FLAVA] Move projections from contrastive loss to model

c801910

ghstack-source-id: f8b9173211262db95c897ab5827b862e595cdd7e
Pull Request resolved: #106


          Update on "[FLAVA] Make projections part of the core model"

f805ce2

Move projections from the contrastive loss to the core model
This will allow users to use the model (instead of the pretraining model) for doing zero shot
Also moved to using the translated the checkpoint.

Test plan
1. pytest
2.  python -m flava.train config=flava/configs/pretraining/debug.yaml
3. python -m flava.finetune config=flava/configs/finetuning/qnli.yaml
 



[ghstack-poisoned]

ankitade added a commit that referenced this pull request


          [FLAVA] Move projections from contrastive loss to model

72fe0af

ghstack-source-id: 4844b172f52d41979f5dcd74323105152d90df69
Pull Request resolved: #106

Contributor Author

ankitade commented Jun 28, 2022

@ankitade has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.


          Update on "[FLAVA] Make projections part of the core model"

0d43ce3

Move projections from the contrastive loss to the core model
This will allow users to use the model (instead of the pretraining model) for doing zero shot
Also moved to using the translated the checkpoint.

Test plan
1. pytest
2.  python -m flava.train config=flava/configs/pretraining/debug.yaml
3. python -m flava.finetune config=flava/configs/finetuning/qnli.yaml
 

Differential Revision: [D37481127](https://our.internmc.facebook.com/intern/diff/D37481127)

[ghstack-poisoned]

ankitade added a commit that referenced this pull request


          [FLAVA] Move projections from contrastive loss to model

6fac11d

ghstack-source-id: e6b230c54929fdf73321869b783a88bcf7fcaca4
Pull Request resolved: #106

This was referenced Jul 4, 2022

change order of itm loss init #131

Draft

[FLAVA]Move itm head to flava model for pretraining #132

Draft

ankitade requested review from apsdehal, ebsmothers, RdoubleA and langong347

July 13, 2022 06:25

ankitade marked this pull request as ready for review

July 13, 2022 06:26

ebsmothers approved these changes

View reviewed changes


          Update on "[FLAVA] Make projections part of the core model"

44c8379

Move projections from the contrastive loss to the core model
This will allow users to use the model (instead of the pretraining model) for doing zero shot
Also moved to using the translated the checkpoint.

Test plan
1. pytest
2.  python -m flava.train config=flava/configs/pretraining/debug.yaml
3. python -m flava.finetune config=flava/configs/finetuning/qnli.yaml
 

Differential Revision: [D37481127](https://our.internmc.facebook.com/intern/diff/D37481127)

[ghstack-poisoned]

Contributor Author

ankitade commented Jul 23, 2022

@ankitade has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.


          Update on "[FLAVA] Make projections part of the core model"

2ba6ff6

Move projections from the contrastive loss to the core model
This will allow users to use the model (instead of the pretraining model) for doing zero shot
Also moved to using the translated the checkpoint.

Test plan
1. pytest
2.  python -m flava.train config=flava/configs/pretraining/debug.yaml
3. python -m flava.finetune config=flava/configs/finetuning/qnli.yaml
 

Differential Revision: [D37481127](https://our.internmc.facebook.com/intern/diff/D37481127)

[ghstack-poisoned]

Contributor Author

ankitade commented Jul 23, 2022

@ankitade has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

4 similar comments

Contributor Author

ankitade commented Jul 23, 2022

@ankitade has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

Contributor Author

ankitade commented Jul 24, 2022

@ankitade has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

Contributor Author

ankitade commented Jul 24, 2022

@ankitade has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

Contributor Author

ankitade commented Jul 25, 2022

@ankitade has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

ankitade mentioned this pull request

[FLAVA] Move masked prediction head to flava_for_pretraining #195

Draft


          Update on "[FLAVA] Make projections part of the core model"

4bcee67

Move projections from the contrastive loss to the core model
This will allow users to use the model (instead of the pretraining model) for doing zero shot
Also moved to using the translated the checkpoint.

Test plan
1. pytest
2.  python -m flava.train config=flava/configs/pretraining/debug.yaml
3. python -m flava.finetune config=flava/configs/finetuning/qnli.yaml
 

Differential Revision: [D37481127](https://our.internmc.facebook.com/intern/diff/D37481127)

[ghstack-poisoned]

Contributor Author

ankitade commented Jul 26, 2022

@ankitade has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

facebook-github-bot closed this in

679f359

facebook-github-bot deleted the gh/ankitade/5/head branch

July 29, 2022 14:17

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment