Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

# Is this repo work for all BERT model? #56

Open
janezzzz opened this issue Nov 17, 2020 · 1 comment
Open

# Is this repo work for all BERT model? #56

janezzzz opened this issue Nov 17, 2020 · 1 comment
Labels
question Further information is requested

Comments

@janezzzz
Copy link

Hi, first of all thank you so much, that's a really helpful repo!

I'm now working with some pretrained BERT models, so I'm wondering if this flop_counter works for all types of BERT models? I noticed here's an example using BertForSequenceClassification, with input_res=(2, 128). But when I try some other BERT models, the input_res could be the big problem. If there any baseline for BERT model like you mentioned (similar to image networks) in README? Or is there any lookup table? Thank you very much!

@sovrasov
Copy link
Owner

sovrasov commented Dec 2, 2020

What do you mean by saying input_res could be the big problem?
If the input shape varies then amount of flops also varies and only average value across a dataset will make sense.

@sovrasov sovrasov added the question Further information is requested label Dec 2, 2020
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
question Further information is requested
Projects
None yet
Development

No branches or pull requests

2 participants