When the prompt exceeds the length #61

jsk1107 · 2022-11-21T15:06:50Z

hello.

Q1.
The total number of classes in the custom data set I have is about 300. So it throws an error. I think this error occurs because the dimension of the logit is [:, :, 256] but the value index of my positive map is greater than 256. Am i right?

Q2.
In #37 "If the prompt exceeds the length, you can take a look at the inference codes about how we deal with the LVIS dataset (~1200 classes)" said.
Can converting coco annotations to LVIS annotations solve the error in Q1? If so, Is there an API that provides conversion between annotations?

Haotian-Zhang · 2022-11-21T23:39:29Z

@jsk1107 Thank you for the questions! Yes, the error is due to the maximum length of the text encoder input, which is 256. You can convert your label directly to LVIS format. There shall be no such big differences between the COCO and LVIS label files. The only thing you may do is modify the datasets section in the config file to make sure the model knows what kind of data you are specifying.

jsk1107 · 2022-11-22T06:41:42Z

@Haotian-Zhang Problem solved. thank you

I have one more question. How should spaces be handled when entering prompts?

ex) aerosolcan / aerosol can / aerosol_can
ex) airconditioner / air conditioner / air_conditioner

WordNet seems to provide underscores, not spaces. It is also underscore in LVIS annotation. Is it a good practice to use underscores for correct prompting?

Haotian-Zhang · 2022-11-22T21:31:38Z

Hi @jsk1107, I don't think underscore make senses in our case. Since we are using BERT as our text encoder, and it was pre-trained on the natural sentences styles. Besides, our detection prompt is also using spaces in between the class categories. Please let us know if you have further concerns.

jsk1107 · 2022-11-25T06:01:40Z

@Haotian-Zhang Thanks! I'll close this issue.

nanfei666 · 2023-06-08T08:00:45Z

@jsk1107 Thank you for the questions! Yes, the error is due to the maximum length of the text encoder input, which is 256. You can convert your label directly to LVIS format. There shall be no such big differences between the COCO and LVIS label files. The only thing you may do is modify the datasets section in the config file to make sure the model knows what kind of data you are specifying.

@Haotian-Zhang Hi, i am not sure which part should be modified in the config file if i have a coco formate annotation file while have much more class, could you point it out for me . thank you

vishalvijay99 · 2023-09-20T07:51:58Z

@jsk1107 what changes did you make? Can you please elaborate

jsk1107 closed this as completed Nov 25, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

When the prompt exceeds the length #61

When the prompt exceeds the length #61

jsk1107 commented Nov 21, 2022 •

edited

Haotian-Zhang commented Nov 21, 2022

jsk1107 commented Nov 22, 2022 •

edited

Haotian-Zhang commented Nov 22, 2022

jsk1107 commented Nov 25, 2022

nanfei666 commented Jun 8, 2023 •

edited

vishalvijay99 commented Sep 20, 2023

When the prompt exceeds the length #61

When the prompt exceeds the length #61

Comments

jsk1107 commented Nov 21, 2022 • edited

Haotian-Zhang commented Nov 21, 2022

jsk1107 commented Nov 22, 2022 • edited

Haotian-Zhang commented Nov 22, 2022

jsk1107 commented Nov 25, 2022

nanfei666 commented Jun 8, 2023 • edited

vishalvijay99 commented Sep 20, 2023

jsk1107 commented Nov 21, 2022 •

edited

jsk1107 commented Nov 22, 2022 •

edited

nanfei666 commented Jun 8, 2023 •

edited