Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Question about the description #2

Open
CIawevy opened this issue Dec 19, 2023 · 3 comments
Open

Question about the description #2

CIawevy opened this issue Dec 19, 2023 · 3 comments

Comments

@CIawevy
Copy link

CIawevy commented Dec 19, 2023

Reckless to ask,I don’t understand how to enable Myriad with the ability to describe IAD, it seems like you freeze the LLM module,So how can you enable the MiniGPT to describe IAD? With the designed prompt and embedding,the conclusion that " the MiniGPT4 can make such description" seems to be impossible?
image

@tzjtatata
Copy link
Owner

Refer to read our paper on arxiv(https://arxiv.org/abs/2310.19070). We encode the anomaly maps predicted by vision experts with our proposed Expert Perception modules into both LLM and Qformer(MiniGPT4) as instructions.
The paper will be updated to the newest version. We are also working hard to prepare our model, data and codes for reproduction. Sorry for waiting.

@CIawevy
Copy link
Author

CIawevy commented Dec 20, 2023

Thanks for your reply! And I am also looking forward to the new version of your excellent work!
Still wondering about the details in the visual-language pairs in IAD datasets, mentioned in the paper.
Do you use the same Image Description in AnomalyGPT? as illustrated bellow (extract from AnomalyGPT paper)
image
How do you construct the Answer of the training data?

@tzjtatata
Copy link
Owner

No, we do not use complex constructed answers for IAD instruction.
Simple "yes, there is a defect/are anomalies." are pretty good for MiniGPT-4. Additional information might be confusion for training MiniGPT-4. BUT we also try multi-task joint learning including counting, detection and object type classification for training.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants