plug in problem #11

kenxxxxx · 2024-03-05T12:56:36Z

The tensor matrix output by vitlens is 1*768 for each modal message right? So where in Instructblip do I plug it in can you please answer? Thanks!

StanLei52 · 2024-03-05T14:06:36Z

Hi, thank you for your question.

For InstructBLIP and SEED-LLaMA integration, we use EVA-CLIP-g/14(embedding dim: 1408) for ViT-Lens training, which is different from ViT-Lens-L (based on ViT Large). Since we use the same ViT as InstructBLIP and SEED-LLaMA, we directly plug the Lens and modality module prior to ViT layers.

kenxxxxx · 2024-03-05T14:31:57Z

Thanks for your answer! Do you plan to upload the models used for integration?

Hi, thank you for your question.

For InstructBLIP and SEED-LLaMA integration, we use EVA-CLIP-g/14(embedding dim: 1408) for ViT-Lens training, which is different from ViT-Lens-L (based on ViT Large). Since we use the same ViT as InstructBLIP and SEED-LLaMA, we directly plug the Lens and modality module prior to ViT layers.

StanLei52 · 2024-03-05T15:51:43Z

Yes. Currently some ckpt can be found on hf(3D). I am working on cleaning the code for integration pipeline and plan to release it within one month due to limited bandwidth.

kenxxxxx · 2024-03-06T08:18:37Z

Okay thanks!

cfeng16 · 2024-04-04T04:35:55Z

Nice work! I am wondering if there is any update regarding the release of code?

kenxxxxx closed this as completed Mar 6, 2024

StanLei52 mentioned this issue Mar 10, 2024

InstructBLIP and SEED Implementation #14

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

plug in problem #11

plug in problem #11

kenxxxxx commented Mar 5, 2024

StanLei52 commented Mar 5, 2024

kenxxxxx commented Mar 5, 2024

StanLei52 commented Mar 5, 2024

kenxxxxx commented Mar 6, 2024

cfeng16 commented Apr 4, 2024

plug in problem #11

plug in problem #11

Comments

kenxxxxx commented Mar 5, 2024

StanLei52 commented Mar 5, 2024

kenxxxxx commented Mar 5, 2024

StanLei52 commented Mar 5, 2024

kenxxxxx commented Mar 6, 2024

cfeng16 commented Apr 4, 2024