where to use #44

bibibabibo26 · 2024-05-20T09:11:59Z

hello, in the inference.py you offered in #14, I see the multi-modal input tokens for LLM, it includes bbox token, but I can't find where you replace the bbox token or you use the image feature which got from clip and interpolate. Can you explain it for me? thank you.

bibibabibo26 · 2024-05-20T09:26:09Z

I find it in SPILlavaLlamaModel

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

where to use #44

where to use #44

bibibabibo26 commented May 20, 2024 •

edited

Loading

bibibabibo26 commented May 20, 2024

where to use #44

where to use #44

Comments

bibibabibo26 commented May 20, 2024 • edited Loading

bibibabibo26 commented May 20, 2024

bibibabibo26 commented May 20, 2024 •

edited

Loading