LLaVA-based architecture #11

silence143 · 2024-03-27T07:34:51Z

How does the Octopus dataset is organized and trained on LLaVA architecture? LLaVA doesn't support in-context learning, if we merge all subtasks into a multi-turn conversation, another problem raises: LLaVA will input all subtask's images embeddings at once, and this problem seems hard to solve.
So how do you deal with that, input no images and only use env information? could you provide a demo.json to show me how dataset is organized on LLaVA architecture? thanks a lot

Jingkang50 · 2024-03-27T11:18:20Z

Thank you for your interest in our work!

The LLaVA-version Octopus will be released once the official LLaVA video is released, as we used some internal code from the mentioned project. The release should be soon but I am not quite sure about the exact date.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

LLaVA-based architecture #11

LLaVA-based architecture #11

silence143 commented Mar 27, 2024

Jingkang50 commented Mar 27, 2024 •

edited

LLaVA-based architecture #11

LLaVA-based architecture #11

Comments

silence143 commented Mar 27, 2024

Jingkang50 commented Mar 27, 2024 • edited

Jingkang50 commented Mar 27, 2024 •

edited