More large LLM potentials needed for the community! #188

dragen1860 · 2024-06-04T07:52:01Z

Dear authors:
It's very promising to witness that the stronger Mistral-7b llm models enhance the capability of video understanding. We would eager to see more potentials performed by replacing the llm with more strong models such as llama3, Yi-34b, InternLM. Specificlly, please try to evaluate some llm models such as 34b, 70b and let the community know whether it helps. Thank for such a great project.

Andy1621 · 2024-06-07T10:49:04Z

Good idea! But current codebase is not friendly for larger LLM. We have updated VideoChat2-HD, which use large resolution for better results.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

More large LLM potentials needed for the community! #188

More large LLM potentials needed for the community! #188

dragen1860 commented Jun 4, 2024

Andy1621 commented Jun 7, 2024

More large LLM potentials needed for the community! #188

More large LLM potentials needed for the community! #188

Comments

dragen1860 commented Jun 4, 2024

Andy1621 commented Jun 7, 2024