Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

More large LLM potentials needed for the community! #188

Open
dragen1860 opened this issue Jun 4, 2024 · 1 comment
Open

More large LLM potentials needed for the community! #188

dragen1860 opened this issue Jun 4, 2024 · 1 comment

Comments

@dragen1860
Copy link

Dear authors:
It's very promising to witness that the stronger Mistral-7b llm models enhance the capability of video understanding. We would eager to see more potentials performed by replacing the llm with more strong models such as llama3, Yi-34b, InternLM. Specificlly, please try to evaluate some llm models such as 34b, 70b and let the community know whether it helps. Thank for such a great project.

@Andy1621
Copy link
Collaborator

Andy1621 commented Jun 7, 2024

Good idea! But current codebase is not friendly for larger LLM. We have updated VideoChat2-HD, which use large resolution for better results.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants