-
Notifications
You must be signed in to change notification settings - Fork 906
support Sail-VL2 models #5921
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
support Sail-VL2 models #5921
Conversation
Summary of ChangesHello @hjh0119, I'm Gemini Code Assist1! I'm currently reviewing this pull request and will post my feedback shortly. In the meantime, here's a summary to help you and other reviewers quickly get up to speed! This pull request significantly enhances the system's capabilities by introducing full support for the SAIL-VL2 series of multimodal models. It involves registering these models and their associated templates, implementing specific generation logic to ensure compatibility, and updating user-facing documentation. A new template class has been developed to streamline the multimodal input encoding, ensuring these vision-language models function seamlessly within the existing framework. Highlights
Using Gemini Code AssistThe full guide for Gemini Code Assist can be found on our documentation page, here are some quick tips. Invoking Gemini You can request assistance from Gemini at any point by creating a comment using either
Customization To customize Gemini Code Assist for GitHub experience, repository maintainers can create a configuration file and/or provide a custom code review style guide (such as PEP-8 for Python) by creating and adding files to a Limitations & Feedback Gemini Code Assist may make mistakes. Please leave feedback on any instances where its feedback is incorrect or counter productive. You can react with 👍 and 👎 on @gemini-code-assist comments. If you're interested in giving your feedback about your experience with Gemini Code Assist for Github and other Google products, sign up here. You can also get AI-powered code generation, chat, as well as code reviews directly in the IDE at no cost with the Gemini Code Assist IDE Extension. Footnotes
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Code Review
This pull request adds support for Sail-VL2 models, including model and template type definitions, model registration, a new template class, documentation updates, and corresponding tests. My review focuses on improving the documentation, correcting a type hint, fixing an incorrect assertion, adding a missing tag for the new model, and simplifying the test code for better clarity and correctness. Overall, the changes are well-structured, but a few refinements are needed.
/gemini review |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Code Review
This pull request adds support for the Sail-VL2 model family. The changes include adding model definitions, a new template implementation, and corresponding documentation and tests. The implementation reuses the internvl
model architecture, which is a good practice. My review focuses on improving code correctness, maintainability, and clarity. I've identified a critical bug in an assertion, a potential device placement issue, and several opportunities to improve code clarity with better naming and comments, especially regarding a specific dependency version constraint.
No description provided.