Skip to content

Conversation

@DoctorKey
Copy link
Contributor

What does this PR do?

This PR introduces Ovis-Image into the diffusers library. Ovis-Image integrates a diffusion-based visual decoder with the Ovis 2.5 multimodal backbone, leveraging a text-centric training pipeline that combines large-scale pre-training with carefully tailored post-training refinements. Despite its compact architecture, Ovis-Image achieves text rendering performance on par with significantly larger open models such as Qwen-Image and approaches closed-source systems like Seedream and GPT4o.

@DoctorKey
Copy link
Contributor Author

Ovis-Image has been released:

@yiyixuxu
Copy link
Collaborator

yiyixuxu commented Dec 1, 2025

@bot /style

@HuggingFaceDocBuilderDev

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

Copy link
Collaborator

@yiyixuxu yiyixuxu left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks so much for the PR! I left a few feedbacks, and I think we can merge this very soon

Congrats on the release!! Sorry, we overlooked the PR (it was the thanksgiving holiday in US)
We will reach out to set up a collaboration channel for your future release.

@DoctorKey
Copy link
Contributor Author

Thank you for the detailed feedback. I have addressed all comments in this commit. Please let me know if further adjustments are needed!
@yiyixuxu

Copy link
Collaborator

@yiyixuxu yiyixuxu left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

thanks!

@yiyixuxu
Copy link
Collaborator

yiyixuxu commented Dec 2, 2025

can you run make style and make fix-copies

and do you want to add docs and test in a follow up PR?

we need to add these docs:

test I saw you already created a folder:)

@DoctorKey
Copy link
Contributor Author

I have run make style and make fix-copies in this commit.

The docs have been added in this commit.

We want to add test in a follow up PR.

@yiyixuxu yiyixuxu merged commit 4f136f8 into huggingface:main Dec 2, 2025
9 of 11 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants