Cominclip

Follow

Xinchen Zhang Cominclip

Follow

Tsinghua University | ByteDance Seed

29 followers · 26 following

Tsinghua University
Beijing
19:13 - 8h ahead
https://cominclip.github.io/

Achievements

Achievements

Organizations

Cominclip/README.md

Hi there 👋

🎓 I’m a first-year master student at IIGROUP in Tsinghua University, supervised by Prof. Yujiu Yang.
💬 I'm now a research intern at ByteDance Seed, focus on Multimodal Large Language Models
🌱 I am currently working closely with Dr. Ling Yang and Prof. Bin Cui from DAIR Lab in Peking University.
🔭 My research interests lie in Controllable Text-to-Image/Video Generation and Multimodal Large Language Models.

Pinned Loading

YangLing0818/RPG-DiffusionMaster Public

[ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (RPG)

Jupyter Notebook 1.8k 102
YangLing0818/IterComp Public

[ICLR 2025] IterComp: Iterative Composition-Aware Feedback Learning from Model Gallery for Text-to-Image Generation

Python 177 11
YangLing0818/RealCompo Public

[NeurIPS 2024] RealCompo: Balancing Realism and Compositionality Improves Text-to-Image Diffusion Models

Python 115 4
Gen-Verse/HermesFlow Public

HermesFlow: Seamlessly Closing the Gap in Multimodal Understanding and Generation

Python 52 3
mini-sora/minisora Public

MiniSora: A community aims to explore the implementation path and future development direction of Sora.

Python 1.3k 151
BoxDiff-XL Public

Extend BoxDiff to SDXL (SDXL-based layout-to-image generation)

Python 22 1