Skip to content
View Cominclip's full-sized avatar

Organizations

@Gen-Verse

Block or report Cominclip

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Cominclip/README.md

Hi there 👋

  • 🎓 I’m a first-year master student at IIGROUP in Tsinghua University, supervised by Prof. Yujiu Yang.
  • 💬 I'm now a research intern at ByteDance Seed, focus on Multimodal Large Language Models
  • 🌱 I am currently working closely with Dr. Ling Yang and Prof. Bin Cui from DAIR Lab in Peking University.
  • 🔭 My research interests lie in Controllable Text-to-Image/Video Generation and Multimodal Large Language Models.

Pinned Loading

  1. YangLing0818/RPG-DiffusionMaster Public

    [ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (RPG)

    Jupyter Notebook 1.8k 102

  2. YangLing0818/IterComp Public

    [ICLR 2025] IterComp: Iterative Composition-Aware Feedback Learning from Model Gallery for Text-to-Image Generation

    Python 177 11

  3. YangLing0818/RealCompo Public

    [NeurIPS 2024] RealCompo: Balancing Realism and Compositionality Improves Text-to-Image Diffusion Models

    Python 115 4

  4. Gen-Verse/HermesFlow Public

    HermesFlow: Seamlessly Closing the Gap in Multimodal Understanding and Generation

    Python 52 3

  5. mini-sora/minisora Public

    MiniSora: A community aims to explore the implementation path and future development direction of Sora.

    Python 1.3k 151

  6. BoxDiff-XL Public

    Extend BoxDiff to SDXL (SDXL-based layout-to-image generation)

    Python 22 1