Skip to content

Popular repositories Loading

  1. Tune-A-Video Tune-A-Video Public

    [ICCV 2023] Tune-A-Video: One-Shot Tuning of Image Diffusion Models for Text-to-Video Generation

    Python 4.3k 387

  2. Awesome-Video-Diffusion Awesome-Video-Diffusion Public

    A curated list of recent diffusion models for video generation, editing, and various other applications.

    4.1k 244

  3. computer_use_ootb computer_use_ootb Public

    Out-of-the-box (OOTB) GUI Agent for Windows and macOS

    Python 1.4k 135

  4. Show-o Show-o Public

    [ICLR 2025] Repository for Show-o, One Single Transformer to Unify Multimodal Understanding and Generation.

    Python 1.3k 55

  5. ShowUI ShowUI Public

    [CVPR 2025] Open-source, End-to-end, Vision-Language-Action model for GUI Agent & Computer Use.

    Python 1.1k 66

  6. Show-1 Show-1 Public

    [IJCV] Show-1: Marrying Pixel and Latent Diffusion Models for Text-to-Video Generation

    Python 1k 61

Repositories

Showing 10 of 89 repositories
  • MovieAgent Public

    MovieAgent: Automated Movie Generation via Multi-Agent CoT Planning

    Python 52 2 1 0 Updated Mar 13, 2025
  • ShowUI Public

    [CVPR 2025] Open-source, End-to-end, Vision-Language-Action model for GUI Agent & Computer Use.

    Python 1,086 Apache-2.0 66 2 0 Updated Mar 13, 2025
  • TPDiff Public

    TPDiff: Temporal Pyramid Video Diffusion Model

    9 1 0 0 Updated Mar 13, 2025
  • Awesome-Video-Diffusion Public

    A curated list of recent diffusion models for video generation, editing, and various other applications.

    4,132 244 2 0 Updated Mar 13, 2025
  • VLog Public

    [CVPR 2025] Video Narration as Vocabulary & Video as Long Document

    Python 557 27 7 0 Updated Mar 13, 2025
  • Show-o Public

    [ICLR 2025] Repository for Show-o, One Single Transformer to Unify Multimodal Understanding and Generation.

    Python 1,252 Apache-2.0 55 39 1 Updated Mar 12, 2025
  • GUI-Thinker Public

    Enable AI to control your PC. This repo includes the WorldGUI Benchmark and GUI-Thinker Agent Framework.

    Python 50 2 1 0 Updated Mar 12, 2025
  • MovieSeq Public

    [ECCV 2024] Learning Video Context as Interleaved Multimodal Sequences

    Jupyter Notebook 36 1 0 0 Updated Mar 11, 2025
  • SMS Public

    Balanced Image Stylization with Style Matching Score

    12 0 0 0 Updated Mar 11, 2025
  • SAM-I2V Public
    0 Apache-2.0 0 0 0 Updated Mar 10, 2025

People

This organization has no public members. You must be a member to see who’s a part of this organization.