Skip to content

Popular repositories Loading

  1. Tune-A-Video Tune-A-Video Public

    [ICCV 2023] Tune-A-Video: One-Shot Tuning of Image Diffusion Models for Text-to-Video Generation

    Python 4.3k 388

  2. Awesome-Video-Diffusion Awesome-Video-Diffusion Public

    A curated list of recent diffusion models for video generation, editing, and various other applications.

    4.2k 245

  3. computer_use_ootb computer_use_ootb Public

    Out-of-the-box (OOTB) GUI Agent for Windows and macOS

    Python 1.4k 146

  4. Show-o Show-o Public

    [ICLR 2025] Repository for Show-o, One Single Transformer to Unify Multimodal Understanding and Generation.

    Python 1.3k 56

  5. ShowUI ShowUI Public

    [CVPR 2025] Open-source, End-to-end, Vision-Language-Action model for GUI Agent & Computer Use.

    Python 1.1k 70

  6. Show-1 Show-1 Public

    [IJCV] Show-1: Marrying Pixel and Latent Diffusion Models for Text-to-Video Generation

    Python 1.1k 61

Repositories

Showing 10 of 90 repositories
  • MovieAgent Public

    MovieAgent: Automated Movie Generation via Multi-Agent CoT Planning

    Python 146 17 7 0 Updated Mar 26, 2025
  • Awesome-MLLM-Hallucination Public

    📖 A curated list of resources dedicated to hallucination of multimodal large language models (MLLM).

    613 22 1 0 Updated Mar 26, 2025
  • Awesome-Video-Diffusion Public

    A curated list of recent diffusion models for video generation, editing, and various other applications.

    4,196 245 1 0 Updated Mar 26, 2025
  • FAR Public

    Code for: "Long-Context Autoregressive Video Modeling with Next-Frame Prediction"

    Python 81 MIT 1 0 0 Updated Mar 26, 2025
  • Awesome-Robotics-Diffusion Public

    (In progress) A curated list of recent robot learning papers incorporating diffusion models for robotics tasks.

    89 3 0 0 Updated Mar 25, 2025
  • computer_use_ootb Public

    Out-of-the-box (OOTB) GUI Agent for Windows and macOS

    Python 1,449 Apache-2.0 146 27 6 Updated Mar 25, 2025
  • Exo2Ego-V Public
    Python 33 Apache-2.0 0 0 0 Updated Mar 24, 2025
  • Show-o Public

    [ICLR 2025] Repository for Show-o, One Single Transformer to Unify Multimodal Understanding and Generation.

    Python 1,281 Apache-2.0 56 38 1 Updated Mar 24, 2025
  • GUI-Thinker Public

    Enable AI to control your PC. This repo includes the WorldGUI Benchmark and GUI-Thinker Agent Framework.

    Python 53 4 1 0 Updated Mar 23, 2025
  • SAM-I2V Public
    2 Apache-2.0 0 1 0 Updated Mar 22, 2025

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Top languages

Loading…

Most used topics

Loading…