Skip to content
View invictus717's full-sized avatar
🎯
Focusing
🎯
Focusing

Highlights

  • Pro
Block or Report

Block or report invictus717

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned

  1. MetaTransformer MetaTransformer Public

    Meta-Transformer for Unified Multimodal Learning

    Python 1.4k 110

  2. AILab-CVC/UniRepLKNet AILab-CVC/UniRepLKNet Public

    [CVPR'24] UniRepLKNet: A Universal Perception Large-Kernel ConvNet for Audio, Video, Point Cloud, Time-Series and Image Recognition

    Python 823 50

  3. csuhan/OneLLM csuhan/OneLLM Public

    [CVPR 2024] OneLLM: One Framework to Align All Modalities with Language

    Python 463 23

  4. AILab-CVC/M2PT AILab-CVC/M2PT Public

    [CVPR'24] Multimodal Pathway: Improve Transformers with Irrelevant Data from Other Modalities

    Python 69 4

  5. BiDiff/bidiff BiDiff/bidiff Public

    [CVPR'24] Text-to-3D Generation with Bidirectional Diffusion using both 2D and 3D priors

    Python 141 5

  6. InteractiveVideo InteractiveVideo Public

    InteractiveVideo: User-Centric Controllable Video Generation with Synergistic Multimodal Instructions

    Python 115 8