Skip to content

Pinned Loading

  1. OmniMMI OmniMMI Public

    [CVPR 2025] OmniMMI: A Comprehensive Multi-modal Interaction Benchmark in Streaming Video Contexts

    Python 13

  2. OpenOmniNexus OpenOmniNexus Public

    a fully open-source implementation of a GPT-4o-like speech-to-speech video understanding model.

    Python 19 2

  3. M4 M4 Public

    [CVPR 2025] OmniMMI: A Comprehensive Multi-modal Interaction Benchmark in Streaming Video Contexts

    Python 9

Repositories

Showing 4 of 4 repositories

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Top languages

Loading…

Most used topics

Loading…