-
Tencent Youtu Lab
- Shanghai
- lzw123@mail.ustc.edu.cn
Pinned Loading
-
VITA-MLLM/VITA
VITA-MLLM/VITA Public✨✨VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech Interaction
-
Open-GroundingDino
Open-GroundingDino PublicThis is the third party implementation of the paper Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection.
-
VITA-MLLM/VITA-Audio
VITA-MLLM/VITA-Audio Public✨✨VITA-Audio: Fast Interleaved Cross-Modal Token Generation for Efficient Large Speech-Language Model
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.