Language Technology Lab at Alibaba DAMO Academy

DAMO-SeaLLMs Public

[ACL 2024 Demo] SeaLLMs - Large Language Models for Southeast Asia

JavaScript 160 15

VideoLLaMA3 Public

Frontier Multimodal Foundation Models for Image and Video Understanding

Jupyter Notebook 623 42

CoI-Agent Public

Official code for paper: Chain of Ideas: Revolutionizing Research via Novel Idea Development with LLM Agents

Python 440 27

Inf-CLIP Public

[CVPR 2025] The official CLIP training codebase of Inf-CL: "Breaking the Memory Barrier: Near Infinite Batch Size Scaling for Contrastive Loss". A super memory-efficiency CLIP training scheme.

Python 234 11

multimodal_textbook Public

The official repository for "2.5 Years in Class: A Multimodal Textbook for Vision-Language Pretraining"

Python 145 16

VideoRefer Public

[CVPR 2025] The code for "VideoRefer Suite: Advancing Spatial-Temporal Object Understanding with Video LLM"

Python 169 8

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Language Technology Lab at Alibaba DAMO Academy

Pinned Loading

Repositories

People

Top languages

Most used topics