-
CSAIL, MIT
- Cambridge, Massachusetts
- https://people.csail.mit.edu/yungsung/
- @YungSungChuang
Highlights
- Pro
Stars
A series of math-specific large language models of our Qwen2 series.
Implementation of "SelfCite: Self-Supervised Alignment for Context Attribution in Large Language Models"
Witness the aha moment of VLM with less than $3.
Fully open reproduction of DeepSeek-R1
An open-source implementaion for fine-tuning Llama3.2-Vision series by Meta.
Let your Claude able to think
Hallucinations (Confabulations) Document-Based Benchmark for RAG. Includes human-verified questions and answers.
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…
This repository contains the official implementation for the AAAI25 paper "From Words to Worth: Newborn Article Impact Prediction with LLM".
Official repository of "SAMURAI: Adapting Segment Anything Model for Zero-Shot Visual Tracking with Motion-Aware Memory"
Inference code for the paper "Spirit-LM Interleaved Spoken and Written Language Model".
Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.
🔥 [ICLR 2025] Official PyTorch Model "Visual Haystacks: A Vision-Centric Needle-In-A-Haystack Benchmark"
Movie Gen Bench - two media generation evaluation benchmarks released with Meta Movie Gen
A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。
Efficient Triton Kernels for LLM Training
Official codebase used to develop Vision Transformer, SigLIP, MLP-Mixer, LiT and more.
Code, datasets, models for the paper "Automatic Evaluation of Attribution by Large Language Models"
Code for the EMNLP 2024 paper "Detecting and Mitigating Contextual Hallucinations in Large Language Models Using Only Attention Maps"
Cambrian-1 is a family of multimodal LLMs with a vision-centric design.
Repository for Meta Chameleon, a mixed-modal early-fusion foundation model from FAIR.
Official implementation of "Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling"
Official repository for LongChat and LongEval
In-Context Sharpness as Alerts: An Inner Representation Perspective for Hallucination Mitigation (ICML 2024)
Official implementation of ICLR'24 paper, "Curiosity-driven Red Teaming for Large Language Models" (https://openreview.net/pdf?id=4KqkizXgXU)