heng-hw

heng-hw

Achievements

Gotta Hear Them All: Sound Source Aware Vision to Audio Generation.

Python 59 Updated Mar 9, 2025

MuCR is a benchmark designed to evaluate Multimodal Large Language Models' (MLLMs) ability to discern causal links across modalities

14 2 Updated Feb 16, 2025

[ECCV2022] D3Net: A Unified Speaker-Listener Architecture for 3D Dense Captioning and Visual Grounding

Python 43 5 Updated Aug 27, 2022

An easy-to-use debug print tool for deep learning projects in python. PyPi: https://pypi.org/project/pydprint/

Python 9 1 Updated Feb 25, 2022

Project page: https://3dmedpt.github.io/

Python 49 7 Updated Jan 13, 2022