Research Associate @ Imperial College London. I work on scaling efficiently large audio-visual models.
-
Imperial College London
- London
- https://umbertocappellazzo.github.io/
- @Umberto_Senpai
- in/umberto-cappellazzo-116093150
Pinned Loading
-
Llama-AVSR
Llama-AVSR Public[ICASSP 2025] Official Pytorch implementation of "Large Language Models are Strong Audio-Visual Speech Recognition Learners".
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.