Skip to content


Folders and files

Last commit message
Last commit date

Latest commit



4 Commits

Repository files navigation

MagicAvatar: Multimodal Avatar Generation and Animation

Jianfeng Zhang* · Hanshu Yan* · Zhongcong Xu* · Jiashi Feng · Jun Hao Liew†
ByteDance Inc.

Paper PDF Project Page Project Page


Introducing MagicAvatar, a multi-modal framework capable of converting various input modalities — text, video, and audio — into motion signals that subsequently generate/ animate an avatar.

For more general video editing applications, please also check our latest work MagicEdit!


If you find our work useful, please consider citing:

    author    = {Zhang, Jianfeng and Yan, Hanshu and Xu, Zhongcong and Feng, Jiashi and Liew, Jun Hao},
    title     = {MagicAvatar: Multi-modal Avatar Generation and Animation},
    booktitle = {arXiv},
    year      = {2023}

    author    = {Liew, Jun Hao and Yan, Hanshu and Zhang, Jianfeng and Xu, Zhongcong and Feng, Jiashi},
    title     = {MagicEdit: High-Fidelity and Temporally Coherent Video Editing},
    booktitle = {arXiv},
    year      = {2023}