This is the official repo for the paper: NÜWA: Visual Synthesis Pre-training for Neural visUal World creAtion.
NÜWA is a unified multimodal pre-trained model that can generate new or manipulate existing visual data (i.e., images and videos) for 8 visual synthesis tasks (as shown above).