try to visualize the features of T5: Text-To-Text Transfer Transformer? #5

Wulx2050 · 2022-06-23T08:16:26Z

Imagen' key discovery is that generic large language models (e.g. T5), pretrained on text-only corpora, are surprisingly effective at encoding text for image synthesis: increasing the size of the language model in Imagen boosts both sample fidelity and image-text alignment much more than increasing the size of the image diffusion model.

Theyalso find that while T5-XXL and CLIP text encoders perform similarly on simple benchmarks such as MS-COCO, human evaluators prefer T5-XXL encoders over CLIP text encoders in both image-text alignment and image fidelity on DrawBench, a set of challenging and compositional prompts.

So try to visualize the features of T5?

T5: https://github.com/google-research/text-to-text-transfer-transformer

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

try to visualize the features of T5: Text-To-Text Transfer Transformer? #5

try to visualize the features of T5: Text-To-Text Transfer Transformer? #5

Wulx2050 commented Jun 23, 2022

try to visualize the features of T5: Text-To-Text Transfer Transformer? #5

try to visualize the features of T5: Text-To-Text Transfer Transformer? #5

Comments

Wulx2050 commented Jun 23, 2022