Design Choices

Hi! First of all thanks for releasing such a great model and accompanying paper. Could you clarify few design choices in the SDXL? 

1. Why do you use both previous CLIP-L and new OpenCLIP ViT-bigG? Have you tried only using the later one, wouldn't it be enough?  
2. The crop-conditioning while avoid generating too many cropped images, seems to generate more duplicated cases, where the object of interest is present everywhere, instead of being a single instance. See [this comparisons](https://github.com/TonyLianLong/stable-diffusion-xl-demo/blob/benchmark/benchmark/README.md). I wonder why not to use multi-aspect ( aka rectangles) training during all training process, rather than only during fine-tuning. 

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Design Choices #35

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Design Choices #35

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions