You do say [how to train a new model from a dataset](https://github.com/huggingface/diffusers/tree/main/examples). You don't say how to create a dataset. Is it just a directory of images? Do they need to be sized somehow? Is there a subdirectory tree? Are there text files containing metadata?