Various Training Results So Far #106

afiaka87 · 2021-03-19T20:09:43Z

afiaka87
Mar 19, 2021

Edit: I'll work on getting the parameters for each of those inlined, but in the meantime if you're desperate and understand wandb.ai, you should be able to find the exact train_dalle.py that I used by clicking on the run name and going to the Files tab on the left.

I trained dalle-pytorch on a couple different datasets a few times. How "successful" the results are is up to you, but one thing I learned is that we need a lot more image-text pairs to get to the level of accuracy that OpenAI has.

~1 Epoch on COCO, OpenImage and OpenAI blog post image-text pairs I scraped.

On this one, I think the OpenAI image-text pairs got overfit and did more harm than good to the generalizability of the model
https://wandb.ai/afiaka87/OpenImagesV6/reports/Training-on-COCO-OpenImage-Blogpost--Vmlldzo1NDE3NjU

Roughly 3 full epochs OpenAIBlog Scrape and COCO2014

This one crashed during its first run, so is split into two reports. It uses a larger batch_size of 48, so it needs fewer iterations than the other runs to finish an epoch.
https://wandb.ai/afiaka87/openai_blog/reports/OpenAIBlog-Scrape-and-COCO2014-part-1---Vmlldzo1NDQ5NjU

https://wandb.ai/afiaka87/openai_blog/reports/OpenAIBlog-Scrape-and-COCO2014--Vmlldzo1NDQ5NTQ

(Colab and an A100) Four Runs on COCO2014 (200k images) ranging from 5 to 8 epochs trained.

The 1024 token model allows us to train in colab! Two runs are P100s. One run is a V100, and finally I also trained on an A100 i had access to.

https://wandb.ai/afiaka87/dalle_train_coco2018/reports/Various-Runs-on-COCO2014--Vmlldzo1NDQ5NzY

afiaka87 · 2021-03-19T20:45:19Z

afiaka87
Mar 19, 2021
Author

A few of these have a dalle.pt checkpoint file. You'll need to click on the project name at the top of the screen, find artifacts on the left side, and then download it if you would like to begin training from there. None of the checkpoints are particularly worthwhile yet however, so I dont see much need for this.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Various Training Results So Far #106

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Replies: 1 comment

{{title}}

Select a reply

Various Training Results So Far #106

afiaka87 Mar 19, 2021

~1 Epoch on COCO, OpenImage and OpenAI blog post image-text pairs I scraped.

~1 Epoch on COCO2014

~1 Epoch on the OpenAI blog post image-text pairs I scraped.

~1 Epoch on COCO and OpenImagesV6 (no blog post images)

Roughly 3 full epochs OpenAIBlog Scrape and COCO2014

(Colab and an A100) Four Runs on COCO2014 (200k images) ranging from 5 to 8 epochs trained.

Replies: 1 comment

afiaka87 Mar 19, 2021 Author

afiaka87
Mar 19, 2021

afiaka87
Mar 19, 2021
Author