Add YttmTokenizer, ImageTextDataset from @rom1504, Single-GPU trainin… #3

afiaka87 · 2021-12-07T21:08:06Z

…g script

rom1504 · 2021-12-07T21:42:23Z

x_clip/dataset.py

+from torch.utils.data import Dataset
+
+
+class ImageTextDataset(Dataset):


I think there might be some value to making an independent package with such vision/text dataset readers (this one and webdataset at least), and depending on it here and in dalle pytorch (and in clip retrieval and probably a bunch of other places)
what do you think?

agreed. at this point it's a bit strange that pytorch doesn't have something for this by now - are you aware of anything?

besides from discussions at pytorch/pytorch#38419 no I don't know
I think pytorch has been taking the approach of letting the user build their own things on the data side

although as a reminder image+text dataset is something that has begun being useful in 2021, it's not that old yet :)

MicPie · 2021-12-10T18:41:22Z

train.py

+        for batch_idx, (text, images) in current_epoch_pbar:
+            with autocast(enabled=args.amp):
+                text, images = map(lambda t: t.cuda(), (text, images))
+                mask = torch.ones_like(text).bool()


A recent update adapted the text loss to incorporate a text mask:

x-clip/x_clip/x_clip.py

Line 314 in ac62779

text_to_image = masked_mean(text_to_image, text_to_image_mask, dim = -1)

To make use of this, the dataset could return the accompanying text masks to utilize the masked mean.
Really nice work! :-)

afiaka87 · 2021-12-16T19:10:51Z

@lucidrains Let me know if there's any glaring mistakes but this should provide similar functionality to what we had in dalle-pytorch. Main thing missing is webdataset support and multi-GPU, but I figured folks may want to start using this and I don't know how long it will take me to implement that.

Romain made a decent point about how everyone seems to just rewrite/copy-paste the text-image dataloader but unfortunately I can't commit to maintaining a pip package for that either.

afiaka87 · 2021-12-20T21:39:10Z

@MicPie Thanks for the DDP code. I've rebased your branch onto this one so we can hopefully get that upstream.

afiaka87 · 2022-01-01T10:46:56Z

Apologies, have not had the time to get this branch working. Closing for now.

Add YttmTokenizer, ImageTextDataset from @rom1504, Single-GPU trainin…

dee5e98

…g script

rom1504 reviewed Dec 7, 2021

View reviewed changes

MicPie reviewed Dec 10, 2021

View reviewed changes

Clay Mullis added 3 commits December 16, 2021 13:04

Merge upstream, Keep YTTM and Simple tokenizers

72ea32a

fix import

a5d2f16

Allow use of simple or yttm tokenizer in train.py

2389a58

afiaka87 marked this pull request as ready for review December 16, 2021 19:08

Clay Mullis and others added 10 commits December 16, 2021 13:23

Fix unfinished tokenize method

7a6b8e4

Adaptations CLIP model class for ddp training 1.

7833706

Draft ddp training 1.

88bb5d9

Draft ddp training 2 incl. dumming dataset and bug fixes.

f037153

Draft ddp training 3 incl. webdataset and fixes.

cf83155

Fix data timing measurement point.

211b549

Add checkdataloading argument to debug dataloading.

db3a65d

Rebase training branch onto ddp branch from MiCPie

b13e2f9

Allow use of simple or yttm tokenizer in train.py

16b6560

Merge remote w local

96dab07

afiaka87 closed this Jan 1, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add YttmTokenizer, ImageTextDataset from @rom1504, Single-GPU trainin… #3

Add YttmTokenizer, ImageTextDataset from @rom1504, Single-GPU trainin… #3

afiaka87 commented Dec 7, 2021

rom1504 Dec 7, 2021 •

edited

Loading

afiaka87 Dec 8, 2021

rom1504 Dec 8, 2021

MicPie Dec 10, 2021 •

edited

Loading

afiaka87 commented Dec 16, 2021 •

edited

Loading

afiaka87 commented Dec 20, 2021

afiaka87 commented Jan 1, 2022

		from torch.utils.data import Dataset


		class ImageTextDataset(Dataset):

Add YttmTokenizer, ImageTextDataset from @rom1504, Single-GPU trainin… #3

Add YttmTokenizer, ImageTextDataset from @rom1504, Single-GPU trainin… #3

Conversation

afiaka87 commented Dec 7, 2021

rom1504 Dec 7, 2021 • edited Loading

Choose a reason for hiding this comment

afiaka87 Dec 8, 2021

Choose a reason for hiding this comment

rom1504 Dec 8, 2021

Choose a reason for hiding this comment

MicPie Dec 10, 2021 • edited Loading

Choose a reason for hiding this comment

afiaka87 commented Dec 16, 2021 • edited Loading

afiaka87 commented Dec 20, 2021

afiaka87 commented Jan 1, 2022

rom1504 Dec 7, 2021 •

edited

Loading

MicPie Dec 10, 2021 •

edited

Loading

afiaka87 commented Dec 16, 2021 •

edited

Loading