Reconstruction results #20

Marcelo5444 · 2023-08-10T17:54:57Z

Hi, First of all thanks for you work.

Working with vit small, I see that results are far away from VQGAN, did you stop training when reached convergence? Do you think there is more room to improve the model performance/

Results with vit-small

input image

ghost · 2023-08-11T12:41:26Z

Can you show me your code for reconstruction？I also meet this problem that reconstruction results of the ViT-VQGAN on ImageNet are very terrible.

Marcelo5444 · 2023-08-11T19:11:55Z

config = OmegaConf.load('configs/imagenet_vitvq_small.yaml')
model = initialize_from_config(config.model)
model.init_from_ckpt('/home/marcelo/Downloads/imagenet_vitvq_small.ckpt')

def preprocess(img):
s = min(img.size)

if s < 256:
    raise ValueError(f'min dim for image {s} < 256')

r = 1024 / s
s = (round(r * img.size[1]), round(r * img.size[0]))
img = TF.resize(img, s, interpolation=PIL.Image.LANCZOS)
img = TF.center_crop(img, output_size=2 * [256])
img = torch.unsqueeze(T.ToTensor()(img), 0)
return img

original=Image.open('/home/marcelo/Downloads/212861459-e4113b34-622d-4602-afe4-f20e2d79425c.png')
image=preprocess(original)
image = image[:,:3,:,:]

quant, _ = model.encode(image)
dec = model.decode(quant)

ghost · 2023-08-12T13:21:03Z

Actually, I think the reason is the bad model checkpoint. Your script is right. I measure the rFID, it is far away from VQGAN. I also train the model on ImageNet, but it still works badly. From: ***@***.***> Date: Sat, Aug 12, 2023, 03:04 Subject: [External] Re: [thuanz123/enhancing-transformers] Reconstruction results (Issue #20) To: "thuanz123/enhancing-transformers"< ***@***.***> Cc: ***@***.***>, "Comment"< ***@***.***> The same as the one in the colab notebook — Reply to this email directly, view it on GitHub <#20 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/A7ZMO7CSSW4EOBN4SOSLJVDXUZ63PANCNFSM6AAAAAA3L3JDYQ> . You are receiving this because you commented.Message ID: ***@***.***>

Marcelo5444 · 2023-08-16T12:48:10Z

So, after your training, you obtain a better model weights that improve the reconstruction?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reconstruction results #20

Reconstruction results #20

Marcelo5444 commented Aug 10, 2023 •

edited

Loading

ghost commented Aug 11, 2023

Marcelo5444 commented Aug 11, 2023

ghost commented Aug 12, 2023 via email

Marcelo5444 commented Aug 16, 2023

Reconstruction results #20

Reconstruction results #20

Comments

Marcelo5444 commented Aug 10, 2023 • edited Loading

ghost commented Aug 11, 2023

Marcelo5444 commented Aug 11, 2023

ghost commented Aug 12, 2023 via email

Marcelo5444 commented Aug 16, 2023

Marcelo5444 commented Aug 10, 2023 •

edited

Loading