`bfloat16` cannot utilize some codes #114

AmitMY · 2024-03-11T10:14:30Z

When using FSQ with [8, 5, 5, 5] levels, and in pytorch-lightning specifying bfloat16 training, the codebook utilization scratches 50% from below, while when training with float32 it scratches 100%.

I don't know if there is any issue with the implementation or just a limitation with the FSQ, in any case I would guess that this library should force float32 for the quantization step.

Example:

torch.tensor([1000,1001,1002,1003], dtype=torch.bfloat16).to(torch.int32)

tensor([1000, 1000, 1000, 1004], dtype=torch.int32)

The text was updated successfully, but these errors were encountered:

lucidrains · 2024-03-19T17:54:27Z

@AmitMY hey Amit! i put in a quick fix in 1.14.4

curious how well FSQ is performing for you otherwise. are you training an autoencoder?

AmitMY · 2024-03-23T14:06:35Z

Hi! Was waiting for some compute to try this, but actually it fails: (network is now BFloat16, input is cast as float)

RuntimeError: mat1 and mat2 must have the same dtype, but got Float and BFloat16

FSQ is performing amazingly well for me. Basically 100% codebook util, and the autoencoder can predict the input very well. I did have to normalize my data, but once that was done it was smooth sailing.

lucidrains · 2024-03-29T13:29:51Z

@AmitMY besides code utilization, have you tried running it against regular VQ as an ablation / to compare?

AmitMY · 2024-03-29T22:44:26Z

I have only tried regular VQ in the beginning, saw that FSQ was better/more stable for my problem, and then scaled up data/model size - so no, for my current problem I did not fully compare FSQ and VQ

lucidrains · 2024-03-30T15:38:25Z

@AmitMY ah got it, no biggie. just curious

lucidrains · 2024-04-16T13:07:52Z

@AmitMY finally had the chance to train FSQ myself yesterday evening and wow, it works great! so much more stable than VQ

AmitMY added a commit to AmitMY/vector-quantize-pytorch that referenced this issue Mar 16, 2024

fix(fsq): allow any percision (resolves lucidrains#114)

e65b096

AmitMY mentioned this issue Mar 16, 2024

fix(fsq): allow any percision (resolves #114) #115

Closed

lucidrains added a commit that referenced this issue Mar 19, 2024

address #114

0024008

lucidrains closed this as completed Mar 23, 2024

AmitMY mentioned this issue Mar 26, 2024

RuntimeError: mat1 and mat2 must have the same dtype, but got Float and BFloat16 #116

Closed

lucidrains mentioned this issue Mar 28, 2024

ResidualLFQ was successful, but ResidualVQ failed severely! lucidrains/meshgpt-pytorch#73

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

`bfloat16` cannot utilize some codes #114

`bfloat16` cannot utilize some codes #114

AmitMY commented Mar 11, 2024 •

edited

Loading

lucidrains commented Mar 19, 2024

AmitMY commented Mar 23, 2024 •

edited

Loading

lucidrains commented Mar 29, 2024

AmitMY commented Mar 29, 2024

lucidrains commented Mar 30, 2024

lucidrains commented Apr 16, 2024 •

edited

Loading

bfloat16 cannot utilize some codes #114

bfloat16 cannot utilize some codes #114

Comments

AmitMY commented Mar 11, 2024 • edited Loading

lucidrains commented Mar 19, 2024

AmitMY commented Mar 23, 2024 • edited Loading

lucidrains commented Mar 29, 2024

AmitMY commented Mar 29, 2024

lucidrains commented Mar 30, 2024

lucidrains commented Apr 16, 2024 • edited Loading

`bfloat16` cannot utilize some codes #114

`bfloat16` cannot utilize some codes #114

AmitMY commented Mar 11, 2024 •

edited

Loading

AmitMY commented Mar 23, 2024 •

edited

Loading

lucidrains commented Apr 16, 2024 •

edited

Loading