Canine model and High VRAM usage #115

Qubitium · 2024-02-11T12:30:47Z

@bminixhofer We are observing very high vram usage with canine model even though the wtp-canine-s-12l-no-adapters fp32 weights are only about 515MB so we naively expected batch=1 in fp16 mode to use 207.5MB of ram for weights plus runtime/inference costs. We didn't expect batch=1 vram to be 1.3GB. Input text is around 230kb text file.

Is this a bug or architecture norm for the canine model? If norm, is there anything that we can do to reduce the memory footprint? Thanks.

wtp = WtP("wtp-canine-s-12l-no-adapters")
wtp.half().to(device="cuda")

batch	vram GB
1	1.309
2	1.335
4	1.385
6	1.428
8	1.487
10	1.542
12	1.583
14	1.639
16	1.688
32	2.094

The text was updated successfully, but these errors were encountered:

bminixhofer · 2024-02-28T09:59:19Z

Hi, thanks for these benchmarks! And sorry for being slow to respond.

You could debug this by checking how much memory the vanilla CANINE (https://huggingface.co/google/canine-s) takes for a forward pass vs. a forward pass of the WtP model (see e.g. here: https://github.com/bminixhofer/wtpsplit/?tab=readme-ov-file#advanced-usage).

If there's a discrepancy there I'll investigate it. It's possible that CANINE just needs a lot of memory though, I am not super happy with that architecture and will upgrade the models to a different arch soon(ish).

Qubitium · 2024-02-29T17:24:47Z

Will do. Btw, if you need gpu compute to train the next model, I can provide you with a A100 80+G. You can ping me up on Twitter at qbitium.

bminixhofer · 2024-03-01T10:06:17Z

Thanks! And that's very generous, deferring to @markus583 since he is doing the training but we are using TPUs so there is probably no need.

markus583 · 2024-03-01T14:29:24Z

Very generous indeed! Thanks but the TPUs are very strong. I'd be very curious whether there is a discrepancy too.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Canine model and High VRAM usage #115

Canine model and High VRAM usage #115

Qubitium commented Feb 11, 2024 •

edited

bminixhofer commented Feb 28, 2024 •

edited

Qubitium commented Feb 29, 2024

bminixhofer commented Mar 1, 2024

markus583 commented Mar 1, 2024

Canine model and High VRAM usage #115

Canine model and High VRAM usage #115

Comments

Qubitium commented Feb 11, 2024 • edited

bminixhofer commented Feb 28, 2024 • edited

Qubitium commented Feb 29, 2024

bminixhofer commented Mar 1, 2024

markus583 commented Mar 1, 2024

Qubitium commented Feb 11, 2024 •

edited

bminixhofer commented Feb 28, 2024 •

edited