Skip to content

Fixes regression where WP1 models OOM#28

Merged
lapp0 merged 1 commit intowp-1.5from
fix-wp1-oom
Mar 9, 2026
Merged

Fixes regression where WP1 models OOM#28
lapp0 merged 1 commit intowp-1.5from
fix-wp1-oom

Conversation

@Clydingus
Copy link

Discovered when trying to load backwards compatible WP1 model on Biome running on 4090, WP1 models consistently OOMed on loading, introduced by most recent PR that loads weights directly on device.

Per Andrew: looks like it loads in fp32 then casts to bf16, increasing vram

@lapp0 lapp0 merged commit bf90520 into wp-1.5 Mar 9, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants