Bypass accelerate #1955

Qubitium · 2025-10-01T01:41:10Z

@avtc This Pr will fix your PVML env crash. I reproduced it on my system and now bypassing the thread safety in Accelerate entirely. Core issue is accelerate.utils.modeling.clear_device_cache is thread unsafe due to:

calls device level torch.cuda.empty_cache() without proper thread ctx and not checking if ops (call paths) are actually cuda related
mutates os.environ without locks

Signed-off-by: Qubitium <Qubitium@modelcloud.ai>

Qubitium added 2 commits October 1, 2025 01:40

bypass accelerate's thread unsafe clear_device_cache

a80ed19

Signed-off-by: Qubitium <Qubitium@modelcloud.ai>

formt

3de8467

Signed-off-by: Qubitium <Qubitium@modelcloud.ai>

Qubitium marked this pull request as ready for review October 1, 2025 01:41

Qubitium mentioned this pull request Oct 1, 2025

os.env/torch.cuda.empty_cache are not thread safe huggingface/accelerate#3801

Open

2 tasks

Qubitium merged commit 32dbaf0 into main Oct 1, 2025
5 checks passed

Qubitium deleted the bypass-accelerate branch October 1, 2025 02:22

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Bypass accelerate #1955

Bypass accelerate #1955

Uh oh!

Qubitium commented Oct 1, 2025 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Bypass accelerate #1955

Bypass accelerate #1955

Uh oh!

Conversation

Qubitium commented Oct 1, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Qubitium commented Oct 1, 2025 •

edited

Loading