add eora toggle to ci test #1934

Qubitium · 2025-09-26T23:24:01Z

No description provided.

Signed-off-by: Qubitium <Qubitium@modelcloud.ai>

Qubitium · 2025-09-27T05:25:46Z

@avtc Ignore the PR title. Add a very estimated memory dealloc check and counter so we only call torch.cuda.empty_cache when we detect about ~1/4 of your gpu max vram has been deallocated recently. The default value is "auto" set in memory.py at the bottom. Right auto is set to min gpu vram / 4. You can set this if auto doesn't work for you.

Goal is to avoid calling it as much as possible since it is very very slow.

pass DEBUG=1 env before you call gptqmodel to get the memory dealloc/alloc debug logs.

Qubitium added 6 commits September 26, 2025 23:23

add eora toggle to ci test

ff59eec

Signed-off-by: Qubitium <Qubitium@modelcloud.ai>

memory tracker v1

d0dad38

Signed-off-by: Qubitium <Qubitium@modelcloud.ai>

fix bad device check

4e5fe3b

Signed-off-by: Qubitium <Qubitium@modelcloud.ai>

memory v3

5ee59c1

Signed-off-by: Qubitium <Qubitium@modelcloud.ai>

auto threshold for gc

943d314

Signed-off-by: Qubitium <Qubitium@modelcloud.ai>

cleanup

a3c90db

Signed-off-by: Qubitium <Qubitium@modelcloud.ai>

Qubitium marked this pull request as ready for review September 27, 2025 05:27

Qubitium merged commit f0491ff into main Sep 27, 2025
5 checks passed

Qubitium deleted the input branch September 27, 2025 05:27

Qubitium mentioned this pull request Sep 27, 2025

remove prev thread fix, replaced by main changes #1916

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

add eora toggle to ci test #1934

add eora toggle to ci test #1934

Uh oh!

Qubitium commented Sep 26, 2025

Uh oh!

Qubitium commented Sep 27, 2025 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

add eora toggle to ci test #1934

add eora toggle to ci test #1934

Uh oh!

Conversation

Qubitium commented Sep 26, 2025

Uh oh!

Qubitium commented Sep 27, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Qubitium commented Sep 27, 2025 •

edited

Loading