Fix Windows dynamic clip loading stack overflow (#178) by pollockjj · Pull Request #183 · pollockjj/ComfyUI-MultiGPU

pollockjj · 2026-03-21T03:18:02Z

This fixes the Windows-native stack overflow in the deep CLIP/T5 dynamic loading path tracked in #178.

What changed

added clip_dynamic_load_list_guard.py to override ModelPatcherDynamic._load_list with a non-recursive traversal when the graph is deep enough to trip the Windows C-stack
wired the guard into override_class_clip and override_class_clip_no_device in wrappers.py
revved pyproject.toml from 2.6.1 to 2.6.2

Validation

reproduced the failure on the Windows SAT5 _load_list(for_dynamic=True) path without the guard
confirmed the same path succeeds with the guard enabled
ruff check . passes
pylint --rcfile pyproject.toml passes at 10.00/10

Files

clip_dynamic_load_list_guard.py
wrappers.py
pyproject.toml

This branch contains only the #178 fix relative to main.

Cheers!

Copilot

Pull request overview

This PR addresses a Windows-native stack overflow when dynamically loading deep CLIP/T5 graphs by introducing a non-recursive guard for ModelPatcherDynamic._load_list, wiring it into the CLIP wrapper entrypoints, and bumping the package version.

Changes:

Added clip_dynamic_load_list_guard.py to patch comfy.model_patcher.ModelPatcherDynamic._load_list with a depth-aware, non-recursive traversal for for_dynamic=True.
Registered the guard during CLIP loader overrides in wrappers.py.
Bumped pyproject.toml version from 2.6.1 to 2.6.2.

Reviewed changes

Copilot reviewed 3 out of 3 changed files in this pull request and generated 3 comments.

File	Description
`clip_dynamic_load_list_guard.py`	Implements and registers the non-recursive `_load_list` guard for deep dynamic-loading graphs on Windows.
`wrappers.py`	Calls guard registration during CLIP loader overrides so the patched behavior is active during CLIP dynamic loading.
`pyproject.toml`	Version bump to ship the fix.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2026-03-21T03:21:33Z

+            sort_criteria = (module_offload_mem >= 64 * 1024, -module_offload_mem)
+            loading.append(sort_criteria + (module_mem, n, m, params))
+


_safe_dynamic_load_list() appends sort_criteria + (module_mem, n, m, params), producing a 6-tuple whose first element is a boolean. Elsewhere in this repo, _load_list() results are assumed to be a 5-tuple where the first element is numeric module_offload_mem (see distorch_2.unpack_load_item). If ModelPatcherDynamic._load_list follows the same contract (as ComfyUI 0.6.0+ does), this will break callers that expect bytes in position 0 and/or len(item)==5. Please align the return item shape/order to the original ComfyUI _load_list tuple contract for your target ComfyUI version.

Suggested change

sort_criteria = (module_offload_mem >= 64 * 1024, -module_offload_mem)

loading.append(sort_criteria + (module_mem, n, m, params))

# Append a 5-tuple matching the original ModelPatcherDynamic._load_list

# contract: (module_offload_mem, module_mem, name, module, params).

loading.append((module_offload_mem, module_mem, n, m, params))

# Preserve the original sort behavior without changing the tuple shape.

# First, prefer modules that require offloading (module_offload_mem >= 64KB),

# then sort by descending module_offload_mem.

loading.sort(key=lambda item: (item[0] >= 64 * 1024, -item[0]))

Copilot · 2026-03-21T03:21:33Z

+_PATCH_MARKER = "_mgpu_issue21_clip_dynamic_load_list_guard"
+_MODULE_THRESHOLD = 200
+_DEPTH_THRESHOLD = 200
+


The patch marker and log messages reference "Issue21" (e.g. _PATCH_MARKER = "_mgpu_issue21_..." and log prefixes), but this PR is specifically for #178. This makes operational logs and future debugging misleading; consider renaming the marker and log text to match #178 (or use a generic, issue-agnostic label).

Copilot · 2026-03-21T03:21:34Z

+    def guarded_load_list(self, for_dynamic=False, default_device=None):
+        if not for_dynamic:
+            return original(self, for_dynamic=for_dynamic, default_device=default_device)
+
+        if _graph_requires_guard(self.model):
+            logger.info("[MultiGPU Issue21] Using non-recursive ModelPatcherDynamic._load_list guard")
+            return _safe_dynamic_load_list(self, default_device=default_device)
+
+        return original(self, for_dynamic=for_dynamic, default_device=default_device)


guarded_load_list() always runs _graph_requires_guard(self.model) for for_dynamic=True, which traverses the entire module graph, and then _safe_dynamic_load_list() traverses it again when the guard is needed. If _load_list(for_dynamic=True) is called frequently, this adds measurable overhead. Consider caching the guard decision per model (e.g., storing a flag on self.model or self) so the graph walk happens at most once per model instance.

pollockjj added 2 commits March 20, 2026 22:06

Fix Windows issue 178 dynamic clip loading

17a50f1

Fix to #178

e507f85

Copilot AI review requested due to automatic review settings March 21, 2026 03:18

Copilot started reviewing on behalf of pollockjj March 21, 2026 03:18 View session

pollockjj linked an issue Mar 21, 2026 that may be closed by this pull request

Loading CLIP with CLIPLoaderMultiGPU crashes ComfyUI v0.17.1 - v0.17.2 #178

Closed

pollockjj merged commit 7d24e15 into main Mar 21, 2026
2 checks passed

Copilot AI reviewed Mar 21, 2026

View reviewed changes

BigStationW mentioned this pull request Mar 22, 2026

Windows fatal exception: stack overflow. #184

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix Windows dynamic clip loading stack overflow (#178)#183

Fix Windows dynamic clip loading stack overflow (#178)#183
pollockjj merged 2 commits intomainfrom
issue-178

pollockjj commented Mar 21, 2026

Uh oh!

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Mar 21, 2026

Uh oh!

Copilot AI Mar 21, 2026

Uh oh!

Copilot AI Mar 21, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

		sort_criteria = (module_offload_mem >= 64 * 1024, -module_offload_mem)
		loading.append(sort_criteria + (module_mem, n, m, params))

-            sort_criteria = (module_offload_mem >= 64 * 1024, -module_offload_mem)
-            loading.append(sort_criteria + (module_mem, n, m, params))
+            # Append a 5-tuple matching the original ModelPatcherDynamic._load_list
+            # contract: (module_offload_mem, module_mem, name, module, params).
+            loading.append((module_offload_mem, module_mem, n, m, params))
+    # Preserve the original sort behavior without changing the tuple shape.
+    # First, prefer modules that require offloading (module_offload_mem >= 64KB),
+    # then sort by descending module_offload_mem.
+    loading.sort(key=lambda item: (item[0] >= 64 * 1024, -item[0]))

Conversation

pollockjj commented Mar 21, 2026

What changed

Validation

Files

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Copilot AI Mar 21, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Mar 21, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Mar 21, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants