ControlNet implementation suggestion #139

JasonS09 · 2023-04-05T13:56:18Z

Hello! I couldn't wait anymore for a controlnet implementation for this plugin (much needed for me, and the constant swapping between the webui and Krita was driving me crazy), so I worked in implementing controlnet for the plugin in my end. I managed to make it work.

This controlnet implementation uses the official API endpoints for txt2img and img2img (which I thought would be convenient since I read you're planning to switch to official API in the future), yet I still didn't touch much of what's already made. The logic I implemented should only work if at least one controlnet unit is enabled. Here is a list of features:

Allows different sources for input images for annotators: users can import images from disk or paste a selected image in clipboard. If none of those inputs are used, the plugin will automatically use selected image as input for the annotator.
Allows annotator preview.
Allows annotator input to switch rgb to bgr and/or invert colors (requires more testing).
Different parameters change dynamically depending on chosen annotator.
Using selection directly as input without preprocessing (choose preprocessor "none").
txt2img, img2img, and inpainting with ControlNet.
Use of official API when ControlNet is activated, but extension backend when it's not.

Limitations:

Fixed annotator config: it can store the config of each controlnet unit individually between sessions, but every time the user switches preprocessor, config of current one is discarded (this is for each unit separately).
For inpainting, mask should be black color (recommended). This is because I found out official API for some reason erases all the content of the mask if there is transparency in the image (it expects white mask on black background). So the current implementation converts transparency to white and then inverts color. If user draw a white mask for inpainting, that mask could be ignored because it will bet inverted to black.
The current approach for removing unmasked content for inpainting is very slow. It is suggested to switch to a new one involving transparency masks.

I have made some testing on my end and everything seems in order. But I'd suggest to test it further, and either inform me about a specific bug or fix it yourself ;)

…om/JasonS09/auto-sd-paint-ext into personal-controlnet-implementation

Rogal80 · 2023-04-06T15:09:20Z

hi, can you write basic breakdown tutorial how to use it - step-by-step example?

JasonS09 · 2023-04-06T17:34:27Z

Sure! I'll work on it when I finish my current project.

rexelbartolome · 2023-04-08T04:10:05Z

Hello, managed to make it work, found a bug about changing canny high and canny low will reset to 200 and 100 after a few seconds.

krita_r85ymXdohZ.mp4

This might be difficult to implement but hopefully in the future we can paste the annotated preview into Krita so we can erase which lines need to be followed etc. That also paves the way for the Controlnet input to not be annotated everytime to save on resources (afaik that's how it works? correct me if I'm wrong) so you can just annotate once, reimport the annotated image as the Controlnet input, then remove the preprocessor once it's done.

Buttons like "use as input and remove preprocessor" and "paste to Krita" (similar to how an img2img generation is fitted to the selected region) would be great :)

And of course thanks for implementing Controlnet itself! I was also looking for one but only found some for Photoshop 😭 Might be able to help with documentation too :)

Edit: just found out that the img2img controlnet isn't working

Error running process: D:\stable-diffusion\empire-install2\stable-diffusion-webui\extensions\sd-webui-controlnet\scripts\controlnet.py
Traceback (most recent call last):
  File "D:\stable-diffusion\empire-install2\stable-diffusion-webui\modules\scripts.py", line 417, in process
    script.process(p, *script_args)
  File "D:\stable-diffusion\empire-install2\stable-diffusion-webui\extensions\sd-webui-controlnet\scripts\controlnet.py", line 628, in process
    unit = self.parse_remote_call(p, unit, idx)
  File "D:\stable-diffusion\empire-install2\stable-diffusion-webui\extensions\sd-webui-controlnet\scripts\controlnet.py", line 540, in parse_remote_call
    unit.enabled = selector(p, "control_net_enabled", unit.enabled, idx, strict=True)
AttributeError: 'str' object has no attribute 'enabled'

txt2img controlnet works fine though 🤔

Interpause · 2023-04-08T05:44:32Z

-Images are not upscaled in backend, they're just scaled by the plugin once received (unless you check hires fix for txt2img). I have yet to find a workaround for this (there is an insinuation of upscaling in the front end in code, so I decided to wait for your opinion on this instead. I admit I'm not really sure of how this upscaling thing works in this plugin).

The webUI's highres fix used to be bad. So I kept the upscaling system from the original plugin. But recently (well as in a few months ago), webui highres fix was improved. The upscaling system is done completely by the custom backend; The frontend "scaling" code is to handle downscaling the image or increasing the canvas size depending on whether there is a canvas selection. I don't think it is necessary to try and get the upscaling system to work with the official endpoints since it would be complicated (would probably need a second API call to upscale).

Interpause · 2023-04-08T06:48:31Z

frontends/krita/krita_diff/utils.py

    qmainwindow.tabifyDockWidget(dockers[TAB_SDCOMMON], dockers[TAB_PREVIEW])
    qmainwindow.tabifyDockWidget(dockers[TAB_TXT2IMG], dockers[TAB_IMG2IMG])
    qmainwindow.tabifyDockWidget(dockers[TAB_TXT2IMG], dockers[TAB_INPAINT])
    qmainwindow.tabifyDockWidget(dockers[TAB_TXT2IMG], dockers[TAB_UPSCALE])
    dockers[TAB_SDCOMMON].raise_()
    dockers[TAB_INPAINT].raise_()
+
+def remove_unmasked_content_for_inpaint(img, mask):


This is very slow for large images, better approach is to insert mask as transparency layer: https://api.kde.org/krita/html/classTransparencyMask.html.

Currently working on this, but can't find a way to import this class to the script. Doesn't seem to work from krita module. No error is shown but it messes up the whole plugin.

Edit: nvm, found the way.

Hello. It seems Google Colab has changed their ToS to prohibit the use of remote UIs. This means I'm unable to perform testing and use the plugin, so unfortunately I'll have to stop development in this implementation. Anyone interested can rescue it and follow development.

that's so unfortunate :(

is it possible for you to use Runpod instead of colab @JasonS09 ? it's generally affordable to use, and i can even sponsor your GPU hours if you want

Hey, thank you for the offer. Honestly, I don't want to pay for this since I'm broke (got 17 dollars in my bank account, no incomes). If you say you can sponsor that could be an option, but I'm not sure if that service provider allows remote connections to the UI. Last time I tried in Paperspace, I couldn't use it as backend (simply didn't work even with the --api flag), and after a while I got kicked from my session, then unable to login again. I'm suspect they banned me for trying to use the UI as backend.

EDIT: I reinstalled webui in my local machine, and it somehow works better now. It's still really slow, but I think this will do the work. I'm going to be away this weekend, then I'll continue the work.

* Creating a transparency mask with `self.doc.createNode(name, "transparencyMask")` in `script.py` is not working properly. It creates a node but you can't do anything with it (not really a transparency mask). * `self.doc.createTransparencyMask(name)` doesn't seem to be a thing, even though you can [find it in the documentation](https://api.kde.org/krita/html/classDocument.html#abbd8e5ca62dd2952623c2d5cbc6faf5f). * Alternatively, it is possible to create one by calling actions `self.app.action("add_new_transparency_mask")`. However, you seemingly can' use `setPixelData()` [to draw the mask into the mask layer](https://api.kde.org/krita/html/classNode.html#a4e0b624db748aa8cf63ba84131dfc1a7). Or at least all my attempts have failed to do so. * So this only leaves me with one option I can think of: create a paint layer first, set the mask pixel data, then convert it into a transparency mask. However, there is another issue with this approach. Setting active node with `self.doc.setActiveNode(layer)` will work but not for the actions like `self.app.action("convert_to_transparency_mask")`. They will completelly ignore the current active node even if it's explicitly set. Making it really difficult to work with.

Well... I've independently verified most of these.

On my most successful attempt so far, where I attempted to retrofit transparency_mask_inserter(self) to handle these masks, I managed to get four functioning transparency masks on a batch and wrote data to one of them, though it did not appear correct, possibly because it needs to be converted to grayscale first? I did that by passing the mask all the way in that function and putting it right after add_mask_action.trigger() which is far less than ideal... I suspect that the issues you're encountering with setting pixel data on a layer created through app.action are more related to the race conditions that the inserter function is meant to work around, though, based on this experience.

I'm going to see if I can set pixel data on masks without completely butchering the mask inserter function, I'll let you know if I find anything out.

I've made a pull request on your branch for a partially working implementation... It is not very clean, mostly due to being shoehorned into old API code.

self.app.action("add_new_transparency_mask") does work, it does produce a correct transparency mask layer. But it does this on another thread, so if you try to immediately set the pixel data that causes a race condition. You also have to convert it to grayscale for it to write correctly, which isn't really documented explicitly anywhere. So my provisional solution, add the transparency mask to the layer, wait for it to appear, then write the data.

If you inpaint from a selection, with "Add transparency mask" enabled, it will work. However, the second that you touch one of the layers, the inpainting mask will be overwritten with solid white. It can be proven that it actually works by converting one of the layers to a paint layer, you will see the proper mask data that way. Why it gets erased when you unhide the layer or its parents, though, I have no idea -- perhaps it has something to do with the method of trying to work around the race condition by recursion. Once that is solved through whatever means (most likely by moving this into its own function and simply waiting for there to be a new child layer), that'll be a working implementation and one merge blocker down. I will try to work on it more later to see if I can make this into a fully working implementation.

This API is going to give me nightmares. I see references to the issues with the other methods of adding transparency masks from years ago, and we're stuck with the one that causes a race condition.

I've made a pull request on your branch for a partially working implementation...

Can you point me to it? I can't find it.

I've made a pull request on your branch for a partially working implementation...

Can you point me to it? I can't find it.

I seem to have opened the pull request on my own fork... Should be fixed now.

I figured it out. Fully working implementation.

setPixelData on a node is broken for some unexplainable reason. But you can setPixelData on a Selection, and then call to create a transparency mask... because the selection will become the content of that mask. This also seems to handily make every race condition irrelevant.

I am going to clean it up a bit, and then I will PR what should be a complete implementation

JasonS09 · 2023-04-12T23:35:29Z

Hello! I'm aware of the changes and bug fixes to do, it's just that I have been working on another commercial project, so I haven't had time to fix. I'll probably be back in a couple of days though. If anyone wanna do the work by themselves, got no problem with it.

Miraihi · 2023-04-13T16:08:45Z

Hello @JasonS09, thank you for your work on ControlNet implementation.
Unfortunately I get the Script Error AssertionError: Raw data size:1441792, Expected size:7680000 right after the processing is done. Though I'm using the DirectML fork of the SD A1111 interface and the memory monitor is disabled, so that may be the problem.

EDIT: Okay, seems like this is the problem with the extension's upscaling algorithm, just as you mentioned. I've unchecked "Disable base/max size" and now it works fine, though I have to constrain the image size to what I'm usually working with in SD web ui. Still a massive improvement for the workflow.

disable legacy highres system by default

…om/JasonS09/auto-sd-paint-ext into personal-controlnet-implementation

JasonS09 · 2023-04-17T21:36:45Z

Hello, managed to make it work, found a bug about changing canny high and canny low will reset to 200 and 100 after a few seconds.

I think this bug has been fixed.

Proof of concept for inpainting transparency mask

Working implementation of transparency mask

drhead · 2023-06-07T22:18:44Z

@Interpause With the changes I'm working on internally to implement post-img2img upscaling, I'm somewhat close to bringing the new API to full feature parity with the old API. It's not getting in the way too much, but it would be nice to be able to tidy things up some without it. Do you want me to delete it on this PR, or to handle that on a separate PR afterwards to allow for more testing? There are a few race conditions that may be down to slower hardware that I'm not sure are completely dealt with.

JasonS09 · 2023-06-07T22:26:06Z

@Interpause With the changes I'm working on internally to implement post-img2img upscaling...

It's nice you're bringing this up. I'm working on a new branch, implementing tiled diffusion and tiled VAE as well. I'm planning to use it as upscaler. I don't know if this idea is interesting enough to be present in the core repo, (it's more of a personal whim), but I just wanted to inform you.

ControlNet img2img pipeline will now route images through the upscaler API endpoint before inserting them. Also changes mask code to use QImage format conversions to isolate the alpha channel.

…nal-controlnet-implementation

…olnet-implementation

Implement basic/feature parity img2img postprocess upscaling

Dekker3D · 2023-07-23T12:53:47Z

As the last reply was a month and a half ago, I'd just like to note that I'm looking for a Gimp or Krita plugin that'll let me use ControlNets seamlessly for a workflow combining 2D sketches and generated imagery with inpainting. This PR looks very promising to me and I hope it gets added soon.

JasonS09 · 2023-07-24T01:23:40Z

As the last reply was a month and a half ago, I'd just like to note that I'm looking for a Gimp or Krita plugin that'll let me use ControlNets seamlessly for a workflow combining 2D sketches and generated imagery with inpainting. This PR looks very promising to me and I hope it gets added soon.

I'm currently working on an adaptation of this plugin incorporating ComfyUI. It will have controlnet and other new features (still a work in progress though). I'm going to continue with its development as soon as I finish implementing reference only preprocessor for Comfy (its taking a while, not gonna lie).

In the meantime you can work with my fork for Automatic1111. Its functional right now, we're just waiting for the merge.

Jorge0998 and others added 17 commits March 30, 2023 07:03

UI work in progress

fc966ce

UI work in progress

3dccbec

Merge branch 'personal-controlnet-implementation' of https://github.c…

7c1d26b

…om/JasonS09/auto-sd-paint-ext into personal-controlnet-implementation

Added controlnet preprocessors settings

c1f337a

Added controlnet preprocessors settings

75ead57

Merge branch 'personal-controlnet-implementation' of https://github.c…

4a2f7d1

…om/JasonS09/auto-sd-paint-ext into personal-controlnet-implementation

Merge branch 'personal-controlnet-implementation' of https://github.c…

78c216e

…om/JasonS09/auto-sd-paint-ext into personal-controlnet-implementation

Merge branch 'personal-controlnet-implementation' of https://github.c…

c7c4e33

…om/JasonS09/auto-sd-paint-ext into personal-controlnet-implementation

Added controlnet base layout

a9984e0

Added the ability to paste copied image to the image loader layout.

44a473a

Get the models and modules from the backend

0ab4579

Added the ability to preview annotators

f87f803

txt2img work in progress

a0a3f55

Working txt2img

8dccecd

Img2img work in progress

ac41be9

Working im2img

3a031ed

Remove QMessageBox for debugging

6e64766

Interpause reviewed Apr 8, 2023

View reviewed changes

JasonS09 and others added 4 commits April 17, 2023 15:17

Merge pull request #1 from Interpause/main

b6f7bc7

disable legacy highres system by default

Fixed bug that restarted threshold values every 3 seconds.

0768ccd

Merge pull request #2 from Interpause/main

b7a58ad

disable legacy highres system by default

Merge branch 'personal-controlnet-implementation' of https://github.c…

efbbc87

…om/JasonS09/auto-sd-paint-ext into personal-controlnet-implementation

Fixed bug that prevented to run controlnet with img2img

8eb2736

JasonS09 and others added 8 commits May 16, 2023 09:22

More bug fixes

94fd0dc

horrible, broken transparency mask for inpainting

f2eef08

Fixed preprocessor options bug at start.

4895381

Switch controlnet unit at start.

9105bbe

Merge pull request #5 from drhead/drhead-patch-1

fb7fabe

Proof of concept for inpainting transparency mask

Working implementation of transparency mask

14f839d

Better race condition handling

d670f7c

Merge pull request #6 from drhead/personal-controlnet-implementation

1f1a291

Working implementation of transparency mask

drhead and others added 18 commits June 7, 2023 19:24

Implement img2img upscale and update mask converter

e7e4b25

ControlNet img2img pipeline will now route images through the upscaler API endpoint before inserting them. Also changes mask code to use QImage format conversions to isolate the alpha channel.

Implement upscale postprocess API call function

cdc2029

fix inpainting without selection

eafff05

Regression bug fix

d1a0826

Merge branch 'JasonS09:personal-controlnet-implementation' into perso…

bd7c878

…nal-controlnet-implementation

Bypass upscaling call when it is not needed

a99774b

changed mask generation to only affect group (may break old api)

14e776b

fix txt2img and simplify glayer creation

2859a75

desperation

bc6fa51

QTimer-based approach to avoiding mask race condition

5391954

Fixed mask shredding bug.

91f5ad8

Regression bug fix

c72b9a8

Regression bug fix

d8ac876

Fix handling of no selection in inpaint

72a4af5

Fixed pixel perfect bug

cc142c2

fixed mask positioning bug

0d0e476

Merge branch 'personal-controlnet-implementation' into personal-contr…

bfa6e90

…olnet-implementation

Merge pull request #7 from drhead/personal-controlnet-implementation

c3fae53

Implement basic/feature parity img2img postprocess upscaling

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ControlNet implementation suggestion #139

ControlNet implementation suggestion #139

JasonS09 commented Apr 5, 2023 •

edited

Loading

Rogal80 commented Apr 6, 2023

JasonS09 commented Apr 6, 2023

rexelbartolome commented Apr 8, 2023 •

edited

Loading

Interpause commented Apr 8, 2023

Interpause Apr 8, 2023

JasonS09 Apr 18, 2023 •

edited

Loading

JasonS09 Apr 20, 2023

rexelbartolome Apr 21, 2023

JasonS09 Apr 21, 2023 •

edited

Loading

drhead Jun 6, 2023

drhead Jun 6, 2023 •

edited

Loading

JasonS09 Jun 6, 2023

drhead Jun 6, 2023

drhead Jun 6, 2023

JasonS09 commented Apr 12, 2023

Miraihi commented Apr 13, 2023 •

edited

Loading

JasonS09 commented Apr 17, 2023

drhead commented Jun 7, 2023

JasonS09 commented Jun 7, 2023

Dekker3D commented Jul 23, 2023

JasonS09 commented Jul 24, 2023

ControlNet implementation suggestion #139

Are you sure you want to change the base?

ControlNet implementation suggestion #139

Conversation

JasonS09 commented Apr 5, 2023 • edited Loading

Rogal80 commented Apr 6, 2023

JasonS09 commented Apr 6, 2023

rexelbartolome commented Apr 8, 2023 • edited Loading

Interpause commented Apr 8, 2023

Interpause Apr 8, 2023

Choose a reason for hiding this comment

JasonS09 Apr 18, 2023 • edited Loading

Choose a reason for hiding this comment

JasonS09 Apr 20, 2023

Choose a reason for hiding this comment

rexelbartolome Apr 21, 2023

Choose a reason for hiding this comment

JasonS09 Apr 21, 2023 • edited Loading

Choose a reason for hiding this comment

drhead Jun 6, 2023

Choose a reason for hiding this comment

drhead Jun 6, 2023 • edited Loading

Choose a reason for hiding this comment

JasonS09 Jun 6, 2023

Choose a reason for hiding this comment

drhead Jun 6, 2023

Choose a reason for hiding this comment

drhead Jun 6, 2023

Choose a reason for hiding this comment

JasonS09 commented Apr 12, 2023

Miraihi commented Apr 13, 2023 • edited Loading

JasonS09 commented Apr 17, 2023

drhead commented Jun 7, 2023

JasonS09 commented Jun 7, 2023

Dekker3D commented Jul 23, 2023

JasonS09 commented Jul 24, 2023

JasonS09 commented Apr 5, 2023 •

edited

Loading

rexelbartolome commented Apr 8, 2023 •

edited

Loading

JasonS09 Apr 18, 2023 •

edited

Loading

JasonS09 Apr 21, 2023 •

edited

Loading

drhead Jun 6, 2023 •

edited

Loading

Miraihi commented Apr 13, 2023 •

edited

Loading