DirectML Execution #507

NullSenseStudio · 2023-01-07T20:52:31Z

Allows using non-NVIDIA GPU's on Windows. Currently only implemented for prompt_to_image. Doesn't support half precision so most users will likely need to make use of attention slicing and/or sequential CPU offload.

This is made as an alternative to ONNX, which has less models available, doesn't support memory optimizations like attention slicing or sequential CPU offload, and runs 3-4 times slower (on my machine at least).

I have encountered one odd bug where generating would start to only produce full white images, which requires releasing the generator to make it go away for a time.

NullSenseStudio · 2023-01-08T02:55:31Z

Now working for other actions. Depth_to_image doesn't follow the depth map at all, but can still generate rather abstract images. Seamless axes are broken and will cause the final image to be mostly a single color.

…rch-directml

carson-katri · 2023-01-08T15:29:36Z

I added a workflow_dispatch trigger so you can manually create builds for AMD users to test.

https://github.com/carson-katri/dream-textures/actions/runs/3867749838

.github/workflows/package-release.yml

Co-authored-by: Carson Katri <Carson.katri@gmail.com>

carson-katri · 2023-01-08T19:42:01Z

Started another run: https://github.com/carson-katri/dream-textures/actions/runs/3868659385

NullSenseStudio · 2023-01-09T18:51:42Z

I'm going to attempt to make half precision be mostly possible. There's not as much compatibility as there is with float32 but there might be some meaningful memory saving if I cast between float16 and float32 where necessary. After I'd like to see if there's anyone in the discord wanting to test it.

faxcorp · 2023-01-10T12:48:09Z

Hello, can you please give some sort of info on how to test this on my AMD 5700xt? Do I need to build this branch or something? Thank you very much

NullSenseStudio · 2023-01-10T13:24:30Z

You can open the link that Carson provided then scroll down into Artifacts and download dream_textures-windows-directml. I believe the zip file you get contains another zip file so you'll have to extract that then install the extracted zip into Blender.

NullSenseStudio · 2023-01-11T01:54:48Z

Half precision was a great success! It doesn't give the same image with the same seed and prompt as full precision like CUDA can with half precision enabled or not, but I'm not going to worry about that.

I also believe I've fixed the white image bug, as long as there isn't anything else that causes it.

New run: https://github.com/carson-katri/dream-textures/actions/runs/3888885287

NullSenseStudio · 2023-01-11T16:01:44Z

New run with fixed model download: https://github.com/carson-katri/dream-textures/actions/runs/3894477733

inpainting, upscaling, depth with color

NullSenseStudio · 2023-01-22T21:48:51Z

Very pleased with the new 0.1.13.1.dev230119 version. Eliminates most of the patches and gives further performance improvements. Used to need around 36 seconds in the denoising loop (25 steps) and now that's down around 22 seconds. Near 39% time savings on a GTX 1070, hopefully there'll be similar gains on AMD cards.

I've modified model handling so that the frontend model id only uses the model's name rather than the full path, this should prevent issues caused from having special characters in account names. I haven't been able to replicate the bug so I'm not entirely sure it'll work.

I've also modified model revision selection to have better preference towards the main and fp16 revisions when the preferred one isn't found. I was having issues with it selecting the onnx revision that I use to compare against.

NullSenseStudio · 2023-01-22T22:03:19Z

Let me know if you have any issues with the changes or that it somehow causes bugs on macOS. I don't think more user testing is needed before release, but maybe it should be tested due to the version change.

generator_process/actions/prompt_to_image.py

Co-Authored-By: Carson Katri <Carson.katri@gmail.com>

fixes #528

carson-katri · 2023-01-30T03:47:30Z

Is this ready for merge?

NullSenseStudio · 2023-01-30T03:57:19Z

Yes, unless if you want to do more user testing before release.

carson-katri · 2023-01-30T03:59:13Z

I think more testing can wait for the next set of pre-release builds.

Mhowser · 2023-01-30T12:07:54Z

Thanks so much for doing this! Will full precision eventually be possible in the future for AMD GPUs?

proof of concept

a9ed322

NullSenseStudio added the enhancement New feature or request label Jan 7, 2023

This was linked to issues Jan 7, 2023

AMD support thru onnx? #340

Closed

Implement AMD and Intel GPU Pipeline Through torch-directml #504

Closed

NullSenseStudio added 2 commits January 7, 2023 21:10

other actions

7e63f3c

add package release workflow

f82c043

Merge branch 'main' of github.com:carson-katri/dream-textures into to…

9a3dc7c

…rch-directml

carson-katri reviewed Jan 8, 2023

View reviewed changes

.github/workflows/package-release.yml Outdated Show resolved Hide resolved

Update .github/workflows/package-release.yml

7ad0388

Co-authored-by: Carson Katri <Carson.katri@gmail.com>

fix seamless axes

32fb258

half precision

7806ff4

fix model download

5c2703a

half precision fixes

93c8fd7

inpainting, upscaling, depth with color

carson-katri added this to the v0.0.10 milestone Jan 15, 2023

NullSenseStudio added 4 commits January 18, 2023 11:44

more half precision fixes

c16cf8d

Merge branch 'main' into torch-directml

ee25824

modify revision selection

7293799

remove obsolete patches

054e71b

NullSenseStudio marked this pull request as ready for review January 22, 2023 20:42

remove #486 upscale timing

20c333b

NullSenseStudio requested a review from carson-katri January 22, 2023 22:03

carson-katri reviewed Jan 25, 2023

View reviewed changes

generator_process/actions/prompt_to_image.py Show resolved Hide resolved

NullSenseStudio and others added 3 commits January 26, 2023 12:00

validate snapshot before selection

8e5d218

Co-Authored-By: Carson Katri <Carson.katri@gmail.com>

tqdm update missing in pipelines

24fbccb

fix upscaling non-square images

4b20da2

fixes #528

NullSenseStudio linked an issue Jan 26, 2023 that may be closed by this pull request

Upscaling of non-square images results in interlaced patterns #528

Closed

carson-katri approved these changes Jan 27, 2023

View reviewed changes

carson-katri merged commit 7aa8499 into main Jan 30, 2023

carson-katri deleted the torch-directml branch January 30, 2023 03:57

NullSenseStudio mentioned this pull request Feb 2, 2023

Please consider supporting OneAPI for Intel GPU's #456

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

DirectML Execution #507

DirectML Execution #507

NullSenseStudio commented Jan 7, 2023 •

edited

NullSenseStudio commented Jan 8, 2023

carson-katri commented Jan 8, 2023 •

edited

carson-katri commented Jan 8, 2023

NullSenseStudio commented Jan 9, 2023

faxcorp commented Jan 10, 2023

NullSenseStudio commented Jan 10, 2023

NullSenseStudio commented Jan 11, 2023

NullSenseStudio commented Jan 11, 2023

NullSenseStudio commented Jan 22, 2023

NullSenseStudio commented Jan 22, 2023

carson-katri commented Jan 30, 2023

NullSenseStudio commented Jan 30, 2023

carson-katri commented Jan 30, 2023 •

edited

Mhowser commented Jan 30, 2023

DirectML Execution #507

DirectML Execution #507

Conversation

NullSenseStudio commented Jan 7, 2023 • edited

NullSenseStudio commented Jan 8, 2023

carson-katri commented Jan 8, 2023 • edited

carson-katri commented Jan 8, 2023

NullSenseStudio commented Jan 9, 2023

faxcorp commented Jan 10, 2023

NullSenseStudio commented Jan 10, 2023

NullSenseStudio commented Jan 11, 2023

NullSenseStudio commented Jan 11, 2023

NullSenseStudio commented Jan 22, 2023

NullSenseStudio commented Jan 22, 2023

carson-katri commented Jan 30, 2023

NullSenseStudio commented Jan 30, 2023

carson-katri commented Jan 30, 2023 • edited

Mhowser commented Jan 30, 2023

NullSenseStudio commented Jan 7, 2023 •

edited

carson-katri commented Jan 8, 2023 •

edited

carson-katri commented Jan 30, 2023 •

edited