Image2image - swift #116

littleowl · 2023-01-28T03:42:43Z

Adds image2image functionality.

This is only the swift portion. A separate library that generates the model for the VAE Encoder exists here: #115

The original PR with both Swift and Python libs can be found here: #73

Sadly, I could not complete the request for the python simplification. I believe it is better not to do so. Consequently, sorry for the delay splitting up the PR. There are not many changes from the previous PR apart from removing the python and rebasing the project.

In Swift, an Encoder class is created. Various changes to the scheduler, pipeline, and CLI to support input image and strength. CGImage creation from MLShapedArray is moved into its own file along with the new function to create a MLShapedArray from a CGImage. Image loading and preparation is currently handled / optimized with vImage.

This should work with both Schedulers that we have so far. (Thanks @pcuenca) for the fix to work with DPMSolverMultistepScheduler.

Do not erase the below when submitting your pull request:
#########

I agree to the terms outlined in CONTRIBUTING.md

Encoder

atiorh · 2023-01-28T04:56:45Z

Thank you @littleowl ! I reviewed #115 and leaving the noise situation as is should be fine, please see my comment there :) I believe the only implication of my review on #115 on this PR is that the model I/O names should be changed from camelCase to underscore and that is it. Thank you again for your amazing contributions!

alejandro-isaza

Looks good, just a couple of small things.

swift/StableDiffusion/pipeline/DPMSolverMultistepScheduler.swift

swift/StableDiffusion/pipeline/Scheduler.swift

swift/StableDiffusion/pipeline/StableDiffusionPipeline+Resources.swift

swift/StableDiffusion/pipeline/StableDiffusionPipeline.swift

swift/StableDiffusionCLI/main.swift

swift/StableDiffusion/pipeline/Encoder.swift

littleowl · 2023-01-31T06:50:32Z

@atiorh, @alejandro-isaza - Thanks for your reviews! I believe that I resolved all the comments. Please let me know if there is anything more that I can do.

alejandro-isaza

The header comment needs to be fixed. Other than that just a couple more nits.

swift/StableDiffusion/pipeline/StableDiffusionPipeline+SampleInput.swift

littleowl · 2023-01-31T07:48:04Z

@alejandro-isaza, thanks again for the review. The additional comments have been resolved.

alejandro-isaza · 2023-02-01T16:57:05Z

swift/StableDiffusion/pipeline/StableDiffusionPipeline+SampleInput.swift

+    }
+
+    /// Image generation configuration
+    public struct Configuration: Hashable {


Sorry, one more thing. Let's rename this file StableDiffusionPipeline.Configuration.swift

atiorh · 2023-02-01T23:29:28Z

@littleowl In addition to the file name change that @alejandro-isaza asked for, if you could just add a quick note in the Example CLI Usage section to indicate that an image argument is now supported, that would be great!

atiorh · 2023-02-02T01:17:50Z

I just tested the CLI with different images and it seems that we need to guard against or fix a few things:
1-) When the --image file's resolution does not match the VAEEncoder input resolution, we get zsh: trace trap
2-) Loading a PNG image with the correct resolution works but trying to load the same image encoded as JPEG fails without a sufficient explanation

atiorh

@littleowl I just realized that we left scattered comments after the initial review, apologies! Here are the two remaining items before we can merge:

swift/StableDiffusion/pipeline/StableDiffusionPipeline+SampleInput.swift renamed to swift/StableDiffusion/pipeline/StableDiffusionPipeline.Configuration.swift
specific error message when input image file format and/or resolution do not match expectations

pj4533 · 2023-02-06T23:13:55Z

I know you all have this awesome PR under control, but just wanted to say that I successfully converted the v2 base model VAEEncoder using the main (w/ the python image2image PR), and then converted the rest of v2 base using this PR, and am currently generating image2image using my own swift code. So from an end-user perspective: well done!

saiedg · 2023-02-08T23:43:46Z

@pj4533 well done! I'm so jealous! how can I try??

also fix 512 hard coded

littleowl · 2023-02-09T08:23:21Z

Again, sorry for the delay @atiorh.
I added some error detection. It needs to happen right before prediction due to the dynamic model.
I fixed a hard coded instance of 512.
Renamed the file.
Fixed Jpeg Support by utilizing Cocoa.

atiorh · 2023-02-09T18:30:10Z

Thanks for your hard work @littleowl, this is going to benefit many developers! I just tested your changes and they are working like a charm. Merging now.

saiedg · 2023-02-10T02:27:16Z

@littleowl congratulations on merging! I'm sooo excited to use it but don't know how. Can you please add instructions in the readme or make a YouTube video?

ynagatomo · 2023-02-10T08:06:39Z

here is a sample;

martinlexow · 2023-02-10T08:19:17Z

Thanks for your exceptional work @littleowl and thank you for sharing the example @ynagatomo!

Can you please explain what the strength parameter does in this context? It seems to just decrease the steps in my observation (like: steps × strength) and it also doesn’t seem obvious to me why strength has to be less than 1.0 to trigger the imageToImage mode.

ynagatomo · 2023-02-11T06:59:00Z

an experiment on strength;

(code: https://github.com/ynagatomo/ImgGenSD2)

littleowl and others added 10 commits January 27, 2023 19:07

Image2Image Encoder

b7280f4

Encoder

Scheduler and pipeline

e3a8587

fix scheduler

d0a754c

cli

19dc885

remove CLI comment

2dca606

disable dpm multistep solver with image2image

e7eb953

clamp initial timestamp

ba7b5fa

Store timesteps in reverse order for consistency.

73927da

Report actual number of steps.

2e05ad1

uint32

c199a42

This was referenced Jan 28, 2023

Image2image - python #115

Merged

Image2image #73

Closed

atiorh requested review from alejandro-isaza and msiracusa January 28, 2023 04:55

alejandro-isaza reviewed Jan 28, 2023

View reviewed changes

atiorh reviewed Jan 31, 2023

View reviewed changes

swift/StableDiffusion/pipeline/Encoder.swift Outdated Show resolved Hide resolved

littleowl added 2 commits January 30, 2023 22:36

PRComments

c17c80f

remove old initializer

d6647a4

littleowl requested review from alejandro-isaza and atiorh and removed request for msiracusa, alejandro-isaza and atiorh January 31, 2023 06:50

alejandro-isaza reviewed Jan 31, 2023

View reviewed changes

pr comments

088bdc1

littleowl requested a review from alejandro-isaza January 31, 2023 07:48

alejandro-isaza approved these changes Jan 31, 2023

View reviewed changes

atiorh requested a review from msiracusa January 31, 2023 19:01

alejandro-isaza reviewed Feb 1, 2023

View reviewed changes

atiorh reviewed Feb 6, 2023

View reviewed changes

atiorh mentioned this pull request Feb 9, 2023

Add random source that matches PyTorch #124

Merged

1 task

littleowl added 2 commits February 9, 2023 00:10

change name and add error handling

eafd72e

also fix 512 hard coded

Add fix for Jpegs

5bf9c71

atiorh merged commit fa7bbdc into apple:main Feb 9, 2023

richardvenneman mentioned this pull request Feb 10, 2023

image2image support MochiDiffusion/MochiDiffusion#151

Closed

1 task

JustinMeans mentioned this pull request Feb 11, 2023

In-Painting and Depth Model Support? #127

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Image2image - swift #116

Image2image - swift #116

littleowl commented Jan 28, 2023 •

edited

Loading

atiorh commented Jan 28, 2023

alejandro-isaza left a comment

littleowl commented Jan 31, 2023

alejandro-isaza left a comment

littleowl commented Jan 31, 2023

alejandro-isaza Feb 1, 2023

atiorh commented Feb 1, 2023

atiorh commented Feb 2, 2023

atiorh left a comment

pj4533 commented Feb 6, 2023

saiedg commented Feb 8, 2023

littleowl commented Feb 9, 2023

atiorh commented Feb 9, 2023

saiedg commented Feb 10, 2023

ynagatomo commented Feb 10, 2023

martinlexow commented Feb 10, 2023

ynagatomo commented Feb 11, 2023

Image2image - swift #116

Image2image - swift #116

Conversation

littleowl commented Jan 28, 2023 • edited Loading

atiorh commented Jan 28, 2023

alejandro-isaza left a comment

Choose a reason for hiding this comment

littleowl commented Jan 31, 2023

alejandro-isaza left a comment

Choose a reason for hiding this comment

littleowl commented Jan 31, 2023

alejandro-isaza Feb 1, 2023

Choose a reason for hiding this comment

atiorh commented Feb 1, 2023

atiorh commented Feb 2, 2023

atiorh left a comment

Choose a reason for hiding this comment

pj4533 commented Feb 6, 2023

saiedg commented Feb 8, 2023

littleowl commented Feb 9, 2023

atiorh commented Feb 9, 2023

saiedg commented Feb 10, 2023

ynagatomo commented Feb 10, 2023

martinlexow commented Feb 10, 2023

ynagatomo commented Feb 11, 2023

littleowl commented Jan 28, 2023 •

edited

Loading