Slight performance improvement to `Euler`, `EDMEuler`, `FlowMatchHeun`, `KDPM2Ancestral` #9616

hlky · 2024-10-09T09:01:26Z

What does this PR do?

This PR provides a slight performance improvement to Euler, EDMEuler, FlowMatchHeun by moving creation of randn tensor noise to inside the gamma > 0 condition.

gamma, calculated as min(s_churn / (len(self.sigmas) - 1), 2**0.5 - 1) if s_tmin <= sigma <= s_tmax else 0.0 is generally 0.0 as s_churn, s_tmin and s_tmax are typically not changed from their default values of 0.0, 0.0 and inf respectively.

Similarly in KDPM2Ancestral we move creation of noise to inside the 2nd order path.

Who can review?

@yiyixuxu

vladmandic · 2024-10-09T12:48:13Z

@hlky love the attention you're giving schedulers.

btw, a bit off-topic, but though of mentioning it here:
this is executed on each step - and its definitely causing torch sync:

sample = sample.to(torch.float32)

hlky · 2024-10-09T13:24:43Z

Thanks! Looks like that cast and the mentioned precision issues comes from rescale_betas_zero_snr so perhaps the cast could be moved under if self.config.rescale_betas_zero_snr, or a different value like in Comfy and Forge could be used and the cast removed.

vladmandic · 2024-10-09T14:41:41Z

rescale_betas_zero_snr

that options is so rarely used that moving cast only if its true should be more than ok.

yiyixuxu

thanks!

yiyixuxu · 2024-10-10T00:21:39Z

some of the tests failed as expected because the random seeds are used differently now
can we:

make a simple slow test that running a pipeline with each of these scheduler we changed and make sure the output is expected (even though it might be different from the main even with the same seed)
update the tests!

hlky · 2024-10-10T09:10:01Z

Do we want new slow tests like

diffusers/tests/pipelines/stable_diffusion/test_stable_diffusion.py

Lines 919 to 931 in e16fd93

    
           def test_stable_diffusion_lms(self): 
        
               sd_pipe = StableDiffusionPipeline.from_pretrained("CompVis/stable-diffusion-v1-4", safety_checker=None) 
        
               sd_pipe.scheduler = LMSDiscreteScheduler.from_config(sd_pipe.scheduler.config) 
        
               sd_pipe = sd_pipe.to(torch_device) 
        
               sd_pipe.set_progress_bar_config(disable=None) 
        
               inputs = self.get_inputs(torch_device) 
        
               image = sd_pipe(**inputs).images 
        
               image_slice = image[0, -3:, -3:, -1].flatten() 
        
               assert image.shape == (1, 512, 512, 3) 
        
               expected_slice = np.array([0.10542, 0.09620, 0.07332, 0.09015, 0.09382, 0.07597, 0.08496, 0.07806, 0.06455]) 
        
               assert np.abs(image_slice - expected_slice).max() < 3e-3

or just review the output to check it's still reasonable?

I've updated the values in KDPM2AncestralDiscreteSchedulerTest

HuggingFaceDocBuilderDev · 2024-10-15T04:18:25Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

…`, `KDPM2Ancestral` (#9616) * Slight performance improvement to Euler * Slight performance improvement to EDMEuler * Slight performance improvement to FlowMatchHeun * Slight performance improvement to KDPM2Ancestral * Update KDPM2AncestralDiscreteSchedulerTest --------- Co-authored-by: YiYi Xu <yixu310@gmail.com>

Slight performance improvement to Euler

a167876

hlky added 3 commits October 9, 2024 14:04

Slight performance improvement to EDMEuler

e42fdb8

Slight performance improvement to FlowMatchHeun

f029e84

Slight performance improvement to KDPM2Ancestral

bcd1fc1

hlky changed the title ~~Slight performance improvement to Euler~~ Slight performance improvement to Euler, EDMEuler, FlowMatchHeun, KDPM2Ancestral Oct 9, 2024

yiyixuxu approved these changes Oct 10, 2024

View reviewed changes

Update KDPM2AncestralDiscreteSchedulerTest

c4f8758

Merge branch 'main' into euler-performance

5758d89

yiyixuxu merged commit 9d06161 into huggingface:main Oct 15, 2024
15 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Slight performance improvement to `Euler`, `EDMEuler`, `FlowMatchHeun`, `KDPM2Ancestral` #9616

Slight performance improvement to `Euler`, `EDMEuler`, `FlowMatchHeun`, `KDPM2Ancestral` #9616

Uh oh!

hlky commented Oct 9, 2024 •

edited

Loading

Uh oh!

vladmandic commented Oct 9, 2024

Uh oh!

hlky commented Oct 9, 2024

Uh oh!

vladmandic commented Oct 9, 2024

Uh oh!

yiyixuxu left a comment

Uh oh!

yiyixuxu commented Oct 10, 2024 •

edited

Loading

Uh oh!

hlky commented Oct 10, 2024

Uh oh!

HuggingFaceDocBuilderDev commented Oct 15, 2024

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Slight performance improvement to Euler, EDMEuler, FlowMatchHeun, KDPM2Ancestral #9616

Slight performance improvement to Euler, EDMEuler, FlowMatchHeun, KDPM2Ancestral #9616

Uh oh!

Conversation

hlky commented Oct 9, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Who can review?

Uh oh!

vladmandic commented Oct 9, 2024

Uh oh!

hlky commented Oct 9, 2024

Uh oh!

vladmandic commented Oct 9, 2024

Uh oh!

yiyixuxu left a comment

Choose a reason for hiding this comment

Uh oh!

yiyixuxu commented Oct 10, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

hlky commented Oct 10, 2024

Uh oh!

HuggingFaceDocBuilderDev commented Oct 15, 2024

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Slight performance improvement to `Euler`, `EDMEuler`, `FlowMatchHeun`, `KDPM2Ancestral` #9616

Slight performance improvement to `Euler`, `EDMEuler`, `FlowMatchHeun`, `KDPM2Ancestral` #9616

hlky commented Oct 9, 2024 •

edited

Loading

yiyixuxu commented Oct 10, 2024 •

edited

Loading