feat: Add Depth Anything PreProcessor #5548

blessedcoolant · 2024-01-22T21:07:35Z

What type of PR is this? (check all applicable)

Feature

Have you discussed this change with the InvokeAI team?

Yes

Have you updated all relevant documentation?

No

Description

This adds the newly released Depth Anything to InvokeAI. A new node Depth Anything Processor has been added to generate depth maps using this new technique. https://depth-anything.github.io
All related checkpoints will be downloaded automatically on first boot. The DinoV2 models will be loaded to your torch cache dir and the checkpoints pertaining to Depth Anything will be downloaded to any/annotators/depth_anything.
Alternatively you can find the checkpoints here and download them to that folder: https://huggingface.co/spaces/LiheYoung/Depth-Anything/tree/main/checkpoints
This depth map can be used with any depth ControlNet model out there but the folks at DepthAnything have also released a custom fine tuned ControlNet model. From my limited testing, I still prefer the original depth model because this one seems to be producing weird artifacts. Not sure if that is a specific problem to Invoke or just the model itself. I'll test more later. Place these in your controlnet folder like your other ControlNets. You can get that here: https://huggingface.co/spaces/LiheYoung/Depth-Anything/tree/main/checkpoints_controlnet
Also available in the LinearUI
DepthAnything has three models large, base and small -- I've defaulted the processor to small but a user can change to the large model if they wish to do so. Small is way faster but obviously somewhat of a lesser quality.
DepthAnything is now the default processor for depth controlnet models.

Screenshots

Merge Plan

DO NOT MERGE YET. Test it first and I'm sure the model caching can be done better. Coz I don't think I've done that at all. I would appreciate if @brandonrising or @lstein or anyone can take a look at that part of it.

hipsterusername · 2024-01-22T21:10:35Z

Should we make this the default for depth models?

blessedcoolant · 2024-01-22T21:14:06Z

Should we make this the default for depth models?

Try and see. I need to experiment with a bit more to see if this is the better alternative. I'll test it a bit more later today and see if it should be the default.

blessedcoolant · 2024-01-22T22:34:31Z

Added to Linear UI too. Not the default. Can change if people feel its better.

blessedcoolant · 2024-01-23T04:52:58Z

Added output resolution. Also changed the if statements to match-case because we're using Python 3.10 and above already. If for some reason this needs to be reverted, let me know.

hipsterusername

works so good.

we should update to default in a future PR.

Just making this change in case there are other models added to the folder in the future

blessedcoolant · 2024-01-23T20:14:16Z

It's already in this PR. Should I undo?

hipsterusername · 2024-01-23T20:16:31Z

Oh - No I think that's fine. We'll just want to RC.

hipsterusername · 2024-01-23T20:16:43Z

Does caching still need to be looked at @blessedcoolant ?

blessedcoolant · 2024-01-23T20:17:53Z

Does caching still need to be looked at @blessedcoolant ?

Think so? One quick glance from them should sort it. I didn't see any glaring issues but maybe @lstein wants to handle the models better and store them in a better location. Right now I put them in the controlnets/annotators folder which is throwing warnings for incompatible models when booting coz they're not recognized. I thought I'll let Lincoln take a jab at that.

blessedcoolant · 2024-01-23T22:12:48Z

I've moved the models to any/annotators/depth_anything to avoid the model manager trying to parse them as controlnet models on boot which was happening in the previous location

lstein

I don't see a problem with the caching per se, but we've generally avoided downloading big models files at unexpected times. Will these models be used frequently enough to make them into "core" models that are downloaded at install time? We do have a way of indicating that one model is dependent on others, which we are using for the relationship of IP Adapters and their encoders, and could use the same syntax to indicate that depth controlnet models require the presence of the Depth Anything models.

Millu · 2024-01-23T23:30:27Z

Not sure what was going on the first time but works great on macOS!

hipsterusername · 2024-01-24T04:26:44Z

I don't see a problem with the caching per se, but we've generally avoided downloading big models files at unexpected times. Will these models be used frequently enough to make them into "core" models that are downloaded at install time? We do have a way of indicating that one model is dependent on others, which we are using for the relationship of IP Adapters and their encoders, and could use the same syntax to indicate that depth controlnet models require the presence of the Depth Anything models.

I anticipate we want these to operate similarly with how we're handling midas right now - as a dependency installed early on. I think we can optimize the downloading outside of this PR, but we should make sure to do so.

Thanks @Millu for testing on Mac. Can you make sure we follow up with an issue to track getting this model into an earlier install/config process?

hipsterusername · 2024-01-24T06:16:29Z

@blessedcoolant can merge after lint issues fixed.

blessedcoolant requested review from GreggHelt2, Kyle0654, RyanJDick, StAlKeR7779, brandonrising, damian0815, hipsterusername, lstein and psychedelicious as code owners January 22, 2024 21:07

blessedcoolant requested a review from maryhipp as a code owner January 22, 2024 22:28

hipsterusername approved these changes Jan 23, 2024

View reviewed changes

blessedcoolant added 7 commits January 23, 2024 14:13

feat: Add Depth Anything PreProcessor

8f5e2cb

fix: lint & other minor issues

c859eb8

feat: Add DepthAnything to Linear UI

13123da

fix: Change the path of the annotator folder to annotators

6a2eb1d

Just making this change in case there are other models added to the folder in the future

feat: Make the depth anything small model the default

f36a691

feat: Make depth anything the default processor for depth controlnet

39fedb0

feat: Add Resolution to DepthAnything

7cb49e6

hipsterusername force-pushed the depth-anything branch from 1640212 to 7cb49e6 Compare January 23, 2024 20:13

blessedcoolant added 3 commits January 24, 2024 03:35

fix: Move the models to any folder to avoid boot warnings

92fb09c

Merge branch 'main' into depth-anything

0868fc2

fix: incorrect local file path

35184db

lstein reviewed Jan 23, 2024

View reviewed changes

Merge branch 'main' into depth-anything

5a67bc6

hipsterusername enabled auto-merge (rebase) January 24, 2024 04:31

fix: linting issues

f82744b

auto-merge was automatically disabled January 24, 2024 13:24
Rebase failed

blessedcoolant merged commit 68da5c6 into invoke-ai:main Jan 24, 2024

blessedcoolant deleted the depth-anything branch February 12, 2024 17:35

feat: Add Depth Anything PreProcessor #5548

feat: Add Depth Anything PreProcessor #5548

Uh oh!

Conversation

blessedcoolant commented Jan 22, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What type of PR is this? (check all applicable)

Have you discussed this change with the InvokeAI team?

Have you updated all relevant documentation?

Description

Screenshots

Merge Plan

Uh oh!

hipsterusername commented Jan 22, 2024

Uh oh!

blessedcoolant commented Jan 22, 2024

Uh oh!

blessedcoolant commented Jan 22, 2024

Uh oh!

blessedcoolant commented Jan 23, 2024

Uh oh!

hipsterusername left a comment

Choose a reason for hiding this comment

Uh oh!

blessedcoolant commented Jan 23, 2024

Uh oh!

hipsterusername commented Jan 23, 2024

Uh oh!

hipsterusername commented Jan 23, 2024

Uh oh!

blessedcoolant commented Jan 23, 2024

Uh oh!

blessedcoolant commented Jan 23, 2024

Uh oh!

lstein left a comment

Choose a reason for hiding this comment

Uh oh!

Millu commented Jan 23, 2024

Uh oh!

hipsterusername commented Jan 24, 2024

Uh oh!

hipsterusername commented Jan 24, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

blessedcoolant commented Jan 22, 2024 •

edited

Loading