Non-interactive model download (support `HUGGINGFACE_TOKEN`) #1578

ebr · 2022-11-27T11:04:17Z

This PR adds the following improvements:

Support for the `HUGGINGFACE_TOKEN` environment variable

Ensures that the SD weights download proceeds in a truly non-interactive fashion when the --yes flag is used and a valid authentication token is present in any of the supported locations (env vars or cached).

Tested as follows:

HUGGING_FACE_HUB_TOKEN env var is not empty:
- with --yes: weights download proceeds; token is not persisted in cache
- without --yes: weights download proceeds without an additional prompt; token is not persisted in cache
HUGGINGFACE_TOKEN env var is not empty:
- same behaviour as with the official var (token not persisted)
Env vars not present without --yes flag: user is prompted for the token, token is cached only if valid (same as current behaviour)
Env vars not present with --yes flag: weights download is skipped (same as current behaviour)
In any case where the token is invalid (i.e. login to HF does not succeed), the token will not be cached.

Additionally:

fixes a bug where outdir location was not printed due to missing f-string
fixes spelling of --outdir argument in a message
implements a way of exposing download failures at the end of configuration run. if model download failed or was skipped, the postscript message was misleadingly suggesting that the application was ready to use.

keturn · 2022-11-27T16:43:35Z

huggingface_hub.login(token) could be a more-explicit alternative to resetting os.environ inside the script.

Also note that they've changed the settings on some of the repos so now not all of the Stable Diffusion models require a token. As of last time I checked earlier this weekend, the CompVis/ ones don't (SD 1.4) but the runwayml/ ones do (SD 1.5 + SD 1.5 inpainting).

ebr · 2022-11-27T18:58:36Z

huggingface_hub.login(token) could be a more-explicit alternative to resetting os.environ inside the script.

That is how I initially implemented it, but the behaviour becomes inconsistent between using the officially supported env var HUGGING_FACE_HUB_TOKEN and the de-facto "community preferred" HUGGINGFACE_TOKEN. Calling the .login() method causes the token to be saved in ~/.huggingface/token, whereas using the official env. var does not.

ebr · 2022-11-27T22:23:01Z

Moving the initialization file is proving tricky if we want to maintain backwards compatibility with the current setup, because we want to move it into the runtime dir, but users might already have a conflicting --root-dir argument in an existing ~/.invokeai file. Also, setting it from Globals before checking for a user-specified runtime dir creates a catch-22.

TLDR; Perhaps the initfile should be dealt with in a separate PR so that this one isn't blocked for too long. Unless we decide to break backwards-compatibility, then it's easy

lstein · 2022-11-28T02:41:34Z

Moving the initialization file is proving tricky if we want to maintain backwards compatibility with the current setup, because we want to move it into the runtime dir, but users might already have a conflicting --root-dir argument in an existing ~/.invokeai file. Also, setting it from Globals before checking for a user-specified runtime dir creates a catch-22.

TLDR; Perhaps the initfile should be dealt with in a separate PR so that this one isn't blocked for too long. Unless we decide to break backwards-compatibility, then it's easy

Why not have an environment variable that contains the path to the init file?

As an aside, introducing Globals wasn't my favorite approach, but it was better than carrying a directory path across multiple nested functions.

Just to set expectations, unless the release gets delayed significantly I'm going to go with the existing configuration script for 2.2.0, and release the new and improved one that you're working on for 2.2.1. This is primarily because I've got a lot of documentation-writing to do and won't be able to do thorough testing on this. I very much appreciate your work on this bit of the code.

I also have a suggestion for a new feature. Either as a command-line option or as an automatic feature during interactive processing, it would be great if the script could scan the existing models directory for SD models it knows about (from INITIAL_MODELS.yaml). If it finds SD models that aren't in the config file, it should offer to add them. This will let people rebuild their config.yaml without having to ask the script to download the models and answer 'y' to each prompt.

lstein · 2022-11-28T02:46:10Z

That being said, if I do have time I will try very hard to get this in. The improvements look very good.

ebr · 2022-11-28T03:05:50Z

All sounds great to me. I actually might have a viable approach to handling the .invokeai config file location in a backwards-compatible manner, but certainly wouldn't want any of this to delay the release. If anything, that would give me more time to do some more refactoring and properly test the changes. But I'm aiming try to get the final fixes in tonight.

ebr · 2022-11-28T04:48:47Z

I think I was able to wrangle the config file location into a good state; this is now ready for review.

@lstein would you like me to tackle the auto-scanning of models in this same PR, or open a new feature request for it? I'd prefer the latter as it might get messy to review otherwise, but it's your call, please let me know.

lstein · 2022-11-28T05:14:40Z

I think I was able to wrangle the config file location into a good state; this is now ready for review.

@lstein would you like me to tackle the auto-scanning of models in this same PR, or open a new feature request for it? I'd prefer the latter as it might get messy to review otherwise, but it's your call, please let me know.

Go ahead and start a new PR. I'm going to integrate what you've done here with a few minor changes I made on a different branch and then merge.

lstein

Looking good. Will do some functional testing prior to merge.

ebr · 2022-11-28T05:19:24Z

I seem to have oddly broken some tests, let me look into that 🤔

Edit: I can't reproduce these test failures locally. On closer look, this might be a transient failure: requests.exceptions.HTTPError: 504 Server Error: Gateway Time-out for url: https://huggingface.co/bert-base-uncased/resolve/main/tokenizer.json, but the configuration script exits without failure and so the application is left in a not-fully-configured state: OSError: Can't load tokenizer for 'openai/clip-vit-large-patch14'

lstein

I started testing and then realized that the default location for the init file has become the InvokeAI repo. This is when neither --root nor INVOKEAI_ROOT is provided.

I don't really want the init file to be stored in the repo because ultimately I want to be able to do a non-source install. When this happens the repo is going to be hidden deep inside a venv folder somewhere and the user will never be able to find the init file. This is really tricky to finesse because the init file tells invoke where the root directory is, but if the init file is inside the root directory how does invoke find it?

In addition, the repo is really cluttered already.

So I want to consult with you about the design decision here. Maybe a better thing to do is to create the initfile in the user's home directory, don't make it hidden, and give clear instructions how they can move it by creating an environment variable or pointing invoke.py to it with --root each time. Sadly, users are not savvy about setting environment variables.

ebr · 2022-11-28T06:30:23Z

I started testing and then realized that the default location for the init file has become the InvokeAI repo.

That sounds unexpected and not intentional. The logic I was trying to implement was to: 1. if an existing config file is found in the user's home directory, then keep using it, and 2. if no existing file is found, then create one in the runtime directory.

If the default location of the runtime directory happens to be the root of the repo, then yes, this is a problem. I think this stems from:

# This is usually overwritten by the command line and/or environment variables
- Globals.root = '.'
+ Globals.root = root_dir if (root_dir := os.getenv("INVOKEAI_ROOT")) else "."

But this behaviour should have remained unchanged if neither the env var nor the --root_dir are specified. I.e. Globals.root was already "." before this PR.

Agreed that we never want to mix the runtime dir and neither the cloned codebase nor especially the dev-installed Python module, as you mention. Would it make sense to instead default the runtime dir to be located at ~/invokeai/ instead of the more ambiguous "."?

This is really tricky to finesse because the init file tells invoke where the root directory is, but if the init file is inside the root directory how does invoke find it?

Definitely, that's the catch-22 situation I was talking about earlier. I think the best solution is to remove ambiguity by having a sane default that's more explicit and guaranteed to be outside of the codebase.

lstein · 2022-11-28T14:17:10Z

Yeah, I think we need to reverse the logic and have a predictable location of the invokeai directory rather than a predictable init file. The init file can then live inside invokeai.

lstein · 2022-11-28T14:19:37Z

Just marking this as draft again while we work out the strategy.

ebr · 2022-11-28T23:37:07Z

I also just noticed that the configure_invokeai.py, the initfile comments, and the CI tests all specify the --root option, while in args.py, CLI.py and elsewhere in the app we have --root_dir.

@lstein perhaps leaving this PR unmerged until after 2.2.0 is indeed for the best, so that we can clean it all up after the release dust settles. I will keep rebasing+updating it though. Do you agree?

(CLI argument handling and various entrypoints could use quite a bit of future refactoring + consolidation in general, IMHO)

ebr · 2022-11-29T06:58:27Z

I moved the init file / root dir changes to another branch and will do a separate PR for it, as it was getting quite messy and needs a larger refactor.

ebr · 2022-12-01T00:59:48Z

This is ready to be reviewed again, because the runtime dir / init file location refactoring will be dealt with separately, likely in #1615

appinteractive · 2022-12-01T22:09:33Z

Very useful for installation via UI 🍾

also implement a generic way of reporting issues at the end of installation

this mirrors the behaviour when using the officially supported env var

…ion in globals.py comment

lstein

This looks great. Thanks.

ebr marked this pull request as draft November 27, 2022 11:04

lstein self-requested a review November 27, 2022 15:28

ebr force-pushed the config-noninteractive-fixes branch from 848a700 to 0d4f297 Compare November 27, 2022 21:01

ebr marked this pull request as ready for review November 27, 2022 22:23

ebr marked this pull request as draft November 28, 2022 02:04

ebr force-pushed the config-noninteractive-fixes branch 3 times, most recently from fe44bd7 to db50446 Compare November 28, 2022 03:15

ebr marked this pull request as ready for review November 28, 2022 04:37

lstein approved these changes Nov 28, 2022

View reviewed changes

lstein requested changes Nov 28, 2022

View reviewed changes

lstein marked this pull request as draft November 28, 2022 14:19

ebr force-pushed the config-noninteractive-fixes branch 3 times, most recently from dca7715 to 2bfc89d Compare November 28, 2022 23:29

ebr mentioned this pull request Nov 28, 2022

[enhancement]: automatically scan model .ckpt files and update models.yaml #1607

Closed

1 task

ebr force-pushed the config-noninteractive-fixes branch from 2bfc89d to 77b468a Compare November 29, 2022 05:22

ebr force-pushed the config-noninteractive-fixes branch from 77b468a to 442d846 Compare November 29, 2022 06:49

ebr changed the title ~~Non-interactive model download + initfile location~~ Non-interactive model download Nov 29, 2022

ebr marked this pull request as ready for review November 29, 2022 06:58

ebr force-pushed the config-noninteractive-fixes branch 2 times, most recently from fdfa615 to 66d7c46 Compare December 1, 2022 00:53

ebr requested review from mauwii, tildebyte and CapableWeb as code owners December 1, 2022 00:53

ebr changed the base branch from development to main December 1, 2022 00:53

ebr changed the title ~~Non-interactive model download~~ Non-interactive model download (support HUGGINGFACE_TOKEN) Dec 1, 2022

tildebyte approved these changes Dec 2, 2022

View reviewed changes

ebr added 6 commits December 2, 2022 23:44

(config) fix f-string in prompt for output location

7210aa3

(config) try to authenticate to Huggingface more eagerly, using env vars

7a1f653

(config) make user aware of any problems downloading models

0797fcf

also implement a generic way of reporting issues at the end of installation

(config) do not cache HF token when using the non-canonical env var

4671ec0

this mirrors the behaviour when using the officially supported env var

(config) fix permissions on configure_invokeai.py, improve documentat…

d2c62e6

…ion in globals.py comment

(config) clarify why we're setting the env var

da39724

ebr force-pushed the config-noninteractive-fixes branch from c296d3a to da39724 Compare December 3, 2022 04:44

lstein approved these changes Dec 3, 2022

View reviewed changes

lstein merged commit c607d4f into invoke-ai:main Dec 3, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Non-interactive model download (support `HUGGINGFACE_TOKEN`) #1578

Non-interactive model download (support `HUGGINGFACE_TOKEN`) #1578

ebr commented Nov 27, 2022 •

edited

Loading

keturn commented Nov 27, 2022

ebr commented Nov 27, 2022 •

edited

Loading

ebr commented Nov 27, 2022

lstein commented Nov 28, 2022

lstein commented Nov 28, 2022

ebr commented Nov 28, 2022

ebr commented Nov 28, 2022

lstein commented Nov 28, 2022

lstein left a comment

ebr commented Nov 28, 2022 •

edited

Loading

lstein left a comment

ebr commented Nov 28, 2022 •

edited

Loading

lstein commented Nov 28, 2022

lstein commented Nov 28, 2022

ebr commented Nov 28, 2022

ebr commented Nov 29, 2022

ebr commented Dec 1, 2022

appinteractive commented Dec 1, 2022

lstein left a comment

Non-interactive model download (support HUGGINGFACE_TOKEN) #1578

Non-interactive model download (support HUGGINGFACE_TOKEN) #1578

Conversation

ebr commented Nov 27, 2022 • edited Loading

Support for the HUGGINGFACE_TOKEN environment variable

Additionally:

keturn commented Nov 27, 2022

ebr commented Nov 27, 2022 • edited Loading

ebr commented Nov 27, 2022

lstein commented Nov 28, 2022

lstein commented Nov 28, 2022

ebr commented Nov 28, 2022

ebr commented Nov 28, 2022

lstein commented Nov 28, 2022

lstein left a comment

Choose a reason for hiding this comment

ebr commented Nov 28, 2022 • edited Loading

lstein left a comment

Choose a reason for hiding this comment

ebr commented Nov 28, 2022 • edited Loading

lstein commented Nov 28, 2022

lstein commented Nov 28, 2022

ebr commented Nov 28, 2022

ebr commented Nov 29, 2022

ebr commented Dec 1, 2022

appinteractive commented Dec 1, 2022

lstein left a comment

Choose a reason for hiding this comment

Non-interactive model download (support `HUGGINGFACE_TOKEN`) #1578

Non-interactive model download (support `HUGGINGFACE_TOKEN`) #1578

ebr commented Nov 27, 2022 •

edited

Loading

Support for the `HUGGINGFACE_TOKEN` environment variable

ebr commented Nov 27, 2022 •

edited

Loading

ebr commented Nov 28, 2022 •

edited

Loading

ebr commented Nov 28, 2022 •

edited

Loading