Execute ScriptVaultSecret on demand instead of on load #82418

Urth · 2023-12-13T14:52:28Z

SUMMARY

A vault secret script can do whatever is needed to return a vault password. This can mean loading something from a local password store or accessing a remote password storage solution. When combining the password script with multiple vault_identity_list entries this adds a lot of unneeded loading of vault indentity passwords. Most of which may not be required to complete the ansible run.

This commit delays executing for Script and ClientScriptVaultSecret classes until the bytes password is accessed.

We currently have a vault_identity_list with 35 different entries. Each entry requires authorization and loading them takes a while making the startup time for ansible significant. Executing the password scripts on demand would solve 2 problems:

users don't have to change their configuration to exclude entries they cannot access.
the time to load unused passwords is removed from the execution time.

ISSUE TYPE

Feature Pull Request

ADDITIONAL INFORMATION

$ grep vault_id_match ansible.cfg 
vault_id_match = true
# Before
$ time ansible-vault decrypt < tmp.yaml
Decryption successful
...
real    0m13,144s
user    0m3,159s
sys     0m0,729s
# After
$ time ansible-vault decrypt < tmp.yaml
Decryption successful
...
real    0m0,494s
user    0m0,229s
sys     0m0,016s

A vault secret script can do whatever is needed to return a vault password. This can mean loading something from a local password store or accessing a remote password storage solution. When combining the password script with multiple vault_identity_list entries this adds a lot of unneeded loading of vault indentity passwords. Most of which may not be required to complete the ansible run. This commit delays executing for Script and ClientScriptVaultSecret classes until the bytes password is accessed.

mstud · 2024-02-19T09:26:09Z

This is relevant for us, too. When can we expect this to be merged?

nitzmahone · 2024-02-19T17:49:10Z

This could have some unintended side-effects that work against the desired behavior.

The gnarliest one off the top of my head: deferring the script execution until first secret access optimizes for "nothing runs", but when the first encounter with an encrypted file/value occurs post-fork (eg, an include_vars or any other DataLoader activity that occurs in a host-specific context), without a bunch of locking and extra bookkeeping to marshal the vault secret state back to the controller from workers, the secret scripts will be run repeatedly on every fork.

This also likely clashes with some upcoming changes around how undecryptable vaulted values are tracked, where we need to know if a value is decryptable before the first fork occurs (for the same reasons as above). That change would basically defeat this one anyway, at least the first time a vaulted value is encountered pre-fork.

It seems like you have some use cases for vault access that aren't handled well by its original design assumptions. Are you using multiple vault_id values concurrently, where some users have > 1 vault secrets in play, or do you maintain multiple encrypted copies of the data in question (using different secrets) and have your secret script map a run to 0-or-1 vault secrets and only load the content that you know will work for that secret? If it's the "multiple" case, would it be helpful if a single vault script could supply N vault-id/secret pairs instead of just one? That way, you could apply whatever error-handling/fallback logic you like (including parallelization, if necessary), and with a full knowledge of what else had already occurred.

MJJoker · 2024-02-20T05:45:32Z

Relevant for me too.
I have one vault-script that can access keepassxc, the vault-id is used to get he correct secret.
The same script (and keepassxc-DB) is used for a lot of plays, I do not know which secrets to return without the proper ids. So I would need multiple vault-ids.

As we are talking about secrets, I do not like the idea of returning "all the secrets" (or just multiple) or having them in memory just-in-case they are needed.

That sounds like a big change, but ansible could handle all the secrets/credentials in a separate instance/process and accessing them is communicating with this separate instance anyway.

Urth · 2024-02-20T11:41:18Z

Are you using multiple vault_id values concurrently, where some users have > 1 vault secrets in play. ... If it's the "multiple" case, would it be helpful if a single vault script could supply N vault-id/secret pairs instead of just one? That way, you could apply whatever error-handling/fallback logic you like (including parallelization, if necessary), and with a full knowledge of what else had already occurred.

Yes, we have multiple vault id's and they all map to the same script. Giving the script access to N wouldn't help much since we don't know which will be used and have to check all of them. What would help is if ansible could preprocess the secrets and collect the vault-id's that will be used and only request those.

ansibot added feature This issue/PR relates to a feature request. needs_triage Needs a first human triage before being processed. needs_revision This PR fails CI tests or a maintainer has requested a review/revision of the PR. labels Dec 13, 2023

Urth force-pushed the vault-lazy-loaded-password-script branch from c960e54 to c9dc0d3 Compare December 13, 2023 15:18

Urth force-pushed the vault-lazy-loaded-password-script branch from c9dc0d3 to 4ef61dc Compare December 13, 2023 15:46

ansibot removed the needs_revision This PR fails CI tests or a maintainer has requested a review/revision of the PR. label Dec 13, 2023

ansibot added the stale_ci This PR has been tested by CI more than one week ago. Close and re-open this PR to get it retested. label Jan 2, 2024

nitzmahone self-requested a review January 4, 2024 19:14

nitzmahone removed the needs_triage Needs a first human triage before being processed. label Jan 4, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Execute ScriptVaultSecret on demand instead of on load #82418

Execute ScriptVaultSecret on demand instead of on load #82418

Urth commented Dec 13, 2023

mstud commented Feb 19, 2024

nitzmahone commented Feb 19, 2024

MJJoker commented Feb 20, 2024

Urth commented Feb 20, 2024

Execute ScriptVaultSecret on demand instead of on load #82418

Are you sure you want to change the base?

Execute ScriptVaultSecret on demand instead of on load #82418

Conversation

Urth commented Dec 13, 2023

SUMMARY

ISSUE TYPE

ADDITIONAL INFORMATION

mstud commented Feb 19, 2024

nitzmahone commented Feb 19, 2024

MJJoker commented Feb 20, 2024

Urth commented Feb 20, 2024