[Azure DataStore] Handle storage options as secrets #1206

hayesgb · 2021-08-15T03:31:06Z

This add the ability to pass standard dictionary keys from fsspec's storage_options parameter into mlrun.run.get_dataitem() as secrets.

Enabling this will more easily allow users engaging in exploratory analysis to leverage the mlrun api to fetch data_items from Azure by enabling the following

storage_options={'account_name': "<NAME>", 'credential': <CREDENTIAL>}
df = mlrun.run.get_dataitem("az://CONTAINER/myfile.parquet", secrets=storage_options).as_df()

…ptions

…h Azure

Hedingber

@hayesgb Looks good to me, just note you have some conflict

hayesgb · 2021-08-15T17:46:50Z

Fixed merge conflict. Thanks!

Hedingber · 2021-08-16T10:49:14Z

@hayesgb I now noticed that the test file you added is almost an exact duplicate of test_azure_blob.py
I feel like we can merge them without too much effort
Looks like you'll need to make the verify_auth... method to behave a bit different whether it's env vars or storage options (can be determined by whether one of the params starts with AZURE_)
And in the tests simply always pass storage options to secrets, just in the case of env vars, it will be None
WDYT ?

hayesgb · 2021-08-17T03:18:26Z

@Hedingber -- I considered combining the tests, but was concerned about the interpretability. The two approaches serve entirely different use-cases, and I was worried that if the tests were blended (storing secrets or retrieving environmental variables vs user-passed credentials, which will most likely be done during exploratory analysis) it would lead to confusion and create potential maintainability issues. Thoughts?

Hedingber · 2021-08-17T19:44:12Z

@hayesgb I see your point but I feel like leaving the tests duplicated will hurt maintainability more 😬 , @theSaarco WDYT ?

hayesgb · 2021-08-17T23:22:29Z

@Hedingber -- No worries. I consolidated the tests into a single file and updated the PR. LMKWYT.

Hedingber · 2021-08-18T00:04:29Z

@hayesgb Looks great!
merged
Big appreciation for the detailed comment in the verify auth method 👏 👏

hayesgb added 4 commits August 14, 2021 22:19

Added ability to read standard storage_options dict keys to storage_o…

3e3162a

…ptions

Validates that passing a standard storage_options dict integrates wit…

545c299

…h Azure

Updated integration yaml, and added credential key to storage_options

af064d9

linting

78e1f2e

Hedingber approved these changes Aug 15, 2021

View reviewed changes

Hedingber requested a review from theSaarco August 15, 2021 16:54

Hedingber changed the title ~~Handle storage options as secrets~~ [Azure DataStore] Handle storage options as secrets Aug 15, 2021

Merge branch 'development' into handle_storage_options_as_secrets

1da1b95

hayesgb added 2 commits August 17, 2021 18:19

Consolidated tests for storage_options and environmental vars

091aeb7

linting

256cf0e

hayesgb requested a review from Hedingber August 17, 2021 23:22

Hedingber approved these changes Aug 18, 2021

View reviewed changes

Hedingber merged commit 936a829 into mlrun:development Aug 18, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Azure DataStore] Handle storage options as secrets #1206

[Azure DataStore] Handle storage options as secrets #1206

hayesgb commented Aug 15, 2021

Hedingber left a comment

hayesgb commented Aug 15, 2021

Hedingber commented Aug 16, 2021

hayesgb commented Aug 17, 2021 •

edited

Hedingber commented Aug 17, 2021

hayesgb commented Aug 17, 2021

Hedingber commented Aug 18, 2021 •

edited

[Azure DataStore] Handle storage options as secrets #1206

[Azure DataStore] Handle storage options as secrets #1206

Conversation

hayesgb commented Aug 15, 2021

Hedingber left a comment

Choose a reason for hiding this comment

hayesgb commented Aug 15, 2021

Hedingber commented Aug 16, 2021

hayesgb commented Aug 17, 2021 • edited

Hedingber commented Aug 17, 2021

hayesgb commented Aug 17, 2021

Hedingber commented Aug 18, 2021 • edited

hayesgb commented Aug 17, 2021 •

edited

Hedingber commented Aug 18, 2021 •

edited