Implement pluggable connection policy by BasPH · Pull Request #36291 · apache/airflow

BasPH · 2023-12-18T21:01:46Z

This PR implements a pluggable connection policy. This can apply validation rules e.g. only access to a pre-defined list of connection ids in a secrets backend, or apply some business logic such as prefixing the given connection id based on some property such as DAG owner.

Example:

def connection_policy(conn_id: str) -> str:
    """
    This connection policy prefixes the connection id with "bi/" in case the owner is set to "bi", and raises
    an AirflowClusterPolicyViolation in case an unknown owner is found.
    """
    dagowner_connid_prefixes = {"bi": "bi/"}

    from airflow.operators.python import get_current_context

    task_context = get_current_context()
    dag_owner = task_context["dag"].owner

    try:
        return dagowner_connid_prefixes[dag_owner] + conn_id
    except KeyError:
        raise AirflowClusterPolicyViolation(
            f"DAG owner '{dag_owner}' not found in list of owners to route a connection. "
            "Consult your cluster admin."
        )

^ Add meaningful description above
Read the Pull Request Guidelines for more information.
In case of fundamental code changes, an Airflow Improvement Proposal (AIP) is needed.
In case of a new dependency, check compliance with the ASF 3rd Party License Policy.
In case of backwards incompatible changes please leave a note in a newsfragment file, named {pr_number}.significant.rst or {issue_number}.significant.rst, in newsfragments.

dstandish · 2023-12-18T21:43:55Z

airflow/policies.py



+@local_settings_hookspec
+def connection_policy(conn_id: str) -> str:  # type: ignore[empty-body]


I thought for a second that maybe we should call this connection_id_policy or something because... e.g. with task policy or dag policy, you get and can mutate the actual object -- but here we only touch the conn_id. So the naming is perhaps slightly inconsistent but that's nit picking.

I suppose we could call it connection_retrieval_policy? Or leave it 🤷

dstandish · 2023-12-18T21:58:00Z

airflow/policies.py

+
+    To support connection id alterations at runtime (typically prefixes are added), a connection id must be
+    returned, even when not altering the connection id.
+    """


I am not sure, but I also wonder whether we should add a disclaimer that this could provide a false sense of security? To me it makes sense to have this because people want something, and this is about as good as we can do right now. But, it's pretty easy to circumvent e.g. with secrets_backend_list[0].get_connection.

Ah.. I just see that we have very same comment.

dstandish · 2023-12-18T21:58:43Z

airflow/hooks/base.py

        :param conn_id: connection id
        :return: connection
        """
+        conn_id = settings.connection_policy(conn_id=conn_id)


I wonder if Connection.get_connection_from_secrets would be a better location

Possibly, but I don't think it matters that much?

The policy would have to be applied before the first request to any secrets system. We could apply it inside Connection.get_connection_from_secrets, but it would have to be the first statement in that method, because after that we start requesting secrets. That applies the policy one level deeper in the stack but logically it wouldn't make a difference.

potiuk · 2023-12-18T22:07:01Z

This is an interesting idea, but I think we should be very clear in the documentation and description that it's not a security feature. Any DAG Author can simply instantiate the connection via creating the model and retrieving it. There is a risk that this will give the users a false sense of security if we are not explicit about it.

potiuk · 2023-12-18T22:09:09Z

Just to add - I can very easily imagine - that if we don't do it, that someone will raise a security report that you can bypass it. And if somoene does it and it's not clearly explained in our security model, then we will have to well, fix it.

BasPH · 2023-12-19T20:26:37Z

@dstandish @potiuk I didn't intend this to be a security feature. I gave your comments some thought and decided to close the PR. Security isn't something I want to be fuzzy about and stating we put a secure-looking policy in Airflow with a disclaimer that it isn't a security feature somewhere in the docs just doesn't work for me.

BasPH added 2 commits December 18, 2023 21:47

Implement pluggable connection policy

7fa97b4

Simplify names

ff3190a

BasPH requested a review from potiuk as a code owner December 18, 2023 21:01

boring-cyborg bot added area:core-operators kind:documentation labels Dec 18, 2023

BasPH and others added 2 commits December 18, 2023 22:35

Precommit fix

b39c2fd

Merge branch 'main' into get_connection_cluster_policy

08fa073

dstandish reviewed Dec 18, 2023

View reviewed changes

dstandish approved these changes Dec 18, 2023

View reviewed changes

BasPH closed this Dec 19, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement pluggable connection policy#36291

Implement pluggable connection policy#36291
BasPH wants to merge 4 commits intoapache:mainfrom
BasPH:get_connection_cluster_policy

BasPH commented Dec 18, 2023

Uh oh!

dstandish Dec 18, 2023

Uh oh!

dstandish Dec 18, 2023

Uh oh!

potiuk Dec 18, 2023 •

edited

Loading

Uh oh!

dstandish Dec 18, 2023

Uh oh!

BasPH Dec 19, 2023

Uh oh!

potiuk commented Dec 18, 2023

Uh oh!

potiuk commented Dec 18, 2023

Uh oh!

BasPH commented Dec 19, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants



		@local_settings_hookspec
		def connection_policy(conn_id: str) -> str: # type: ignore[empty-body]

Conversation

BasPH commented Dec 18, 2023

Uh oh!

dstandish Dec 18, 2023

Choose a reason for hiding this comment

Uh oh!

dstandish Dec 18, 2023

Choose a reason for hiding this comment

Uh oh!

potiuk Dec 18, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

dstandish Dec 18, 2023

Choose a reason for hiding this comment

Uh oh!

BasPH Dec 19, 2023

Choose a reason for hiding this comment

Uh oh!

potiuk commented Dec 18, 2023

Uh oh!

potiuk commented Dec 18, 2023

Uh oh!

BasPH commented Dec 19, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

potiuk Dec 18, 2023 •

edited

Loading