Update _get_env_from to handle cases better #398

carolineechen · 2024-01-26T01:25:54Z

improve 2 cases of _get_env_from, which is expected to return an Env type:

input env = "base_env" -- falls through all the if statements and returns the string "base_env" instead of the Env object
env is not saved in rns_client but exists on the cluster associated with the call. returns the env string instead of Env object

carolineechen · 2024-01-26T01:26:07Z

Update _get_env_from to handle cases better #398 👈
main

This stack of pull requests is managed by Graphite. Learn more about stacking.

Join @carolineechen and the rest of your teammates on Graphite

carolineechen · 2024-01-26T20:48:10Z

runhouse/resources/envs/utils.py

+
+    warning = f"Could not locate Env from value: {env}"
+    warning = f"{warning} on cluster {system.name}" if system else warning
+    logging.warning(warning)


@dongreenberg can you double check my logic here and if this PR makes sense? and is there an instance where it still makes sense to return the input env if it doesn't satisfy any of the conditions or to simply throw an error?

most cases we expect an Env to be returned which will throw an error elsewhere if it returns a string and we try to call an env property on it. but there could also be cases where the output is simply used to populate a config etc, not sure if it's an issue if a string is returned if the corresponding Env could not be found.

tbh I'm nervous that passing system here is a little non-deterministic. Den and this system could have different configs for that env name, and the only thing determining which is returned is the order we've implemented here. What case are we solving for where we want to check on the cluster for the env config?

I also don't think I'd raise a warning here. We use this method all the time in instances where we don't necessarily expect and env to be found (but should construct one if it is), and it'll be pretty noisy for the user.

dongreenberg

Tbh I think I'm missing some context, but there's some back and forth here with the cluster that seems a bit hairy to me. Let's chat about it live

dongreenberg · 2024-01-30T10:30:20Z

runhouse/resources/envs/utils.py

+    if isinstance(env, Resource) or env is None:
        return env

    from runhouse.resources.envs import Env


Suggested change

if isinstance(env, Resource) or env is None:

return env

from runhouse.resources.envs import Env

from runhouse.resources.envs import Env

if isinstance(env, Env) or env is None:

return env

dongreenberg · 2024-01-30T10:41:04Z

runhouse/resources/envs/utils.py

+            if system.get(env):
+                return system.get(env)
+
+            # handle case where env is passed in as full rns address
+            name, _ = rns_client.split_rns_name_and_path(
+                rns_client.resolve_rns_path(env)
+            )
+            if system.get(name):
+                return system.get(name)


Suggested change

if system.get(env):

return system.get(env)

# handle case where env is passed in as full rns address

name, _ = rns_client.split_rns_name_and_path(

rns_client.resolve_rns_path(env)

)

if system.get(name):

return system.get(name)

# handle case where env is passed in as full rns address

name, _ = rns_client.split_rns_name_and_path(

rns_client.resolve_rns_path(env)

) if "/" in env else (env, None)

if system.get(name):

return system.get(name)

Just saves a get

dongreenberg · 2024-01-30T10:52:21Z

runhouse/resources/envs/utils.py

+
+    warning = f"Could not locate Env from value: {env}"
+    warning = f"{warning} on cluster {system.name}" if system else warning
+    logging.warning(warning)


tbh I'm nervous that passing system here is a little non-deterministic. Den and this system could have different configs for that env name, and the only thing determining which is returned is the order we've implemented here. What case are we solving for where we want to check on the cluster for the env config?

I also don't think I'd raise a warning here. We use this method all the time in instances where we don't necessarily expect and env to be found (but should construct one if it is), and it'll be pretty noisy for the user.

dongreenberg · 2024-01-30T10:54:00Z

runhouse/resources/hardware/cluster.py

@@ -317,7 +318,7 @@ def install_packages(
        from runhouse.resources.envs.env import Env

        self.check_server()
-        env = _get_env_from(env) or Env(name=env or Env.DEFAULT_NAME)
+        env = _get_env_from(env, system=self) or Env(name=env or Env.DEFAULT_NAME)


This makes me a bit nervous - if the env contains package objects they will be serialized and sent back pointing to the cluster's local filesystem. Also not totally clear what's happening here, we're getting the env object from the system by name so we can then extract the name out of it?

dongreenberg · 2024-01-30T10:56:47Z

I also think perhaps the base_env and base default naming business is confusing me here. We should probably rename those or eliminate the overlap asap.

carolineechen force-pushed the cc/env-utils branch from f60026f to 60b739f Compare January 26, 2024 04:13

carolineechen added 2 commits January 25, 2024 23:14

handle case where _get_env_from receives "base_env"

60b739f

support case where env only lives on cluster

fbb6a6f

carolineechen requested a review from dongreenberg January 26, 2024 20:43

carolineechen commented Jan 26, 2024

View reviewed changes

dongreenberg reviewed Jan 30, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update _get_env_from to handle cases better #398

Update _get_env_from to handle cases better #398

carolineechen commented Jan 26, 2024 •

edited

carolineechen commented Jan 26, 2024 •

edited

carolineechen Jan 26, 2024 •

edited

dongreenberg Jan 30, 2024

dongreenberg left a comment

dongreenberg Jan 30, 2024

dongreenberg Jan 30, 2024

dongreenberg Jan 30, 2024

dongreenberg Jan 30, 2024

dongreenberg Jan 30, 2024

dongreenberg commented Jan 30, 2024

Update _get_env_from to handle cases better #398

Are you sure you want to change the base?

Update _get_env_from to handle cases better #398

Conversation

carolineechen commented Jan 26, 2024 • edited

carolineechen commented Jan 26, 2024 • edited

carolineechen Jan 26, 2024 • edited

Choose a reason for hiding this comment

dongreenberg Jan 30, 2024

Choose a reason for hiding this comment

dongreenberg left a comment

Choose a reason for hiding this comment

dongreenberg Jan 30, 2024

Choose a reason for hiding this comment

dongreenberg Jan 30, 2024

Choose a reason for hiding this comment

dongreenberg Jan 30, 2024

Choose a reason for hiding this comment

dongreenberg Jan 30, 2024

Choose a reason for hiding this comment

dongreenberg Jan 30, 2024

Choose a reason for hiding this comment

dongreenberg commented Jan 30, 2024

carolineechen commented Jan 26, 2024 •

edited

carolineechen commented Jan 26, 2024 •

edited

carolineechen Jan 26, 2024 •

edited