fix(scheduling): numpy worker environs are not taking effect #2893

bojiang · 2022-08-11T16:02:43Z

solution:

only allow scheduling_strategy to control the env variables
setting up environ before importing bentoml

WARNING:
a breaking change for a not documented public API custom scheduler

codecov · 2022-08-11T16:05:46Z

Codecov Report

Merging #2893 (3280c4b) into main (3ec89ec) will decrease coverage by 1.53%.
The diff coverage is 21.66%.

❗ Current head 3280c4b differs from pull request most recent head f0b7386. Consider uploading reports for the commit f0b7386 to get more accurate results

@@            Coverage Diff             @@
##             main    #2893      +/-   ##
==========================================
- Coverage   70.88%   69.35%   -1.54%     
==========================================
  Files         103      103              
  Lines        9335     9374      +39     
==========================================
- Hits         6617     6501     -116     
- Misses       2718     2873     +155

Impacted Files	Coverage Δ
bentoml/_internal/yatai_client/__init__.py	`24.06% <6.06%> (-0.94%)`	⬇️
bentoml/_internal/yatai_rest_api_client/yatai.py	`31.25% <27.27%> (-0.64%)`	⬇️
bentoml/_internal/yatai_rest_api_client/schemas.py	`93.53% <100.00%> (+0.06%)`	⬆️
bentoml/_internal/utils/buildx.py	`0.00% <0.00%> (-49.00%)`	⬇️
bentoml/_internal/utils/docker.py	`34.48% <0.00%> (-34.49%)`	⬇️
bentoml/_internal/utils/circus/__init__.py	`60.00% <0.00%> (-30.00%)`	⬇️
bentoml/_internal/utils/platform.py	`66.66% <0.00%> (-8.34%)`	⬇️
bentoml/_internal/runner/container.py	`83.98% <0.00%> (-6.07%)`	⬇️
bentoml/_internal/runner/runner_handle/remote.py	`83.87% <0.00%> (-4.31%)`	⬇️
bentoml/_internal/runner/utils.py	`86.88% <0.00%> (-3.28%)`	⬇️
... and 7 more

parano · 2022-08-14T21:38:15Z

bentoml_cli/server/runner.py

@@ -51,20 +58,10 @@ def main(
            - file:///path/to/unix.sock
            - fd://12
        working_dir: (Optional) the working directory
-        worker_id: (Optional) if set, the runner will be started as a worker with the given ID
+        worker_id: (Optional) if set, the runner will be started as a worker with the given ID. Important: begin from 1.


add arg doc for worker_env_map

parano · 2022-08-14T21:39:41Z

bentoml_cli/server/runner.py

+    required=False,
+    type=click.STRING,
+    default=None,
+    help="The environment variables to pass to the worker process. The format is a JSON string, e.g. '{0: {\"CUDA_VISIBLE_DEVICES\": 0}}'.",


should we use a dotenv file instead of JSON? that seems easier for debugging purpose

The env map includes all envvars for each worker.

{ 0: {"CUDA_VISIBLE_DEVICES": 0}, 1: {"CUDA_VISIBLE_DEVICES": 1}, }

It seems not that easy to be represented by dotenv files.

Agree that passing JSON in CLI isn't the most intuitive. Using an .env file, we can allow multiple arguments of key-value pairs.

--worker-env 0:worker_0.env --worker-env 1:worker_1.env

@ssheng I think that's too complicated. This is not the public API.
The public API is the bentoml serve and bentoml.serve

parano · 2022-08-14T21:42:32Z

bentoml/_internal/runner/runner.py

-            worker_id,
-        )
+    @property
+    def scheduled_worker_env_map(self) -> dict[int, dict[str, t.Any]]:


does Yatai need this information for scheduling runners?

the worker concept is transparent for yatai

@bojiang got it, in the case of Yatai, it will just use resources available from system.

solution: * only allow scheduling_strategy to control the env variables * setting up environ before importing bentoml

bojiang requested review from ssheng, parano and a team as code owners August 11, 2022 16:02

bojiang requested review from jjmachan and removed request for a team August 11, 2022 16:02

bojiang mentioned this pull request Aug 11, 2022

fix(scheduling): raise an error for invalid resources #2894

Merged

parano reviewed Aug 14, 2022

View reviewed changes

bojiang force-pushed the fix-scheduling branch 2 times, most recently from 6a26765 to f0b7386 Compare August 18, 2022 04:54

bojiang added 11 commits August 18, 2022 12:54

fix(scheduling): numpy worker environs are not taking effect

1c1fd80

solution: * only allow scheduling_strategy to control the env variables * setting up environ before importing bentoml

style

cedf5d1

assert message

6d7dd4a

fix unittest

3f173c5

fix

37f2974

style

5cd921e

isort

3908b52

fix index

929a16c

comments

2e1314c

add doc

8909d85

fix unittest

7030027

bojiang force-pushed the fix-scheduling branch from f0b7386 to 7030027 Compare August 18, 2022 04:54

ssheng approved these changes Aug 18, 2022

View reviewed changes

ssheng merged commit ff7d608 into bentoml:main Aug 18, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(scheduling): numpy worker environs are not taking effect #2893

fix(scheduling): numpy worker environs are not taking effect #2893

bojiang commented Aug 11, 2022 •

edited

codecov bot commented Aug 11, 2022 •

edited

parano Aug 14, 2022

parano Aug 14, 2022

bojiang Aug 15, 2022 •

edited

ssheng Aug 17, 2022 •

edited

bojiang Aug 17, 2022

parano Aug 14, 2022

bojiang Aug 15, 2022

parano Aug 15, 2022

bojiang Aug 17, 2022

fix(scheduling): numpy worker environs are not taking effect #2893

fix(scheduling): numpy worker environs are not taking effect #2893

Conversation

bojiang commented Aug 11, 2022 • edited

codecov bot commented Aug 11, 2022 • edited

Codecov Report

parano Aug 14, 2022

Choose a reason for hiding this comment

parano Aug 14, 2022

Choose a reason for hiding this comment

bojiang Aug 15, 2022 • edited

Choose a reason for hiding this comment

ssheng Aug 17, 2022 • edited

Choose a reason for hiding this comment

bojiang Aug 17, 2022

Choose a reason for hiding this comment

parano Aug 14, 2022

Choose a reason for hiding this comment

bojiang Aug 15, 2022

Choose a reason for hiding this comment

parano Aug 15, 2022

Choose a reason for hiding this comment

bojiang Aug 17, 2022

Choose a reason for hiding this comment

bojiang commented Aug 11, 2022 •

edited

codecov bot commented Aug 11, 2022 •

edited

bojiang Aug 15, 2022 •

edited

ssheng Aug 17, 2022 •

edited