Deprecate serve command line options in favor of yaml config file #146

xwu99 · 2024-03-15T00:57:53Z

We will put most of the complex config in yaml file.
Only keep some necessary options and remove command line options already presented in yaml file to avoid confusing.

carsonwang · 2024-03-27T02:20:13Z

@Deegue Please list the options to keep and options to remove here.

Deegue · 2024-03-27T02:45:24Z

As for serve.py, I'm going to remove:

--port
--route_prefix
--num_replicas
--cpus_per_worker
--gpus_per_worker
--hpus_per_worker
--deepspeed
--workers_per_group
--ipex
--device

while those configs are still kept in command line options:
--config_file
--model_id_or_path
--tokenizer_id_or_path
--models
--serve_local_only
--simple
--keep_serve_terminal

carsonwang · 2024-03-27T03:27:55Z

cc @xwu99 @jiafuzha @KepingYan

xwu99 · 2024-04-03T02:48:19Z

Reference

@Deegue thanks for the summary. I am ok with this except @KepingYan pls clarify the differences between simple and OpenAI when using --route_prefix, could we just put it in the yaml file?

KepingYan · 2024-04-03T03:25:33Z

Now we support three modes to pass parameters：

If --config_file is specified, all attribute values are determined by this file.
--config_file is None and --model_id_or_path is specified. In this case, --tokenizer_id_or_path --port --route_prefix --num_replicas --cpus_per_worker --gpus_per_worker --hpus_per_worker --deepspeed --workers_per_group --ipex --device will take effect.
Both --config_file and --model_id_or_path are None, then all config files in the llm_on_ray/inference/models/ directory will be obtained as the serve models list. And users can specify a sub-list of models to deploy via --models.

If we remove these parameters, what will determine the other attribute values when user specifies --model_id_or_path?
After the modification, which methods of passing parameters will we provide to users?

--port
--route_prefix
--num_replicas
--cpus_per_worker
--gpus_per_worker
--hpus_per_worker
--deepspeed
--workers_per_group
--ipex
--device

xwu99 · 2024-04-03T16:40:30Z

Now we support three modes to pass parameters：

If --config_file is specified, all attribute values are determined by this file.

--config_file is None and --model_id_or_path is specified. In this case, --tokenizer_id_or_path --port --route_prefix --num_replicas --cpus_per_worker --gpus_per_worker --hpus_per_worker --deepspeed --workers_per_group --ipex --device will take effect.

Both --config_file and --model_id_or_path are None, then all config files in the llm_on_ray/inference/models/ directory will be obtained as the serve models list. And users can specify a sub-list of models to deploy via --models.

If we remove these parameters, what will determine the other attribute values when user specifies --model_id_or_path?

After the modification, which methods of passing parameters will we provide to users?
--port

--route_prefix

--num_replicas

--cpus_per_worker

--gpus_per_worker

--hpus_per_worker

--deepspeed

--workers_per_group

--ipex

--device

Can we keep 1 and 3 and remove 2?

@Deegue Keep in mind that We need to serve multiple models at the same time. So also need to support multiple config files

Deegue · 2024-04-03T17:01:59Z

Now we support three modes to pass parameters：

If --config_file is specified, all attribute values are determined by this file.

--config_file is None and --model_id_or_path is specified. In this case, --tokenizer_id_or_path --port --route_prefix --num_replicas --cpus_per_worker --gpus_per_worker --hpus_per_worker --deepspeed --workers_per_group --ipex --device will take effect.

Both --config_file and --model_id_or_path are None, then all config files in the llm_on_ray/inference/models/ directory will be obtained as the serve models list. And users can specify a sub-list of models to deploy via --models.

If we remove these parameters, what will determine the other attribute values when user specifies --model_id_or_path?
After the modification, which methods of passing parameters will we provide to users?
--port

--route_prefix

--num_replicas

--cpus_per_worker

--gpus_per_worker

--hpus_per_worker

--deepspeed

--workers_per_group

--ipex

--device
Can we keep 1 and 3 and remove 2?

@Deegue Keep in mind that We need to serve multiple models at the same time. So also need to support multiple config files

Obviously, it will be clear if we remove case 2..

xwu99 · 2024-04-08T01:52:38Z

Now we support three modes to pass parameters：

If --config_file is specified, all attribute values are determined by this file.

--config_file is None and --model_id_or_path is specified. In this case, --tokenizer_id_or_path --port --route_prefix --num_replicas --cpus_per_worker --gpus_per_worker --hpus_per_worker --deepspeed --workers_per_group --ipex --device will take effect.

Both --config_file and --model_id_or_path are None, then all config files in the llm_on_ray/inference/models/ directory will be obtained as the serve models list. And users can specify a sub-list of models to deploy via --models.

If we remove these parameters, what will determine the other attribute values when user specifies --model_id_or_path?
After the modification, which methods of passing parameters will we provide to users?
--port

--route_prefix

--num_replicas

--cpus_per_worker

--gpus_per_worker

--hpus_per_worker

--deepspeed

--workers_per_group

--ipex

--device
Can we keep 1 and 3 and remove 2?
@Deegue Keep in mind that We need to serve multiple models at the same time. So also need to support multiple config files
Obviously, it will be clear if we remove case 2..

Yes, pls go ahead to remove case 2 and update code & docs

xwu99 · 2024-04-08T01:56:40Z

also rename --model_id_or_path to --model_id

KepingYan · 2024-04-08T02:06:47Z

also rename --model_id_or_path to --model_id

If we remove case 2, can we also remove --model_id_or_path and --tokenizer_id_or_path? Just use --models to specify the list of models to deploy?

xwu99 mentioned this issue Mar 15, 2024

Override config file with command line argument #98

Closed

xwu99 assigned Deegue Mar 18, 2024

Deegue mentioned this issue Mar 28, 2024

[Inference] Deprecate serve command line options in favor of yaml config file #165

Merged

xwu99 closed this as completed May 11, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Deprecate serve command line options in favor of yaml config file #146

Deprecate serve command line options in favor of yaml config file #146

xwu99 commented Mar 15, 2024 •

edited

Loading

carsonwang commented Mar 27, 2024

Deegue commented Mar 27, 2024

carsonwang commented Mar 27, 2024

xwu99 commented Apr 3, 2024 •

edited

Loading

KepingYan commented Apr 3, 2024

xwu99 commented Apr 3, 2024 •

edited

Loading

Deegue commented Apr 3, 2024

xwu99 commented Apr 8, 2024

xwu99 commented Apr 8, 2024

KepingYan commented Apr 8, 2024

Deprecate serve command line options in favor of yaml config file #146

Deprecate serve command line options in favor of yaml config file #146

Comments

xwu99 commented Mar 15, 2024 • edited Loading

carsonwang commented Mar 27, 2024

Deegue commented Mar 27, 2024

carsonwang commented Mar 27, 2024

xwu99 commented Apr 3, 2024 • edited Loading

KepingYan commented Apr 3, 2024

xwu99 commented Apr 3, 2024 • edited Loading

Deegue commented Apr 3, 2024

xwu99 commented Apr 8, 2024

xwu99 commented Apr 8, 2024

KepingYan commented Apr 8, 2024

xwu99 commented Mar 15, 2024 •

edited

Loading

xwu99 commented Apr 3, 2024 •

edited

Loading

xwu99 commented Apr 3, 2024 •

edited

Loading