I would like the --tensor-parallel-size setting of the vllm backend to be added to yaml. #2084

Taikono-Himazin · 2024-04-20T12:03:09Z

Is your feature request related to a problem? Please describe.

No similar issues were found.

Describe the solution you'd like

I would like it to be possible to set --tensor-parallel-size from yaml, similar to gpu_memory_utilization and trust_remote_code in the vllm backend.
https://docs.vllm.ai/en/latest/models/engine_args.html

The current settings only allow inference on a single GPU.
This setting is necessary to make this compatible with multiple GPUs.

localai-bot · 2024-04-20T12:05:45Z

Hello Taikono-Himazin,

Thank you for taking the time to submit an enhancement request to the LocalAI project.

Based on your description, it seems that adding the --tensor-parallel-size setting to the vllm backend configuration through a YAML file would indeed be a useful enhancement. This would allow users to leverage multiple GPUs for inference, which is an essential capability for handling larger workloads.

I will pass your suggestion along to the project maintainers. They will review it and decide whether or not to implement it. You may not receive a direct reply, but rest assured, your request has been noted and will be considered.

Remember, LocalAI is a community-driven project, so if you would like to contribute to its development, please feel free to join the community and helps.

Thank you once again for your feedback, and we look forward to seeing more contributions from you!

I am a bot, an experiment of @mudler.

Taikono-Himazin added the enhancement New feature or request label Apr 20, 2024

Taikono-Himazin mentioned this issue Apr 20, 2024

Add tensor_parallel_size setting to vllm setting items #2085

Merged

mudler closed this as completed in #2085 Apr 20, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

I would like the --tensor-parallel-size setting of the vllm backend to be added to yaml. #2084

I would like the --tensor-parallel-size setting of the vllm backend to be added to yaml. #2084

Taikono-Himazin commented Apr 20, 2024 •

edited

localai-bot commented Apr 20, 2024

I would like the --tensor-parallel-size setting of the vllm backend to be added to yaml. #2084

I would like the --tensor-parallel-size setting of the vllm backend to be added to yaml. #2084

Comments

Taikono-Himazin commented Apr 20, 2024 • edited

localai-bot commented Apr 20, 2024

Taikono-Himazin commented Apr 20, 2024 •

edited