How to use multi node to deploy model? #353

Closed

amulil started this conversation in General

amulil
Sep 1, 2023

I want to use 2 node(2*8GPU) to accelerate the inference speed of the llama2.

Replies: 1 comment

lvhan028
Sep 1, 2023
Maintainer

You can independently launch llama2 on each node. Then manage them with k8s.

0 replies

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment