Replies: 2 comments
-
We currrently shard the model via |
Beta Was this translation helpful? Give feedback.
0 replies
-
But we are working on an internal sharding feature right now, so stay tuned! |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
I see that the
accelerate
library is in the dependencies. However, I cannot see any argument that enables model sharing for inference. I may be missing something tho.Beta Was this translation helpful? Give feedback.
All reactions