CPU limits #48

lystrata · 2023-12-07T19:31:26Z

On an 8 Core system, I see that onprem pegs 4 coress @100% while the 0ther 4 cores are relatively inactive. Is this an intentional design limit? Or can we enable the code to make use of more cores?

amaiya · 2023-12-08T01:09:04Z

The default is to use half of your CPUs and is set by llama-cpp-python. You can change it by supplying the n_threads parameter to LLM:

from onprem import LLM
llm = LLM(n_threads=desired_number_of_cores)

amaiya added the question Further information is requested label Dec 8, 2023

amaiya closed this as completed Dec 8, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CPU limits #48

CPU limits #48

lystrata commented Dec 7, 2023

amaiya commented Dec 8, 2023

CPU limits #48

CPU limits #48

Comments

lystrata commented Dec 7, 2023

amaiya commented Dec 8, 2023