Default value for the number of threads #89

avdosev · 2023-04-17T19:54:36Z

The current default value is cpu_count/2:

Line 102 in b2a24bd

self.n_threads = n_threads or max(multiprocessing.cpu_count() // 2, 1)

This value does not seem to be optimal for multicore systems. For example, a CPU with 8 cores will have 4 cores idle. Or to put it simply, we will get twice the slowdown (if there are no more nuances in model execution).

Related issues: #71

In this discussion I would like to know the motivation for such a default value, as it seems that it is not obvious to most users.

gjmulder · 2023-04-17T20:12:24Z

Most physical systems are hyperthreaded
Hyperthreading doesn't seem to improve performance due to the memory I/O bound nature of llama.cpp
Might be invalid for VMs

avdosev · 2023-04-17T21:25:20Z

Hyperthreading doesn't seem to improve performance due to the memory I/O bound nature of llama.cpp

It's a bit counterintuitive for me. Hypertreading was created to fully utilize the CPU during memory bound programs.

If we are talking about limited CPU utilization on the VM, in my opinion, the library should not solve such case and should offer the most productive option for the base scenario.

In general, it is necessary to test the performance on different systems and choose the most performance default value.

Priestru · 2023-04-18T07:27:46Z

This was tested in original llama. C++ implementation already utilities full available compute of physical cores, increasing amount of threads beyond would only to lower performance due to reason mentioned above.

avdosev · 2023-04-21T20:48:24Z

Okay. If the systems with hypertrading receive reductions, then for them this logic can be left, but why for systems with all physical cores logic is not different?

gjmulder · 2023-05-12T17:33:02Z

Closing as it is impossible to support every config of hardware.

gjmulder added the hardware Hardware specific issue label May 12, 2023

gjmulder closed this as not planned Won't fix, can't repro, duplicate, stale May 15, 2023

amaiya mentioned this issue Dec 8, 2023

CPU limits amaiya/onprem#48

Closed

icsy7867 mentioned this issue Jan 29, 2024

CPU caps out at 50% zylon-ai/private-gpt#1553

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Default value for the number of threads #89

Default value for the number of threads #89

avdosev commented Apr 17, 2023

gjmulder commented Apr 17, 2023

avdosev commented Apr 17, 2023

Priestru commented Apr 18, 2023 •

edited

avdosev commented Apr 21, 2023

gjmulder commented May 12, 2023

Default value for the number of threads #89

Default value for the number of threads #89

Comments

avdosev commented Apr 17, 2023

gjmulder commented Apr 17, 2023

avdosev commented Apr 17, 2023

Priestru commented Apr 18, 2023 • edited

avdosev commented Apr 21, 2023

gjmulder commented May 12, 2023

Priestru commented Apr 18, 2023 •

edited