New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
lib/cpuinfo: Increase the file descriptors limit to handle more CPUs #263
Open
babumoger
wants to merge
1
commit into
intel:master
Choose a base branch
from
babumoger:AMD-Max_cores_fix-0.1
base: master
Could not load branches
Branch not found: {{ refName }}
Could not load tags
Nothing to show
Are you sure you want to change the base?
Some commits from the old base branch may be removed from the timeline,
and old review comments may become outdated.
Conversation
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Please take a look at the code. |
Yes, we are reviewing this code. Thanks |
Hi Babu, |
babumoger
force-pushed
the
AMD-Max_cores_fix-0.1
branch
from
April 19, 2024 14:10
9b41e2a
to
244a252
Compare
Hi Raghavan, I have implemented your changes. Please review. thanks |
rkanagar
reviewed
Apr 24, 2024
rkanagar
reviewed
Apr 25, 2024
babumoger
force-pushed
the
AMD-Max_cores_fix-0.1
branch
from
April 25, 2024 16:07
244a252
to
c25bd6d
Compare
rkanagar
reviewed
Apr 30, 2024
The pqos tool fails with the following errors on systems with 300 or more CPU cores. $pqos NOTE: Mixed use of MSR and kernel interfaces to manage CAT or CMT & MBM may lead to unexpected behavior. ERROR: Could not open /sys/fs/resctrl directory ERROR: Failed to stop resctrl events ERROR: Failed to start all selected OS monitoring events Monitoring start error on core(s) 339, status 1 By default, the file descriptor limit is set to 1024 for a session. pqos monitor uses 3 descriptors for each CPU for perf monitoring. So, it runs out of limit(1024) on systems with 300 or more CPUs. Fix the issue by detecting the number of CPUs in the system and increasing the descriptor limit using system call getrlimit and setrlimit respectively. Increase the limit to 4 times the number of CPUs to take care of open files limit. Signed-off-by: Babu Moger <babu.moger@amd.com>
babumoger
force-pushed
the
AMD-Max_cores_fix-0.1
branch
from
April 30, 2024 19:00
c25bd6d
to
42475e2
Compare
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
The pqos tool fails with the following errors on systems with 300 or more CPU cores.
$pqos
NOTE: Mixed use of MSR and kernel interfaces to manage
CAT or CMT & MBM may lead to unexpected behavior.
ERROR: Could not open /sys/fs/resctrl directory
ERROR: Failed to stop resctrl events
ERROR: Failed to start all selected OS monitoring events Monitoring start error on core(s) 339, status 1
By default, the file descriptor limit is set to 1024 for a session. pqos monitor uses 3 descriptors for each CPU for perf monitoring. So, it runs out of limit(1024) on systems with 300 or more CPUs.
Fix the issue by detecting the number of CPUs in the system and increasing the descriptor limit using system call getrlimit and setrlimit respectively. Increase the limit to 4 times the number of CPUs to take care of open files limit.
Description
By default, the file descriptor limit is set to 1024 for a session. pqos monitor uses 3 descriptors for each CPU for perf monitoring. So, it runs out of limit(1024) on systems with 300 or more CPUs.
Fix the issue by detecting the number of CPUs in the system and increasing the descriptor limit using system call getrlimit and setrlimit respectively. Increase the limit to 4 times the number of CPUs to take care of open files limit.
Affected parts
Motivation and Context
#261
How Has This Been Tested?
Types of changes
Checklist: