Skip to content

#618: [Speedup] Disable ldconfig on nvidia driver configuration.#634

Merged
theyoprst merged 2 commits intodevfrom
issue-618/flock-ldconfig
Apr 3, 2025
Merged

#618: [Speedup] Disable ldconfig on nvidia driver configuration.#634
theyoprst merged 2 commits intodevfrom
issue-618/flock-ldconfig

Conversation

@theyoprst
Copy link
Collaborator

@theyoprst theyoprst commented Apr 2, 2025

nvidia-container-cli configure runs for 2-2.5s on average and it was run under flock only to prevent races during ldconfig calls (it failed which resulted in a pod restart).

Since ldconfig is run separately under other flock, it was removed from the configuration call with the flock altogether.
It should speed up the workers statefulset restart by 2-2.5s multiplied by the number of workers.

@theyoprst theyoprst force-pushed the issue-618/flock-ldconfig branch from c95bef2 to 5e29d41 Compare April 2, 2025 14:53
@theyoprst theyoprst added bug Something isn't working enhancement New feature or request and removed bug Something isn't working labels Apr 2, 2025
@theyoprst theyoprst marked this pull request as ready for review April 2, 2025 14:55
@theyoprst theyoprst changed the title #618: Disable ldconfig on nvidia driver configuration. #618: [Speedup] Disable ldconfig on nvidia driver configuration. Apr 2, 2025
asteny
asteny previously approved these changes Apr 2, 2025
@theyoprst theyoprst merged commit 5b4c98e into dev Apr 3, 2025
2 checks passed
@asteny asteny deleted the issue-618/flock-ldconfig branch June 23, 2025 13:47
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

enhancement New feature or request

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants