Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

job-manager: get jobspec from KVS on restart #3396

Merged
merged 2 commits into from
Dec 4, 2020

Conversation

garlick
Copy link
Member

@garlick garlick commented Dec 3, 2020

Problem: recent changes to cache jobspec in job manager
neglected to restore jobspec when job manager is restarted.

Look up jobspec in parallel with eventlog when replaying
job eventlogs during job manager startup.

Copy link
Member

@chu11 chu11 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM!

@garlick
Copy link
Member Author

garlick commented Dec 4, 2020

Restarted centos 7 builder that had hung in the "hydra launches flux" test. That might be the hang we've been seeing on centos 7. Anyway, thanks for the review - I'll set MWP.

Problem: recent changes to cache jobspec in job manager
neglected to restore jobspec when job manager is restarted.

Look up jobspec in parallel with eventlog when replaying
job eventlogs during job manager startup.
@codecov
Copy link

codecov bot commented Dec 4, 2020

Codecov Report

Merging #3396 (c0f02e8) into master (de460f6) will increase coverage by 0.04%.
The diff coverage is 100.00%.

@@            Coverage Diff             @@
##           master    #3396      +/-   ##
==========================================
+ Coverage   81.93%   81.98%   +0.04%     
==========================================
  Files         301      301              
  Lines       46445    46462      +17     
==========================================
+ Hits        38055    38091      +36     
+ Misses       8390     8371      -19     
Impacted Files Coverage Δ
src/modules/job-manager/job.c 96.49% <100.00%> (+0.57%) ⬆️
src/modules/job-manager/restart.c 86.60% <100.00%> (+3.11%) ⬆️
src/broker/modservice.c 70.67% <0.00%> (-0.76%) ⬇️
src/common/libflux/message.c 83.60% <0.00%> (-0.13%) ⬇️
src/modules/job-info/guest_watch.c 76.72% <0.00%> (+0.57%) ⬆️
src/modules/job-archive/job-archive.c 59.68% <0.00%> (+0.79%) ⬆️
src/broker/module.c 76.74% <0.00%> (+1.58%) ⬆️
src/common/libpmi/pmi.c 93.39% <0.00%> (+1.88%) ⬆️
src/common/libpmi/simple_client.c 89.61% <0.00%> (+3.27%) ⬆️

@mergify mergify bot merged commit 2bab67f into flux-framework:master Dec 4, 2020
@garlick garlick deleted the restart_jobspec branch February 8, 2021 13:46
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants