Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Job exiting due to SCR_FINALIZE_CALLED #13

Closed
lwan86 opened this issue Oct 24, 2016 · 3 comments · Fixed by #37
Closed

Job exiting due to SCR_FINALIZE_CALLED #13

lwan86 opened this issue Oct 24, 2016 · 3 comments · Fixed by #37
Assignees

Comments

@lwan86
Copy link

lwan86 commented Oct 24, 2016

My code worked well with previous version of SCR (I couldn't remember which version).
Last week I upgraded the SCR source code to the newest version in the github repo and recompile it. However, when I tried to run my code, it always exits before the function SCR_Init() finishes. The debugging info says the reason the job exits is because "SCR_FINALIZE_CALLED". Why the SCR_Finalize is called during the SCR_Init()? I located the problem happened in function "scr_bool_check_halt_and_decrement()", but I don't understand why it happened. I was wondering if someone could help me on this. Thanks!

@adammoody
Copy link
Contributor

Hi @lwan86. You likely just need to remove the SCR_FINALIZE_CALLED condition from the halt file or just delete the halt file. You can do that by running the following command:

scr_halt --remove

This is behavior is confusing at first, but it's actually by design. We use the call to SCR_Finalize() as the way for the application to signal that it has completed. Without such a signal, the SCR library and its associated scripting would assume the job ended unexpectedly and continue to restart it forever. SCR_Finalize acts as a break to the infinite loop. It does this by recording the fact that SCR_Finalize was called in the halt file. If the library detects this condition in the halt file, it will not restart the application.

Although this is by design, it does lead to lots of confusion. I'll keep this ticket open as a reminder to revisit this to improve things.

@lwan86
Copy link
Author

lwan86 commented Oct 26, 2016

Thanks for your suggestion, @adammoody! I deleted the halt.scr file directly and it works now.

@adammoody
Copy link
Contributor

Due to massive confusion over this "feature", we've disabled SCR_Init from bailing if SCR_FINALIZE_CALLED is set in the halt file. We'll assume the job was restarted intentionally in this case. The fix is available starting with commit bc040fe.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging a pull request may close this issue.

2 participants