Skip to content

restart notification#40

Merged
PaulJKathmann merged 7 commits intodevelopfrom
pk/notify_restart
Oct 13, 2025
Merged

restart notification#40
PaulJKathmann merged 7 commits intodevelopfrom
pk/notify_restart

Conversation

@PaulJKathmann
Copy link
Copy Markdown
Contributor

@PaulJKathmann PaulJKathmann commented Jul 28, 2025

Before this PR

If the user code crashes the forwarder will think the user code is still running until the user code posts a result for the same jobId. However, after restarting the user code will not know about the previously failed job so it will never report on it. This way a node/module might be blocked from receiving new requests after restarting.

After this PR

We inform the forwarder whenever a node starts up so it will remove all existing jobs related to it that it thinks are still running.

Possible downsides?

Are Docs needed?

@PaulJKathmann PaulJKathmann marked this pull request as ready for review July 28, 2025 19:17
Comment thread README.md Outdated
Comment thread compute_modules/client/internal_query_client.py Outdated
call schemas before restart endpoint

change order in which we call restart and schemas endpoint

Add generated changelog entries

feature flag the restart notification

documentation on how to use feature flag

make default True

readme -> default True

Update pr-40.v2.yml
@palantir palantir deleted a comment from changelog-app Bot Sep 2, 2025
Comment thread compute_modules/client/internal_query_client.py Outdated
Copy link
Copy Markdown
Contributor

@scottkruyswyk scottkruyswyk left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

A few nits but otherwise looks good!

Comment thread README.md Outdated
Comment thread changelog/@unreleased/pr-40.v2.yml Outdated
Comment thread compute_modules/client/internal_query_client.py
PaulJKathmann and others added 2 commits October 10, 2025 17:09
Co-authored-by: Scott Kruyswyk <skruyswyk@palantir.com>
Co-authored-by: Scott Kruyswyk <skruyswyk@palantir.com>
@PaulJKathmann PaulJKathmann merged commit ac5c2ac into develop Oct 13, 2025
11 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants