Skip to content

Conversation

@brusdev
Copy link
Member

@brusdev brusdev commented Jun 26, 2023

ARTEMIS-4332 Add management method to close stuck server sessions
In rare cases a store operation could silently fails or starves, blocking the related server session and all delivering messages. Those server sessions can be closed adding a management method that cleans their operation context before closing them.

This PR also includes an optional commit to track store operations that would help to analyze heap dumps with stuck server sessions: ARTEMIS-4332 Add store operation trackers

Copy link
Member

@gemmellr gemmellr left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Adding JIRA ref in PR title would be good.

@brusdev brusdev changed the title Close stuck sessions ARTEMIS-4332 Add management method to close stuck server sessions Jun 28, 2023
@brusdev brusdev force-pushed the close_stuck_sessions branch from e79c199 to 6714bb7 Compare July 3, 2023 08:03
Copy link
Member

@gemmellr gemmellr left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM other than the slight nitpick. I'd like someone else to review it too though.

@brusdev brusdev force-pushed the close_stuck_sessions branch from 6714bb7 to a3a3f24 Compare July 3, 2023 15:10
@brusdev
Copy link
Member Author

brusdev commented Jul 3, 2023

@gemmellr thanks for your review

@brusdev brusdev force-pushed the close_stuck_sessions branch from a3a3f24 to ce76651 Compare July 7, 2023 03:11
brusdev added 2 commits July 7, 2023 17:48
In rare cases a store operation could silently fails or starves, blocking the
related server session and all delivering messages. Those server sessions can
be closed adding a management method that cleans their operation context
before closing them.
@brusdev brusdev force-pushed the close_stuck_sessions branch from ce76651 to 9b598aa Compare July 7, 2023 15:48
@clebertsuconic
Copy link
Contributor

@brusdev only thing I could ask now is to verify if the new test over Core is failing in a loop or not? We had a few overCore tests failing in management because of noise from the Core itself (the testing framework itself is creating a connection and a session in the server's).

Other than that I think we should merge this.

@clebertsuconic clebertsuconic merged commit 451d03f into apache:main Jul 13, 2023
Copy link

@rvais rvais left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Tests should be enough for this one.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants