WFARQ-14 NPE in ServerSetupObserver.handleAfterUndeploy #85

rhusar · 2016-11-23T22:13:53Z

Jira
https://issues.jboss.org/browse/WFARQ-14

handleOnUndeploy not counting with the possibility of undeploy being fired by arq-core

rhusar · 2016-11-23T22:16:20Z

@rachmatowicz Could you please take this for a spin? This fixes the problem that QE reported with 2 test cases.

rachmatowicz · 2016-11-24T18:13:33Z

Hi Rado
Ran your fix against the testsuite. Same problem:

Tests in error:
ClusteredJPA2LCTestCase.org.jboss.as.test.clustering.cluster.jpa2lc.ClusteredJPA2LCTestCase » IndexOutOfBounds

rhusar · 2016-11-24T18:15:37Z

Ok, these are different issues. Can you try the other PR? If that one doesnt work either, lets open another Jira.

rachmatowicz · 2016-11-24T18:17:15Z

Which is the other PR?

rhusar · 2016-11-24T18:22:57Z

@rachmatowicz I sent you an email in the morning... the other pr is #86

rachmatowicz · 2016-11-24T19:02:00Z

Isn't the problem here that DistributableTestCase is not calling deploy/undeploy in pairs?

jamezp · 2016-12-14T15:51:03Z

I think this is correct, but before merging I'm attempting to see if I can figure out why it's being called twice on the same container.

rhusar · 2016-12-14T15:57:54Z

@jamezp I am thinking, can we discuss this maybe first? Since we have 4 different PR for the same class https://github.com/wildfly/wildfly-arquillian/pulls

One of the problems that I have is that there is no JavaDoc so the expected contract of the class is not clear to me.

jamezp · 2016-12-14T16:15:50Z

@rhusar Yes. A lot of what is happening is not quite clear. This one does seem okay because it's kind of just a "let's be safe". I do think we should try to figure out why it's being triggered twice for the same container though. To me that seems a bit odd.

As I understand this specific class is to handle the @ServerSetup. Before the deployment the setup is ran, after the undeploy the setup is torn down and the AfterClass is used to clean up any remaining setups.

I do kind of wonder if the observed events shouldn't be dealing with deployments in this case. It would be a bit of a change for manual mode tests though. Which now that I think about it, maybe that's why events are fired twice. I think these clustering tests are manual mode so maybe the undeploy is being fired twice.

jamezp · 2016-12-14T22:00:51Z

Okay I think I see this issue and this fix seems fine to me. What it looks like happens is the DistributableTestCase.testGracefulServeOnUndeploy undeploys the application, then after the test class the ClusterAbstractTestCase.afterTestMethod attempts to undeploy it again. I do think an undeploy should be safe to execute multiple times so I'd say this approach is fine.

You can work around this by checking the existence of the deployment before undeploying. I do think this is something we should add to the ArchiveDeployer, e.g. ArchiveDeployer.isDeployed(), as well as safely handle multiple undeploy's for the same deployment.

rhusar · 2016-12-16T17:40:31Z

@jamezp I think you are right.

TLDR; since this is affecting our day to day operation I would appreciate merging this, releasing a micro or Beta and update in WF/EAP. But this is still mostly a workaround.

I do think we should try to figure out why it's being triggered twice for the same container though. To me that seems a bit odd.

This is what we are doing every time actually for every test method, see:
https://github.com/wildfly/wildfly/blob/10.1.0.Final/testsuite/integration/clustering/src/test/java/org/jboss/as/test/clustering/cluster/ClusterAbstractTestCase.java#L99
This class doesn't care what was the state of the container (started,stopped,killed) it will always start it and make sure all deployments are undeployed.

Similarly before every test it deploys all the deployments and starts the containers so that ARQ injections work, see https://github.com/wildfly/wildfly/blob/10.1.0.Final/testsuite/integration/clustering/src/test/java/org/jboss/as/test/clustering/cluster/ClusterAbstractTestCase.java#L89

Note that this is not costly, as usually the servers are started so node start will return immediately.

As I understand this specific class is to handle the @serversetup.

Right, but the problem is that in https://github.com/wildfly/wildfly/blob/10.1.0.Final/testsuite/integration/clustering/src/test/java/org/jboss/as/test/clustering/cluster/web/DistributableTestCase.java there is no server setup! This is because the internal maps are leaking references, see fix somewhere along the lines https://github.com/wildfly/wildfly-arquillian/pull/86/files

I don't remember now what exactly are the steps to trigger the leasks, but the "deployed" map is never cleared (nulled) when things are undeployed and the after class doen't clear because setupTasksInForce is empty and afterTestClass() won't clear the maps. When I re-think about that not maybe a fix would be to fix that condition in afterTestClass.

The above PR cause tests like https://github.com/wildfly/wildfly/blob/10.1.0.Final/testsuite/integration/smoke/src/test/java/org/jboss/as/test/smoke/deployment/rar/tests/redeployment/ReDeploymentTestCase.java#L131 to fail, because these expect that the observer logic will be triggered only once on the first deployment -- is that the right contract of this ServerSetup API?

I do think an undeploy should be safe to execute multiple times so I'd say this approach is fine. You can work around this by checking the existence of the deployment before undeploying. I do think this is something we should add to the ArchiveDeployer, e.g. ArchiveDeployer.isDeployed(), as well as safely handle multiple undeploy's for the same deployment.

Right, that's part of the reason, we currently don't know what is deployed. I am not sure how this is implemented, but we would need to make sure that the deployer would be aware of deployments that were deployed before the container is stopped. Since this is in our test cases.

I have opened https://issues.jboss.org/browse/WFARQ-17 for that, see if this gets any traction.

jamezp · 2016-12-16T17:47:21Z

common/src/main/java/org/jboss/as/arquillian/container/ServerSetupObserver.java

+        if (count == null) {
+            // The deployment was already undeployed or never deployed
+            // AfterUnDeploy and BeforeUnDeploy events are fired by arquillian-core regardless of deployment status
+            return;


What do you think about adding some debug logging here just to indicate the container appears to have been removed and is being skipped?

I am hoping this won't be necessary if we can nail down the real cause (somewhere along the lines of #86 is attempting).

rhusar force-pushed the WFARQ-14 branch from f64e415 to b8c8c68 Compare November 23, 2016 22:27

jamezp reviewed Dec 16, 2016

View reviewed changes

jamezp force-pushed the WFARQ-14 branch from b8c8c68 to bea61ea Compare December 16, 2016 21:34

WFARQ-14 NPE in ServerSetupObserver.handleAfterUndeploy

70ce4c9

jamezp force-pushed the WFARQ-14 branch from bea61ea to 70ce4c9 Compare December 16, 2016 21:35

jamezp merged commit 65bcb46 into wildfly:master Dec 16, 2016

jamezp mentioned this pull request Dec 16, 2016

[WFARQ-14] Introduce sanity check for container deployment count. #84

Closed

rhusar deleted the WFARQ-14 branch December 19, 2016 15:21

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

WFARQ-14 NPE in ServerSetupObserver.handleAfterUndeploy #85

WFARQ-14 NPE in ServerSetupObserver.handleAfterUndeploy #85

rhusar commented Nov 23, 2016 •

edited

rhusar commented Nov 23, 2016

rachmatowicz commented Nov 24, 2016 •

edited

rhusar commented Nov 24, 2016

rachmatowicz commented Nov 24, 2016

rhusar commented Nov 24, 2016 •

edited

rachmatowicz commented Nov 24, 2016

jamezp commented Dec 14, 2016

rhusar commented Dec 14, 2016

jamezp commented Dec 14, 2016

jamezp commented Dec 14, 2016

rhusar commented Dec 16, 2016 •

edited

jamezp Dec 16, 2016

rhusar Dec 19, 2016

WFARQ-14 NPE in ServerSetupObserver.handleAfterUndeploy #85

WFARQ-14 NPE in ServerSetupObserver.handleAfterUndeploy #85

Conversation

rhusar commented Nov 23, 2016 • edited

rhusar commented Nov 23, 2016

rachmatowicz commented Nov 24, 2016 • edited

rhusar commented Nov 24, 2016

rachmatowicz commented Nov 24, 2016

rhusar commented Nov 24, 2016 • edited

rachmatowicz commented Nov 24, 2016

jamezp commented Dec 14, 2016

rhusar commented Dec 14, 2016

jamezp commented Dec 14, 2016

jamezp commented Dec 14, 2016

rhusar commented Dec 16, 2016 • edited

jamezp Dec 16, 2016

Choose a reason for hiding this comment

rhusar Dec 19, 2016

Choose a reason for hiding this comment

rhusar commented Nov 23, 2016 •

edited

rachmatowicz commented Nov 24, 2016 •

edited

rhusar commented Nov 24, 2016 •

edited

rhusar commented Dec 16, 2016 •

edited