TAJO-527: Upgrade to Netty 4#311
Conversation
|
I did not catch up this failure when I ran test cases on my laptop. I will dig out this test failure. |
|
No problem :) |
|
Travis test has timed out and this issue led my test build failed. This issue may require more time to figure out what is wrong on my test build. |
|
@ykrips |
|
Thank you for nice work. It looks awesome. Since this work may affect an entire Tajo system, the review and test on real environments will take longer time. So, I think that it will be merged to next release instead 0.10. |
|
@hyunsik |
|
I ran test on my macbook. I got rpc hangs in TestAsyncRpc |
|
@jinossy , |
|
@ykrips |
|
@jinossy, |
|
Hello All, |
|
First build test passed, but second one did not. I will look through this error. |
|
Hello All, |
|
@ykrips, thanks for your great work! |
|
Hello @jihoonson , |
|
@jinossy, |
|
@ykrips |
|
Alright. It will be fixed up soon. |
|
The patch looks nice to me. In order to ensure its stability, it would be great if we carry out some experiments with some heavy queries on TB-sized data sets. Anyone can help this kind of experiment? |
|
@hyunsik, it would be a great thing that we can run some stress tests on multiple-node clusters. We need to find out test environment for this test. |
|
@ykrips
|
There was a problem hiding this comment.
Can you add file checking ?
if (PullServerUtil.isNativeIOPossible() && manageOsCache && count() > 0 && super.isOpen())
It will fix the "bad file descriptor"
2015-03-03 10:34:40,755 WARN org.apache.tajo.pullserver.PullServerUtil: Failed to manage OS cache for /data05/tajo/data/q_1425346386770_0001/output/1/hash-shuffle/3/263
java.lang.NullPointerException
at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)
at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
at java.lang.reflect.Method.invoke(Method.java:606)
at org.apache.tajo.pullserver.PullServerUtil.posixFadviseIfPossible(PullServerUtil.java:56)
at org.apache.tajo.pullserver.FadvisedFileRegion.transferSuccessful(FadvisedFileRegion.java:163)
at org.apache.tajo.pullserver.FileCloseListener.operationComplete(FileCloseListener.java:46)
There was a problem hiding this comment.
@jinossy,
Thanks for posting test results on Netty4. I'll commit it soon.
There was a problem hiding this comment.
Interesting... Netty team added isOpen() api on DefaultFileRegion class in 4.0.25.final. It could be good to use netty api for checking if this fileregion is deallocated or not.
|
I’ve successfully tested by real data on my company cluster.
|
|
+1 |
This is a first try to upgrade netty. I did not optimize code yet. However, this is not easy to apply this change, and I want to hear any suggestions from anyone who has a interest on this patch.