TAJO-527: Upgrade to Netty 4 by ykrips · Pull Request #311 · apache/tajo

ykrips · 2014-12-19T09:31:47Z

This is a first try to upgrade netty. I did not optimize code yet. However, this is not easy to apply this change, and I want to hear any suggestions from anyone who has a interest on this patch.

ykrips · 2014-12-20T15:20:57Z

I did not catch up this failure when I ran test cases on my laptop. I will dig out this test failure.

hyunsik · 2014-12-20T16:46:47Z

No problem :)

ykrips · 2014-12-21T01:42:30Z

Travis test has timed out and this issue led my test build failed. This issue may require more time to figure out what is wrong on my test build.

Your test run exceeded 50 minutes.

jinossy · 2014-12-22T06:23:56Z

@ykrips
Don't worry about it. I also investigate the problem.

hyunsik · 2014-12-22T06:34:59Z

Thank you for nice work. It looks awesome.

Since this work may affect an entire Tajo system, the review and test on real environments will take longer time. So, I think that it will be merged to next release instead 0.10.

jinossy · 2014-12-22T06:52:00Z

@hyunsik
I agree with you. We need more review and test

jinossy · 2014-12-22T08:20:33Z

I ran test on my macbook. I got rpc hangs in TestAsyncRpc
@ykrips
Could you check the HashedWheelTimer in TaskRunnerManager ?

main" prio=5 tid=7fabff000800 nid=0x10ff62000 waiting on condition [10ff60000]
   java.lang.Thread.State: WAITING (parking)
        at sun.misc.Unsafe.park(Native Method)
        - parking to wait for  <7f3e47578> (a java.util.concurrent.Semaphore$NonfairSync)
        at java.util.concurrent.locks.LockSupport.park(LockSupport.java:156)
        at java.util.concurrent.locks.AbstractQueuedSynchronizer.parkAndCheckInterrupt(AbstractQueuedSynchronizer.java:811)
        at java.util.concurrent.locks.AbstractQueuedSynchronizer.doAcquireSharedInterruptibly(AbstractQueuedSynchronizer.java:969)
        at java.util.concurrent.locks.AbstractQueuedSynchronizer.acquireSharedInterruptibly(AbstractQueuedSynchronizer.java:1281)
        at java.util.concurrent.Semaphore.acquire(Semaphore.java:286)
        at org.apache.tajo.rpc.CallFuture.get(CallFuture.java:70)
        at org.apache.tajo.rpc.TestAsyncRpc.testStubDisconnected(TestAsyncRpc.java:263)
        at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
        at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
        at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
        at java.lang.reflect.Method.invoke(Method.java:597)
        at org.junit.runners.model.FrameworkMethod$1.runReflectiveCall(FrameworkMethod.java:47)
        at org.junit.internal.runners.model.ReflectiveCallable.run(ReflectiveCallable.java:12)
        at org.junit.runners.model.FrameworkMethod.invokeExplosively(FrameworkMethod.java:44)
        at org.junit.internal.runners.statements.InvokeMethod.evaluate(InvokeMethod.java:17)
        at org.junit.rules.ExternalResource$1.evaluate(ExternalResource.java:48)
        at org.junit.rules.RunRules.evaluate(RunRules.java:20)
        at org.junit.runners.ParentRunner.runLeaf(ParentRunner.java:271)
        at org.junit.runners.BlockJUnit4ClassRunner.runChild(BlockJUnit4ClassRunner.java:70)
        at org.junit.runners.BlockJUnit4ClassRunner.runChild(BlockJUnit4ClassRunner.java:50)
        at org.junit.runners.ParentRunner$3.run(ParentRunner.java:238)
        at org.junit.runners.ParentRunner$1.schedule(ParentRunner.java:63)
        at org.junit.runners.ParentRunner.runChildren(ParentRunner.java:236)
        at org.junit.runners.ParentRunner.access$000(ParentRunner.java:53)
        at org.junit.runners.ParentRunner$2.evaluate(ParentRunner.java:229)
        at org.junit.runners.ParentRunner.run(ParentRunner.java:309)
        at org.apache.maven.surefire.junit4.JUnit4Provider.execute(JUnit4Provider.java:264)
        at org.apache.maven.surefire.junit4.JUnit4Provider.executeTestSet(JUnit4Provider.java:153)
        at org.apache.maven.surefire.junit4.JUnit4Provider.invoke(JUnit4Provider.java:124)
        at org.apache.maven.surefire.booter.ForkedBooter.invokeProviderInSameClassLoader(ForkedBooter.java:200)
        at org.apache.maven.surefire.booter.ForkedBooter.runSuitesInProcess(ForkedBooter.java:153)
        at org.apache.maven.surefire.booter.ForkedBooter.main(ForkedBooter.java:103)

ykrips · 2014-12-22T08:50:31Z

@jinossy ,
Thanks for posting your test results.
I have ran test cases with JDK 1.6 and JDK 1.7, and I did not find rpc hang issues on TestAsyncRpc. Can you provide the details on your test environment?
By the way, I have removed the HashedWheelTimer in TaskRunnerManager, because netty 4 does not accept the Timer object when creating ReadTimeoutHandler instance. Netty 4 uses internal scheduler when catching up the timeout event.
Anyway, thank you again, and I will look through these timeout handlers. In some cases, I found that netty 4 does not create the time-out event.

jinossy · 2014-12-22T10:10:24Z

@ykrips
Thanks for quick response
I think we need handle the event after rpc stub is disconnected
My env is following :
OSX : 10.9.5
JDK: 1.7.0_67-b01

ykrips · 2014-12-23T06:10:22Z

@jinossy,
Thank you for posting your environment.
I have quickly made up the test environment on Mac OSX yosemite with JDK 1.6.0_65, and got a same problem. This test case works well on Ubuntu 14.04, so I think that this error might come from different thread management. I will look through this issue, and will post patches if I found the root cause.

ykrips · 2014-12-29T09:37:18Z

Hello All,
I was away from the computer for several days, and sorry for that. I have added a tricky parameter on testStubDisconnected function. MacOSX uses the completely different thread management policy than Linux, therefore other thread does not work at all. These test cases works with JDK "1.6.0_65" and "1.7.0_71" on MacOSX yosemite, and with this investigation I feel that it is needed to change some codes which uses eventloopgroup.

ykrips · 2014-12-30T01:41:05Z

First build test passed, but second one did not. I will look through this error.

ykrips · 2015-02-16T01:29:52Z

Hello All,
It has been a long time to enable a netty4 library to tajo project. Finally, performance on netty4 was achieved to the acceptable level, and errors on Travis CI build was resolved. Now, I think, it is a time to discuss on any missing points or any potential issues.

jihoonson · 2015-02-16T03:53:51Z

@ykrips, thanks for your great work!
I have one question, just from my curiosity. How did you evaluate the performance with Netty4?

ykrips · 2015-02-16T04:58:22Z

Hello @jihoonson ,
I have done with several items. First, disabled nagle algorithm as possible. Enabling nagle algorithm will reduce the resource use on network infrastructure, but it will delay network transmission. Also, netty4 team recommend not to use flush() function frequently, but it also delays the network transmission. Second, I have set the send and receive buffer size of servers and clients as possible. Low buffer size also delays the network performance, and providers and consumers wait until the buffer is empty. Finally, I have merged and refactored the source code to use shared eventloopgroup. Creating a object which tightly coupled to the operating system resource is expensive operation, and when creating these objects frequently, it may lead starvation on native memory and network resources.

ykrips · 2015-02-26T07:46:26Z

@jinossy,
I would liked to append additional patches for rpc codes using netty4. Would you please check these patches?

jinossy · 2015-02-26T15:05:34Z

@ykrips
Sure, I will review soon

ykrips · 2015-02-27T09:20:57Z

Alright. It will be fixed up soon.

hyunsik · 2015-03-01T00:40:09Z

The patch looks nice to me. In order to ensure its stability, it would be great if we carry out some experiments with some heavy queries on TB-sized data sets. Anyone can help this kind of experiment?

ykrips · 2015-03-01T02:16:32Z

@hyunsik, it would be a great thing that we can run some stress tests on multiple-node clusters. We need to find out test environment for this test.

jinossy · 2015-03-02T05:12:43Z

@ykrips
Could you fix following error ? I ran TPCH-Q3

Error 1

2015-03-02 11:41:19,399 WARN org.apache.tajo.rpc.RpcConnectionPool: Try to reconnect : server1/xxx.xxx.xxx.xxx:28091
2015-03-02 11:41:19,405 ERROR org.apache.tajo.rpc.AsyncRpcClient: server2/xxx.xxx.xxx.xxx:28091,class org.apache.tajo.ipc.TajoWorkerProtocol,java.nio.channels.ClosedCh
annelException
com.google.protobuf.ServiceException: java.nio.channels.ClosedChannelException
        at org.apache.tajo.rpc.AsyncRpcClient$ProxyRpcChannel$1.operationComplete(AsyncRpcClient.java:147)
        at org.apache.tajo.rpc.AsyncRpcClient$ProxyRpcChannel$1.operationComplete(AsyncRpcClient.java:142)
        at io.netty.util.concurrent.DefaultPromise.notifyListener0(DefaultPromise.java:680)
        at io.netty.util.concurrent.DefaultPromise.notifyListeners(DefaultPromise.java:567)
        at io.netty.util.concurrent.DefaultPromise.tryFailure(DefaultPromise.java:424)
        at io.netty.channel.AbstractChannel$AbstractUnsafe.safeSetFailure(AbstractChannel.java:754)
        at io.netty.channel.AbstractChannel$AbstractUnsafe.write(AbstractChannel.java:655)
        at io.netty.channel.DefaultChannelPipeline$HeadContext.write(DefaultChannelPipeline.java:1113)
        at io.netty.channel.AbstractChannelHandlerContext.invokeWrite(AbstractChannelHandlerContext.java:633)
        at io.netty.channel.AbstractChannelHandlerContext.access$1900(AbstractChannelHandlerContext.java:32)
        at io.netty.channel.AbstractChannelHandlerContext$AbstractWriteTask.write(AbstractChannelHandlerContext.java:908)
        at io.netty.channel.AbstractChannelHandlerContext$WriteAndFlushTask.write(AbstractChannelHandlerContext.java:960)
        at io.netty.channel.AbstractChannelHandlerContext$AbstractWriteTask.run(AbstractChannelHandlerContext.java:893)
        at io.netty.util.concurrent.SingleThreadEventExecutor.runAllTasks(SingleThreadEventExecutor.java:380)
        at io.netty.channel.nio.NioEventLoop.run(NioEventLoop.java:357)
        at io.netty.util.concurrent.SingleThreadEventExecutor$2.run(SingleThreadEventExecutor.java:116)
        at java.lang.Thread.run(Thread.java:745)
Caused by: java.nio.channels.ClosedChannelException
2015-03-02 11:41:19,407 ERROR org.apache.tajo.master.ContainerProxy: Connect error to server3/xxx.xxx.xxx.xxx:28091 caused by
io.netty.channel.ConnectTimeoutException: Connect error to server3/xxx.xxx.xxx.xxx:28091 caused by
        at org.apache.tajo.rpc.NettyClientBase.handleConnectionInternally(NettyClientBase.java:93)
        at org.apache.tajo.rpc.NettyClientBase.connect(NettyClientBase.java:103)
        at org.apache.tajo.rpc.RpcConnectionPool.getConnection(RpcConnectionPool.java:96)
        at org.apache.tajo.master.TajoContainerProxy.assignExecutionBlock(TajoContainerProxy.java:105)
        at org.apache.tajo.master.TajoContainerProxy.launch(TajoContainerProxy.java:75)
        at org.apache.tajo.worker.TajoResourceAllocator$LaunchRunner.run(TajoResourceAllocator.java:210)
        at java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:471)
        at java.util.concurrent.FutureTask.run(FutureTask.java:262)
        at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145)
        at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615)
        at java.lang.Thread.run(Thread.java:745)

Error 2

2015-03-02 11:41:19,406 ERROR org.apache.tajo.rpc.AsyncRpcClient: null,class org.apache.tajo.ipc.TajoWorkerProtocol,java.lang.UnsupportedOperationException: unsupported message type:
 RpcProtos$RpcRequest (expected: ByteBuf, FileRegion)
com.google.protobuf.ServiceException: java.lang.UnsupportedOperationException: unsupported message type: RpcProtos$RpcRequest (expected: ByteBuf, FileRegion)
        at org.apache.tajo.rpc.AsyncRpcClient$ProxyRpcChannel$1.operationComplete(AsyncRpcClient.java:147)
        at org.apache.tajo.rpc.AsyncRpcClient$ProxyRpcChannel$1.operationComplete(AsyncRpcClient.java:142)
        at io.netty.util.concurrent.DefaultPromise.notifyListener0(DefaultPromise.java:680)
        at io.netty.util.concurrent.DefaultPromise.notifyListeners(DefaultPromise.java:567)
        at io.netty.util.concurrent.DefaultPromise.tryFailure(DefaultPromise.java:424)
        at io.netty.channel.AbstractChannel$AbstractUnsafe.safeSetFailure(AbstractChannel.java:754)
        at io.netty.channel.AbstractChannel$AbstractUnsafe.write(AbstractChannel.java:669)
        at io.netty.channel.DefaultChannelPipeline$HeadContext.write(DefaultChannelPipeline.java:1113)
        at io.netty.channel.AbstractChannelHandlerContext.invokeWrite(AbstractChannelHandlerContext.java:633)
        at io.netty.channel.AbstractChannelHandlerContext.access$1900(AbstractChannelHandlerContext.java:32)
        at io.netty.channel.AbstractChannelHandlerContext$AbstractWriteTask.write(AbstractChannelHandlerContext.java:908)
        at io.netty.channel.AbstractChannelHandlerContext$WriteAndFlushTask.write(AbstractChannelHandlerContext.java:960)
        at io.netty.channel.AbstractChannelHandlerContext$AbstractWriteTask.run(AbstractChannelHandlerContext.java:893)
        at io.netty.util.concurrent.SingleThreadEventExecutor.runAllTasks(SingleThreadEventExecutor.java:380)
        at io.netty.channel.nio.NioEventLoop.run(NioEventLoop.java:357)
        at io.netty.util.concurrent.SingleThreadEventExecutor$2.run(SingleThreadEventExecutor.java:116)
        at java.lang.Thread.run(Thread.java:745)
Caused by: java.lang.UnsupportedOperationException: unsupported message type: RpcProtos$RpcRequest (expected: ByteBuf, FileRegion)
        at io.netty.channel.nio.AbstractNioByteChannel.filterOutboundMessage(AbstractNioByteChannel.java:280)
        at io.netty.channel.AbstractChannel$AbstractUnsafe.write(AbstractChannel.java:663)
        ... 10 more

jinossy · 2015-03-03T03:09:23Z

Can you add file checking ?
if (PullServerUtil.isNativeIOPossible() && manageOsCache && count() > 0 && super.isOpen())

It will fix the "bad file descriptor"

2015-03-03 10:34:40,755 WARN org.apache.tajo.pullserver.PullServerUtil: Failed to manage OS cache for /data05/tajo/data/q_1425346386770_0001/output/1/hash-shuffle/3/263 java.lang.NullPointerException at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method) at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57) at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43) at java.lang.reflect.Method.invoke(Method.java:606) at org.apache.tajo.pullserver.PullServerUtil.posixFadviseIfPossible(PullServerUtil.java:56) at org.apache.tajo.pullserver.FadvisedFileRegion.transferSuccessful(FadvisedFileRegion.java:163) at org.apache.tajo.pullserver.FileCloseListener.operationComplete(FileCloseListener.java:46)

@jinossy,
Thanks for posting test results on Netty4. I'll commit it soon.

Interesting... Netty team added isOpen() api on DefaultFileRegion class in 4.0.25.final. It could be good to use netty api for checking if this fileregion is deallocated or not.

jinossy · 2015-03-03T03:20:22Z

I’ve successfully tested by real data on my company cluster.

ENV
- 2 TajoMaster + 4 TajoWorker
- JDK 1.7.0_67
- 1G Network

Json Table 
2TB compressed by snappy
7.3TB Actual bytes

select count(*) from (select id from table1 group by id) t1;
Progress: 100%, response time: 3546.781 sec
?count
-------------------------------
2802809536
(1 rows, 3546.781 sec, 11 B selected)


Parquet table
8.1TB compressed by snappy
select count(*)  from table2
Progress: 100%, response time: 374.358 sec
?count
-------------------------------
16090817643
(1 rows, 374.358 sec, 12 B selected)

jinossy · 2015-03-03T06:51:32Z

+1
I greatly appreciate your effort.
Thank you!

ykrips and others added 3 commits December 14, 2014 22:34

TAJO-527: Upgrade to Netty 4

29fe41f

Merge remote-tracking branch 'upstream/master' into TAJO-527

0c705b7

TAJO-527: Upgrade to Netty 4

cf049a3

TAJO-527: Upgrade to Netty 4

b586021

Jihun Kang added 3 commits December 30, 2014 01:12

Merge remote-tracking branch 'upstream/master' into TAJO-527

8ba99c2

TAJO-527: Upgrade to Netty 4

f8799b2

TAJO-527: Upgrade to Netty 4

0b00137

Jihun Kang added 11 commits January 5, 2015 14:32

TAJO-527: Upgrade to Netty 4

77f1e82

Merge remote-tracking branch 'upstream/master' into TAJO-527

aff6e65

Merge remote-tracking branch 'upstream/master' into TAJO-527

4c0e138

TAJO-527: Upgrade to Netty 4

7c5b1df

TAJO-527: Upgrade to Netty 4

8a9f753

Merge remote-tracking branch 'upstream/master' into TAJO-527

f6f55c5

TAJO-527: Upgrade to Netty 4

e3aafb1

Merge remote-tracking branch 'upstream/master' into TAJO-527

96d4d0e

TAJO-527: Upgrade to Netty 4

8d1ac7d

TAJO-527: Upgrade to Netty 4

082b89c

Merge remote-tracking branch 'upstream/master' into TAJO-527

5d39b36

Jihun Kang added 7 commits February 10, 2015 14:52

Fixed minor issues

10d0683

recover the code for fetcher to wait until server closes

382c5e5

Reduced resource use in rpc codes

0dda709

Reduced GC Overhead

5612f91

Added error handler

0096b19

Removed potential bottlenecks

a71dfe0

fixed up missing points and resource leaks

30c7d69

Jihun Kang added 3 commits February 24, 2015 10:43

Merge remote-tracking branch 'upstream/master' into TAJO-527

f3c59a5

Fixed code for connection termination

bc6055c

Merge remote-tracking branch 'upstream/master' into TAJO-527

e92e58b

Fixed connection handlers

3f1e547

Jihun Kang added 2 commits February 27, 2015 17:20

Added simple eventloopgroup pool

ce4078d

Removed unused local variable

369a72c

Changed some constants

a20b8d0

Jihun Kang added 2 commits March 2, 2015 20:14

Merge remote-tracking branch 'upstream/master' into TAJO-527

e029cdf

Fixed some http handlers

e936427

jinossy reviewed Mar 3, 2015
View reviewed changes

Fixed check function for resource deallocation

8f404d5

asfgit closed this in 22876a8 Mar 3, 2015

Conversation

ykrips commented Dec 19, 2014

Uh oh!

ykrips commented Dec 20, 2014

Uh oh!

hyunsik commented Dec 20, 2014

Uh oh!

ykrips commented Dec 21, 2014

Uh oh!

jinossy commented Dec 22, 2014

Uh oh!

hyunsik commented Dec 22, 2014

Uh oh!

jinossy commented Dec 22, 2014

Uh oh!

jinossy commented Dec 22, 2014

Uh oh!

ykrips commented Dec 22, 2014

Uh oh!

jinossy commented Dec 22, 2014

Uh oh!

ykrips commented Dec 23, 2014

Uh oh!

ykrips commented Dec 29, 2014

Uh oh!

ykrips commented Dec 30, 2014

Uh oh!

ykrips commented Feb 16, 2015

Uh oh!

jihoonson commented Feb 16, 2015

Uh oh!

ykrips commented Feb 16, 2015

Uh oh!

ykrips commented Feb 26, 2015

Uh oh!

jinossy commented Feb 26, 2015

Uh oh!

ykrips commented Feb 27, 2015

Uh oh!

hyunsik commented Mar 1, 2015

Uh oh!

ykrips commented Mar 1, 2015

Uh oh!

jinossy commented Mar 2, 2015

Uh oh!

jinossy Mar 3, 2015

Choose a reason for hiding this comment

Uh oh!

ykrips Mar 3, 2015

Choose a reason for hiding this comment

Uh oh!

ykrips Mar 3, 2015

Choose a reason for hiding this comment

Uh oh!

jinossy commented Mar 3, 2015

Uh oh!

jinossy commented Mar 3, 2015

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants