New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[presto 0.206] Exception while running TPCH queries with 500GB data. #11232
Comments
Could you post explain of the query? |
I cannot increase the nodes. How to increase the task concurrency?
…On Fri 10 Aug, 2018, 2:13 PM Karol Sobczak, ***@***.***> wrote:
Could you post explain of the query?
It seems that there is too many row for each HashBuilderOperator operator
instance. You could try increasing number of nodes or increasing task
concurrency.
—
You are receiving this because you authored the thread.
Reply to this email directly, view it on GitHub
<#11232 (comment)>,
or mute the thread
<https://github.com/notifications/unsubscribe-auth/AFndfDNkcnQlU3pGUDeUlTi2ptYSMjOsks5uPUfOgaJpZM4V1igw>
.
|
|
I used task.concurrency = 64. with this instead of failing in a minute, it failed after 3 minutes. And below is the explain result that you asked. presto:tpchcarbon_default> explain select n_name, sum(l_extendedprice * (1 - l_discount)) as revenue from customer, orders, lineitem, supplier, nation, region where c_custkey = o_custkey and l_orderkey = o_orderkey and l_suppkey = s_suppkey and c_nationkey = s_nationkey and s_nationkey = n_nationkey and n_regionkey = r_regionkey and r_name = 'ASIA' and o_orderdate >= date('1994-01-01') and o_orderdate < date('1995-01-01') group by n_name order by revenue desc;
|
I have attached the explain results. Even with task.concurrency=64. It
didn't work
…On Fri 10 Aug, 2018, 2:33 PM Karol Sobczak, ***@***.***> wrote:
set session task_concurrency=64 for instance
—
You are receiving this because you authored the thread.
Reply to this email directly, view it on GitHub
<#11232 (comment)>,
or mute the thread
<https://github.com/notifications/unsubscribe-auth/AFndfJ3dd8i7JuWb1Gw_Ynqxcy-QXX_8ks5uPUxygaJpZM4V1igw>
.
|
+1 |
Hi, I am running TPCH queries with 500GB data on 3 node cluster [each node has 150GB query memory with 48 core CPU].
I have my own carbondata connector with presto.
I am using presto [0.206]
I got below exception for 5 queries out of 22 queries.
Is anyone familiar with this exception call stack ? what is the workaround ?
java.lang.IllegalArgumentException: Too large (897278064 expected elements with load factor 0.75)
at it.unimi.dsi.fastutil.HashCommon.arraySize(HashCommon.java:160)
at com.facebook.presto.operator.PagesHash.(PagesHash.java:63)
at com.facebook.presto.operator.JoinHashSupplier.(JoinHashSupplier.java:70)
at com.facebook.presto.operator.PagesIndex.createLookupSourceSupplier(PagesIndex.java:512)
at com.facebook.presto.operator.HashBuilderOperator.buildLookupSource(HashBuilderOperator.java:589)
at com.facebook.presto.operator.HashBuilderOperator.finishInput(HashBuilderOperator.java:486)
at com.facebook.presto.operator.HashBuilderOperator.finish(HashBuilderOperator.java:442)
at com.facebook.presto.operator.Driver.processInternal(Driver.java:393)
at com.facebook.presto.operator.Driver.lambda$processFor$8(Driver.java:282)
at com.facebook.presto.operator.Driver.tryWithLock(Driver.java:672)
at com.facebook.presto.operator.Driver.processFor(Driver.java:276)
at com.facebook.presto.execution.SqlTaskExecution$DriverSplitRunner.processFor(SqlTaskExecution.java:973)
at com.facebook.presto.execution.executor.PrioritizedSplitRunner.process(PrioritizedSplitRunner.java:162)
at com.facebook.presto.execution.executor.TaskExecutor$TaskRunner.run(TaskExecutor.java:477)
at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
at java.lang.Thread.run(Thread.java:748)
The text was updated successfully, but these errors were encountered: