Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

When running NDS 2.0 on Dataproc, the Spark eventlog may have duplicate query IDs #33

Closed
viadea opened this issue Jun 25, 2022 · 5 comments

Comments

@viadea
Copy link
Collaborator

viadea commented Jun 25, 2022

When running NDS 2.0 on Dataproc, the Spark eventlog may have duplicate query IDs.

For example, the Spark eventlog may show as:
query66
query66
query23_part2
query23_part2

This is very confusing. I can reproduce each time on Dataproc.

@wjxiz1992
Copy link
Collaborator

Is this only happening in Dataproc? Is it possible to have a look at the Spark history page for it, I'm wondering if it's related to https://github.com/NVIDIA/spark-rapids-benchmarks/blob/branch-22.06/nds/nds_power.py#L208-L209

@wjxiz1992
Copy link
Collaborator

22/06/28 08:09:58 ERROR org.apache.spark.scheduler.AsyncEventQueue: Dropping event from queue shared. This likely means one of the listeners is too slow and cannot keep up with the rate at which tasks are being started by the scheduler.

I saw such error when running on Dataproc environment. Still not sure if it's the cause.

@wjxiz1992
Copy link
Collaborator

This should be related to #37

@viadea
Copy link
Collaborator Author

viadea commented Jul 1, 2022

So far i only saw this in dataproc. My standalone cluster looks good.

@wjxiz1992
Copy link
Collaborator

#41 resolves this issue according to my latest tests on Dataproc. It's due to the dropped event error mentioned above. I'm running more rounds tests to confirm, but it's pretty much like that.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants