Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[SUPPORT]RejectedExecutionException FutureTask rejected from ThreadPoolExecutor[Terminated...] #2723

Closed
liijiankang opened this issue Mar 26, 2021 · 11 comments
Assignees
Labels
priority:critical production down; pipelines stalled; Need help asap.

Comments

@liijiankang
Copy link

Describe the problem you faced
We use Structured Streaming to subscribe to the data in Kafka, and then write the data to the hoodie, the program will stop abnormally after running for a period of time.

Environment Description

  • Hudi version :

  • Spark version :

  • Hive version :

  • Hadoop version :

  • Storage (HDFS/S3/GCS..) :HDFS

  • Running on Docker? (yes/no) :no

Add the stacktrace of the error.

1
2
3

@liijiankang
Copy link
Author

Hudi version :0.6.0

Spark version :2.4.0+cdh6.2.0

Hive version :2.1.1+cdh6.2.0

Hadoop version :3.0.0+cdh6.2.0

@bvaradar
Copy link
Contributor

This does look like some exception coming from cleaner. Can you look around the logs to see if there are more stack traces related to this ?

Can you try setting hoodie.clean.async=False and see ?

@liijiankang
Copy link
Author

4
Thanks,I will test it .If this exception comes from a cleaner, do we have to set hoodie.clean.async = false?

@nsivabalan
Copy link
Contributor

@bvaradar : once you respond, can you please remove "awaiting-user-response" label for the issue. If possible add "awaiting-community-help" label.

@bvaradar
Copy link
Contributor

@liijiankang : No, My intention was to make the cleaner synchronous so that the spark job can fail immediately after encountering the exception. It is purely to debug the issue and not a solution.

@liijiankang
Copy link
Author

@bvaradar I set hoodie.clean.async to false and this exception did not occur again.

@n3nash n3nash added the priority:critical production down; pipelines stalled; Need help asap. label Apr 8, 2021
@bvaradar
Copy link
Contributor

@liijiankang : This exception is also seen when shutting down service which has async threads running. With async cleaning, there is a separate executor service setup to handle cleaning. Is it possible that you were doing ctrl-C and then saw this exception.

@liijiankang
Copy link
Author

@bvaradar Our spark streaming program is submitted to yarn to run in cluster mode, and ctrl-C is not executed during the operation.

@n3nash n3nash added this to In progress in GI Tracker Board Apr 22, 2021
@n3nash
Copy link
Contributor

n3nash commented Apr 27, 2021

@liijiankang Does this issue come back when you turn on async cleaning ? If yes, can you file a JIRA ticket and ping back the ticket here so we can look into this ?

@liijiankang
Copy link
Author

@nsivabalan
Copy link
Contributor

can you please update the jira w/ stack trace as well.

GI Tracker Board automation moved this from In progress to Done Jun 3, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
priority:critical production down; pipelines stalled; Need help asap.
Projects
Development

No branches or pull requests

4 participants