You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Do we want to consider what this means more exactly from perspective of HudiIO? (integration with Apache Beam, how Spark session is managed for long running jobs, what workflow manager means, etc)
@SemanticBeeng this is orthogonal to connection management I believe.. This ticket was around figuring out how to deploy hudi in a long running mode.. Some aspects like givining up containers on dynamic allocation etc could still be useful. let me post this into HUDI-70
Per http://spark.apache.org/docs/latest/job-scheduling.html , Spark already can do schedule tasks internally (we do this at scale already). and Spark has APIs to request and relinquish executors.
https://spark.apache.org/docs/2.0.2/api/java/org/apache/spark/SparkContext.html#requestExecutors(int)
Even for batch pipelines, we would like hoodie pipelines to be run efficiently, like Spark Streaming, except we give up containers when not in use..
Blocker : #123
The text was updated successfully, but these errors were encountered: