Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[FLINK-19693][runtime] Downstream Failover for Approximate Local Recovery #13880

Closed
wants to merge 3 commits into from

Commits on Nov 8, 2020

  1. [FLINK-19693][runtime] Downstream Failover for Approximate Local Reco…

    …very
    
    Enables downstream failover for approximate local recovery. That says if a task fails,
    all its downstream tasks restart, including itself. This is achieved by reusing the existing
    RestartPipelinedRegionFailoverStrategy --- treat each individual task connected by
    ResultPartition.Pipelined_Approximate as a separate region.
    
    To achieve this, we introduced an attribute "reconnectable" in ResultPartitionType
    to indicate whether the partition is reconnectable. Notice that this is only a temporary
    solution for now. It will be removed after:
     - Approximate local recovery has its won failover strategy to restart the failed set of
       tasks instead of restarting downstream of failed tasks depending on
       {@code RestartPipelinedRegionFailoverStrategy}
     - FLINK-19895: Unify the life cycle of ResultPartitionType Pipelined Family.
       There is also a good discussion on this in FLINK-19632.
    curcur committed Nov 8, 2020
    Configuration menu
    Copy the full SHA
    c2d0a86 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    faa8ec8 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    46b3d4b View commit details
    Browse the repository at this point in the history