Skip to content

Change Checkpoint default to true. Mesos already did so, we might as well also#730

Merged
tpetr merged 1 commit into
HubSpot:masterfrom
stevenschlansker:checkpoint-by-default
Oct 13, 2015
Merged

Change Checkpoint default to true. Mesos already did so, we might as well also#730
tpetr merged 1 commit into
HubSpot:masterfrom
stevenschlansker:checkpoint-by-default

Conversation

@stevenschlansker
Copy link
Copy Markdown
Contributor

No description provided.

@tpetr
Copy link
Copy Markdown
Contributor

tpetr commented Oct 13, 2015

To enable this in a pre-existing cluster:

Changing FrameworkInfo (while keeping the FrameworkID) is not handled correctly by Mesos at the moment. This is what you currently need to do to propagate FrameworkInfo.checkpoint throughout the cluster.

--> Update FrameworkInfo inside your framework and re-register with master. (Old FrameworkInfo is still cached at master and slaves).
--> Failover the leading master. (New FrameworkInfo will be cached by new leading master).
--> Hard restart (kill slave and wipe meta data) your slave in batches.

The proper fix for this is tracked at: https://issues.apache.org/jira/browse/MESOS-703

tpetr pushed a commit that referenced this pull request Oct 13, 2015
Change Checkpoint default to true.  Mesos already did so, we might as well also
@tpetr tpetr merged commit c00c5e2 into HubSpot:master Oct 13, 2015
@tpetr tpetr removed hs_qa labels Oct 13, 2015
@tpetr tpetr added this to the 0.4.6 milestone Oct 13, 2015
@stevenschlansker stevenschlansker deleted the checkpoint-by-default branch October 13, 2015 18:53
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants