Permalink
Switch branches/tags
Commits on Sep 28, 2011
Commits on Sep 27, 2011
  1. Perform final sync once all handoff data has been sent.

    jonmeredith committed Sep 27, 2011
    The new cluster membership code switched to forwarding
    once handoff is complete.  Without this change the vnode
    starts forwarding while the new owner is still processing
    buffered TCP data.
Commits on Sep 26, 2011
  1. Fix bug with nodes leaving the cluster earlier than intended.

    jtuple committed Sep 26, 2011
    Change ring_ready to wait on exiting nodes in addition to valid and leaving
    nodes. This ensure the ring converges on a node's intent to leave before the
    node leaves the cluster.
    
    Change claimant from moving itself from exiting to invalid. Instead, after
    the claimant moves to exiting, a new claimant will emerge that will move the
    previous claimant to invalid and initiate shutdown.
Commits on Sep 23, 2011
  1. Fixed update_forwarding_mode return in deleted case.

    jonmeredith committed Sep 23, 2011
    The caller wraps the state with the next state information.
  2. Bump lager dependency version

    Jared Morrow
    Jared Morrow committed Sep 23, 2011
  3. Made Mod:delete happen before unregister.

    jonmeredith committed Sep 23, 2011
    Prevent a race with the master starting a new vnode.
    Changed coverage to run while in handoff - otherwise
    listkeys et al will bomb during partition transfer.
  4. Added infinity timeout on finish_handoff call.

    jonmeredith committed Sep 23, 2011
    On a very busy 6-node stagedevrel cluster was hitting.
    11:35:18.950 [error] gen_fsm <0.171.0> in state active terminated with reason: {timeout,{gen_server,call,[riak_core_gossip,{finish_handoff,45671926166590716193865151022383844364247891968,'dev1@127.0.0.1','dev3@127.0.0.1',riak_pipe_vnode}]}}
    
    The process is local and the call is monitored in case gossip dies.
  5. Changed vnode to unregister from master before cleaning up.

    jonmeredith committed Sep 23, 2011
    Fullsync repl was hanging because it delivered a fold message
    while finish_handoff was being called.  The message was never
    processed as the vnode immediately shut down rather than
    forwarding the messages in the queue.
    
    On completion of handoff, async unregister from the vnode master. The
    unregister call now passes the pid of the vnode unregistering
    and now the master sends an unregistered event once the vnode
    is removed from the master ETS table.
    
    While waiting for the acknowledgment of unregister the vnode goes
    into forwarding mode.
Commits on Sep 21, 2011
  1. Update new partition claim algorithm after review + bug fixes

    jtuple committed Sep 21, 2011
    Change claim_simulation.erl eunit test to run a simulation with both the
    new and old claim algorithm as suggested.
    
    Rename riak_core_new_claim:new_claim/2 to new_choose_claim/2 to match
    default_choose_claim/2.
    
    Fix two bugs in riak_core_new_claim.erl that are on code paths that cannot
    occur in 1.0 due to existing invariants, but should be fixed nevertheless:
    - Match error in prefilter_violations: change CNth to {CNth, _}.
    - Handle case where new_choose_claim fails to claim partitions by falling
      back to claim_rebalance_n.
Commits on Sep 20, 2011
  1. Add new partition claim function and claim simulator

    jtuple committed Sep 20, 2011
    Add riak_core_new_claim:new_wants_claim/2 and new_claim/2.
    Merge in claim simulation code provided by Greg Nelson (grourk@dropcam.com).
    Add pretty_print function to riak_core_ring.
    
    The new claim function is designed to reduce the number of partition transfers
    that occur when rebalancing the ring, aiming as close to possible for minimal
    consistent hashing.
  2. Update depend versions of lager and poolboy

    Jared Morrow
    Jared Morrow committed Sep 20, 2011
  3. Fix a variable conflict

    Vagabond committed Sep 20, 2011
  4. Fix bug with worker checkin tracking

    Vagabond committed Sep 15, 2011
    bz1188
  5. Initial attempt at clean vnode shutdown that waits for queued work

    Vagabond committed Sep 14, 2011
    bz1188
    
    This patch adds a patched supervisor module that supports graceful
    shutdown from a simple_one_for_one, so when a node stops gracefully, we
    can block shutdown long enough to process any queued work and do any
    other cleanups.
Commits on Sep 16, 2011
  1. Merge pull request #86 from basho/AZ721-louder-2i-errors

    rustyio committed Sep 16, 2011
    AZ721 - Fail Loudly on 2i Errors
  2. Fix subtle bug in riak_core_coverage_fsm.

    kellymclaughlin committed Sep 15, 2011
    Fixes: az726
    
    This change fixes a bug in riak_core_coverage_fsm where the updated
    state is not passed to the module implementing the behavior in the
    finish call. This can lead to incomplete results for operations that
    accumulate results in the state and do something with them in the
    finish function.
Commits on Sep 15, 2011
  1. Stop coverage fsm when there is an error.

    rustyio committed Sep 15, 2011
    AZ721
Commits on Sep 14, 2011