Storm-166: Nimbus HA design doc and implementation. #354

Parth-Brahmbhatt · 2014-12-19T22:51:16Z

I have deleted the bit torrent implementation from this pull request as the only available bit torrent library does not support tracker less torrents. In absence of tracker less torrents a single tracker becomes a single point of failure and a multi tracker implementation requires that if a tracker host fails the replacement host has same dns/network configuration.

Some manual tests I executed:

start 3 nimbuses, test simple word count topology works. try storm list/activate/deactivate/rebalance/kill from ui and CLI.
set the replication factor to 2 run the first test again.
bring up a new nimbus, ensure it catches up and competes for leader lock.
with 3 nimbuses and 2 topologies, delete one topology code from each non leader nimbus. After killing master nimbus, ensure one of them eventually becomes leader.

Conflicts: storm-core/test/clj/backtype/storm/supervisor_test.clj

…y way. All tests pass now and was able to run wordcount and excalmation topologies.

…HDFSCodeDistributor. Working version of HDFSCodeDistributor.

… display list of nimbus hosts and current leader.

…cceed in absence of the NIMBUS-CONF value.

…ot have all the active topology code locally, keeps the lock if it can verify all active topology code exists locally.

…on to be achieved before the topology is activated.

vesense · 2015-03-20T01:24:24Z

+1 looks good to me.

ptgoetz · 2015-03-20T01:56:48Z

+1

I'll proceed with crating the necessary branches.

Conflicts: storm-core/src/ui/public/index.html storm-core/src/ui/public/templates/index-page-template.html storm-core/src/ui/public/templates/topology-page-template.html

ptgoetz · 2015-03-20T21:28:26Z

Thanks @Parth-Brahmbhatt. I've merged this to 0.11.x-branch.

ptgoetz · 2015-03-21T03:21:54Z

NOTE: 0.11.x-branch has been renamed to 'nimbus-ha-branach'.

longdafeng · 2015-03-23T14:50:19Z

@ptgoetz @revans2 @Parth-Brahmbhatt,

Sorry for late discuss the HA desgin.
Strongly recommend using JStorm's Nimbus HA Design. It is pretty stable. JStorm's Nimbus HA has been released for one year and has been approved stable.

The logic is very simple. The code less 1000 lines.

Every nimbus will try to own the znode /nimbus_master, the winner will be the nimbus's master. Slaves will watch and timely check the znode, once it disappear, slaves will try to own it. During slaves check the znode, it will sync binary from master.
All client API firstly connection ZK and get to know who are the nimbus Master.

(1) The core code in nimbus:
https://github.com/alibaba/jstorm/blob/master/jstorm-server/src/main/java/com/alibaba/jstorm/schedule/FollowerRunnable.java

(2) How to find the master of nimbus:
https://github.com/alibaba/jstorm/blob/master/jstorm-client/src/main/java/backtype/storm/security/auth/ThriftClient.java

Parth-Brahmbhatt · 2015-03-24T05:03:48Z

Disclaimer I did not thoroughly look at the code but I am commenting based on your design description of Jstorm.

@longdafeng Did you have a chance to take a look at the current design? We are using curator for leader election which seems to be a very well tested library and is not really far from what you have proposed for leader election.

As for the length of the code, I don't completely agree with that being a good metric for most things. Due to the usage of an existing library the actual code for leader election in current PR is much smaller, 53 lines. https://github.com/Parth-Brahmbhatt/incubator-storm/blob/STORM-166/storm-core/src/clj/backtype/storm/zookeeper.clj#L250.

On top of that as part of this PR several of us had concerns around all clients connecting to zk to identify leader nimbus , as each new zk connection is a write to zk. We have partially fixed the issue by introducing thrift APIs for nimbus discovery which should be more efficient then the original approach and I plan to add caching at nimbus layer which should further improve the performance.

As @ptgoetz mentioned in the jira, we do not want user's topologies getting lost once nimbus accepts it and we also do not want to force all users to have a dependency on a fully replicated storage layer like HDFS. In current design by adding a code replication interface we are guaranteeing that once a topology is in active state it will be fully replicated, which seems to be another missing feature in your proposal. Its still a choice between availability and initial topology submission time which the users can chose based on their topology.replication.count config setting.

We also added few more features like UI improvements, nimbus summary being stored in zk, thrift API modification so users can figure out replication factor of their topologies, compatibility with rolling upgrade feature. All of which in my opinion are good admin tools and this feature will be incomplete without it.

I appreciate any feedback you can provide based on your experience of running Nimbus HA in production for a year. Please take some time to review the current design and let us know if you have any concerns.

longdafeng · 2015-03-24T05:15:07Z

@Parth-Brahmbhatt , I don't suggest using "org.apache.curator.framework.recipes.leader"

In fact, We had implemented HA with "org.apache.curator.framework.recipes.leader" in the first version of JStorm Nimbus HA, It occur a lot of problem when Zookeeper load is big. Later We use low level of curator api to implement HA. it's much more stable even when zookeeper load is big.

Right now, JStorm Nimbus HA is the third version.

revans2 · 2015-03-24T14:06:22Z

@longdafeng I am glad to see the JStorm community starting to help out here. I respect your experience with this, especially since this is your third iteration on the code. ZK load is a big issue and concern for me so I am all for trying to adopt what JStorm has done in the area of HA. Because we are working on a separate feature branch for this, where we can break APIs if need be, perhaps we can check in the code as it is now, and file a follow on JIRA to adopt the model, configs, and ideally some of the code that JStorm is using. This would also make combining the two simpler in the future.

@Parth-Brahmbhatt @ptgoetz do either of you have an opinion on this?

ptgoetz · 2015-03-24T17:15:16Z

@revans2 @longdafeng @Parth-Brahmbhatt I haven't had a chance to fully review the JStorm Nimbus HA implementation yet, but I'm open to incorporating any of its concepts/implementation.

But to be clear, we have to complete IP clearance before any decisions are made regarding the JStorm code.

Conflicts: storm-core/src/jvm/backtype/storm/utils/NimbusClient.java storm-core/test/clj/backtype/storm/security/auth/nimbus_auth_test.clj

Conflicts: STORM-UI-REST-API.md conf/defaults.yaml storm-core/src/clj/backtype/storm/daemon/nimbus.clj storm-core/src/clj/backtype/storm/ui/core.clj storm-core/src/jvm/backtype/storm/Config.java storm-core/src/jvm/backtype/storm/generated/TopologySummary.java

Conflicts: storm-core/src/jvm/backtype/storm/utils/NimbusClient.java

…o ensure nimbus sets up the correct code-distributor entries on startup.

Conflicts: storm-core/src/jvm/backtype/storm/utils/NimbusClient.java

Conflicts: storm-core/src/clj/backtype/storm/ui/core.clj storm-core/src/jvm/backtype/storm/Config.java storm-core/src/jvm/backtype/storm/utils/NimbusClient.java Conflicts: storm-core/src/clj/backtype/storm/ui/core.clj storm-core/src/jvm/backtype/storm/Config.java storm-core/src/jvm/backtype/storm/utils/NimbusClient.java

…code is called.

…eral entries getting deleted. Adding a sleep before cody-sycn thread executes ls /code-distributor/topology-id to ensure it gets the correct id back so users dont have to wait for upto 5 minutes to submit topology.

…tributor path as zookeeper does not gurantee Simultaneously Consistent Cross-Client Views unless sync is called.

Parth-Brahmbhatt · 2015-08-12T21:12:23Z

@revans2 @ptgoetz @harshach I have merged with master one more time. We are still using curator's LeaderLatch recipe. Nimbus discovery is done via an API so all clients don't have to connect to zookeeper.

harshach · 2015-08-16T20:50:41Z

I am +1 on merging into master.

Parth-Brahmbhatt · 2015-08-19T18:42:50Z

@revans2 Can you please review this PR when you have time? Given you have been involved in the original PR I don't want to commit this until I get a confirmation from you.

revans2 · 2015-08-24T13:42:17Z

Sorry I took so long on this. I am +1 on merging this in. I see a +1 from @harshach before the upmerge and a +1 from @ptgoetz from a long time ago. The upmerge looks like it was mostly trivial changes so I am just going to merge this into master.

Midpoint Applications and others added 30 commits September 12, 2014 14:17

upgraded to logback 1.0.13

2b8dade

Nimbus-HA: initial commit with leader election code.

4e1f474

removing duplicate dependency declaration.

670ad01

Adding leader election nimbus test case.

9280afb

Adding the code distribution interface.

d0aa8ff

Moving the bitTorrent code to ICodeDistributor interface.

4cb2eee

adding back the deleted superviosr test.

4d0c650

Merge remote-tracking branch 'upstream/master' into STORM-166

50c4c34

Adding HDFSCodeDistributor.

f5ac420

Merge remote-tracking branch 'upstream/master' into STORM-166

b074843

Conflicts: storm-core/test/clj/backtype/storm/supervisor_test.clj

Fixing the supervisor test failures.

0e24a43

BugFix: bittorrent code was downloaded in wrong folder. Fixed in heck…

050e8fd

…y way. All tests pass now and was able to run wordcount and excalmation topologies.

Making the code distributor injectable via strom configs. Changes to …

932cecf

…HDFSCodeDistributor. Working version of HDFSCodeDistributor.

Adding the auto discovery of nimbuses for all clients. Modified UI to…

16a3ce4

… display list of nimbus hosts and current leader.

Merge remote-tracking branch 'upstream/master' into STORM-166

fbb326b

Removing NIMBUS_HOST config values, updated unit tests so they can su…

f0701b3

…cceed in absence of the NIMBUS-CONF value.

Add a meaningful message when no nimbus participant is found.

046262a

Removing unused import.

11aadf5

Cleaning up torrent download directory.

728b35a

Implemented leader latch listener that relinquishes lock if it does n…

771f6a9

…ot have all the active topology code locally, keeps the lock if it can verify all active topology code exists locally.

Removing todos from HDFDCodeDistributor.

609f569

Fixing type in log statement.

50a33ac

Adding wait loop in master nimbus to allow for desired code replicati…

522696c

…on to be achieved before the topology is activated.

Removing unwanted TODO.

e037d3b

adding logging statements to wait for replication method.

e5c14e2

Adding cluster state for code-distributor.

c684939

Making zkLeaderElector addToLeaderQueue/removeFromLeaderQueue idempotent

32d7838

Addig nimbus sync-code background thread.

dda2ee1

making downloads atomic.

4e35c1a

Binding the bittorrent to all network interfaces.

d198242

Parth-Brahmbhatt added 2 commits March 20, 2015 10:38

Fixing a broker unit test that started failing after merge.

eb3a837

Merge remote-tracking branch 'upstream/master' into STORM-166

a11fcc3

Conflicts: storm-core/src/ui/public/index.html storm-core/src/ui/public/templates/index-page-template.html storm-core/src/ui/public/templates/topology-page-template.html

Parth-Brahmbhatt added 10 commits April 1, 2015 10:44

Merge remote-tracking branch 'apache/master' into nimbus-ha

765e4c2

Conflicts: storm-core/src/jvm/backtype/storm/utils/NimbusClient.java storm-core/test/clj/backtype/storm/security/auth/nimbus_auth_test.clj

STORM-726: Adding nimbus.host config for backward compatibility.

3e20823

Conflicts: storm-core/src/jvm/backtype/storm/utils/NimbusClient.java

Changed the code-distributor entries to ephemeral nodes. Added code t…

3f66ffd

…o ensure nimbus sets up the correct code-distributor entries on startup.

modifed ui to reflect dead nimbus hosts based on nimbus.seeds.

16293e4

Conflicts: storm-core/src/jvm/backtype/storm/utils/NimbusClient.java

Fixing the uptime in nimbusSummary.

95fb680

Nimbus-HAL: Bug fix, install watch on code-distrbutor everytime sync-…

21ba9c1

…code is called.

Addressing TODOs. Calling sync before calling getChildren on code-dis…

93dbcaf

…tributor path as zookeeper does not gurantee Simultaneously Consistent Cross-Client Views unless sync is called.

Parth-Brahmbhatt force-pushed the STORM-166 branch from cd46c48 to 93dbcaf Compare August 12, 2015 20:57

Merge remote-tracking branch 'upstream/master' into ha-merge

51fdc1a

asfgit merged commit 51fdc1a into apache:master Aug 24, 2015

drewrobb mentioned this pull request Sep 5, 2015

Support for highly available nimbus upcoming feature in storm 1.0 mesos/storm#58

Open

d2r mentioned this pull request Oct 8, 2015

STORM-166:Nimbus HA solution based on Zookeeper #61

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Storm-166: Nimbus HA design doc and implementation. #354

Storm-166: Nimbus HA design doc and implementation. #354

Parth-Brahmbhatt commented Dec 19, 2014

vesense commented Mar 20, 2015

ptgoetz commented Mar 20, 2015

ptgoetz commented Mar 20, 2015

ptgoetz commented Mar 21, 2015

longdafeng commented Mar 23, 2015

Parth-Brahmbhatt commented Mar 24, 2015

longdafeng commented Mar 24, 2015

revans2 commented Mar 24, 2015

ptgoetz commented Mar 24, 2015

Parth-Brahmbhatt commented Aug 12, 2015

harshach commented Aug 16, 2015

Parth-Brahmbhatt commented Aug 19, 2015

revans2 commented Aug 24, 2015

Storm-166: Nimbus HA design doc and implementation. #354

Storm-166: Nimbus HA design doc and implementation. #354

Conversation

Parth-Brahmbhatt commented Dec 19, 2014

vesense commented Mar 20, 2015

ptgoetz commented Mar 20, 2015

ptgoetz commented Mar 20, 2015

ptgoetz commented Mar 21, 2015

longdafeng commented Mar 23, 2015

Parth-Brahmbhatt commented Mar 24, 2015

longdafeng commented Mar 24, 2015

revans2 commented Mar 24, 2015

ptgoetz commented Mar 24, 2015

Parth-Brahmbhatt commented Aug 12, 2015

harshach commented Aug 16, 2015

Parth-Brahmbhatt commented Aug 19, 2015

revans2 commented Aug 24, 2015