Skip to content

Latest commit

 

History

History
24 lines (18 loc) · 1.22 KB

standbynn_secondarynn.md

File metadata and controls

24 lines (18 loc) · 1.22 KB

What is the difference between a standby NameNodes and a secondary NameNode?

Secondary Namenode

In Hadoop 1.x and 2.x, the secondary namenode means the same. It does CPU intensive tasks for Namenode. In more details, it combines the Edit log and fs_image and returns the consolidated file to Namenode. Namenode then loads that file into RAM. But, secondary namenode doesn't provide failover capabilities. So, in case of Namenode failure, Hadoop admins have to manually recover the data from Secondary Namenode.

Standby Namenode

In Hadoop 2.0, with the introduction of HA, the Standby Namenode came into picture. The standby namenode is the node that removes the problem of SPOF (Single Point Of Failure) that was there in Hadoop 1.x. The standby namenode provides automatic failover in case Active Namenode (can be simply called 'Namenode' if HA is not enabled) fails.

Moreover, enabling HA is not mandatory. But, when it is enabled, you can't use Secondary Namenode. So, either Secondary Namenode is enabled OR Standby Namenode is enabled.

Reference