Skip to content

Latest commit

 

History

History
26 lines (21 loc) · 1.18 KB

hdcs-sh-hadoop-data-protection-and-netapp.adoc

File metadata and controls

26 lines (21 loc) · 1.18 KB
sidebar permalink keywords summary
sidebar
data-analytics/hdcs-sh-hadoop-data-protection-and-netapp.html
distcp, copy, backup workflow, hdfs, mapreduce
Hadoop DistCp is a native tool used for large intercluster and intracluster copying. The Hadoop DistCp basic process is a typical backup workflow using Hadoop native tools such as MapReduce to copy Hadoop data from an HDFS source to a corresponding target.

Hadoop data protection and NetApp

Hadoop DistCp is a native tool used for large intercluster and intracluster copying. The Hadoop DistCp basic process shown in the figure below is a typical backup workflow using Hadoop native tools such as MapReduce to copy Hadoop data from an HDFS source to a corresponding target.

The NetApp NFS direct access enables customers to set NFS as the target destination for the Hadoop DistCp tool to copy the data from HDFS source into an NFS share through MapReduce. The NetApp NFS direct access acts as an NFS driver for the DistCp tool.

hdcs sh image4