Skip to content

Hadoop Installation Steps on Windows as Single Node

Vijay edited this page Jun 30, 2019 · 9 revisions
  • Download hadoop-3.2.0.zip from apache hadoop site and extract into a new folder (recommended folder name: hadoop)

  • Replace C:\hadoop\hadoop-3.2.0\bin folder with binaries from https://github.com/s911415/apache-hadoop-3.1.0-winutils

  • Modify the c:\hadoop\hadoop-3.2.0\etc\hadoop\core-site.xml as (core-site.xml)[https://github.com/AMVijay/hadoop-learning/blob/master/hadoop-3.2.0/etc/hadoop/core-site.xml]

  • Modify the c:\hadoop\hadoop-3.2.0\etc\hadoop\mapred-site.xml as mapred-site.xml

  • Modify the c:\hadoop\hadoop-3.2.0\etc\hadoop\hdfs-site.xml as hdfs-site.xml

  • Modify the c:\hadoop\hadoop-3.2.0\etc\hadoop\yarn-site.xml as yarn-site.xml.

  • Append below config path to the c:\hadoop\hadoop-3.2.0\etc\hadoop\hadoop-env.cmd, as hadoop-env.cmd.

  • Start HDFS using command C:\hadoop\hadoop-3.2.0\sbin>start-dfs.cmd

    • To format the file system, use command C:\hadoop\hadoop-3.2.0\bin>hdfs.cmd namenode -format
  • Copied the jar file hadoop-yarn-server-timelineservice-3.2.0.jar from C:\hadoop\hadoop-3.2.0\share\hadoop\yarn\timelineservice\hadoop-yarn-server-timelineservice-3.2.0.jar to the path C:\hadoop\hadoop-3.2.0\share\hadoop\yarn\

  • Start YARN using command C:\hadoop\hadoop-3.2.0\sbin>start-yarn.cmd

  • Copy local configuration files into hadaoop HDFS using C:\hadoop\hadoop-3.2.0>hdfs dfs -put etc/hadoop input

Clone this wiki locally