ICP1
Hiresh Jakkala Bhaskar edited this page Aug 22, 2019
·
2 revisions
- Install Cloudera
- Load datasets into HDFS
- Append both files
- Visualize the result file with Hue
- Display first and last 5 lines of the result file
- Load new file and append data of all the 3 datasets
- Cloudera
- Hadoop
- Hue
- Creating new directory BDP:
- Copy the files to hadoop hdfs
appendToFile – copies files from local file system to a destination file system
Appending the files using Cat command and storing moving the output to HDFS using put command,
****View the first 5 lines of merged dataset using appropriate hdfs commands
****View the first 5 lines of merged dataset using appropriate hdfs commands
****Create a new text file and load it into hdfs and try to append all three datasets.
****Visualize file with Hue