Skip to content

rtdavis22/hdfs_spool

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

1 Commit
 
 
 
 

Repository files navigation

This script uses inotifywait to listen for files added to /var/spool/to_hdfs/ and copies them to HDFS.

The components of the desired HDFS path should be separated by #s. For example, if the file name is logs#a#b.txt, it will be placed in /logs/a/b.txt in HDFS.

Note that a new VM is started up for each file copy so this isn't a terribly efficient ingestion mechanism.

Also make sure the user running the script has read/write permissions for the spool directory and the necessary HDFS locations.

About

An asynchronous mechanism for ingesting files into HDFS

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages