Join GitHub today
GitHub is home to over 28 million developers working together to host and review code, manage projects, and build software together.Sign up
Allow leading zeros for NumberedFileInputSplit #420
What changes were proposed in this pull request?
With these changes, NumberedFileInputSplit permits two kinds of formatting -- "prefix4.suffix" or "prefix0004.suffix." Latter is more suitable for, e.g., Hadoop mapfiles spit out by a Spark ETL pipeline.
How was this patch tested?
Added a NumberedFileInputSplitTests class with some unit tests to verify the name formatting works. I've also run it in live code, and it performed fine.