Join GitHub today
GitHub is home to over 28 million developers working together to host and review code, manage projects, and build software together.Sign up
Make bamtools_split output a dataset collection #227
There was some discussion on the Hangout earlier about the different semantic modes of bamtools split. Turns out they are:
Modes 1 and 2 split into exactly 2 datasets. Modes 3 and 4 split into an undefined number of dataset, with mode 3 potentially most useful for parallel processing. There does not (except for, arguably, mode 3) be a "split into chunks" options. The discussion revolved around the differences between "split into chunks" and other split modes and the possible need to create different tools to address the different semantics of these modes. I'd like to see some comments on this issue before a merge.
Secondly, this change introduces a undocumented feature (output to dataset collection) as default behaviour in a devteam tool. This might prove confusing to users: should this merge wait on documentation?