Update to Hadoop-BAM 7.8.0 #1455

fnothaft · 2017-03-23T15:54:38Z

Bumps to latest version of HTSJDK. Should tack on to #1440. Will need to update SAM/BAM/CRAM queryname sorted loading code. As part of that fix, we should consolidate the BAM->Fragments path into a loadBamAsFragments method, which will make it easier to use downstream (e.g., in https://github.com/heuermh/cannoli/pull/15).

The text was updated successfully, but these errors were encountered:

heuermh · 2017-03-23T22:19:08Z

While we're in there I'd suggest taking out

} else if (filePath.endsWith(".reads.adam")) {
  log.info(s"Loading $filePath as ADAM AlignmentRecords and converting to Fragments.")
  loadAlignments(filePath).toFragments

And from loadAlignments

} else if (filePath.endsWith("contig.adam")) {
  log.info(s"Loading $filePath as Parquet of NucleotideContigFragment and converting to AlignmentRecords. Projection is ignored.")
  AlignmentRecordRDD.unaligned(loadParquetContigFragments(filePath).toReads)

Sorry, I'll create a new issue. See #1456.

Resolves bigdatagenomics#1455. Adds `org.bdgenomics.adam.rdd.read.RepairPartitions`, which works around removed functionality from Hadoop-BAM for keeping read pairs from a queryname sorted BAM file in a single partition.

fnothaft added this to the 0.22.0 milestone Mar 23, 2017

fnothaft mentioned this issue Mar 27, 2017

Dependency version bump + BroadcastRegionJoin fix #1440

Merged

heuermh closed this as completed in 4a74482 Mar 27, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update to Hadoop-BAM 7.8.0 #1455

Update to Hadoop-BAM 7.8.0 #1455

fnothaft commented Mar 23, 2017

heuermh commented Mar 23, 2017 •

edited

Loading

Update to Hadoop-BAM 7.8.0 #1455

Update to Hadoop-BAM 7.8.0 #1455

Comments

fnothaft commented Mar 23, 2017

heuermh commented Mar 23, 2017 • edited Loading

heuermh commented Mar 23, 2017 •

edited

Loading