Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[CANNOLI-33] Use ADAM tab5 formatter for bowtie #42

Closed
wants to merge 5 commits into from

Conversation

heuermh
Copy link
Member

@heuermh heuermh commented May 17, 2017

Fixes #33
Depends on bigdatagenomics/adam#1522

No reads are processed however

$ ADAM_MAIN=org.bdgenomics.cannoli.Cannoli \
  adam-submit \
  --jars target/cannoli-spark2_2.11-0.1-SNAPSHOT.jar \
  -- \
  bowtie \
  -bowtie_index GCA_000001405.25_GRCh38.p10_genomic \
  -print_metrics -single \
  NIST7035_TAAGGCGA_L002.ifq \
  NIST7035_TAAGGCGA_L002.sam

Using ADAM_MAIN=org.bdgenomics.cannoli.Cannoli
Using SPARK_SUBMIT=/usr/local/bin/spark-submit
...
# reads processed: 0
# reads with at least one reported alignment: 0 (0.00%)
# reads that failed to align: 0 (0.00%)
No alignments
# reads processed: 0
# reads with at least one reported alignment: 0 (0.00%)
# reads that failed to align: 0 (0.00%)
No alignments

Unless I'm doing something wrong, --12 - doesn't seem to work

$ cat fragments.tab5 
H06HDADXX130110:2:2116:3345:91806	GTTAGGGTTAGGGTTGGGTTAGGGTTAGGGTTAGGGTTAGGGGTAGGGTTAGGGTTAGGGGTAGGGTTAGGGTTAGGGTTAGGGTTAGGGTTAGGGGTAGGGCTAGGGTTAAGGGTAGGGTTAGCGAAAGGGCTGGGGTTAGGGGTGCGGGTACGCGTAGCATTAGGGCTAGAAGTAGGATCTGCAGTGCCTGACCGCGTCTGCGCGGCGACTGCCCAAAGCCTGGGGCCGACTCCAGGCTGAAGCTCAT	>=<=???>?>???=??>>8<?>=2=<===1194<?;:?>>?#3==>###########################################################################################################################################################################################################	TAACCCTAACCCTAACCCTAACCCTAACCCTAACCCTAACCCTAACCCTAACCCTAACCCTAACCCTACCCCTAACCCTAACCCTAACCCTAACCCGTACCCTAAACCCAACCCTAACCACAAAGCAAATCCCAACCTTAACCGGAACCCGAAATCTCGCAGCAAATCTGCAGTAGAGACGCAGACTCAACCATGCGTCTATTAGTACGCATTATCATTGCCTCATGCTTCTTAAGTACAGAGAGATGAC	==;<?>@@@<>>@??<>>???<=>>?>:><@?4=:>7=5=>:=@;'@A?########################################################################################################################################################################################################
...

$ bowtie -S GCA_000001405.25_GRCh38.p10_genomic --12 fragments.tab5 
@HD	VN:1.0	SO:unsorted
@SQ	SN:CM000663.2	LN:248956422
...
@PG	ID:Bowtie	VN:1.2	CL:"bowtie-align --wrapper basic-0 -S GCA_000001405.25_GRCh38.p10_genomic --12 fragments.tab5"
H06HDADXX130110:2:2116:3345:91806	77	*	0	0	*	*	0	0	GTTAGGGTTAGGGTTGGGTTAGGGTTAGGGTTAGGGTTAGGGGTAGGGTTAGGGTTAGGGGTAGGGTTAGGGTTAGGGTTAGGGTTAGGGTTAGGGGTAGGGCTAGGGTTAAGGGTAGGGTTAGCGAAAGGGCTGGGGTTAGGGGTGCGGGTACGCGTAGCATTAGGGCTAGAAGTAGGATCTGCAGTGCCTGACCGCGTCTGCGCGGCGACTGCCCAAAGCCTGGGGCCGACTCCAGGCTGAAGCTCAT	>=<=???>?>???=??>>8<?><=2=<===1194<?;:?>>?#3==>###########################################################################################################################################################################################################	XM:i:0
H06HDADXX130110:2:2116:3345:91806	141	*	0	0	*	*	0	0	TAACCCTAACCCTAACCCTAACCCTAACCCTAACCCTAACCCTAACCCTAACCCTAACCCTAACCCTACCCCTAACCCTAACCCTAACCCTAACCCGTACCCTAAACCCAACCCTAACCACAAAGCAAATCCCAACCTTAACCGGAACCCGAAATCTCGCAGCAAATCTGCAGTAGAGACGCAGACTCAACCATGCGTCTATTAGTACGCATTATCATTGCCTCATGCTTCTTAAGTACAGAGAGATGAC	==;<?>@@@<>>@??<>>???<=>>?>:><@?4=:>7=5=>:<=@;'@A?########################################################################################################################################################################################################	XM:i:0
...

$ cat fragments.tab5 | bowtie -S GCA_000001405.25_GRCh38.p10_genomic --12 -
@HD	VN:1.0	SO:unsorted
@SQ	SN:CM000663.2	LN:248956422
...
@PG	ID:Bowtie	VN:1.2	CL:"bowtie-align --wrapper basic-0 -S GCA_000001405.25_GRCh38.p10_genomic --12 -"
# reads processed: 0
# reads with at least one reported alignment: 0 (0.00%)
# reads that failed to align: 0 (0.00%)
No alignments

@AmplabJenkins
Copy link

Test FAILed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/cannoli-prb/25/

Build result: FAILURE

[...truncated 16 lines...] > /home/jenkins/git2/bin/git rev-parse origin/pr/42/merge^{commit} # timeout=10 > /home/jenkins/git2/bin/git branch -a --contains ee6e51f # timeout=10 > /home/jenkins/git2/bin/git rev-parse remotes/origin/pr/42/merge^{commit} # timeout=10Checking out Revision ee6e51f (origin/pr/42/merge) > /home/jenkins/git2/bin/git config core.sparsecheckout # timeout=10 > /home/jenkins/git2/bin/git checkout -f ee6e51fd572cb3622c51a62971e5789e6c95ae48First time build. Skipping changelog.Triggering cannoli-prb ? 2.6.0,2.11,2.0.0,centosTriggering cannoli-prb ? 2.3.0,2.10,2.0.0,centosTriggering cannoli-prb ? 2.6.0,2.10,1.6.1,centosTriggering cannoli-prb ? 2.6.0,2.11,1.6.1,centosTriggering cannoli-prb ? 2.3.0,2.11,1.6.1,centosTriggering cannoli-prb ? 2.3.0,2.11,2.0.0,centosTriggering cannoli-prb ? 2.3.0,2.10,1.6.1,centosTriggering cannoli-prb ? 2.6.0,2.10,2.0.0,centoscannoli-prb ? 2.6.0,2.11,2.0.0,centos completed with result FAILUREcannoli-prb ? 2.3.0,2.10,2.0.0,centos completed with result FAILUREcannoli-prb ? 2.6.0,2.10,1.6.1,centos completed with result FAILUREcannoli-prb ? 2.6.0,2.11,1.6.1,centos completed with result FAILUREcannoli-prb ? 2.3.0,2.11,1.6.1,centos completed with result FAILUREcannoli-prb ? 2.3.0,2.11,2.0.0,centos completed with result FAILUREcannoli-prb ? 2.3.0,2.10,1.6.1,centos completed with result FAILUREcannoli-prb ? 2.6.0,2.10,2.0.0,centos completed with result FAILURENotifying endpoint 'HTTP:https://webhooks.gitter.im/e/ac8bb6e9f53357bc8aa8'
Test FAILed.

@coveralls
Copy link

coveralls commented Jun 20, 2017

Coverage Status

Coverage increased (+33.0%) to 62.617% when pulling 236d570 on heuermh:bowtie-tab5 into d5e9577 on bigdatagenomics:master.

@AmplabJenkins
Copy link

Test FAILed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/cannoli-prb/26/

Build result: FAILURE

[...truncated 15 lines...] > /home/jenkins/git2/bin/git fetch --tags --progress https://github.com/bigdatagenomics/cannoli.git +refs/pull/:refs/remotes/origin/pr/ # timeout=15 > /home/jenkins/git2/bin/git rev-parse origin/pr/42/merge^{commit} # timeout=10 > /home/jenkins/git2/bin/git branch -a -v --no-abbrev --contains a65c216 # timeout=10Checking out Revision a65c216 (origin/pr/42/merge) > /home/jenkins/git2/bin/git config core.sparsecheckout # timeout=10 > /home/jenkins/git2/bin/git checkout -f a65c2168f6c9440ac4715021392bb171b37994d1First time build. Skipping changelog.Triggering cannoli-prb ? 2.6.0,2.10,1.6.1,centosTriggering cannoli-prb ? 2.3.0,2.11,1.6.1,centosTriggering cannoli-prb ? 2.6.0,2.11,2.0.0,centosTriggering cannoli-prb ? 2.3.0,2.10,2.0.0,centosTriggering cannoli-prb ? 2.6.0,2.11,1.6.1,centosTriggering cannoli-prb ? 2.3.0,2.10,1.6.1,centosTriggering cannoli-prb ? 2.6.0,2.10,2.0.0,centosTriggering cannoli-prb ? 2.3.0,2.11,2.0.0,centoscannoli-prb ? 2.6.0,2.10,1.6.1,centos completed with result FAILUREcannoli-prb ? 2.3.0,2.11,1.6.1,centos completed with result SUCCESScannoli-prb ? 2.6.0,2.11,2.0.0,centos completed with result SUCCESScannoli-prb ? 2.3.0,2.10,2.0.0,centos completed with result SUCCESScannoli-prb ? 2.6.0,2.11,1.6.1,centos completed with result SUCCESScannoli-prb ? 2.3.0,2.10,1.6.1,centos completed with result FAILUREcannoli-prb ? 2.6.0,2.10,2.0.0,centos completed with result SUCCESScannoli-prb ? 2.3.0,2.11,2.0.0,centos completed with result SUCCESSNotifying endpoint 'HTTP:https://webhooks.gitter.im/e/ac8bb6e9f53357bc8aa8'
Test FAILed.

@heuermh
Copy link
Member Author

heuermh commented Jun 22, 2017

Jenkins, retest this please

@coveralls
Copy link

Coverage Status

Coverage decreased (-0.8%) to 28.879% when pulling 236d570 on heuermh:bowtie-tab5 into d5e9577 on bigdatagenomics:master.

1 similar comment
@coveralls
Copy link

coveralls commented Jun 22, 2017

Coverage Status

Coverage decreased (-0.8%) to 28.879% when pulling 236d570 on heuermh:bowtie-tab5 into d5e9577 on bigdatagenomics:master.

@AmplabJenkins
Copy link

Test FAILed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/cannoli-prb/27/

Build result: FAILURE

[...truncated 15 lines...] > /home/jenkins/git2/bin/git fetch --tags --progress https://github.com/bigdatagenomics/cannoli.git +refs/pull/:refs/remotes/origin/pr/ # timeout=15 > /home/jenkins/git2/bin/git rev-parse origin/pr/42/merge^{commit} # timeout=10 > /home/jenkins/git2/bin/git branch -a -v --no-abbrev --contains a65c216 # timeout=10Checking out Revision a65c216 (origin/pr/42/merge) > /home/jenkins/git2/bin/git config core.sparsecheckout # timeout=10 > /home/jenkins/git2/bin/git checkout -f a65c2168f6c9440ac4715021392bb171b37994d1First time build. Skipping changelog.Triggering cannoli-prb ? 2.3.0,2.11,2.0.0,centosTriggering cannoli-prb ? 2.3.0,2.10,2.0.0,centosTriggering cannoli-prb ? 2.6.0,2.11,2.0.0,centosTriggering cannoli-prb ? 2.3.0,2.10,1.6.1,centosTriggering cannoli-prb ? 2.3.0,2.11,1.6.1,centosTriggering cannoli-prb ? 2.6.0,2.11,1.6.1,centosTriggering cannoli-prb ? 2.6.0,2.10,1.6.1,centosTriggering cannoli-prb ? 2.6.0,2.10,2.0.0,centoscannoli-prb ? 2.3.0,2.11,2.0.0,centos completed with result SUCCESScannoli-prb ? 2.3.0,2.10,2.0.0,centos completed with result SUCCESScannoli-prb ? 2.6.0,2.11,2.0.0,centos completed with result SUCCESScannoli-prb ? 2.3.0,2.10,1.6.1,centos completed with result FAILUREcannoli-prb ? 2.3.0,2.11,1.6.1,centos completed with result SUCCESScannoli-prb ? 2.6.0,2.11,1.6.1,centos completed with result SUCCESScannoli-prb ? 2.6.0,2.10,1.6.1,centos completed with result FAILUREcannoli-prb ? 2.6.0,2.10,2.0.0,centos completed with result SUCCESSNotifying endpoint 'HTTP:https://webhooks.gitter.im/e/ac8bb6e9f53357bc8aa8'
Test FAILed.

Copy link
Member

@devin-petersohn devin-petersohn left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks good, just a couple of comments.

@@ -0,0 +1 @@
#!/bin/bash
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Is this the extent of the script?

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This is intentionally a stub for later.

@@ -31,6 +31,7 @@ private[cannoli] object QuerynameGrouper extends Serializable {
def apply(rdd: AlignmentRecordRDD): FragmentRDD = {
new FragmentRDD(apply(rdd.rdd),
rdd.sequences,
rdd.recordGroups)
rdd.recordGroups,
optPartitionMap = None)
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

We lose some information here about whether or not the data is sorted. This should be fixed in bigdatagenomics/adam#1561. Under the new method, it does force a recompute on the optPartitionMap, but at least we would keep track of whether or not the data is sorted.

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

By definition, the data is not coordinate sorted here.

@heuermh
Copy link
Member Author

heuermh commented Jul 1, 2017

I'm not sure how, I've git-borked the commits on this one and can't seem to unsuck them. The problem with bowtie itself has been fixed in version 1.2.1.1, and is available in homebrew-science. I'd like to close this unmerged and try again with separate pull requests.

@heuermh heuermh changed the title Use ADAM tab5 formatter for bowtie [CANNOLI-33] Use ADAM tab5 formatter for bowtie Jul 1, 2017
@fnothaft
Copy link
Member

fnothaft commented Jul 5, 2017

Closing as unmerged, @heuermh will reopen as separate PRs.

@fnothaft fnothaft closed this Jul 5, 2017
@heuermh heuermh deleted the bowtie-tab5 branch November 1, 2017 16:34
@heuermh heuermh added this to the 0.1.0 milestone Jan 24, 2018
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

5 participants