-
Notifications
You must be signed in to change notification settings - Fork 354
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
oncofuse #237
oncofuse #237
Conversation
Hi Paul, Looks good-- you might want to look at bcbio.providance.do.run function instead of using subprocess directly; if you do that you will get all the cool logging of the command line stuff for free. |
thanks. I just followed the implementation of snpEff. if it's implemented appropriately, I can change snpEff's as well. now, I am trapped here chapmanb/cloudbiolinux#139. I need to install everything to my new machine to test the script |
…t via the shell instead of subprocess #237
Paul; In terms of installation, bcbio-nextgen is not well tested with MacOSX installation. We have some of the pieces in place but really haven't gone through a full installation. I expect this to be a bit of work due to making things compile. If you want to test on a Mac machine, it might be worth using a virtual machine. Rory put together instructions here: https://bcbio-nextgen.readthedocs.org/en/latest/contents/installation.html#on-a-virtual-machine We're currently moving towards a VM and docker container approach to try to avoid needing to get all this software cleanly compiling on multiple systems. Hope this helps with testing things out. |
Thanks Brad. Just that my linux machine is not ready yet, I will move back to centos soon. Appreciate all the helps. The docker container approach looks very nice. |
sorry that i cannot test till Monday |
@roryk, I just checked, tophat2+oncofuse should run ok now; I put fusion detection as part of transcript counts, not sure if it's ok with you |
Hi Paul, Thanks for the pull request-- do you have anything like a very small test dataset that we could add that should have a fusion event in it so I can add a unit test? We haven't really used this type of data much so we have nothing. |
@roryk , I personally don't. I was looking into the SRA, and we may use the following study as test. Let me know if it's ok, and I can prepare a template. We may add the sra toolkit such that the pipeline can start with .sra file |
Sure-- sounds good! Thanks Paul for taking the lead on this. |
Hi Paul, |
Thanks Miika! I wanted to find it at EBI but couldn't find it at first. Thanks for the link. Should you have a better dataset, please let me know, and you may test if I implemented oncofuse correctly. Thanks. |
There is also a simulated data set available from the authors of FusionMap, mentioned in this publication http://www.biomedcentral.com/content/pdf/1471-2105-14-S7-S2.pdf on page 9. |
There is some test data in the FusionMap bundle: |
Hi Paul and Miika, Thanks for all your work; let us know how the test data looks. Having a nice unit test and a good idea if it is working properly for this would be great before the merge. |
Sure Rory. I have been trying the simulated dataset in fusionmap, but there is an Paul Exception in thread "main" net.sf.picard.PicardException: In paired mode, at net.sf.picard.sam.FastqToSam.getBaseName(FastqToSam.java:291) at net.sf.picard.sam.FastqToSam.doPaired(FastqToSam.java:187) at net.sf.picard.sam.FastqToSam.doWork(FastqToSam.java:140) at at net.sf.picard.sam.FastqToSam.main(FastqToSam.java:118) On Sun, Jan 12, 2014 at 2:58 PM, Rory Kirchner notifications@github.comwrote:
|
Hi Paul, Looks like the :1 and :2 identifiers are throwing it off, they should either be /1 and /2 to denote which member of the pair it is, or the Illumina style 1:Y:18:ATCACG style: |
Thanks for the hint. It works, but oncofuse didn't report the right answer, Paul On Sun, Jan 12, 2014 at 3:58 PM, Rory Kirchner notifications@github.comwrote:
|
@roryk , the script runs ok (using the test dataset in fusionMap) on hg19 reference after changing the :1 and :2 identifiers to /1 and /2. It doesn't work on GRCh37 unless I take out 'chr' from refseq annotation in oncofuse package: oncofuse/common/refseq_full.txt what would you recommend in fixing these two issues? P.S. the other example takes too long as a unit test |
I copied my sample config here: https://gist.github.com/tanglingfung/8409280 |
Hi Paul, This is great! Thank you so much for all of your work. We're going to roll a release tomorrow morning and after that I'll work on testing this and merging it in as part of the next release. |
@roryk , no problem, I haven't tested with the latest updates, and star index is generated every time when i debug my code |
Hi Paul, I merged this in here: 1502ebd. I set you as the author since it was mostly your code, I'm sorry I'm not good enough at git to know how to add my commits on top of yours and also clean up the hhistory, so I merged them all together and set you to be the author. There is some work that needs to be done to get this production ready. Some help on these items would be awesome.
At any rate the unit test for this passes for me but I thought I'd commit this and merge it in so we have a point to jump off of to smooth this all out. |
@roryk , sorry for my late response. thanks for taking care of this. I am not very familiar with git either, let me try though
Thanks for everything |
let me close this for now, and create a new PR with a few bugs fixed |
haven't tested the script yet but want to check if the style is consistent
happy new year