New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
doing bcl->qseq->fastq->analysis->galaxy in one machine #20
Comments
Paul; |
Brad: Thanks. Yes, I think it's nice too. But it would be also nice if it comes as a option. We were thinking to amend the code a bit, but just wonder what's the best way to generalize the code. |
Paul; |
Thanks. We are getting a similar approach. On Tue, Apr 12, 2011 at 4:43 AM, chapmanb
|
Hi Brad, One question. Where can I find the details of get_flowcell_info() and get_get_fastq_dir()? (Sorry, I am kind of new to Python.) We have changed the illumina_finished_msg.py to transfer the data from the sequencing machine to the analysis machine in the bcl->qseq step, and extracted the copy_and_analysis in analyze_finish_sqn.py to initiate the automated_initial_analysis.py. However, I wasn't sure which directory I should put for the |
sorry, I found the files. But then, I got an error in nosetests that i wasn't sure where the problem is: do you have any idea? |
Paul; For your CollectGcBiasMetrics error, it sounds like Picard is having trouble finding the Rscript executable, which should come as part of R. Is R installed and on your path? Picard uses Rscript to generate plots. |
Brad, Well. It will be good to be able to specify the fastq and qseqs files Yes, I can call Rscript anywhere. I should be on my path too. Would it Thanks, On Sat, Apr 16, 2011 at 5:18 AM, chapmanb
|
Paul; For your CollectGcBias error, can you run this by hand from the commandline? That's the best way to debug it. The bait and target files are only necessary if you are doing targetted re-sequencing. That wouldn't have any effect on CollectGcBias. Thanks, |
Brad, That's a good idea. Let me try to put it up soon. As for the OLB problem, we find that it's a problem on our side. We Our approach now is to transfer the qseqs and fastq in a different In the future, we want to generalize it to handle the following cases:
I saw you had plans for case 1, right? On Sun, Apr 17, 2011 at 9:13 AM, chapmanb
|
Paul; For the fastq files, I would suggest transferring them somewhere outside of the Illumina dump tree. You'll probably have different backup strategies for these compared to the rest of the Illumina output, so this helps facilitate that. For your last question, I'm not sure what step you are referring to. The analysis works with fastq files now and if you only want to process certain lanes, you can pass in a custom run_info.yaml file specifying what to process to automated_initial_analysis.py: https://github.com/chapmanb/bcbb/blob/master/nextgen/config/run_info.yaml Hope this helps. |
Hi Brad, I'm not used to github yet, so, I posted our code with gist. Basically, we took away the messaging part in illumina_finished_msg.py, transfer the qseqs files to analysis machine, generate fastq files directly to the analysis dir and start the analysis there. I attached the code in the following URL: I hope it can be generalized and compatible with your scripts. We seems have the system set up on our server. The analysis pipeline is still running. Thanks, |
Paul; The only thing I did not add is changing where qseqs are dumped. I'd rather not mess with OLB directories and processes to keep this forward compatible as Illumina practices and software changes. Hopefully this works for what you wanted to do. Let me know if you run into any problems. |
Hi,
We have a different setting here where the drive with the bcl files is mounted to the analysis machine and we would do everything there. Do you recommend we keep the messaging system in the pipeline? Just want to get some advices.
Thanks,
Paul
The text was updated successfully, but these errors were encountered: