-
Notifications
You must be signed in to change notification settings - Fork 5
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Is there any protocol for whole genome sequencing data? #1
Comments
Not here, since the files would be large. But I think there's an example in bcbio-nextgen to run it on the 1000 Genomes NA12878 data. |
If I want to manipulate the data in na12878, which manual should I follow? I only found bed file in that folder. $make
......
Traceback (most recent call last):
File "/usr/local/bin/cnvkit.py", line 4, in <module>
__import__('pkg_resources').run_script('CNVkit==0.7.5.dev0', 'cnvkit.py')
File "/usr/local/lib/python2.7/dist-packages/pkg_resources/__init__.py", line 745, in run_script
self.require(requires)[0].run_script(script_name, ns)
File "/usr/local/lib/python2.7/dist-packages/pkg_resources/__init__.py", line 1677, in run_script
exec(script_code, namespace, namespace)
File "/usr/local/lib/python2.7/dist-packages/CNVkit-0.7.5.dev0-py2.7.egg/EGG-INFO/scripts/cnvkit.py", line 11, in <module>
File "build/bdist.linux-x86_64/egg/cnvlib/commands.py", line 576, in _cmd_fix
File "build/bdist.linux-x86_64/egg/cnvlib/commands.py", line 589, in do_fix
File "build/bdist.linux-x86_64/egg/cnvlib/fix.py", line 25, in load_adjust_coverages
File "build/bdist.linux-x86_64/egg/cnvlib/fix.py", line 89, in match_ref_to_probes
ValueError: Reference is missing 19402 bins found in CL_seq
make: *** [build/CL_seq_flat.cnr] Error 1 |
The files in the na12878 in this repo are work in progress, and it's going to be an example for whole exome sequencing, not whole genome. But WGS analysis is fairly straightforward, and you don't need any special data to do the analysis from scratch:
|
Sure, @charliecurnin, you can create a reference .cnn from a pool of normal
samples.
See:
- https://cnvkit.readthedocs.io/en/stable/pipeline.html#reference
- https://cnvkit.readthedocs.io/en/stable/nonhybrid.html
Using a pooled reference is not required, but generally it will improve
your results.
|
I wonder weather there is a example to apply cnvkit in whole genome sequencing data.
The text was updated successfully, but these errors were encountered: