-
Notifications
You must be signed in to change notification settings - Fork 24
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[fetch_refseq:40] Error, cannot retrieve reference chr1:0-0. #3
Comments
Hi ttriche, Did you find any way to fix this error? |
Hi rahbz, are you seeing this? Could you elaborate on how to reproduce? I tried to reproduce this on my side but in vain. I must be missing something. |
Hi,
But I think I have figured out the error. If I change the reference genome modifying the header of fasta to just the Chromosome name instead of all the information I seemed to get no error. I dont know why that is the case. Thanks, |
it disappeared once I updated biscuit and sorted the BAMs (verified sort --t On Wed, May 25, 2016 at 6:12 AM, Wanding Zhou notifications@github.com
|
re: headers: I just used hg19 from UCSC, so that reason seems unlikely --t On Wed, May 25, 2016 at 6:33 AM, Rahul Bharadwaj notifications@github.com
|
Yeah even I thought the same, I have the same headers from Arabidopsis, tair10. Dont know really why that was the case. |
Hello, I have the exact same problem even after cloning the latest version of biscuit and sorting/indexing my bam files. Did someone figure out what would be the issue ? Thanks |
Hello jleluyer, |
Hello, They are just with scaffold names: i.e >sp_scaff00001. Is the way there are called a problem ? |
I got similar error here "[fetch_refcache:94] Error, cannot retrieve reference" Modification of fasta header to only keep the contig name did solve the problem |
Hello, I'm getting a similar error: Error: The reference fasta I'm using only contains the GRCh38 assembled chromosome 6. I changed the header from ">CM000668.2 Homo sapiens chromosome 6, GRCh38 reference primary assembly" to just ">chr6" and it still does not resolve the problem. EDIT: I've figured out the issue. I didn't index the reference fasta again after changing the header. |
Hi all: I'm having the same error. Error: The resulting output vcf has correct headers but no data. Following the advice above in the thread I renamed all the chromosomes to have simpler names: the reference fasta now has the following chromosomes:
I indexed the fasta and aligned my reads again and pileup still will not work. Any suggestions would be greatly appreciated. |
I have run into this issue quite a bit, oddly, it doesn't seem to be deterministic (sometimes it works, sometimes not). Say I have the following files in my working directory:
if I run
However, if i run Based on this I'm guessing that the built-in faidx generation in |
I don't recall biscuit generates faidx in any of its function. Yeah use samtools. |
Perhaps we should document that a faidx is necessary to run the command, even though it's not specified in the command... |
Thank you @njspix ! I hope this modification can be documented. |
After marking dupes, the following
biscuit pileup -q 16 -r hg19.fa -i WGBS_$SAMPLE.mkdups.hg19.bam -o WGBS_$SAMPLE.mkdups.hg19.vcf
fails on a number of samples (4/6, not all though). Is there a straightforward way to debug this? Or just to grep out the offending read[s]? The runs that fail are all from the same flow cell IIRC. I don't see this behavior elsewhere and can't seem to find anything on Google. Eventually I'll look into the source... :-/
The text was updated successfully, but these errors were encountered: