-
Notifications
You must be signed in to change notification settings - Fork 243
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
fasterq-dump err #214
Comments
I just had this error today as well. |
@groverj3, was it on SRR316212 or a different accession? Did you prefetch the file or used it directly? Is it the same error every time you run? If the file is on your disk, try to run vdb-validate on it. |
Actually, at least two different accessions. I was getting all the runs from a couple different bioprojects and got errors with both fastq-dump and fasterq-dump. There was also an error with prefetch, but obviously a different one. I, unfortunately, don't have the exact SRR# or those other error messages handy at the moment. But if I get a spare moment I can do some digging. I was able to get them directly as .fastq.gz files from the ENA instead. Which was super slow. |
Error happened again with SRR6294776. I let it sit there until it exited, and no file was downloaded. I tried it again, by itself (other files downloaded without issue) and the exact error was:
Before I killed it. Trying prefetch now. |
prefetch also fails with:
|
A few more potentially helpful details: Navigating to https://trace.ncbi.nlm.nih.gov/Traces/sra/?run=SRR6294776 results in waiting a very long time, but it does eventually come up. Once there, downloading using wget with the https link https://sra-download.ncbi.nlm.nih.gov/sos/sra-pub-run-1/SRR6294776/SRR6294776.1 takes a long time to respond and is very slow, but it does eventually start. Meanwhile, navigating to https://www.ebi.ac.uk/ena/data/view/SRR6294776 works fine. Also, downloading from ENA'S ftp link to the SRA file is fast. Though, their direct .fastq.gz file link is slow. fasterq-dump works just fine on the manually downloaded SRA file. |
Thank you for the reporting, Jeffrey.
There have been a couple network switch updates today as well as some back-end storage updates... there are a lot of moving parts for us to investigate right now.
We are usually able to isolate such changes from public access effects, but apparently not today. We are trying to track down these issues as we head into the weekend.
Again, we appreciate such reporting as our own monitoring systems are not as sensitive to such issues.
…-Chris
From: "Jeffrey Grover" <notifications@github.com<mailto:notifications@github.com>>
Date: Friday, September 6, 2019 at 6:15:49 PM
To: "ncbi/sra-tools" <sra-tools@noreply.github.com<mailto:sra-tools@noreply.github.com>>
Cc: "Subscribed" <subscribed@noreply.github.com<mailto:subscribed@noreply.github.com>>
Subject: Re: [ncbi/sra-tools] fasterq-dump err (#214)
A few more potentially helpful details:
Navigating to https://trace.ncbi.nlm.nih.gov/Traces/sra/?run=SRR6294776 results in waiting a very long time, but it does eventually come up. Once there, downloading using wget with the https link https://sra-download.ncbi.nlm.nih.gov/sos/sra-pub-run-1/SRR6294776/SRR6294776.1 takes a long time to respond and is very slow, but it does eventually start.
Meanwhile, navigating to https://www.ebi.ac.uk/ena/data/view/SRR6294776 works fine. Also, downloading from ENA'S ftp link to the SRA file is fast. Though, their direct .fastq.gz file link is slow.
—
You are receiving this because you are subscribed to this thread.
Reply to this email directly, view it on GitHub<#214?email_source=notifications&email_token=AC66ETK5PSHDMTCSY4LGAI3QILJAHA5CNFSM4IKZU5G2YY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOD6EGG2I#issuecomment-529032041>, or mute the thread<https://github.com/notifications/unsubscribe-auth/AC66ETPOKSWEHF66763TU7DQILJAHANCNFSM4IKZU5GQ>.
|
Hello,
Version Error message after downloading 12 files that add up to 48GB in the TEMP directory:
|
Hi, Same error for me for sratoolkit.2.10.4-ubuntu64/bin/fasterq-dump SRR10572657 -e 32 -o data/sra -t data/tmp 2020-03-09T22:11:44 fasterq-dump.2.10.4 err: cmn_iter.c cmn_read_String( #48847424 ).VCursorCellDataDirect() -> RC(rcPS,rcCondition,rcWaiting,rcTimeout,rcExhausted) ---- 34.08%2020-03-09T22:12:59 fasterq-dump.2.10.4 err: cmn_iter.c cmn_read_uint8_array( #18261357 ).VCursorCellDataDirect() -> RC(rcPS,rcCondition,rcWaiting,rcTimeout,rcExhausted)
2020-03-09T22:12:59 fasterq-dump.2.10.4 err: row #18261357 : READ.len(300) != QUALITY.len(0) (F)
2020-03-09T22:12:59 fasterq-dump.2.10.4 fatal: SIGNAL - Segmentation fault |
My problem only occurred on very large files and here is how I solved it: Instead of using fastq-dump or fasterq-dump directly to download a sample which write the files as paired fastq files, I downloaded the sample in a non-readable .sra format as a single file which takes less space so download is faster, and possible to resume if it stops, and then converted that file to a pair of fastq files. Here is an example code: module load sratoolkit Above creates a folder and an sra file, I extract fastq files from it: fastq-dump SRR8371669/SRR8371669.sra --split-3 You need to define the --max-size because normally it is set to 20GB for some reason and it wouldn't work. |
Please try our latest 2.10.6 toolkit. |
I have a similar problem with sratoolkit-2.10.7 centos linux x64 for phs000260:
Will try to prefetch the failing reads |
Same issue here fasterq-dump.2.10.7 fasterq-dump --split-files SRR11848434 [fails] 2020-07-16T09:29:36 fasterq-dump.2.10.7 err: cmn_iter.c cmn_read_uint8_array( #4308993 ).VCursorCellDataDirect() -> RC(rcPS,rcCondition,rcWaiting,rcTimeout,rcExhausted) prefetch --max-size 100000000 SRR11848434 [works] |
Most people don't know in advance the size of what they will be downloading. Many people have high bandwidth connections, but this varies by installation and geography. |
Sorry to give a similar answer, but we'd like to ask you to try once again with 2.10.8. Most of the issues we've been chasing for some time have nothing to do with the SRA Toolkit itself, but are attempts to overcome issues with some backend networking infrastructure. 2.10.8 addresses an issue with the SRA Toolkit, where on some systems the stack sizes were too small for all runs (this is data dependent) and had been built with an inadequate stack guard. The latter problem led to cases where we could exhaust a thread's stack without detection, resulting in some data corruptions on output (e.g. NB - I have no proof that the error above was due to this issue, but it would be consistent with errors we've observed. |
Please try our new 2.10.9 release. Then do the following: First run Then run |
Hi,
there is the problem about fasterq-dump. It worked properly when running in other SRA file.
here is my error information.
The text was updated successfully, but these errors were encountered: