cannot create std::vector larger than max_size() #14

cnk113 · 2020-03-31T00:50:11Z

Hello,

I attempted to run clustering of reads on my fasta reads, but it terminates after a few minutes of starting.

terminate called after throwing an instance of 'std::length_error'
  what():  cannot create std::vector larger than max_size()
Aborted (core dumped)```

The text was updated successfully, but these errors were encountered:

ante-turudic · 2020-11-17T16:53:46Z

I had similar issue, and solution was to 'clean' input file by removing sequences shorter than kmer size.

dedenmatra · 2021-10-13T09:00:03Z

I had similar issue, and solution was to 'clean' input file by removing sequences shorter than kmer size.

How to produce "clean" input?

ante-turudic · 2021-10-13T09:08:27Z

I had similar issue, and solution was to 'clean' input file by removing sequences shorter than kmer size.

How to produce "clean" input?

Just remove sequences shorter than kmer size from input file. Lot of tools can do it. I am not sure, but I think used BBtools's reformat.sh script.

dedenmatra · 2021-10-13T09:26:46Z

I had similar issue, and solution was to 'clean' input file by removing sequences shorter than kmer size.

How to produce "clean" input?

Just remove sequences shorter than kmer size from input file. Lot of tools can do it. I am not sure, but I think used BBtools's reformat.sh script.

I just tried to remove sequences below 200 bp but any criteria for filtering? for example, trimming polyA?

ante-turudic · 2021-10-13T09:57:27Z

I had similar issue, and solution was to 'clean' input file by removing sequences shorter than kmer size.

How to produce "clean" input?

Just remove sequences shorter than kmer size from input file. Lot of tools can do it. I am not sure, but I think used BBtools's reformat.sh script.

I just tried to remove sequences below 200 bp but any criteria for filtering? for example, trimming polyA?

From my experience no.

EduEyras · 2021-10-13T10:13:22Z

Thanks for the questions and inputs We’ll add some info in the README to help with possible issues with the input Keeping polyA’s should be actually better for clustering and transcript reconstruction One thing we noticed though is internal adapters in ont cDNA sequencing that will lead to overclustering and need to removed, and the reads must be split E

On Wed, 13 Oct 2021 at 20:57, Ante Turudic ***@***.***> wrote: I had similar issue, and solution was to 'clean' input file by removing sequences shorter than kmer size. How to produce "clean" input? Just remove sequences shorter than kmer size from input file. Lot of tools can do it. I am not sure, but I think used BBtools's reformat.sh script. I just tried to remove sequences below 200 bp but any criteria for filtering? for example, trimming polyA? From my experience no. — You are receiving this because you are subscribed to this thread. Reply to this email directly, view it on GitHub <#14 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/ADCZKBYQ7LERGUSK3YVDS3LUGVJZFANCNFSM4LXD6H6A> . Triage notifications on the go with GitHub Mobile for iOS <https://apps.apple.com/app/apple-store/id1477376905?ct=notification-email&mt=8&pt=524675> or Android <https://play.google.com/store/apps/details?id=com.github.android&referrer=utm_campaign%3Dnotification-email%26utm_medium%3Demail%26utm_source%3Dgithub>.

--

dedenmatra · 2021-10-13T10:22:19Z

Thanks for the questions and inputs We’ll add some info in the README to help with possible issues with the input Keeping polyA’s should be actually better for clustering and transcript reconstruction One thing we noticed though is internal adapters in ont cDNA sequencing that will lead to overclustering and need to removed, and the reads must be split E
On Wed, 13 Oct 2021 at 20:57, Ante Turudic @.***> wrote: I had similar issue, and solution was to 'clean' input file by removing sequences shorter than kmer size. How to produce "clean" input? Just remove sequences shorter than kmer size from input file. Lot of tools can do it. I am not sure, but I think used BBtools's reformat.sh script. I just tried to remove sequences below 200 bp but any criteria for filtering? for example, trimming polyA? From my experience no. — You are receiving this because you are subscribed to this thread. Reply to this email directly, view it on GitHub <#14 (comment)>, or unsubscribe https://github.com/notifications/unsubscribe-auth/ADCZKBYQ7LERGUSK3YVDS3LUGVJZFANCNFSM4LXD6H6A . Triage notifications on the go with GitHub Mobile for iOS https://apps.apple.com/app/apple-store/id1477376905?ct=notification-email&mt=8&pt=524675 or Android https://play.google.com/store/apps/details?id=com.github.android&referrer=utm_campaign%3Dnotification-email%26utm_medium%3Demail%26utm_source%3Dgithub.

well, thank for suggestions. I will keep PolyA

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

cannot create std::vector larger than max_size() #14

cannot create std::vector larger than max_size() #14

cnk113 commented Mar 31, 2020

ante-turudic commented Nov 17, 2020

dedenmatra commented Oct 13, 2021

ante-turudic commented Oct 13, 2021

dedenmatra commented Oct 13, 2021

ante-turudic commented Oct 13, 2021

EduEyras commented Oct 13, 2021 via email

dedenmatra commented Oct 13, 2021

cannot create std::vector larger than max_size() #14

cannot create std::vector larger than max_size() #14

Comments

cnk113 commented Mar 31, 2020

ante-turudic commented Nov 17, 2020

dedenmatra commented Oct 13, 2021

ante-turudic commented Oct 13, 2021

dedenmatra commented Oct 13, 2021

ante-turudic commented Oct 13, 2021

EduEyras commented Oct 13, 2021 via email

dedenmatra commented Oct 13, 2021