Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Xargs: cat: terminated by signal 13 with kraken2-build --build. #156

Open
kdbchau opened this issue Sep 30, 2021 · 4 comments
Open

Xargs: cat: terminated by signal 13 with kraken2-build --build. #156

kdbchau opened this issue Sep 30, 2021 · 4 comments

Comments

@kdbchau
Copy link

kdbchau commented Sep 30, 2021

Hello,

I am using kraken2 version 2.1.2 and I am trying to build the nt database with the following command:

kraken2-build --build --db /scratch/chauk/kraken2/kraken_nt/nt --threads 48

It fails at step 2:

Creating sequence ID to taxonomy ID map (step 1)...
Found 81034768/81152031 targets, searched through 797981066 accession IDs, search complete.
lookup_accession_numbers: 117263/81152031 accession numbers remain unmapped, see unmapped.txt in DB directory
Sequence ID to taxonomy ID map complete. [1h12m58.303s]
Estimating required capacity (step 2)...
xargs: cat: terminated by signal 13

I tried again with changing the threads to 1:

Creating sequence ID to taxonomy ID map (step 1)...
Sequence ID to taxonomy ID map already present, skipping map creation.
Estimating required capacity (step 2)...
xargs: cat: terminated by signal 13

What could be the problem?

@SaraOliva
Copy link

I have the same problem. Could you solve it?

@kdbchau
Copy link
Author

kdbchau commented Feb 21, 2023

Nope never solved. I just stopped bothering with kraken2. Seems too buggy and no one is helping.

@mclaugsf
Copy link

mclaugsf commented Sep 7, 2023

i'm guessing you're running out of memory. I just submitted this ticket: jenniferlu717/Bracken#236

i was really having trouble with plant genomes which are low complexity and highly enriched for repeats. I think building a database of plant genomes results in finding kmers that are very frequent which can use quite a bit of memory and I also think there's a memory leak in kraken2 so if you keep running it on larger sets of plant genomes it will use much more memory than you think it should. i solved the issue by chopping it up into smaller chunks which confirms there's a memory leak issue.

@marco315176
Copy link

I downloaded the taxonomi directory with:

kraken2-build --download-taxonomy --db ${db}

and it was solved

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants