Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Data that used to work, now killed. #62

Open
jwasmuth opened this issue Mar 25, 2021 · 3 comments
Open

Data that used to work, now killed. #62

jwasmuth opened this issue Mar 25, 2021 · 3 comments

Comments

@jwasmuth
Copy link

Hi,
A few years ago I used idba successfully, now on a new server and every meaningful dataset I have (pe len=100bp) ends with the 'killed' statement. Troubleshooting this is difficult. Would someone mind sharing a dataset that they know works with idba v 1.1.3?
Many thanks
James

@jwasmuth jwasmuth changed the title In need for example data In need of example data Mar 25, 2021
@jwasmuth jwasmuth changed the title In need of example data Data that used to work, now killed. Mar 25, 2021
@jwasmuth
Copy link
Author

I found the data I used a couple of years ago. Sadly, it doesn't work with 1.1.3 on the new server. No idea why. 1m PE reads of 100 bp. Job killed while being read. Run on a 3TB RAM machine. I tracked the memory usage before the job is killed and it doesn't get close to the max. I welcome any ideas anyone may have.

@th-of
Copy link

th-of commented Mar 25, 2021

I found the data I used a couple of years ago. Sadly, it doesn't work with 1.1.3 on the new server. No idea why. 1m PE reads of 100 bp. Job killed while being read. Run on a 3TB RAM machine. I tracked the memory usage before the job is killed and it doesn't get close to the max. I welcome any ideas anyone may have.

Are you recompiling the source code on the system you are currently on? Or running an old binary? Remember that idba only supports an interleaved fasta file. Another explanation is a corrupted dataset, use a text editor like vim (or head and tail in bash) to see that sequences are matched and contain headers.

No problems on my side.

@jwasmuth
Copy link
Author

I have tried recompiling and, separately, installing from conda. Both give the same result. Extensive running with different number of input sequences shows that the larger the input file the quicker it gets killed. Though I don't know why.

I did get it working on the old server last night, so I know that the sequence file isn't corrupted. It may be something with the configuration of the new server, which I don't have any privileges for.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants