Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Error: Running hmmsearch fail #170

Open
StickHu opened this issue Aug 4, 2024 · 2 comments
Open

Error: Running hmmsearch fail #170

StickHu opened this issue Aug 4, 2024 · 2 comments

Comments

@StickHu
Copy link

StickHu commented Aug 4, 2024

Hi, great job of binning.

Recently when I use Semibin2 to do the scaffolds binnning, a problem occurred in some samples. The first line is my command. Could you please help me to solve that? Thanks

`(semibin) [huwa@localhost: SRR21518485_spades_raw] SemiBin2 single_easy_bin --environment human_oral -i scaffolds.fasta -b SRR21518485_vamb/bam_path/SRR21518485_28131_sorted_mapped.bam -o SRR21518485_semibin_bam
2024-08-04 15:16:55 localhost.localdomain SemiBin[2188044] INFO Setting number of CPUs to 104
2024-08-04 15:16:55 localhost.localdomain SemiBin[2188044] INFO Binning for short_read
2024-08-04 15:16:58 localhost.localdomain SemiBin[2188044] INFO Did not detect GPU, using CPU.
2024-08-04 15:16:58 localhost.localdomain SemiBin[2188044] INFO Generating training data...
2024-08-04 15:16:59 localhost.localdomain SemiBin[2188044] INFO Calculating coverage for every sample.
2024-08-04 15:17:02 localhost.localdomain SemiBin[2188044] INFO Processed: SRR21518485_vamb/bam_path/SRR21518485_28131_sorted_mapped.bam
2024-08-04 15:17:02 localhost.localdomain SemiBin[2188044] INFO Start binning.
2024-08-04 15:17:03 localhost.localdomain SemiBin[2188044] INFO Number of bins prior to reclustering: 0
2024-08-04 15:17:04 localhost.localdomain SemiBin[2188044] INFO Running naive ORF finder

Error: Sequence file /tmp/tmpydq2w4yn/orfs.faa is empty or misformatted

Error: Running hmmsearch fail`

@luispedro
Copy link
Member

SemiBin should not have generated an error here (or at least a more informative error), but it found 0 bins in this sample (see the message Number of bins prior to reclustering: 0)

Either something is wrong with the files or could this have been a very shallow/empty sample?

@yazhinia
Copy link

I am having the same issue and also it says it can't find GPU despite I explicitly running job on gpu node.

024-08-13 21:51:46 agq013 SemiBin[86060] INFO Setting number of CPUs to 32
2024-08-13 21:51:46 agq013 SemiBin[86060] INFO Binning for short_read
2024-08-13 21:51:46 agq013 SemiBin[86060] INFO SemiBin will run in self supervised mode
2024-08-13 21:52:01 agq013 SemiBin[86060] INFO Did not detect GPU, using CPU.
2024-08-13 21:52:07 agq013 SemiBin[86060] INFO Generating training data...
2024-08-13 21:58:16 agq013 SemiBin[86060] INFO Calculating coverage for every sample.
2024-08-13 22:10:13 agq013 SemiBin[86060] INFO Processed: /bamfiles/0_sorted.bam
2024-08-13 22:10:21 agq013 SemiBin[86060] INFO Processed: /bamfiles/1_sorted.bam
2024-08-13 22:10:21 agq013 SemiBin[86060] INFO Processed: /bamfiles/2_sorted.bam
2024-08-13 22:10:28 agq013 SemiBin[86060] INFO Processed: /bamfiles/3_sorted.bam
2024-08-13 22:10:28 agq013 SemiBin[86060] INFO Processed: /bamfiles/4_sorted.bam
2024-08-13 22:10:28 agq013 SemiBin[86060] INFO Processed: /bamfiles/5_sorted.bam
2024-08-13 22:10:28 agq013 SemiBin[86060] INFO Processed: /bamfiles/6_sorted.bam
2024-08-13 22:10:28 agq013 SemiBin[86060] INFO Processed: /bamfiles/7_sorted.bam
2024-08-13 22:10:28 agq013 SemiBin[86060] INFO Processed: /bamfiles/8_sorted.bam
2024-08-13 22:10:28 agq013 SemiBin[86060] INFO Processed: /bamfiles/9_sorted.bam
2024-08-13 22:10:55 agq013 SemiBin[86060] INFO Start training from a single sample.
2024-08-13 22:11:10 agq013 SemiBin[86060] INFO Training model...
100%|██████████| 15/15 [1:11:22<00:00, 285.51s/it]
2024-08-13 23:22:48 agq013 SemiBin[86060] INFO Training finished.
2024-08-13 23:22:48 agq013 SemiBin[86060] INFO Start binning.
2024-08-13 23:24:52 agq013 SemiBin[86060] INFO Number of bins prior to reclustering: 0
2024-08-13 23:24:53 agq013 SemiBin[86060] INFO Running naive ORF finder

Error: Sequence file /semibin_tmp/tmpoh16qf07/orfs.faa is empty or misformatted

Error: Running hmmsearch fail
Command exited with non-zero status 1

Any help is much appreciated.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants