I'm getting an error when running CheckM v1.0.7.
I've split up my genomes into batches of 250. Each batch is run on a computing cluster with multiple threads. Most batches complete without error. However, for some get an error at the step checkm analyze. All previous steps complete without error. Here's the error:
Finished processing 5 of 250 (2.00%) bins.^M [Error] Make sure HMMER executables (e.g., hmmsearch, hmmfetch) are on yo
ur system path.
[Error] Make sure HMMER executables (e.g., hmmsearch, hmmfetch) are on your system path.
Process Process-47:
Traceback (most recent call last):
File "/global/homes/s/snayfach/.conda/envs/checkm/lib/python2.7/multiprocessing/process.py", line 258, in _bootstrap
self.run()
File "/global/homes/s/snayfach/.conda/envs/checkm/lib/python2.7/multiprocessing/process.py", line 114, in run
File "/global/homes/s/snayfach/.conda/envs/checkm/lib/python2.7/multiprocessing/process.py", line 258, in _bootstrap
self.run()
File "/global/homes/s/snayfach/.conda/envs/checkm/lib/python2.7/multiprocessing/process.py", line 114, in run
self._target(*self._args, **self._kwargs)
File "/global/homes/s/snayfach/.conda/envs/checkm/lib/python2.7/site-packages/checkm/hmmerAligner.py", line 199, in __c
reateMSA
self.__alignMarker(markerId, markersWithMultipleHits[markerId], None, False, binAlignOutputDir, tempModelFile, bKeepUnmaskedAlign=False)
File "/global/homes/s/snayfach/.conda/envs/checkm/lib/python2.7/site-packages/checkm/hmmerAligner.py", line 296, in __alignMarker
self.__maskAlignment(alignSeqFile, makedSeqFile)
File "/global/homes/s/snayfach/.conda/envs/checkm/lib/python2.7/site-packages/checkm/hmmerAligner.py", line 355, in __maskAlignment
maskedSeq = ''.join([seq[i] for i in xrange(0, len(seq)) if mask[i] == 'x'])
UnboundLocalError: local variable 'mask' referenced before assignment
[Error] Make sure HMMER executables (e.g., hmmsearch, hmmfetch) are on your system path.
[Error] Make sure HMMER executables (e.g., hmmsearch, hmmfetch) are on your system path.
Process Process-49:
Traceback (most recent call last):
File "/global/homes/s/snayfach/.conda/envs/checkm/lib/python2.7/multiprocessing/process.py", line 258, in _bootstrap
self.run()
File "/global/homes/s/snayfach/.conda/envs/checkm/lib/python2.7/multiprocessing/process.py", line 114, in run
self._target(*self._args, **self._kwargs)
File "/global/homes/s/snayfach/.conda/envs/checkm/lib/python2.7/site-packages/checkm/hmmerAligner.py", line 199, in __createMSA
self.__alignMarker(markerId, markersWithMultipleHits[markerId], None, False, binAlignOutputDir, tempModelFile, bKeepUnmaskedAlign=False)
File "/global/homes/s/snayfach/.conda/envs/checkm/lib/python2.7/site-packages/checkm/hmmerAligner.py", line 296, in __alignMarker
self.__maskAlignment(alignSeqFile, makedSeqFile)
File "/global/homes/s/snayfach/.conda/envs/checkm/lib/python2.7/site-packages/checkm/hmmerAligner.py", line 331, in __maskAlignment
for line in open(inputFile):
IOError: [Errno 2] No such file or directory: '/global/projectb/scratch/snayfach/projects/gut_mags/4_checkm/checkm_out/mags/217/checkm/storage/aai_qa/RheumatoidArthritis_D188.012/PF02367.12.aligned.faa'
Unexpected error: <type 'exceptions.AttributeError'>
Traceback (most recent call last):
File "/global/homes/s/snayfach/.conda/envs/checkm/bin/checkm", line 712, in <module>
checkmParser.parseOptions(args)
File "/global/homes/s/snayfach/.conda/envs/checkm/lib/python2.7/site-packages/checkm/main.py", line 1233, in parseOptions
self.analyze(options)
File "/global/homes/s/snayfach/.conda/envs/checkm/lib/python2.7/site-packages/checkm/main.py", line 331, in analyze
os.path.join(options.out_folder, 'storage', 'aai_qa')
File "/global/homes/s/snayfach/.conda/envs/checkm/lib/python2.7/site-packages/checkm/hmmerAligner.py", line 175, in makeAlignmentsOfMultipleHits
p.terminate()
File "/global/homes/s/snayfach/.conda/envs/checkm/lib/python2.7/multiprocessing/process.py", line 137, in terminate
self._popen.terminate()
AttributeError: 'NoneType' object has no attribute 'terminate'
I've run one of these failed batches of 250 genomes interactively and I get the same error.
I took the same failed batch, and ran each genome individually through CheckM (250 CheckM runs), each with a single thread, and did not get any error.
Any help/ideas would be appreciated.
Hi Donovan,
I'm getting an error when running CheckM v1.0.7.
I've split up my genomes into batches of 250. Each batch is run on a computing cluster with multiple threads. Most batches complete without error. However, for some get an error at the step
checkm analyze. All previous steps complete without error. Here's the error:I've run one of these failed batches of 250 genomes interactively and I get the same error.
I took the same failed batch, and ran each genome individually through CheckM (250 CheckM runs), each with a single thread, and did not get any error.
Any help/ideas would be appreciated.
Thanks,
Stephen