Skip to content

checkm analyze error #115

@snayfach

Description

@snayfach

Hi Donovan,

I'm getting an error when running CheckM v1.0.7.

I've split up my genomes into batches of 250. Each batch is run on a computing cluster with multiple threads. Most batches complete without error. However, for some get an error at the step checkm analyze. All previous steps complete without error. Here's the error:

Finished processing 5 of 250 (2.00%) bins.^M  [Error] Make sure HMMER executables (e.g., hmmsearch, hmmfetch) are on yo
ur system path.
  [Error] Make sure HMMER executables (e.g., hmmsearch, hmmfetch) are on your system path.
Process Process-47:
Traceback (most recent call last):
  File "/global/homes/s/snayfach/.conda/envs/checkm/lib/python2.7/multiprocessing/process.py", line 258, in _bootstrap
    self.run()
  File "/global/homes/s/snayfach/.conda/envs/checkm/lib/python2.7/multiprocessing/process.py", line 114, in run
  File "/global/homes/s/snayfach/.conda/envs/checkm/lib/python2.7/multiprocessing/process.py", line 258, in _bootstrap
    self.run()
  File "/global/homes/s/snayfach/.conda/envs/checkm/lib/python2.7/multiprocessing/process.py", line 114, in run
    self._target(*self._args, **self._kwargs)
  File "/global/homes/s/snayfach/.conda/envs/checkm/lib/python2.7/site-packages/checkm/hmmerAligner.py", line 199, in __c
reateMSA
    self.__alignMarker(markerId, markersWithMultipleHits[markerId], None, False, binAlignOutputDir, tempModelFile, bKeepUnmaskedAlign=False)
  File "/global/homes/s/snayfach/.conda/envs/checkm/lib/python2.7/site-packages/checkm/hmmerAligner.py", line 296, in __alignMarker
    self.__maskAlignment(alignSeqFile, makedSeqFile)
  File "/global/homes/s/snayfach/.conda/envs/checkm/lib/python2.7/site-packages/checkm/hmmerAligner.py", line 355, in __maskAlignment
    maskedSeq = ''.join([seq[i] for i in xrange(0, len(seq)) if mask[i] == 'x'])
UnboundLocalError: local variable 'mask' referenced before assignment
  [Error] Make sure HMMER executables (e.g., hmmsearch, hmmfetch) are on your system path.
  [Error] Make sure HMMER executables (e.g., hmmsearch, hmmfetch) are on your system path.
Process Process-49:
Traceback (most recent call last):
  File "/global/homes/s/snayfach/.conda/envs/checkm/lib/python2.7/multiprocessing/process.py", line 258, in _bootstrap
    self.run()
  File "/global/homes/s/snayfach/.conda/envs/checkm/lib/python2.7/multiprocessing/process.py", line 114, in run
    self._target(*self._args, **self._kwargs)
  File "/global/homes/s/snayfach/.conda/envs/checkm/lib/python2.7/site-packages/checkm/hmmerAligner.py", line 199, in __createMSA
    self.__alignMarker(markerId, markersWithMultipleHits[markerId], None, False, binAlignOutputDir, tempModelFile, bKeepUnmaskedAlign=False)
  File "/global/homes/s/snayfach/.conda/envs/checkm/lib/python2.7/site-packages/checkm/hmmerAligner.py", line 296, in __alignMarker
    self.__maskAlignment(alignSeqFile, makedSeqFile)
  File "/global/homes/s/snayfach/.conda/envs/checkm/lib/python2.7/site-packages/checkm/hmmerAligner.py", line 331, in __maskAlignment
    for line in open(inputFile):
IOError: [Errno 2] No such file or directory: '/global/projectb/scratch/snayfach/projects/gut_mags/4_checkm/checkm_out/mags/217/checkm/storage/aai_qa/RheumatoidArthritis_D188.012/PF02367.12.aligned.faa'

Unexpected error: <type 'exceptions.AttributeError'>
Traceback (most recent call last):
  File "/global/homes/s/snayfach/.conda/envs/checkm/bin/checkm", line 712, in <module>
    checkmParser.parseOptions(args)
  File "/global/homes/s/snayfach/.conda/envs/checkm/lib/python2.7/site-packages/checkm/main.py", line 1233, in parseOptions
    self.analyze(options)
  File "/global/homes/s/snayfach/.conda/envs/checkm/lib/python2.7/site-packages/checkm/main.py", line 331, in analyze
    os.path.join(options.out_folder, 'storage', 'aai_qa')
  File "/global/homes/s/snayfach/.conda/envs/checkm/lib/python2.7/site-packages/checkm/hmmerAligner.py", line 175, in makeAlignmentsOfMultipleHits
    p.terminate()
  File "/global/homes/s/snayfach/.conda/envs/checkm/lib/python2.7/multiprocessing/process.py", line 137, in terminate
    self._popen.terminate()
AttributeError: 'NoneType' object has no attribute 'terminate'

I've run one of these failed batches of 250 genomes interactively and I get the same error.

I took the same failed batch, and ran each genome individually through CheckM (250 CheckM runs), each with a single thread, and did not get any error.

Any help/ideas would be appreciated.

Thanks,
Stephen

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions