This release adds the following bug fixes:
- The file with the cgMLST MSA was missing from the output directory created by the
AlleleCallEvaluatormodule since the release of theComputeMSAmodule (thanks to @victor5lm who reported this issue in #235). This change was unintended. The file with cgMLST MSA was re-added as one of the output files from theAlleleCallEvaluatormodule (protein_msa.fastafile). Simplified the conditions used to determine which steps to run. - Made the definition of the sequence headers for the adapted schemas created by the
PrepExternalSchemaless rigid to better deal with unexpected headers. Added tests to validate schema adaptation for multiple external schema formats (e.g., EnteroBase, PubMLST, and Ridom). - Fixed the parsing of the file with loci annotations in the
LoadSchemamodule. - Fixed the creation of the paths to intermediate FASTA files used by BLASTp to determine representative alleles during schema adaptation. The paths were not properly created when the
SyncSchemamodule called thePrepExternalSchemamodule using relative paths. - Changed the index value used by the
select_highest_scoresfunction to sort and select the highest scoring BLASTp matches per target from5to6. The results were being sorted based on the length of the target sequences instead of the alignment raw score. This issue would not allow to identify the best scoring alignment in some cases, potentially leading to some sequences not being classified if a lower scoring alignment was selected. Fixing this issue results in a slight increase in the accuracy of the allele calling (thanks to @andreaderuvo for reporting this issue in #234).