Skip to content

CTranslate2 1.7.0

Compare
Choose a tag to compare
@guillaumekln guillaumekln released this 28 Feb 13:48
· 1364 commits to master since this release

CTranslate2 1.7.0

New features

  • Translation option return_alternatives to return multiple choices at the first unconstrained decoding position: combined with a target prefix, this could be used to provide alternative words and translations at a specific location in the target
  • Support Transformers with different number of encoder/decoder layers
  • Allow compilation without OpenMP with -DOPENMP_RUNTIME=NONE

Fixes and improvements

  • Fix SavedModel conversion when TensorFlow Addons 0.8 is installed
  • Fix error when releasing a translator/model that is placed on a GPU that is not GPU 0
  • Fix memory that was allocated on GPU 0 even when the translator/model was placed on another GPU
  • Query GPU int8 support on the first model load, and then cache the result for future loads
  • Avoid creating an empty model directory on conversion errors
  • Parallelize some low level transformations on CPU
  • Reduce memory usage when translating large files by limiting the work queue size