Skip to content

CTranslate2 3.16.1

Compare
Choose a tag to compare
@guillaumekln guillaumekln released this 03 Jul 19:02
· 149 commits to master since this release

Fixes and improvements

  • Fix repeated outputs in version 3.16.0 when using include_prompt_in_result=False and a batch input with variable lengths: a typo in the code led to min_length being incorrectly applied
  • Update the Transformers converter to accept extra tokens for Falcon models
  • Release the Python GIL when loading the model
  • Initialize the rotary embeddings on the GPU instead of the CPU
  • Avoid a copy for the input features passed to the Whisper methods
  • Vectorize copy in the Tile CUDA operator