Skip to content

CTranslate2 1.20.0

Compare
Choose a tag to compare
@guillaumekln guillaumekln released this 20 Apr 13:19
· 943 commits to master since this release

Changes

  • Drop Python 3.5 support
  • Docker image tags suffixed with -gpu are no longer updated to prefer tags with an explicit CUDA version

Fixes and improvements

  • Fix int8 quantization for rows that only contain zeros
  • Fix type error when running the CUDA code path of the Multinomial operator
  • Add EOS score to the greedy search final score for consistency with the beam search output
  • Use third party library google/cpu_features to resolve CPU features at runtime
  • Small optimizations when manipulating tensor shapes and indices
  • Internal refactoring of Transformer layers