Skip to content

CTranslate2 3.16.0

Compare
Choose a tag to compare
@guillaumekln guillaumekln released this 15 Jun 15:01
· 162 commits to master since this release

New features

  • Update the Transformers converter to support more architectures:
    • Falcon-40B
    • XLM-RoBERTa
  • Add the generation option sampling_topp to enable top-p (nucleus) sampling
  • Save vocabulary files in the JSON format to better support tokens containing newlines or carriage returns

Fixes and improvements

  • Fix the application of min_length and max_length when using include_prompt_in_result=False and a batch input with variable lengths: the length constraint should only apply to the sequence after the prompt
  • Update oneDNN to 3.1.1