Skip to content

v0.9.0

Compare
Choose a tag to compare
@wenbingl wenbingl released this 21 Sep 18:28

What's Changed

  • New Python API gen_processing_models to export ONNX data processing model from Huggingface Tokenizers such as LLaMA , CLIP, XLM-Roberta, Falcon, BERT, etc.
  • New TrieTokenizer operator for RWKV-like LLM models, and other tokenizer operator enhancements.
  • New operators for Azure EP compatibility: AzureAudioToText, AzureTextToText, AzureTritonInvoker for Python and NuGet packages.
  • Processing operators have been migrated to the new Lite Custom Op API
  • New operator of string strip
  • Using the latest Ort header instead of minimum compatible headers
  • Support offset mapping in most tokenizers like BERT, CLIP, Roberta and etc.
  • Remove the deprecating std::codecvt_utf8 from code base
  • Document are uploaded to https://onnxruntime.ai/docs/extensions/

Contributions
Contributors to ONNX Runtime Extensions include members across teams at Microsoft, along with our community members: @aidanryan-msft @RandySheriffH @edgchen1 @kunal-vaishnavi @sayanshaw24 @skottmckay @snnn @VishalX @wenbingl @wejoncy