Skip to content

v0.7.1

Latest
Compare
Choose a tag to compare
@aciddelgado aciddelgado released this 22 Apr 02:20
efab081

Release Notes

  • Add AMD Quark Quantizer Support #1207
  • Added Gemma 3 to model builder #1359
  • Updated Phi-3 Python Q&A example to be consistent with C++ example #1392
  • Updated Microsoft.Extensions.AI.Abstractions to 9.4.0-preview.1.25207.5 #1388
  • Added OnnxRuntimeGenAIChatClient constructor with Config #1364
  • Improve and Fix TopKTopP #1363
  • Switch the order of softmax on CPU Top K #1354
  • Updated custom nuget packaging logic #1377
  • Updated pybind and fix rpath for macos and check for nullptr #1367
  • Convert tokens to list for concat to accommodate breaking API change in tokenizer #1358