Skip to content

Version 2.31.0

Choose a tag to compare

@bhushan23 bhushan23 released this 20 May 00:27
  • New Features

    • ONNX

      • Support Qwen 3VL in AdaScale ONNX (35d2440)
    • Torch

      • Add Gemma 3 support for AdaScale (a2da0de)
      • LoRA integration (0b90d8a)
  • Removed Features

  • Bug fixes and Improvements

    • ONNX

      • Fuse supergroups to ONNX function nodes in QuantSim init (441ac6d)
      • Enable ONNX initializer deduplication pass in torch>=2.12 (21dc8e0)
      • Detect post-writing norm incompatibility in ONNX SpinQuant (85bdbdb)
      • Remove incorrect entries from grid-preserving ops list (4007d7f)
      • Set self.session = None to avoid double memory allocation during rebuild session (18664a4)
      • Give fused supergroup nodes intuitive naming (a775b6e)
    • Torch

      • Raise ValueError for unsupported architectures in PyTorch SpinQuant (a962614)
  • Documentation

    • Add zero_point_shift to 1.0.0 encoding spec documentation (094dead)
    • Add float8/float4 encoding to 2.0.0 spec documentation (02e75aa)