Skip to content

Version 2.10.0

Choose a tag to compare

@aimetci aimetci released this 14 Jul 21:49

What's Changed

  • New Feature

    • Promote to_onnx_qdq to a public API (f333188). Note: This is currently a beta feature
  • Bug fixes and Improvements

    • Common
      • Added hover tooltip to plot per layer sensitivity. Changed x-axis to plot layer indices instead of names (c96894f)
    • PyTorch
      • Implement scaling factor in aimet-torch float QDQ (9b8c655)
      • Fix CustomSiLU bug (499df9f)
      • Added extra logic to isolate model outputs from connectedgraph (4ad0703)
      • Always instantiate quantizers with requires_grad=True (5aac9c5)
      • Add logic to place adascale quantizers into correct dtype (5e1e6f2)
    • ONNX
      • Allow AdaRound and SeqMSE to take uncalibrated sims(31ca7fd)
      • Modify bias quantizer setting based on weight quantizer (b47a97e)
      • Fix cnt overflow issue (70029c5)
      • Make memory saving optimization default in build_session and _infer_activation_dtypes (4b94ca9)
      • Implement two-phase AMP API (1603c17)
      • Work-around onnx version converter issue for models with external weights (22f0f23)
  • Documentation

    • Update SeqMSE feature guide (fefd504)
    • Fix links in example notebooks (fe66376)
    • Modify docs for CLE (f9d0d6c)
    • Edit automatic mixed precision feature guide (22b5c94)
    • Polish BQ user guide (f547a49)
    • Polish QAT user guide (339a225)
    • Update Quick start example, PTQ section (example) and notebook (03fdd64)
    • Add missing workflow images (010898a)
    • Add reference to mixed precision page in docs (188d401)