Skip to content

Releases: wejoncy/QLLM

v0.2.0

26 Aug 12:40
798198c
Compare
Choose a tag to compare

What's Changed

Full Changelog: v0.1.9.1...v0.2.0

v0.1.9.1

02 Jul 09:28
7ea4fb0
Compare
Choose a tag to compare

What's Changed

Full Changelog: v0.1.9...v0.1.9.1

v0.1.9

19 Jun 08:11
f07d07b
Compare
Choose a tag to compare

What's Changed

New Contributors

Full Changelog: v0.1.8...v0.1.9

v0.1.8

21 Mar 06:00
Compare
Choose a tag to compare

What's Changed

Full Changelog: v0.1.7.1...v0.1.8

v0.1.7.1

09 Mar 15:30
2023fa1
Compare
Choose a tag to compare

What's Changed

Full Changelog: v0.1.7...v0.1.7.1

v0.1.7

08 Mar 10:44
ff202aa
Compare
Choose a tag to compare

What's Changed

Full Changelog: v0.1.6...v0.1.7

v0.1.6

23 Jan 12:28
92190a4
Compare
Choose a tag to compare

What's Changed

New Contributors

Full Changelog: v0.1.5...v0.1.6

v0.1.5

03 Jan 03:22
9beef0b
Compare
Choose a tag to compare

What's Changed

  • works on windows, set dtype is importang by @wejoncy in #54
  • use_heuristic=false default for models having hard to predict unquantized layers like mixtral-8x7b by @wejoncy in #55
  • add mixtral in readme example by @wejoncy in #56
  • bugfix when export 16bit model by @wejoncy in #57
  • Fix build err, uint32_t is not defined. <stdint.h> by @wejoncy in #58
  • dp kernel support g_idx by @wejoncy in #59
  • [important] packing improve, faster by @wejoncy in #60
  • [improve packing]fix for awq unpack by @wejoncy in #61
  • 3bit support with g_idx in dq_kernel by @wejoncy in #63
  • 3bit fix by @wejoncy in #64
  • 0.1.5.dev by @wejoncy in #65
  • onnx support Act_order && some onnx fix by @wejoncy in #66
  • Support gemv with g_idx and some fix in exporter/dataloader by @wejoncy in #67
  • support mixtral in gptq/awq by @wejoncy in #68
  • minor fix for act_order detect by @wejoncy in #70
  • Bump version to 0.1.5 by @wejoncy in #69

Full Changelog: v0.1....v0.1.5

v0.1.4

19 Dec 02:32
Compare
Choose a tag to compare

What's Changed

Full Changelog: v0.1.3...v0.1.4

v0.1.3

14 Dec 11:41
c0be5ed
Compare
Choose a tag to compare

What's Changed

Full Changelog: https://github.com/wejoncy/QLLM/commits/v0.1.3