Skip to content

Releases: JuliaGPU/KernelAbstractions.jl

v0.9.35

10 Jun 19:15
Compare
Choose a tag to compare

KernelAbstractions v0.9.35

Diff since v0.9.34

Merged pull requests:

  • Implement a CPU backend using POCL (#556) (@vchuravy)
  • [0.10] Forbid divergent execution of work-group barriers (#558) (@vchuravy)
  • Bump julia-actions/setup-julia from 1 to 2 (#561) (@dependabot[bot])
  • Switch Format.yml to CUDA.jl style (#568) (@vchuravy)
  • Test pocl#main on CI (#569) (@vchuravy)
  • CompatHelper: add new compat entry for SPIRVIntrinsics at version 0.2, (keep existing compat) (#571) (@github-actions[bot])
  • CompatHelper: add new compat entry for GPUCompiler at version 1, (keep existing compat) (#572) (@github-actions[bot])
  • CompatHelper: add new compat entry for LLVM at version 9, (keep existing compat) (#573) (@github-actions[bot])
  • Check that malformed allocations throw and don't stackoverflow (#576) (@vchuravy)
  • Check that malformed allocations throw and don't stackoverflow (#576) (#577) (@vchuravy)
  • Avoid callgraph recursion due to exception branch in get_global_id (#579) (@vchuravy)
  • Remove CPU(static=true) test (#580) (@vchuravy)
  • Set SPIR-V to 1.2 (#582) (@vchuravy)
  • use POCL with fixes (#589) (@vchuravy)
  • use barrier with LOCAL_MEM_FENCE (#591) (@vchuravy)
  • Test correct backend in examples test (#597) (@christiangnrd)
  • Switch to pocl_jll@v7 (#599) (@vchuravy)
  • prevent get_backend from overflowing the stack (#602) (@nsajko)
  • [NFC] Ignore formatting PRs in blame (#604) (@christiangnrd)
  • Enable downstream CI for 0.10 (#608) (@vchuravy)
  • Disable Float16 on the CPU backend (#609) (@vchuravy)

Closed issues:

  • Exception when getting the index on the CPU backend (#542)
  • Undefined variable error in kernel after update to 0.9.34 (#575)
  • Utility for copying array to GPU (#581)
  • StackOverflowError on get_backend(::UnitRange) (#588)
  • KA reports undefined variables, when they are in fact defined (#596)

v0.9.34

14 Feb 10:57
1516dce
Compare
Choose a tag to compare

KernelAbstractions v0.9.34

Diff since v0.9.33

Merged pull requests:

v0.9.33

23 Jan 09:50
8a87f77
Compare
Choose a tag to compare

KernelAbstractions v0.9.33

Diff since v0.9.32

Merged pull requests:

v0.9.32

21 Jan 14:35
e2cc83b
Compare
Choose a tag to compare

KernelAbstractions v0.9.32

Diff since v0.9.31

  • Clarify the semantics of KernelAbstractions.copyto! and add KernelAbstractions.pagelock!
  • Add support for multiple devices per backend

Merged pull requests:

Closed issues:

  • Add Feature to Select Devices to Execute Kernels On (#458)

v0.9.31

10 Dec 07:05
d373ee0
Compare
Choose a tag to compare

KernelAbstractions v0.9.31

Diff since v0.9.30

Merged pull requests:

v0.9.30

01 Dec 21:10
34c3d96
Compare
Choose a tag to compare

KernelAbstractions v0.9.30

Diff since v0.9.29

Merged pull requests:

v0.9.29

25 Oct 19:24
Compare
Choose a tag to compare

KernelAbstractions v0.9.29

Diff since v0.9.28

Merged pull requests:

  • Allow return statements for GPU-only kernels (#538) (@pxl-th)

Closed issues:

  • Multi-GPU backend (#540)

v0.9.28

04 Oct 15:43
419481c
Compare
Choose a tag to compare

KernelAbstractions v0.9.28

Diff since v0.9.27

Merged pull requests:

v0.9.27

21 Sep 19:05
bc89f91
Compare
Choose a tag to compare

KernelAbstractions v0.9.27

Diff since v0.9.26

Merged pull requests:

v0.9.26

17 Sep 14:31
d9062a3
Compare
Choose a tag to compare

KernelAbstractions v0.9.26

Diff since v0.9.25

Merged pull requests:

Closed issues:

  • [EnzymeExt] tape_type error (#495)