v3.4.2
CUDA v3.4.2
Closed issues:
- Broadcasting a datatype does not work (#261)
- CUDA error: invalid argument during Zygote/Flux gradient computation (#1107)
- EXCEPTION_ACCESS_VIOLATION when using shared memory allocations. (#1116)
Merged pull requests:
- add symmetric support for mul (#217) (@Roger-luo)
- adds a device array type for CuSparseMatrixCSR to support using it in kernel functions (#1106) (@Roger-luo)
- Update manifest (#1108) (@github-actions[bot])
- Specialize Ref{<:Type} for GPU compatibility. (#1109) (@maleadt)
- Use the documented version of the enable_finalizers API. (#1111) (@maleadt)
- Don't embed the method table in the AST. (#1112) (@maleadt)
- Remove the hacky unique'ing of shmem GVs. (#1114) (@maleadt)
- Introduce a macro for marking multiple functions as device-only. (#1117) (@maleadt)
- Simplify library loading. (#1121) (@maleadt)
- Backports for 3.4.2 (#1122) (@maleadt)