v7.0.0rc1
Pre-release
Pre-release
This is the release note of v7.0.0rc1. See here for the complete list of solved issues and merged PRs.
Announcements
This time, we will keep the current branches for active development (master for v7.x, v6 for v6.x) after the RC. We will maintain v6.x series until Python2 EOL, so we do not cut the new development version for now to avoid increasing the number of branches to maintain. New features will be included directly into v7 for a while, and maintenance changes will be backported to v6.
Highlights
- Experimental support of AMD GPUs are added (#1094). See the installation guide for how to install CuPy with AMD support. Note that this feature is still experimental, and we do not guarantee the API stability.
Changes without compatibility
- Avoid casting inputs to
cupy.ndarrayincupy.pad(#2504)- From this release,
cupy.paddoes no longer convert the input tocupy.ndarrayautomatically. This is done under the design principle of not implicitly synchronizing the host and the device, which most of the other APIs are also following.
- From this release,
New Features
- Experimental support of AMD GPU via HIP (ROCm2.7.0+) (#1094)
- Adds
nvccas aRawKernelbackend (#1941, thanks @sjperkins and @leofang!) - Support cuTENSOR 0.2 (#2341)
- Implement
isinandin1d(#2388, thanks @UmashankarTriforce!) - Support
scipy.ndimagecompatibleconvolveandcorrelate(#2483) - Added
cupy.cuda.memory.get_allocatorinterface (#2489) - Handle PCI bus ID (#2531, thanks @jameshclrk!)
- Expand coverage of cuSolverSP APIs (#2539)
- Add cuSPARSE routines for preconditioners (#2542)
Enhancements
- Fix division by zero in mean/std/var functions for 0-length dimensions (#2201, thanks @pentschev!)
- Improve error message in
cupy.linalg.inv(#2342) - Replace
cupy.padwith a heavily refactored version from NumPy 1.17 (#2399, thanks @grlee77!) - Fix
cupy.repeaterror message aboutrepeatsargument type (#2400) - Ignore warning caused by
fastrlock(#2488) - Update
__cuda_array_interface__to protocol version 2 (#2491, thanks @leofang!) - Allow
axis=Nonein concatenate (#2496, thanks @liwt31!) - Fix
@testing.numpy_cupy_decorators for skips (#2498) - Avoid implicit cast inputs to
cupy.ndarrayincupy.pad(#2504) - Cholesky decomposition to support complex values (#2509)
- Enhance shuffle-test of
testing.for_dtypes_combination(#2511) - Allow to use
realandimagon CUDA kernels (#2520) - Support complex numbers in
cupy.linalg.qr()(#2526, thanks @leofang!) - Fix bug in CUB + Native support of complex numbers in CUB (#2538, thanks @leofang!)
- Support
cupyx.fallback_modeas an experimental feature (#2541) - Support stream in CUB (#2555, thanks @leofang!)
Performance Improvements
- Performance improvement for
cupy.varcomplex inputs (#2484) - Enable fast CUB-based reductions in more cases (
cupy.linalg.norm, etc.) (#2517, thanks @grlee77!)
Code Fixes
Documentation
- Fix dead links in NumPy docs in random functions (#2384)
- Update
install_rocm.rst(#2512) - Fix some typo (#2523, thanks @garanews!)