Skip to content

Latest commit

 

History

History
175 lines (134 loc) · 13.1 KB

CHANGELOG.md

File metadata and controls

175 lines (134 loc) · 13.1 KB

Change Log

2.9.00 (2019-06-24)

Full Changelog

Implemented enhancements:

  • KokkosBatched: Add specialization for float2, float4 and double4 #427
  • KokkosBatched: Reduce VectorLength (16 to 8) #432
  • KokkosBatched: Remove experimental name space for batched blas #371
  • Capability: Initial sparse triangular solve capability #435
  • Capability: Add support for MAGMA GESV TPL #409
  • cuBLAS: Add CudaUVMSpace specializations for GEMM #397

Fixed bugs:

  • Deprecated Code Fixes #411
  • BuildSystem: Compilation error on rzansel #401

2.8.00 (2019-02-05)

Full Changelog

Implemented enhancements:

  • Capability, Tests: C++14 Support and Testing #351
  • Capability: Batched getrs #332
  • More Kernel Labels for KokkosBlas #239
  • Name all parallel kernels and regions #124

Fixed bugs:

  • BLAS TPL: BLAS underscore mangling #369
  • BLAS TPL, Complex: Promotion 2.7.24 broke MV unit tests in Tpetra with complex types #360
  • GEMM: GEMM uses wrong function for computing shared memory allocation size #368
  • BuildSystem: BLAS TPL macro not properly enabled with MKL BLAS #347
  • BuildSystem: make clean - errors #353
  • Compiler Workaround: Internal compiler error in KokkosBatched::Experimental::TeamGemm #349
  • KokkosBlas: Some KokkosBlas kernels assume default execution space #14

2.7.24 (2018-11-04)

Full Changelog

Implemented enhancements:

  • Enhance test_all_sandia script to set scalar and ordinal types #315
  • Batched getri need #305
  • Deterministic Coloring #271
  • MKL - guard minor version for MKL v. 18 #268
  • TPL Support for all BLAS functions using CuBLAS #247
  • Add L1 variant to multithreaded Gauss-Seidel #240
  • Multithreaded Gauss-Seidel does not support damping #221
  • Guard 1-phase SpGEMM in Intel MKL #217
  • generate makefile with-spaces option #98
  • Add MKL version check #7

Fixed bugs:

  • Perf test failures w/ just CUDA enabled #257
  • Wrong signature for axpy blas functions #329
  • Failing unit tests with float - unit test error checking issue #322
  • cuda.graph_graph_color* COLORING_VBD test failures with cuda/9.2 + gcc/7.2 on White #317
  • KokkosBatched::Experimental::SIMD<T> does not build with T=complex<float> #316
  • simple test program fails using 3rdparty Eigen library #309
  • KokkosBlas::dot is broken for complex, due to incorrect assumptions about Fortran ABI #307
  • strides bug in kokkos tpl interface. #292
  • Failing spgemm unit test with MKL #289
  • Fix the block_pcg perf-test when offsets are size_t #287
  • spotcheck warnings from kokkos #284
  • Linking error in tpl things #282
  • Build failure with clang 3.9.0 #281
  • CMake modification for TPLs. #276
  • KokkosBatched warnings #259
  • KokkosBatched contraction length bug #258
  • Small error in KokkosBatched_Gemm_Serial_Imp.hpp with SerialGemm<Trans::Transpose,*,*> #147

2.7.00 (2018-05-24)

Full Changelog

Implemented enhancements:

  • Tests: add capability to build a unit test standalone #233
  • Make KokkosKernels work without KOKKOS_ENABLE_DEPRECATED_CODE #223
  • Replace KOKKOS_HAVE_* FLAGS with KOKKOS_ENABLE_* #219
  • Add team-based scal, mult, update, nrm2 #214
  • Add team based abs #209
  • Generated CPP files moving includes inside the ifdef's #199
  • Implement BlockCRS in Kokkoskernels #184
  • Spgemm hash promotion #171
  • Batched BLAS enhancement #170
  • Document & check CMAKE_CXX_USE_RESPONSE_FILE_FOR_OBJECTS=ON in CUDA build #148

Fixed bugs:

  • Update drivers in perf_tests/graph to use Kokkos::initialize() #200
  • unit tests failing/hanging on Volta #188
  • Inner TRSM: SIMD build error; manifests in Ifpack2 #183
  • d2_graph_color doesn't have a default coloring mechanism #168
  • Unit tests do not build with Serial backend #154

2.6.00 (2018-03-07)

Full Changelog

Implemented enhancements:

  • Spgemm hash promotion #171
  • Batched BLAS enhancement #170

Fixed bugs:

  • d2_graph_color doesn't have a default coloring mechanism #168
  • Build error when MKL TPL is enabled #135

2.5.00 (2017-12-15)

Full Changelog

Implemented enhancements:

  • KokkosBlas: Add GEMM interface #105
  • KokkosBlas: Add GEMM default Kernel #125
  • KokkosBlas: Add GEMV that wraps BLAS (and cuBLAS) #16
  • KokkosSparse: Make SPMV test not print GBs of output if something goes wrong. #111
  • KokkosSparse: ETI SpGEMM and Gauss Seidel and take it out of Experimental namespace #74
  • BuildSystem: Fix Makesystem to correctly build library after aborted install #104
  • BuildSystem: Add option ot generate_makefile.bash to define memoryspaces for instantiation #89
  • BuildSystem: generate makefile tpl option #66
  • BuildSystem: Add a simpler compilation script, README update etc #96

Fixed bugs:

  • Internal Compiler Error GCC in GEMM #129
  • Batched Team LU: bug for small team_size #110
  • Compiler BUG in IBM XL pragma unrolling #92
  • Fix Blas TPL enables build #77
  • Batched Gemm Failure #73
  • CUDA 7.5 (GCC 4.8.4) build errors #72
  • Cuda BLAS tests fail with UVM if CUDA_LAUNCH_BLOCKING=1 is not defined on Kepler #51
  • CrsMatrix: sumIntoValues and replaceValues incorrectly count the number of valid column indices. #11
  • findRelOffset test assumes UVM #32

0.10.03 (2017-09-11)

Implemented enhancements:

  • KokkosSparse: Fix unused variable warnings in spmv_impl_omp, spmv Test and graph color perf_test #63
  • KokkosBlas: dot: Add unit test #15
  • KokkosBlas: dot: Add special case for multivector * vector (or vector * multivector) #13
  • BuildSystem: Make KokkosKernels build independently of Trilinos #1
  • BuildSystem: Fix ETI System not to depend on Tpetra ETI #5
  • BuildSystem: Change CMake to work with new ETI system #19
  • BuildSystem: Fix TpetraKernels names to KokkosKernels #4
  • BuildSystem: Trilinos/KokkosKernels reports no ETI in almost any circumstance #29
  • General: Kokkos::ArithTraits<double>::nan() is very slow #35
  • General: Design and Define New UnitTest infrastructure #28
  • General: Move Tpetra::Details::OrdinalTraits to KokkosKernels #22
  • General: Rename files and NameSpace to KokkosKernels #12
  • General: PrepareStandalone: Get rid of Teuchos usage #2
  • General: Fix warning with char being either signed or unsigned in ArithTraits #60
  • Testing: Make all tests run with -Werror #68

Fixed bugs:

  • SPGEMM Test Fails for Cuda when compiled through Trilinos #49
  • Fix ArithTraits min for floating points #47
  • Pthread ETI error #25
  • Fix CMake Based ETI for Threads backend #46
  • KokkosKernels_ENABLE_EXPERIMENTAL causes build error #59
  • ArithTraits warnings in CUDA build #71
  • Graph coloring build warnings #3

* This Change Log was automatically generated by github_changelog_generator