2.8.00 (2019-02-05)
Implemented enhancements:
- Capability, Tests: C++14 Support and Testing #351
- Capability: Batched getrs #332
- More Kernel Labels for KokkosBlas #239
- Name all parallel kernels and regions #124
Fixed bugs:
- BLAS TPL: BLAS underscore mangling #369
- BLAS TPL, Complex: Promotion 2.7.24 broke MV unit tests in Tpetra with complex types #360
- GEMM: GEMM uses wrong function for computing shared memory allocation size #368
- BuildSystem: BLAS TPL macro not properly enabled with MKL BLAS #347
- BuildSystem: make clean - errors #353
- Compiler Workaround: Internal compiler error in KokkosBatched::Experimental::TeamGemm #349
- KokkosBlas: Some KokkosBlas kernels assume default execution space #14
2.7.24 (2018-11-04)
Implemented enhancements:
- Enhance test_all_sandia script to set scalar and ordinal types #315
- Batched getri need #305
- Deterministic Coloring #271
- MKL - guard minor version for MKL v. 18 #268
- TPL Support for all BLAS functions using CuBLAS #247
- Add L1 variant to multithreaded Gauss-Seidel #240
- Multithreaded Gauss-Seidel does not support damping #221
- Guard 1-phase SpGEMM in Intel MKL #217
- generate makefile with-spaces option #98
- Add MKL version check #7
Fixed bugs:
- Perf test failures w/ just CUDA enabled #257
- Wrong signature for axpy blas functions #329
- Failing unit tests with float - unit test error checking issue #322
- cuda.graph_graph_color* COLORING_VBD test failures with cuda/9.2 + gcc/7.2 on White #317
- KokkosBatched::Experimental::SIMD<T> does not build with T=complex<float> #316
- simple test program fails using 3rdparty Eigen library #309
- KokkosBlas::dot is broken for complex, due to incorrect assumptions about Fortran ABI #307
- strides bug in kokkos tpl interface. #292
- Failing spgemm unit test with MKL #289
- Fix the block_pcg perf-test when offsets are size_t #287
- spotcheck warnings from kokkos #284
- Linking error in tpl things #282
- Build failure with clang 3.9.0 #281
- CMake modification for TPLs. #276
- KokkosBatched warnings #259
- KokkosBatched contraction length bug #258
- Small error in KokkosBatched_Gemm_Serial_Imp.hpp with SerialGemm<Trans::Transpose,*,*> #147
2.7.00 (2018-05-24)
Implemented enhancements:
- Tests: add capability to build a unit test standalone #233
- Make KokkosKernels work without KOKKOS_ENABLE_DEPRECATED_CODE #223
- Replace KOKKOS_HAVE_* FLAGS with KOKKOS_ENABLE_* #219
- Add team-based scal, mult, update, nrm2 #214
- Add team based abs #209
- Generated CPP files moving includes inside the ifdef's #199
- Implement BlockCRS in Kokkoskernels #184
- Spgemm hash promotion #171
- Batched BLAS enhancement #170
- Document & check CMAKE_CXX_USE_RESPONSE_FILE_FOR_OBJECTS=ON in CUDA build #148
Fixed bugs:
- Update drivers in perf_tests/graph to use Kokkos::initialize() #200
- unit tests failing/hanging on Volta #188
- Inner TRSM: SIMD build error; manifests in Ifpack2 #183
- d2_graph_color doesn't have a default coloring mechanism #168
- Unit tests do not build with Serial backend #154
2.6.00 (2018-03-07)
Implemented enhancements:
Fixed bugs:
- d2_graph_color doesn't have a default coloring mechanism #168
- Build error when MKL TPL is enabled #135
2.5.00 (2017-12-15)
Implemented enhancements:
- KokkosBlas: Add GEMM interface #105
- KokkosBlas: Add GEMM default Kernel #125
- KokkosBlas: Add GEMV that wraps BLAS (and cuBLAS) #16
- KokkosSparse: Make SPMV test not print GBs of output if something goes wrong. #111
- KokkosSparse: ETI SpGEMM and Gauss Seidel and take it out of Experimental namespace #74
- BuildSystem: Fix Makesystem to correctly build library after aborted install #104
- BuildSystem: Add option ot generate_makefile.bash to define memoryspaces for instantiation #89
- BuildSystem: generate makefile tpl option #66
- BuildSystem: Add a simpler compilation script, README update etc #96
Fixed bugs:
- Internal Compiler Error GCC in GEMM #129
- Batched Team LU: bug for small team_size #110
- Compiler BUG in IBM XL pragma unrolling #92
- Fix Blas TPL enables build #77
- Batched Gemm Failure #73
- CUDA 7.5 (GCC 4.8.4) build errors #72
- Cuda BLAS tests fail with UVM if CUDA_LAUNCH_BLOCKING=1 is not defined on Kepler #51
- CrsMatrix: sumIntoValues and replaceValues incorrectly count the number of valid column indices. #11
- findRelOffset test assumes UVM #32
0.10.03 (2017-09-11)
Implemented enhancements:
- KokkosSparse: Fix unused variable warnings in spmv_impl_omp, spmv Test and graph color perf_test #63
- KokkosBlas: dot: Add unit test #15
- KokkosBlas: dot: Add special case for multivector * vector (or vector * multivector) #13
- BuildSystem: Make KokkosKernels build independently of Trilinos #1
- BuildSystem: Fix ETI System not to depend on Tpetra ETI #5
- BuildSystem: Change CMake to work with new ETI system #19
- BuildSystem: Fix TpetraKernels names to KokkosKernels #4
- BuildSystem: Trilinos/KokkosKernels reports no ETI in almost any circumstance #29
- General: Kokkos::ArithTraits<double>::nan() is very slow #35
- General: Design and Define New UnitTest infrastructure #28
- General: Move Tpetra::Details::OrdinalTraits to KokkosKernels #22
- General: Rename files and NameSpace to KokkosKernels #12
- General: PrepareStandalone: Get rid of Teuchos usage #2
- General: Fix warning with char being either signed or unsigned in ArithTraits #60
- Testing: Make all tests run with -Werror #68
Fixed bugs:
- SPGEMM Test Fails for Cuda when compiled through Trilinos #49
- Fix ArithTraits min for floating points #47
- Pthread ETI error #25
- Fix CMake Based ETI for Threads backend #46
- KokkosKernels_ENABLE_EXPERIMENTAL causes build error #59
- ArithTraits warnings in CUDA build #71
- Graph coloring build warnings #3
* This Change Log was automatically generated by github_changelog_generator