Revisions of kokkos

buildservice-autocommit accepted request 1163728 from Vicente Adolfo Bolea Sanchez's avatar Vicente Adolfo Bolea Sanchez (vicentebolea) (revision 29)
baserev update by copy to link target
Vicente Adolfo Bolea Sanchez's avatar Vicente Adolfo Bolea Sanchez (vicentebolea) accepted request 1163725 from Sarah Kriesch's avatar Sarah Kriesch (AdaLovelace) (revision 28)
- Remove constraints file because of improved memory usage
- Edit shebang lines for nvcc_wrapper and kokkos_launch_compiler because of rpm errors 
buildservice-autocommit accepted request 1131471 from Cezary Skrzyński's avatar Cezary Skrzyński (cz4rs) (revision 27)
baserev update by copy to link target
Cezary Skrzyński's avatar Cezary Skrzyński (cz4rs) accepted request 1131462 from Cezary Skrzyński's avatar Cezary Skrzyński (cz4rs) (revision 26)
- Update to new upstream release 4.2.00:
  see https://github.com/kokkos/kokkos/blob/4.2.00/CHANGELOG.md
buildservice-autocommit accepted request 1097631 from Cezary Skrzyński's avatar Cezary Skrzyński (cz4rs) (revision 25)
baserev update by copy to link target
Cezary Skrzyński's avatar Cezary Skrzyński (cz4rs) accepted request 1097624 from Vicente Adolfo Bolea Sanchez's avatar Vicente Adolfo Bolea Sanchez (vicentebolea) (revision 24)
- Update to new upstream release 4.1.00:
  see https://github.com/kokkos/kokkos/blob/4.1.00/CHANGELOG.md
- Use a tarball as Source0 instead of an URL.
buildservice-autocommit accepted request 1085499 from Cezary Skrzyński's avatar Cezary Skrzyński (cz4rs) (revision 23)
baserev update by copy to link target
Cezary Skrzyński's avatar Cezary Skrzyński (cz4rs) accepted request 1084494 from Vicente Adolfo Bolea Sanchez's avatar Vicente Adolfo Bolea Sanchez (vicentebolea) (revision 22)
- Update to new upstream release 4.0.01:
  see https://github.com/kokkos/kokkos/blob/62d2b6c879b74b6ae7bd06eb3e5e80139c4708e6/CHANGELOG.md
- Passed the spec-cleaner and formatted according to
- Change package version naming scheme to follow the OBS guidelines for shlibs. 
- Disable two test that fails with this version update
- Replaced patch with sed onliner
- Use Ninja for building the project.
Atri Bhattacharya's avatar Atri Bhattacharya (badshah400) accepted request 1078814 from Cezary Skrzyński's avatar Cezary Skrzyński (cz4rs) (revision 19)
- Update to new upstream release 4.0.00:
  see https://github.com/kokkos/kokkos/blob/master/CHANGELOG.md#400-2023-02-21
- Increase memory limits for build to avoid OpenMP tests running out of memory
- Add patch fixing rpmlint complaint about `env` being used as interpreter
buildservice-autocommit accepted request 972964 from Christoph Junghans's avatar Christoph Junghans (cjunghans) (revision 18)
baserev update by copy to link target
Christoph Junghans's avatar Christoph Junghans (cjunghans) accepted request 972855 from Jan Engelhardt's avatar Jan Engelhardt (jengelh) (revision 17)
- Resolve rpmlint report "libkokkos3.x86_64: E:
  shlib-policy-name-error SONAME: libkokkoscontainers.so.3.3.0,
  expected package suffix: 3_3_0"
Stefan Brüns's avatar Stefan Brüns (StefanBruens) accepted request 924992 from Stefan Brüns's avatar Stefan Brüns (StefanBruens) (revision 16)
- Remove random constraint adjustments again, and use a global
  lower limit of 6 GB of memory (minimum for the OMP tests).
- Limit the number of parallel jobs and OMP threads in the unit
  tests, to keep the required memory bounded.
  This way, even architectures with many cores and comparatively
  low memory can build successfully.
Klaus Kämpf's avatar Klaus Kämpf (kwk) accepted request 923179 from Sarah Kriesch's avatar Sarah Kriesch (AdaLovelace) (revision 15)
- New entries in constraints file because of missing build nodes with 
  18GB of memory for s390x and PPC:
  - 2GB for memoryperjob at x86, riscv64 and s390x
  - aarch64 3GB for memoryperjob because of "Out of memory" with 2GB
  - PPC64 and PPC64le with 1GB for memoryperjob because of missing 
    node capacity
buildservice-autocommit accepted request 857139 from Matthias Mailänder's avatar Matthias Mailänder (Mailaender) (revision 14)
baserev update by copy to link target
Matthias Mailänder's avatar Matthias Mailänder (Mailaender) accepted request 857026 from Christoph Junghans's avatar Christoph Junghans (cjunghans) (revision 13)
- dropped 3308.patch - merged upstream 
- Version bump to 3.3.00:
  - Features:
    - Require C++14 as minimum C++ standard. C++17 and C++20 are
      supported too.
    - HIP backend is nearly feature complete. Kokkos Dynamic Task
      Graphs are missing.
    - Major update for OpenMPTarget: many capabilities now work.
      For details contact us.
    - Added DPC++/SYCL backend: primary capabilites are working.
    - Added Kokkos Graph API analogous to CUDA Graphs.
    - Added parallel_scan support with TeamThreadRange
      [gh#kokkos/kokkos#3536]
    - Added Logical Memory Spaces [gh#kokkos/kokkos#3546]
    - Added initial half precision support [gh#kokkos/kokkos#3439]
    - Experimental feature: control cuda occupancy
      [gh#kokkos/kokkos#3379]
  - Implemented enhancements Backends and Archs:
    - Add a64fx and fujitsu Compiler support
      [gh#kokkos/kokkos#3614]
    - Adding support for AMD gfx908 archictecture
      [gh#kokkos/kokkos#3375]
    - SYCL parallel_for MDRangePolicy [gh#kokkos/kokkos#3583]
    - SYCL add parallel_scan [gh#kokkos/kokkos#3577]
    - SYCL custom reductions [gh#kokkos/kokkos#3544]
    - SYCL Enable container unit tests [gh#kokkos/kokkos#3550]
    - SYCL feature level 5 [gh#kokkos/kokkos#3480]
    - SYCL Feature level 4 (parallel_for) [gh#kokkos/kokkos#3474]
    - SYCL feature level 3 [gh#kokkos/kokkos#3451]
    - SYCL feature level 2 [gh#kokkos/kokkos#3447]
    - OpenMPTarget: Hierarchial reduction for + operator on
      scalars [gh#kokkos/kokkos#3504]
    - OpenMPTarget hierarchical [gh#kokkos/kokkos#3411]
    - HIP Add Impl::atomic_[store,load] [gh#kokkos/kokkos#3440]
    - HIP enable global lock arrays [gh#kokkos/kokkos#3418]
    - HIP Implement multiple occupancy paths for various HIP
      kernel launchers [gh#kokkos/kokkos#3366]
  - Implemented enhancements Policies:
    - MDRangePolicy: Let it be semiregular [gh#kokkos/kokkos#3494]
    - MDRangePolicy: Check narrowing conversion in construction
      [gh#kokkos/kokkos#3527]
    - MDRangePolicy: CombinedReducers support
      [gh#kokkos/kokkos#3395]
    - Kokkos Graph: Interface and Default Implementation
      [gh#kokkos/kokkos#3362]
    - Kokkos Graph: add Cuda Graph implementation
      [gh#kokkos/kokkos#3369]
    - TeamPolicy: implemented autotuning of team sizes and vector
      lengths [gh#kokkos/kokkos#3206]
    - RangePolicy: Initialize all data members in default
      constructor [gh#kokkos/kokkos#3509]
  - Implemented enhancements BuildSystem:
    - Auto-generate core test files for all backends
      [gh#kokkos/kokkos#3488]
    - Avoid rewriting test files when calling cmake
      [gh#kokkos/kokkos#3548]
    - RULE_LAUNCH_COMPILE and RULE_LAUNCH_LINK system for
      nvcc_wrapper [gh#kokkos/kokkos#3136]
    - Adding -include as a known argument to nvcc_wrapper
      [gh#kokkos/kokkos#3434]
    - Install hpcbind script [gh#kokkos/kokkos#3402]
    - cmake/kokkos_tribits.cmake: add parsing for args
      [gh#kokkos/kokkos#3457]
  - Implemented enhancements Tools:
    - Changed namespacing of
      Kokkos::Tools::Impl::Impl::tune_policy
      [gh#kokkos/kokkos#3455]
    - Delegate to an impl allocate/deallocate method to allow
      specifying a SpaceHandle for MemorySpaces
      [gh#kokkos/kokkos#3530]
    - Use the Kokkos Profiling interface rather than the Impl
      interface [gh#kokkos/kokkos#3518]
    - Runtime option for tuning [gh#kokkos/kokkos#3459]
    - Dual View Tool Events [gh#kokkos/kokkos#3326]
  - Implemented enhancements Other:
    - Abort on errors instead of just printing
      [gh#kokkos/kokkos#3528]
    - Enable C++14 macros unconditionally [gh#kokkos/kokkos#3449]
    - Make ViewMapping trivially copyable [gh#kokkos/kokkos#3436]
    - Rename struct ViewMapping to class [gh#kokkos/kokkos#3435]
    - Replace enums in Kokkos_ViewMapping.hpp (removes -Wextra)
      [gh#kokkos/kokkos#3422]
    - Use bool for enums representing bools
      [gh#kokkos/kokkos#3416]
    - Fence active instead of default execution space instances
      [gh#kokkos/kokkos#3388]
    - Refactor parallel_reduce fence usage [gh#kokkos/kokkos#3359]
    - Moved Space EBO helpers to Kokkos_EBO
      [gh#kokkos/kokkos#3357]
    - Add remove_cvref type trait [gh#kokkos/kokkos#3340]
    - Adding identity type traits and update definition of
      identity_t alias [gh#kokkos/kokkos#3339]
    - Add is_specialization_of type trait [gh#kokkos/kokkos#3338]
    - Make ScratchMemorySpace semi-regular [gh#kokkos/kokkos#3309]
    - Optimize min/max atomics with early exit on no-op case
      [gh#kokkos/kokkos#3265]
    - Refactor Backend Development [gh#kokkos/kokkos#2941]
  - Fixed bugs:
    - Fixup MDRangePolicy construction from Kokkos arrays
      [gh#kokkos/kokkos#3591]
    - Add atomic functions for unsigned long long using gcc
      built-in [gh#kokkos/kokkos#3588]
    - Fixup silent pointless comparison with zero in
      checked_narrow_cast (compiler workaround)
      [gh#kokkos/kokkos#3566]
    - Fixes for ROCm 3.9 [gh#kokkos/kokkos#3565]
    - Fix windows build issues which crept in for the CUDA build
      [gh#kokkos/kokkos#3532]
    - HIP Fix atomics of large data types and clean up lock arrays
      [gh#kokkos/kokkos#3529]
    - Pthreads fix exception resulting from 0 grain size
      [gh#kokkos/kokkos#3510]
    - Fixup do not require atomic operation to be default
      constructible [gh#kokkos/kokkos#3503]
    - Fix race condition in HIP backend [gh#kokkos/kokkos#3467]
    - Replace KOKKOS_DEBUG with KOKKOS_ENABLE_DEBUG
      [gh#kokkos/kokkos#3458]
    - Fix multi-stream team scratch space definition for HIP
      [gh#kokkos/kokkos#3398]
    - HIP fix template deduction [gh#kokkos/kokkos#3393]
    - Fix compiling with HIP and C++17 [gh#kokkos/kokkos#3390]
    - Fix sigFPE in HIP blocksize deduction
      [gh#kokkos/kokkos#3378]
    - Type alias change: replace CS with CTS to avoid conflicts
      with NVSHMEM [gh#kokkos/kokkos#3348]
    - Clang compilation of CUDA backend on Windows
      [gh#kokkos/kokkos#3345]
    - Fix HBW support [gh#kokkos/kokkos#3343]
    - Added missing fences to unique token [gh#kokkos/kokkos#3260]
  - Incompatibilities:
    - Remove unused utilities (forward, move, and expand_variadic)
      from Kokkos::Impl [gh#kokkos/kokkos#3535]
    - Remove unused traits [gh#kokkos/kokkos#3534]
    - HIP: Remove old HCC code [gh#kokkos/kokkos#3301]
    - Prepare for deprecation of ViewAllocateWithoutInitializing
      [gh#kokkos/kokkos#3264]
    - Remove ROCm backend [gh#kokkos/kokkos#3148]
buildservice-autocommit accepted request 836168 from Christoph Junghans's avatar Christoph Junghans (cjunghans) (revision 12)
baserev update by copy to link target
Christoph Junghans's avatar Christoph Junghans (cjunghans) accepted request 835906 from Christoph Junghans's avatar Christoph Junghans (cjunghans) (revision 11)
- added 3308.patch to fix naming of printer-tool 
- Version bump to 3.2.00:
  - Implemented enhancements:
    - HIP:Enable stream in HIP [gh#kokkos/kokkos#3163]
    - HIP:Add support for shuffle reduction for the HIP backend
      [gh#kokkos/kokkos#3154]
    - HIP:Add implementations of missing HIPHostPinnedSpace
      methods for LAMMPS [gh#kokkos/kokkos#3137]
    - HIP:Require HIP 3.5.0 or higher [gh#kokkos/kokkos#3099]
    - HIP:WorkGraphPolicy for HIP [gh#kokkos/kokkos#3096]
    - OpenMPTarget: Significant update to the new experimental
      backend. Requires C++17, works on Intel GPUs, reference
      counting fixes. [gh#kokkos/kokkos#3169]
    - Windows Cuda support [gh#kokkos/kokkos#3018]
    - Pass -Wext-lambda-captures-this to NVCC when support for
      __host__ __device__ lambda is enabled from CUDA 11
      [gh#kokkos/kokkos#3241]
    - Use explicit staging buffer for constant memory kernel
      launches and cleanup host/device synchronization
      [gh#kokkos/kokkos#3234]
    - Various fixup to policies including making TeamPolicy
      default constructible and making RangePolicy and TeamPolicy
      assignable: [gh#kokkos/kokkos#3202], [gh#kokkos/kokkos#3203],
      [gh#kokkos/kokkos#3196]
    - Annotations for DefaultExectutionSpace and
      DefaultHostExectutionSpace to use in static analysis
      [gh#kokkos/kokkos#3189]
    - Add documentation on using Spack to install Kokkos and
      developing packages that depend on Kokkos
      [gh#kokkos/kokkos#3187]
    - Add OpenMPTarget backend flags for NVC++ compiler
      [gh#kokkos/kokkos#3185]
    - Move deep_copy/create_mirror_view on
      Experimental::OffsetView into Kokkos:: namespace
      [gh#kokkos/kokkos#3166]
    - Allow for larger block size in HIP [gh#kokkos/kokkos#3165]
    - View: Added names of Views to the different View
      initialize/free kernels [gh#kokkos/kokkos#3159]
    - Cuda: Caching cudaFunctorAttributes and whether L1/Shmem
      prefer was set [gh#kokkos/kokkos#3151]
    - BuildSystem: Improved performance in default configuration
      by defaulting to Release build [gh#kokkos/kokkos#3131]
    - Cuda: Update CUDA occupancy calculation
      [gh#kokkos/kokkos#3124]
    - Vector: Adding data() to Vector [gh#kokkos/kokkos#3123]
    - BuildSystem: Add CUDA Ampere configuration support
      [gh#kokkos/kokkos#3122]
    - General: Apply [[noreturn]] to Kokkos::abort when applicable
      [gh#kokkos/kokkos#3106]
    - TeamPolicy: Validate storage level argument passed to
      TeamPolicy::set_scratch_size() [gh#kokkos/kokkos#3098]
    - BuildSystem: Make kokkos_has_string() function in
      Makefile.kokkos case insensitive [gh#kokkos/kokkos#3091]
    - Modify KOKKOS_FUNCTION macro for clang-tidy analysis
      [gh#kokkos/kokkos#3087]
    - Move allocation profiling to allocate/deallocate calls
      [gh#kokkos/kokkos#3084]
    - BuildSystem: FATAL_ERROR when attempting in-source build
      [gh#kokkos/kokkos#3082]
    - Change enums in ScatterView to types [gh#kokkos/kokkos#3076]
    - HIP: Changes for new compiler/runtime
      [gh#kokkos/kokkos#3067]
    - Extract and use get_gpu [gh#kokkos/kokkos#3061] ,
      [gh#kokkos/kokkos#3048]
    - Add is_allocated to View-like containers
      [gh#kokkos/kokkos#3059]
    - Combined reducers for scalar references
      [gh#kokkos/kokkos#3052]
    - Add configurable capacity for UniqueToken
      [gh#kokkos/kokkos#3051]
    - Add installation testing [gh#kokkos/kokkos#3034]
    - HIP: Add UniqueToken [gh#kokkos/kokkos#3020]
    - Autodetect number of devices [gh#kokkos/kokkos#3013]
  - Fixed bugs:
    - Check error code from cudaStreamSynchronize in CUDA fences
      [gh#kokkos/kokkos#3255]
    - Fix issue with C++ standard flags when using nvcc\_wrapper
      with PGI [gh#kokkos/kokkos#3254]
    - Add missing threadfence in lock-based atomics
      [gh#kokkos/kokkos#3208]
    - Fix dedup of linker flags for shared lib on CMake <=3.12
      [gh#kokkos/kokkos#3176]
    - Fix memory leak with CUDA streams [gh#kokkos/kokkos#3170]
    - BuildSystem: Fix OpenMP Target flags for Cray
      [gh#kokkos/kokkos#3161]
    - ScatterView: fix for OpenmpTarget remove inheritance from
      reducers [gh#kokkos/kokkos#3162]
    - BuildSystem: Set OpenMP flags according to host compiler
      [gh#kokkos/kokkos#3127]
    - OpenMP: Fix logic for nested omp in partition_master bug
      [gh#kokkos/kokkos#3101]
    - nvcc_wrapper: send --cudart to nvcc instead of host compiler
      [gh#kokkos/kokkos#3092]
    - BuildSystem: Fixes for Cuda/11 and c++17
      [gh#kokkos/kokkos#3085]
    - HIP: Fix print_configuration [gh#kokkos/kokkos#3080]
    - Conditionally define get_gpu [gh#kokkos/kokkos#3072]
    - Fix bounds for ranges in random number generator
      [gh#kokkos/kokkos#3069]
    - Fix Cuda minor arch check [gh#kokkos/kokkos#3035]
    - BuildSystem: Add -expt-relaxed-constexpr flag to
      nvcc_wrapper [gh#kokkos/kokkos#3021]
  - Incompatibilities:
    - Remove ETI support [gh#kokkos/kokkos#3157]
    - Remove KOKKOS_INTERNAL_ENABLE_NON_CUDA_BACKEND
      [gh#kokkos/kokkos#3147]
    - Remove core/unit_test/config [gh#kokkos/kokkos#3146]
    - Removed the preprocessor branch for KOKKOS_ENABLE_PROFILING
      [gh#kokkos/kokkos#3115]
    - Disable profiling with MSVC [gh#kokkos/kokkos#3066]
  - Closed issues:
    - Silent error (Validate storage level arg to
      set_scratch_size) [gh#kokkos/kokkos#3097]
    - Remove KOKKKOS_ENABLE_PROFILING Option
      [gh#kokkos/kokkos#3095]
    - Cuda 11 -> allow C++17 [gh#kokkos/kokkos#3083]
    - In source build failure not explained
      [gh#kokkos/kokkos#3081]
    - Allow naming of Views for initialization kernel
      [gh#kokkos/kokkos#3070]
    - DefaultInit tests failing when using CTest resource
      allocation feature [gh#kokkos/kokkos#3040]
    - Add installation testing. [gh#kokkos/kokkos#3037]
    - nvcc_wrapper needs to handle -expt-relaxed-constexpr flag
      [gh#kokkos/kokkos#3017]
    - CPU core oversubscription warning on macOS with OpenMP
      backend [gh#kokkos/kokkos#2996]
    - Default behavior of KOKKOS_NUM_DEVICES to use all devices
      available [gh#kokkos/kokkos#2975]
    - Assert blocksize > 0 [gh#kokkos/kokkos#2974]
    - Add ability to assign kokkos profile function from
      executable [gh#kokkos/kokkos#2973]
    - ScatterView Support for the pre/post increment operator
      [gh#kokkos/kokkos#2967]
    - Compiler issue: Cuda build with clang 10 has errors with the
      atomic unit tests [gh#kokkos/kokkos#3237]
    - Incompatibility of flags for C++ standard with PGI v20.4 on
      Power9/NVIDIA V100 system [gh#kokkos/kokkos#3252]
    - Error configuring as subproject [gh#kokkos/kokkos#3140]
    - CMake fails with Nvidia compilers when the GPU architecture
      option is not supplied (Fix configure with OMPT and Cuda)
      [gh#kokkos/kokkos#3207]
    - PGI compiler being passed the gcc -fopenmp flag
      [gh#kokkos/kokkos#3125]
    - Cuda: Memory leak when using CUDA stream
      [gh#kokkos/kokkos#3167]
    - RangePolicy has an implicitly deleted assignment operator
      [gh#kokkos/kokkos#3192]
    - MemorySpace::allocate needs to have memory pool counting.
      [gh#kokkos/kokkos#3064]
    - Missing write fence for lock based atomics on CUDA
      [gh#kokkos/kokkos#3038]
    - CUDA compute capability version check problem
      [gh#kokkos/kokkos#3026]
    - Make DynRankView fencing consistent [gh#kokkos/kokkos#3014]
    - nvcc_wrapper cant handle -Xcompiler -o out.o
      [gh#kokkos/kokkos#2993]
    - Reductions of non-trivial types of size 4 fail in CUDA shfl
      operations [gh#kokkos/kokkos#2990]
    - complex_double misalignment in reduce, clang+CUDA
      [gh#kokkos/kokkos#2989]
    - Span of degenerated (zero-length) subviews is not zero in
      some special cases [gh#kokkos/kokkos#2979]
    - Rank 1 custom layouts dont work as expected.
      [gh#kokkos/kokkos#2840]
buildservice-autocommit accepted request 828273 from Christoph Junghans's avatar Christoph Junghans (cjunghans) (revision 10)
baserev update by copy to link target
Displaying revisions 1 - 20 of 29
openSUSE Build Service is sponsored by