Revisions of openblas
Dominique Leuenberger (dimstar_suse)
accepted
request 869479
from
Ismail Dönmez (namtrac)
(revision 40)
Dominique Leuenberger (dimstar_suse)
accepted
request 856635
from
Ismail Dönmez (namtrac)
(revision 39)
- Update to version 0.3.13 common: * Added a generic bfloat16 SBGEMV kernel * Fixed a potentially severe memory leak after fork in OpenMP builds that was introduced in 0.3.12 * Added detection of the Fujitsu Fortran compiler * Added detection of the (e)gfortran compiler on OpenBSD * Added support for overriding the default name of the library independently from symbol suffixing in the gmake builds (already supported in cmake) RISC V: * Added a RISC V port optimized for C910V POWER: * Added optimized POWER10 kernels for SAXPY, CAXPY, SDOT, DDOT and DGEMV_N * Improved DGEMM performance on POWER10 * Improved STRSM and DTRSM performance on POWER9 and POWER10 * Fixed segmemtation faults in DYNAMIC_ARCH builds * Fixed compilation with the PGI compiler x86: * Fixed compilation of kernels that require SSE2 intrinsics since 0.3.12 x86_64: * Added an optimized bfloat16 SBGEMV kernel for SkylakeX and Cooperlake * Improved the performance of SASUM and DASUM kernels through parallelization * Improved the performance of SROT and DROT kernels * Improved the performance of multithreaded xSYRK * Fixed OpenMP builds that use the LLVM Clang compiler together with GNU gfortran (where linking of both the LLVM libomp and GNU libgomp could lead to lockups or wrong results) * Fixed miscompilations by old gcc 4.6 * Fixed misdetection of AVX2 capability in some Sandybridge cpus
Dominique Leuenberger (dimstar_suse)
accepted
request 843798
from
Ismail Dönmez (namtrac)
(revision 38)
- Update to version 0.3.12 common: * Fixed missing BLAS/LAPACK functions (inadvertently dropped during the build system restructuring to support selective compilation) * Fixed argument conversion macro in LAPACKE_zgesvdq (LAPACK #458) power: * Added optimized SCOPY/CCOPY kernels for POWER10 * Increased and unified the default size of the GEMM buffer * Fixed building for POWER10 in DYNAMIC_ARCH mode * POWER10 compatibility test now checks binutils version as well * Cleaned up compiler warnings x86_64: * Corrected compiler version checks for AVX2 compatibility * Added compiler option -mavx2 for building with flang * Fixed direct SGEMM pathway for small matrix sizes (broken by the code refactoring in 0.3.11) * Fixed unhandled partial register clobbers in several kernels for AXPY,DOT,GEMV_N and GEMV_T flagged by gcc10 tree-vectorizer armv8: * Improved Apple Vortex support to include cross-compiling - Drop fix-build.patch, merged upstream.
Dominique Leuenberger (dimstar_suse)
accepted
request 843166
from
Ismail Dönmez (namtrac)
(revision 37)
- Update _constraints to use 12GB RAM on x86_64 - Update to version 0.3.11 common: * Reduced the default BLAS3_MEM_ALLOC_THRESHOLD (used as an upper limit for placing temporary arrays on the stack) to be compatible with a stack size of 1mb (as imposed by the JAVA runtime library) * Added mixed-precision dot function SBDOT and utility functions shstobf16, shdtobf16, sbf16tos and dbf16tod to convert between single or double precision float arrays and bfloat16 arrays * Fixed prototypes of LAPACK_?ggsvp and LAPACK_?ggsvd functions in lapack.h * Fixed underflow and rounding errors in LAPACK SLANV2 and DLANV2 (causing miscalculations in e.g. SHSEQR/DHSEQR, LAPACK issue #263) * Fixed workspace calculation in LAPACK ?GELQ (LAPACK issue #415) * Fixed several bugs in the LAPACK testsuite * Improved performance of TRMM and TRSM for certain problem sizes * Fixed infinite recursions and workspace miscalculations in ReLAPACK * CMAKE builds no longer require pkg-config for creating the .pc file * Makefile builds no longer misread NO_CBLAS=0 or NO_LAPACK=0 as enabling these options * Fixed detection of gfortran when invoked through an mpi wrapper * Improve thread reinitialization performance with OpenMP after a fork * Added support for building only the subset of the library required for a particular precision by specifying BUILD_SINGLE, BUILD_DOUBLE * Optional function name prefixes and suffixes are now correctly reflected in the generated cblas.h * Added CMAKE build support for the LAPACK and multithreading tests power: * Added optimized support for POWER10
Dominique Leuenberger (dimstar_suse)
accepted
request 839313
from
Egbert Eich (eeich)
(revision 36)
- Set DYNAMIC_ARCH everywhere, use a base CPU model for non-dynamic bits to have a reproducible base line: x86_64: CORE2 aarch64: ARMV8 ppc: POWER8 s390: ZARCH_GENERIC - Remove workaround for build failure on aarch64 (boo#1128794). (forwarded request 839300 from eeich)
Dominique Leuenberger (dimstar_suse)
accepted
request 837347
from
Ismail Dönmez (namtrac)
(revision 35)
Dominique Leuenberger (dimstar_suse)
accepted
request 833714
from
Egbert Eich (eeich)
(revision 34)
- Add build support for gcc10 to HPC build (bsc#1174439). (forwarded request 833599 from eeich)
Dominique Leuenberger (dimstar_suse)
accepted
request 825919
from
Ismail Dönmez (namtrac)
(revision 33)
Dominique Leuenberger (dimstar_suse)
accepted
request 814611
from
Ismail Dönmez (namtrac)
(revision 32)
Dominique Leuenberger (dimstar_suse)
accepted
request 783595
from
Ismail Dönmez (namtrac)
(revision 31)
Dominique Leuenberger (dimstar_suse)
accepted
request 779513
from
Egbert Eich (eeich)
(revision 30)
- Add support for gcc8/9 building (jsc#SLE-8604). (forwarded request 779439 from eeich)
Dominique Leuenberger (dimstar_suse)
accepted
request 756109
from
Dmitry Roshchin (Dmitry_R)
(revision 29)
Dominique Leuenberger (dimstar_suse)
accepted
request 734000
from
Ana Guerrero (anag)
(revision 28)
Add Jira feature ID
Dominique Leuenberger (dimstar_suse)
accepted
request 722787
from
Dmitry Roshchin (Dmitry_R)
(revision 27)
Dominique Leuenberger (dimstar_suse)
accepted
request 700583
from
Dmitry Roshchin (Dmitry_R)
(revision 26)
Dominique Leuenberger (dimstar_suse)
accepted
request 684454
from
Dmitry Roshchin (Dmitry_R)
(revision 25)
Dominique Leuenberger (dimstar_suse)
accepted
request 668899
from
Dmitry Roshchin (Dmitry_R)
(revision 23)
Dominique Leuenberger (dimstar_suse)
accepted
request 656137
from
Dmitry Roshchin (Dmitry_R)
(revision 22)
Dominique Leuenberger (dimstar_suse)
accepted
request 640893
from
Dmitry Roshchin (Dmitry_R)
(revision 21)
Displaying revisions 21 - 40 of 60