The GotoBLAS2 optimized BLAS library
GotoBLAS2 has been released by the Texas Advanced Computing Center as open source software under the BSD license. This product is no longer under active development by TACC, but it is being made available to the community to use, study, and extend. GotoBLAS2 uses new algorithms and memory techniques for optimal performance of the BLAS routines.
The changes in this final version target new architecture features in microprocessors and interprocessor communication techniques; also, NUMA controls enhance multi-threaded execution of BLAS routines on node. The library features optimal performance on the following platforms:
Intel Nehalem and Atom systems
AMD Shanghai and Istanbul
The library includes the following features:
* Configurations for a variety of hardware platforms
* Incorporation of features of many ISAs (Instruction Set Architecture)
* Implementation of NUMA controls to assure best process affinity
and memory policy
* Dynamic detection of multiple architecture components, which can be
included in a single binary (for binary distributions)
This library has been build with dynamic hardware detection at runtime, so the best performances are always achieved wathever the hardware is. Moreover, this packages contains the threaded and non-threaded version of GotoBLAS2.
This library is not maintaned anymore and is now replaced by OpenBLAS.