AMD announces Core Math Library 4.0

From the press announcement:

ACML is a set of numerical routines tuned specifically for AMD64 processors designed to increase support for linear programming, an optimizing technique for mathematically intensive code, in multi-core processing environments. Enhancements to ACML announced today include an update to the Linear Algebra Package (LAPACK) and increased optimizations for Quad-Core AMD Processors, including Quad Core AMD Opteron™ Processors and the upcoming AMD Phenom™ quad-core processor. ACML is specifically designed to use key components of next-generation AMD processors, including Streaming SIMD Extensions (SSE), and to create highly threaded code that can help developers to better utilize multi-core processor technology. ACML consists of the following features:

  • A full implementation of Level 1, 2 and 3 Basic Linear Algebra Subroutines (BLAS), with key routines optimized for high performance on AMD Opteron processors.
  • A full suite of Linear Algebra (LAPACK) routines. As well as taking advantage of the highly-tuned BLAS kernels, a key set of LAPACK routines has been further optimized to achieve considerably higher performance than standard LAPACK implementations.
  • A comprehensive suite of Fast Fourier Transforms (FFTs) in single-, double-, single-complex and double-complex data types.
  • Fast scalar, vector, and array math transcendental library routines optimized for high performance on AMD Opteron processors.
  • Random Number Generators in both single- and double-precision.

 

ACML 4.0 is available for download in various version for different platforms, compilers and bit sizes.