MAGMA 1.6 features top performance and high accuracy LAPACK compliant routines for multicore CPUs enhanced with NVIDIA GPUs and includes more than 400 routines, covering one-sided dense matrix factorizations and solvers, two-sided factorizations and eigen/singular-value problem solvers, as well as a subset of highly optimized BLAS for GPUs. In 2014, the MAGMA Sparse and MAGMA Batched packages were added with the MAGMA 1.6 release, providing support for sparse iterative and batched linear algebra on a set of small matrices in parallel, respectively. MAGMA provides multiple precision arithmetic support (S/D/C/Z, including mixed-precision). Most of the algorithms are hybrid, using both multicore CPUs and GPUs, but starting with the 1.6 release, GPU-specific algorithms were added. MAGMA also supports AMD GPUs (clMAGMA 1.3) and Intel Xeon Phi coprocessors (MAGMA MIC 1.3)."
MAGMA Embedded: Towards a Dense Linear Algebra Library for Energy Efficient Extreme Computing - http://icl.eecs.utk.edu/magma/
Parallel Programming Models for Dense Linear Algebra on Heterogeneous Systems - http://superfri.org/superfri/article/view/90
Investigating Half Precision Arithmetic to Accelerate Dense Linear System Solvers - https://hgpu.org/?p=17870
Parallel Programming Models for Dense Linear Algebra on Heterogeneous Systems - http://superfri.org/superfri/article/view/90
Investigating Half Precision Arithmetic to Accelerate Dense Linear System Solvers - https://hgpu.org/?p=17870
No comments:
Post a Comment