"KBLAS
(KAUST-BLAS) is a small open-source library that optimizes critical
numerical kernels on CUDA-enabled GPUs. KBLAS provides a subset of
standard BLAS functions. It also proposes some function with BLAS-like
interface that target both single and multi- GPU systems.
The
ultimate goal for KBLAS is performance. KBLAS has a set of tuning
parameters that affect its performance according to the GPU
architecture, and the CUDA runtime version. While we cannot guarantee
optimal performance with the default tuning parameters, the user can
easily edit such parameters on his local system. KBLAS might be shipped
with autotuners in the future. The user can refer to the tuning chapter
in this document."
KBLAS: High Performance Level-2 BLAS on Multi-GPU Systems - http://on-demand.gputechconf.com/gtc/2014/poster/pdf/P4168_KBLAS_GPU_computing_optimization.pdf
No comments:
Post a Comment