spmvELLRT

spmvELLRT

A library for efficient sparse matrix vector product on GPUs

F. Vázquez (1), G. Ortega (1), J.J. Fernández (1,2), E.M. Garzón (1)

(1) Dept. Computer Architecture. Univ. Almeria. 04120 Almeria. Spain.

(2) Centro Nacional de Biotecnología (CSIC). Campus UAM. 28049 Madrid. Spain.

Description

Sparse matrices are involved in linear systems, eigensystems and partial differential equations from a wide spectrum of scientific and engineering disciplines. Hence, sparse matrix vector product (SpMV) is considered as key operation in engineering and scientific computing. For these applications the optimization of the sparse matrix vector product (SpMV) is very relevant. However, the irregular computation involved in SpMV prevents the optimum exploitation of computational architectures when the sparse matrices are very large. Graphics Processing Units (GPUs) have recently emerged as platforms that yield outstanding acceleration factors. SpMV implementations for GPUs have already appeared on the scene. Recently a new, efficient format for SpMV for GPUs has successfully been introduced and evaluated (ELLRT). This format is based on the format ELLPACKR, which also derives from the well known ELLPACK and allows storage of the sparse matrix in a regular manner. In general, the new format turns out to outperform other formats previously used in scientific computing.

The library spmvELLRT implements efficient sparse matrix vector product on GPUs based on the new approach described in:

Improving the performance of the sparse matrix vector product with GPUs.

F. Vazquez, G. Ortega, J.J. Fernandez, E.M. Garzon.

Procs. 10th IEEE Intl. Conf. Computer and Information Technology (CIT 2010), pp: 1146-1151, 2010.

[PDF]

A new approach for sparse matrix vector product on NVIDIA GPUs.

F. Vazquez, J.J. Fernandez, E.M. Garzon. 

Concurrency and Computation: Practice and Experience 23:815-826, 2011.

[PDF]

Please, cite these articles if you use  spmvELLRT  in your work.

Download

Current version: July 2010

The development of this package has been supported by the Spanish MEC and MCI, J.Andalucia and CSIC.

Copyright by the authors.