TY - GEN
T1 - A GPU framework for sparse matrix vector multiplication
AU - Neelima, B.
AU - Reddy, G. Ram Mohana
AU - Raghavendra, Prakash S.
N1 - Publisher Copyright:
© 2014 IEEE.
PY - 2014/9/16
Y1 - 2014/9/16
N2 - The hardware and software evolutions related to Graphics Processing Units (GPUs), for general purpose computations, have changed the way the parallel programming issues are addressed. Many applications are being ported onto GPU for achieving performance gain. The GPU execution time is continuously optimized by the GPU programmers while optimizing pre-GPU computation overheads attracted the research community in the recent past. While GPU executes the programs given by a CPU, pre-GPU computation overheads does exists and should be optimized for a better usage of GPUs. The GPU framework proposed in this paper improves the overall performance of the application by optimizing pre-GPU computation overheads along with GPU execution time. This paper proposes a sparse matrix format prediction tool to predict an optimal sparse matrix format to be used for a given input matrix by analyzing the input sparse matrix and considering pre-GPU computation overheads. The sparse matrix format predicted by the proposed method is compared against the best performing sparse matrix formats posted in the literature. The proposed model is based on the static data that is available from the input directly and hence the prediction overhead is very small. Compared to GPU specific sparse format prediction, the proposed model is more inclusive and precious in terms of increasing overall application's performance.
AB - The hardware and software evolutions related to Graphics Processing Units (GPUs), for general purpose computations, have changed the way the parallel programming issues are addressed. Many applications are being ported onto GPU for achieving performance gain. The GPU execution time is continuously optimized by the GPU programmers while optimizing pre-GPU computation overheads attracted the research community in the recent past. While GPU executes the programs given by a CPU, pre-GPU computation overheads does exists and should be optimized for a better usage of GPUs. The GPU framework proposed in this paper improves the overall performance of the application by optimizing pre-GPU computation overheads along with GPU execution time. This paper proposes a sparse matrix format prediction tool to predict an optimal sparse matrix format to be used for a given input matrix by analyzing the input sparse matrix and considering pre-GPU computation overheads. The sparse matrix format predicted by the proposed method is compared against the best performing sparse matrix formats posted in the literature. The proposed model is based on the static data that is available from the input directly and hence the prediction overhead is very small. Compared to GPU specific sparse format prediction, the proposed model is more inclusive and precious in terms of increasing overall application's performance.
UR - https://www.scopus.com/pages/publications/84908653542
UR - https://www.scopus.com/pages/publications/84908653542#tab=citedBy
U2 - 10.1109/ISPDC.2014.10
DO - 10.1109/ISPDC.2014.10
M3 - Conference contribution
AN - SCOPUS:84908653542
T3 - Proceedings - IEEE 13th International Symposium on Parallel and Distributed Computing, ISPDC 2014
SP - 51
EP - 58
BT - Proceedings - IEEE 13th International Symposium on Parallel and Distributed Computing, ISPDC 2014
A2 - Muntean, Traian
A2 - Rolland, Robert
A2 - Mugwaneza, Leon
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 13th IEEE International Symposium on Parallel and Distributed Computing, ISPDC 2014
Y2 - 24 June 2014 through 27 June 2014
ER -