메뉴 건너뛰기




Volumn 30, Issue 2, 2004, Pages 187-210

Architecture of an automatically tuned linear algebra library

Author keywords

Automatic tuning; Block methods; High performance computing; Linear algebra; Polylibraries

Indexed keywords

ALGORITHMS; COMPUTER SOFTWARE; EIGENVALUES AND EIGENFUNCTIONS; OPTIMIZATION; PARALLEL PROCESSING SYSTEMS; TUNING;

EID: 1342306684     PISSN: 01678191     EISSN: None     Source Type: Journal    
DOI: 10.1016/j.parco.2003.11.002     Document Type: Article
Times cited : (31)

References (35)
  • 4
    • 0030661485 scopus 로고    scopus 로고
    • Optimizing matrix multiply using PHiPAC: A portable, high-performance, ansi C coding methodology
    • Bilmes J., Asanovic K., Chin C.W., Demmel J. Optimizing matrix multiply using PHiPAC: a portable, high-performance, ansi C coding methodology. International Conference on Supercomputing. 3:1997;340-347.
    • (1997) International Conference on Supercomputing , vol.3 , pp. 340-347
    • Bilmes, J.1    Asanovic, K.2    Chin, C.W.3    Demmel, J.4
  • 9
    • 0242658775 scopus 로고    scopus 로고
    • Self adapting software for numerical linear algebra and LAPACK for clusters
    • Chen Z., Dongarra J., Luszczek P., Roche K. Self adapting software for numerical linear algebra and LAPACK for clusters. Parallel Computing. 29(11/12):2003;1723-1743.
    • (2003) Parallel Computing , vol.29 , Issue.11-12 , pp. 1723-1743
    • Chen, Z.1    Dongarra, J.2    Luszczek, P.3    Roche, K.4
  • 17
    • 0031636309 scopus 로고    scopus 로고
    • FFTW: An adaptive software architecture for the FFT
    • IEEE Press
    • Frigo M. FFTW: an adaptive software architecture for the FFT. Proceedings of the ICASSP Conference. vol. 3:1998;1381-1384 IEEE Press.
    • (1998) Proceedings of the ICASSP Conference , vol.3 , pp. 1381-1384
    • Frigo, M.1
  • 21
    • 0010578197 scopus 로고    scopus 로고
    • Heterogeneous distribution of computations while solving linear algebra problems on networks of heterogeneous computers
    • Kalinov A., Lastovetsky A. Heterogeneous distribution of computations while solving linear algebra problems on networks of heterogeneous computers. Journal of Parallel and Distributed Computing, Academic Press. 61(4):2001;520-535.
    • (2001) Journal of Parallel and Distributed Computing, Academic Press , vol.61 , Issue.4 , pp. 520-535
    • Kalinov, A.1    Lastovetsky, A.2
  • 23
    • 1342338519 scopus 로고    scopus 로고
    • Performance of automatically tuned parallel GMRES(m) method on distributed memory machines
    • Japan: University of Tokyo
    • Kuroda H., Katagiri T., Kanada Y. Performance of automatically tuned parallel GMRES(m) method on distributed memory machines. Proceedings of Hakken Kagaku Team. 1999;11-19 University of Tokyo, Japan.
    • (1999) Proceedings of Hakken Kagaku Team , pp. 11-19
    • Kuroda, H.1    Katagiri, T.2    Kanada, Y.3
  • 26
    • 0038368778 scopus 로고    scopus 로고
    • Deploying parallel numerical library routines to cluster computing in a self adapting fashion
    • J. Murli, & P. Vanneschi. London: Imperial College Press
    • Roche K.J., Dongarra J.J. Deploying parallel numerical library routines to cluster computing in a self adapting fashion. Murli J., Vanneschi P. Proceedings of ParCo2001, Parallel Computing: Advances and Current Issues. 2002;Imperial College Press, London.
    • (2002) Proceedings of ParCo2001, Parallel Computing: Advances and Current Issues
    • Roche, K.J.1    Dongarra, J.J.2
  • 28
    • 84960391710 scopus 로고    scopus 로고
    • Block size selection of parallel LU factorization
    • IEEE Computer Society
    • Zhang Y. Block size selection of parallel LU factorization. Proceedings of HPC-ASIA2000. vol. 1:2000;247-249 IEEE Computer Society.
    • (2000) Proceedings of HPC-ASIA2000 , vol.1 , pp. 247-249
    • Zhang, Y.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.