메뉴 건너뛰기




Volumn , Issue , 2009, Pages

Triangular matrix inversion on Graphics Processing Unit

Author keywords

CUDA; Dense matrix inversion; GPGPU; Trianguar matrix

Indexed keywords

BASIC PROCEDURE; COMMERCIAL GRAPHICS; DENSE MATRICES; DIVIDE AND CONQUER; DOUBLE PRECISION; FACTORIZATION METHODS; GRAPHICS PROCESSING UNIT; LINEAR ALGEBRA ALGORITHMS; LU DECOMPOSITION; MATRIX; TRIANGULAR MATRICES;

EID: 74049149935     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1145/1654059.1654069     Document Type: Conference Paper
Times cited : (30)

References (17)
  • 1
    • 33845468997 scopus 로고    scopus 로고
    • Galoppo, N., Govindaraju, N. K., Henson, M., and Manocha, D. 2005. LU-GPU: Efficient Algorithms for Solving Dense Linear Systems on Graphics Hardware. In Proceedings of the 2005 ACM/IEEE Conference on Supercomputing (November 12 - 18, 2005). Conference on High Performance Networking and Computing. IEEE Computer Society, Washington, DC, 3. DOI= http://dx.doi.org/10.1109/SC.2005.42
    • Galoppo, N., Govindaraju, N. K., Henson, M., and Manocha, D. 2005. "LU-GPU: Efficient Algorithms for Solving Dense Linear Systems on Graphics Hardware". In Proceedings of the 2005 ACM/IEEE Conference on Supercomputing (November 12 - 18, 2005). Conference on High Performance Networking and Computing. IEEE Computer Society, Washington, DC, 3. DOI= http://dx.doi.org/10.1109/SC.2005.42
  • 3
    • 51849144655 scopus 로고    scopus 로고
    • Barrachina, S., Castillo, M., Igual, F. D., Mayo, R., and Quintana-Ortí, E. S. 2008. Solving Dense Linear Systems on Graphics Processors. In Proceedings of the 14th international Euro-Par Conference on Parallel Processing (Las Palmas de Gran Canaria, Spain, August 26 - 29, 2008). E. Luque, T. Margalef, and D. Benítez, Eds. Lecture Notes In Computer Science, 5168. Springer-Verlag, Berlin, Heidelberg, 739-748. DOI= http://dx.doi.org/10.1007/978-3-540-85451-7-79 C. J.
    • Barrachina, S., Castillo, M., Igual, F. D., Mayo, R., and Quintana-Ortí, E. S. 2008. "Solving Dense Linear Systems on Graphics Processors". In Proceedings of the 14th international Euro-Par Conference on Parallel Processing (Las Palmas de Gran Canaria, Spain, August 26 - 29, 2008). E. Luque, T. Margalef, and D. Benítez, Eds. Lecture Notes In Computer Science, vol. 5168. Springer-Verlag, Berlin, Heidelberg, 739-748. DOI= http://dx.doi.org/10.1007/978-3-540-85451-7-79 C. J.
  • 4
    • 74049153535 scopus 로고    scopus 로고
    • V. Volkov and J. W. Demmel, LU, QR and Cholesky Factorizations using Vector Capabilities of GPUs, EECS Department University of California, Berkeley Technical Report No. UCB/EECS-2008-49 May 13, 2008
    • V. Volkov and J. W. Demmel, "LU, QR and Cholesky Factorizations using Vector Capabilities of GPUs", EECS Department University of California, Berkeley Technical Report No. UCB/EECS-2008-49 May 13, 2008
  • 5
    • 74049118588 scopus 로고    scopus 로고
    • NVIDIA 2008. NVIDIA CUDA Compute Unified Device Architecture, Programming Guide, v. 2.2.
    • NVIDIA 2008. NVIDIA CUDA Compute Unified Device Architecture, Programming Guide, v. 2.2.
  • 6
    • 34548458883 scopus 로고    scopus 로고
    • Performance Analysis of General-Purpose Computation on Commodity Graphics Hardware: A Case Study Using Bioinformatics
    • DOI= http://dx.doi.org/10.1007/s11265-007-0064-7, Sep
    • Liu, W., Schmidt, B., and Müller-Wittig, W. 2007. "Performance Analysis of General-Purpose Computation on Commodity Graphics Hardware: A Case Study Using Bioinformatics". J. VLSI Signal Process. Syst. 48, 3 (Sep. 2007), 209-221. DOI= http://dx.doi.org/10.1007/s11265-007-0064-7
    • (2007) J. VLSI Signal Process. Syst , vol.48 , Issue.3 , pp. 209-221
    • Liu, W.1    Schmidt, B.2    Müller-Wittig, W.3
  • 8
    • 78650082402 scopus 로고    scopus 로고
    • NVIDIA GeForce GTX 200 GPU architectural overview, second-generation unified GPU architecture for visual computing
    • Tech. Rep, NVIDIA, 2008. URL
    • NVIDIA, NVIDIA GeForce GTX 200 GPU architectural overview, second-generation unified GPU architecture for visual computing, Tech. Rep., NVIDIA, 2008. URL: www.nvidia.com/docs/IO/55506/GeForce-GTX-200-GPU-Technical- Brief.pdf
  • 9
    • 0001712739 scopus 로고
    • A survey of parallel algorithms in numerical linear algebra
    • D. Heller, "A survey of parallel algorithms in numerical linear algebra", SIAM Review 20, (1978) 740-777.
    • (1978) SIAM Review , vol.20 , pp. 740-777
    • Heller, D.1
  • 11
    • 0004354231 scopus 로고
    • A Strassen-type matrix inversion algorithm
    • I. Dimov, O. TanevkEds, IOS Press
    • S.M. Balle, P.C. Hansen, "A Strassen-type matrix inversion algorithm", in: I. Dimov, O. Tanevk(Eds.), Advances in Parallel Algorithms, IOS Press, 1994, pp. 22-30.
    • (1994) Advances in Parallel Algorithms , pp. 22-30
    • Balle, S.M.1    Hansen, P.C.2
  • 12
    • 0035546674 scopus 로고    scopus 로고
    • Optimal parallelization of a recursive algorithm for triangular matrix inversion on MIMD computers
    • 1 December, ISSN 0167-8191, DOI: 10.1016/S0167-8191(01)00111-9
    • Nasri, W., Mahjoub, Z., "Optimal parallelization of a recursive algorithm for triangular matrix inversion on MIMD computers", Parallel Computing, Volume 27, Issue 13, 1 December 2001, Pages 1767-1782, ISSN 0167-8191, DOI: 10.1016/S0167-8191(01)00111-9.
    • (2001) Parallel Computing , vol.27 , Issue.13 , pp. 1767-1782
    • Nasri, W.1    Mahjoub, Z.2
  • 13
    • 78651269052 scopus 로고    scopus 로고
    • Fatahalian, K., Sugerman, J., and Hanrahan, P. 2004. Understanding the efficiency of GPU algorithms for matrix-matrix multiplication. In Proceedings of the ACM SIGGRAPH/EUROGRAPHICS Conference on Graphics Hardware (Grenoble, France, August 29 - 30, 2004). HWWS '04. ACM, New York, NY, 133-137. DOI= http://doi.acm.org/10.1145/1058129.1058148
    • Fatahalian, K., Sugerman, J., and Hanrahan, P. 2004. "Understanding the efficiency of GPU algorithms for matrix-matrix multiplication". In Proceedings of the ACM SIGGRAPH/EUROGRAPHICS Conference on Graphics Hardware (Grenoble, France, August 29 - 30, 2004). HWWS '04. ACM, New York, NY, 133-137. DOI= http://doi.acm.org/10.1145/1058129.1058148
  • 14
    • 50949166640 scopus 로고    scopus 로고
    • Barrachina, S.; Castillo, M.; Igual, F.D.; Mayo, R.; Quintana-Orti, E.S., Evaluation and tuning of the Level 3 CUBLAS for graphics processors, Parallel and Distributed Processing, 2008. IPDPS 2008. IEEE International Symposium on Supercomputing, no., pp.1-8, 14-18 April 2008 URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?arnumber= 4536485&isnumber=4536075
    • Barrachina, S.; Castillo, M.; Igual, F.D.; Mayo, R.; Quintana-Orti, E.S., "Evaluation and tuning of the Level 3 CUBLAS for graphics processors," Parallel and Distributed Processing, 2008. IPDPS 2008. IEEE International Symposium on Supercomputing, vol., no., pp.1-8, 14-18 April 2008 URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?arnumber= 4536485&isnumber=4536075
  • 15
    • 1542698782 scopus 로고    scopus 로고
    • The Native POSIX Thread Library for Linux
    • Technical report, Redhat, February 2003
    • Drepper, U., Molnar, I. "The Native POSIX Thread Library for Linux". Technical report, Redhat, February 2003.
    • Drepper, U.1    Molnar, I.2
  • 16
    • 74049103184 scopus 로고    scopus 로고
    • Anderson, E., Bai, Z., Bischof, C., Blackford, L. S., Demmel, J., Dongarra, J. J., Du Croz, J., Hammarling, S., Greenbaum, A., McKenney, A., and Sorensen, D. 1999 LAPACK Users' Guide (Third Ed.). Society for Industrial and Applied Mathematics.
    • Anderson, E., Bai, Z., Bischof, C., Blackford, L. S., Demmel, J., Dongarra, J. J., Du Croz, J., Hammarling, S., Greenbaum, A., McKenney, A., and Sorensen, D. 1999 "LAPACK Users' Guide (Third Ed.)". Society for Industrial and Applied Mathematics.
  • 17
    • 0002806690 scopus 로고    scopus 로고
    • OpenMP: An industry standard API for shared-memory programming
    • IEEE, Jan-Mar 1998 URL
    • Dagum, L.; Menon, R., "OpenMP: an industry standard API for shared-memory programming", Computational Science & Engineering, IEEE , vol.5, no.1, pp.46-55, Jan-Mar 1998 URL: http://ieeexplore.ieee.org/ stamp/stamp.jsp?arnumber= 660313&isnumber=14417
    • Computational Science & Engineering , vol.5 , Issue.1 , pp. 46-55
    • Dagum, L.1    Menon, R.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.