SCOPUS 정보 검색 플랫폼

Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis, SC '09

Volumn , Issue , 2009, Pages

Triangular matrix inversion on Graphics Processing Unit

(4) Ries, Florian a De Marco, Tommaso a Zivieri, Matteo a Guerrieri, Roberto a

a UNIVERSITY OF BOLOGNA (Italy)

Author keywords

CUDA; Dense matrix inversion; GPGPU; Trianguar matrix

Indexed keywords

BASIC PROCEDURE; COMMERCIAL GRAPHICS; DENSE MATRICES; DIVIDE AND CONQUER; DOUBLE PRECISION; FACTORIZATION METHODS; GRAPHICS PROCESSING UNIT; LINEAR ALGEBRA ALGORITHMS; LU DECOMPOSITION; MATRIX; TRIANGULAR MATRICES;

COMPUTER GRAPHICS EQUIPMENT; IMAGE CODING; INTERCONNECTION NETWORKS; LEARNING ALGORITHMS; PROGRAM PROCESSORS;

MATRIX ALGEBRA;

EID: 74049149935 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: 10.1145/1654059.1654069 Document Type: Conference Paper

Times cited : (30)

References (17)

1
- 33845468997
- Galoppo, N., Govindaraju, N. K., Henson, M., and Manocha, D. 2005. LU-GPU: Efficient Algorithms for Solving Dense Linear Systems on Graphics Hardware. In Proceedings of the 2005 ACM/IEEE Conference on Supercomputing (November 12 - 18, 2005). Conference on High Performance Networking and Computing. IEEE Computer Society, Washington, DC, 3. DOI= http://dx.doi.org/10.1109/SC.2005.42
- Galoppo, N., Govindaraju, N. K., Henson, M., and Manocha, D. 2005. "LU-GPU: Efficient Algorithms for Solving Dense Linear Systems on Graphics Hardware". In Proceedings of the 2005 ACM/IEEE Conference on Supercomputing (November 12 - 18, 2005). Conference on High Performance Networking and Computing. IEEE Computer Society, Washington, DC, 3. DOI= http://dx.doi.org/10.1109/SC.2005.42

2
- 70350771131
- Benchmarking GPUs to tune dense linear algebra
- Online, Available
- V. Volkov and J. W. Demmel, "Benchmarking GPUs to tune dense linear algebra," in SC '08: Proceedings of the 2008 ACM/IEEE conference on Supercomputing. Piscataway, NJ, USA: IEEE Press, 2008, pp. 1-11. [Online]. Available: http://dx.doi.org/10.1145/1413370.1413402
- (2008) SC '08: Proceedings of the 2008 ACM/IEEE conference on Supercomputing. Piscataway, NJ, USA: IEEE Press , pp. 1-11
- Volkov, V.¹ Demmel, J.W.²

3
- 51849144655
- Barrachina, S., Castillo, M., Igual, F. D., Mayo, R., and Quintana-Ortí, E. S. 2008. Solving Dense Linear Systems on Graphics Processors. In Proceedings of the 14th international Euro-Par Conference on Parallel Processing (Las Palmas de Gran Canaria, Spain, August 26 - 29, 2008). E. Luque, T. Margalef, and D. Benítez, Eds. Lecture Notes In Computer Science, 5168. Springer-Verlag, Berlin, Heidelberg, 739-748. DOI= http://dx.doi.org/10.1007/978-3-540-85451-7-79 C. J.
- Barrachina, S., Castillo, M., Igual, F. D., Mayo, R., and Quintana-Ortí, E. S. 2008. "Solving Dense Linear Systems on Graphics Processors". In Proceedings of the 14th international Euro-Par Conference on Parallel Processing (Las Palmas de Gran Canaria, Spain, August 26 - 29, 2008). E. Luque, T. Margalef, and D. Benítez, Eds. Lecture Notes In Computer Science, vol. 5168. Springer-Verlag, Berlin, Heidelberg, 739-748. DOI= http://dx.doi.org/10.1007/978-3-540-85451-7-79 C. J.

4
- 74049153535
- V. Volkov and J. W. Demmel, LU, QR and Cholesky Factorizations using Vector Capabilities of GPUs, EECS Department University of California, Berkeley Technical Report No. UCB/EECS-2008-49 May 13, 2008
- V. Volkov and J. W. Demmel, "LU, QR and Cholesky Factorizations using Vector Capabilities of GPUs", EECS Department University of California, Berkeley Technical Report No. UCB/EECS-2008-49 May 13, 2008

5
- 74049118588
- NVIDIA 2008. NVIDIA CUDA Compute Unified Device Architecture, Programming Guide, v. 2.2.
- NVIDIA 2008. NVIDIA CUDA Compute Unified Device Architecture, Programming Guide, v. 2.2.

6
- 34548458883
- Performance Analysis of General-Purpose Computation on Commodity Graphics Hardware: A Case Study Using Bioinformatics
- DOI= http://dx.doi.org/10.1007/s11265-007-0064-7, Sep
- Liu, W., Schmidt, B., and Müller-Wittig, W. 2007. "Performance Analysis of General-Purpose Computation on Commodity Graphics Hardware: A Case Study Using Bioinformatics". J. VLSI Signal Process. Syst. 48, 3 (Sep. 2007), 209-221. DOI= http://dx.doi.org/10.1007/s11265-007-0064-7
- (2007) J. VLSI Signal Process. Syst , vol.48 , Issue.3 , pp. 209-221
- Liu, W.¹ Schmidt, B.² Müller-Wittig, W.³

7
- 25844479498
- A Survey of General-Purpose Computation on Graphics Hardware
- J. Owens, D. Luebke, N. Govindaraju, M. Harris, J. Kruger, A. Lefohn and T. Purcell, "A Survey of General-Purpose Computation on Graphics Hardware," in Eurographics 2005, pp. 21-51.
- (2005) Eurographics , pp. 21-51
- Owens, J.¹ Luebke, D.² Govindaraju, N.³ Harris, M.⁴ Kruger, J.⁵ Lefohn, A.⁶ Purcell, T.⁷

8
- 78650082402
- NVIDIA GeForce GTX 200 GPU architectural overview, second-generation unified GPU architecture for visual computing
- Tech. Rep, NVIDIA, 2008. URL
- NVIDIA, NVIDIA GeForce GTX 200 GPU architectural overview, second-generation unified GPU architecture for visual computing, Tech. Rep., NVIDIA, 2008. URL: www.nvidia.com/docs/IO/55506/GeForce-GTX-200-GPU-Technical- Brief.pdf

9
- 0001712739
- A survey of parallel algorithms in numerical linear algebra
- D. Heller, "A survey of parallel algorithms in numerical linear algebra", SIAM Review 20, (1978) 740-777.
- (1978) SIAM Review , vol.20 , pp. 740-777
- Heller, D.¹

10
- 0003487687
- Halstead Press, New York
- Y. Robert, "The Impact of Vector and Parallel Architectures on the Gaussian Elimination", Halstead Press, New York, 1990
- (1990) The Impact of Vector and Parallel Architectures on the Gaussian Elimination
- Robert, Y.¹

11
- 0004354231
- A Strassen-type matrix inversion algorithm
- I. Dimov, O. TanevkEds, IOS Press
- S.M. Balle, P.C. Hansen, "A Strassen-type matrix inversion algorithm", in: I. Dimov, O. Tanevk(Eds.), Advances in Parallel Algorithms, IOS Press, 1994, pp. 22-30.
- (1994) Advances in Parallel Algorithms , pp. 22-30
- Balle, S.M.¹ Hansen, P.C.²

12
- 0035546674
- Optimal parallelization of a recursive algorithm for triangular matrix inversion on MIMD computers
- 1 December, ISSN 0167-8191, DOI: 10.1016/S0167-8191(01)00111-9
- Nasri, W., Mahjoub, Z., "Optimal parallelization of a recursive algorithm for triangular matrix inversion on MIMD computers", Parallel Computing, Volume 27, Issue 13, 1 December 2001, Pages 1767-1782, ISSN 0167-8191, DOI: 10.1016/S0167-8191(01)00111-9.
- (2001) Parallel Computing , vol.27 , Issue.13 , pp. 1767-1782
- Nasri, W.¹ Mahjoub, Z.²

13
- 78651269052
- Fatahalian, K., Sugerman, J., and Hanrahan, P. 2004. Understanding the efficiency of GPU algorithms for matrix-matrix multiplication. In Proceedings of the ACM SIGGRAPH/EUROGRAPHICS Conference on Graphics Hardware (Grenoble, France, August 29 - 30, 2004). HWWS '04. ACM, New York, NY, 133-137. DOI= http://doi.acm.org/10.1145/1058129.1058148
- Fatahalian, K., Sugerman, J., and Hanrahan, P. 2004. "Understanding the efficiency of GPU algorithms for matrix-matrix multiplication". In Proceedings of the ACM SIGGRAPH/EUROGRAPHICS Conference on Graphics Hardware (Grenoble, France, August 29 - 30, 2004). HWWS '04. ACM, New York, NY, 133-137. DOI= http://doi.acm.org/10.1145/1058129.1058148

14
- 50949166640
- Barrachina, S.; Castillo, M.; Igual, F.D.; Mayo, R.; Quintana-Orti, E.S., Evaluation and tuning of the Level 3 CUBLAS for graphics processors, Parallel and Distributed Processing, 2008. IPDPS 2008. IEEE International Symposium on Supercomputing, no., pp.1-8, 14-18 April 2008 URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?arnumber= 4536485&isnumber=4536075
- Barrachina, S.; Castillo, M.; Igual, F.D.; Mayo, R.; Quintana-Orti, E.S., "Evaluation and tuning of the Level 3 CUBLAS for graphics processors," Parallel and Distributed Processing, 2008. IPDPS 2008. IEEE International Symposium on Supercomputing, vol., no., pp.1-8, 14-18 April 2008 URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?arnumber= 4536485&isnumber=4536075

15
- 1542698782
- The Native POSIX Thread Library for Linux
- Technical report, Redhat, February 2003
- Drepper, U., Molnar, I. "The Native POSIX Thread Library for Linux". Technical report, Redhat, February 2003.
- Drepper, U.¹ Molnar, I.²

16
- 74049103184
- Anderson, E., Bai, Z., Bischof, C., Blackford, L. S., Demmel, J., Dongarra, J. J., Du Croz, J., Hammarling, S., Greenbaum, A., McKenney, A., and Sorensen, D. 1999 LAPACK Users' Guide (Third Ed.). Society for Industrial and Applied Mathematics.
- Anderson, E., Bai, Z., Bischof, C., Blackford, L. S., Demmel, J., Dongarra, J. J., Du Croz, J., Hammarling, S., Greenbaum, A., McKenney, A., and Sorensen, D. 1999 "LAPACK Users' Guide (Third Ed.)". Society for Industrial and Applied Mathematics.

17
- 0002806690
- OpenMP: An industry standard API for shared-memory programming
- IEEE, Jan-Mar 1998 URL
- Dagum, L.; Menon, R., "OpenMP: an industry standard API for shared-memory programming", Computational Science & Engineering, IEEE , vol.5, no.1, pp.46-55, Jan-Mar 1998 URL: http://ieeexplore.ieee.org/ stamp/stamp.jsp?arnumber= 660313&isnumber=14417
- Computational Science & Engineering , vol.5 , Issue.1 , pp. 46-55
- Dagum, L.¹ Menon, R.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.