-
1
-
-
33845468997
-
-
Galoppo, N., Govindaraju, N. K., Henson, M., and Manocha, D. 2005. LU-GPU: Efficient Algorithms for Solving Dense Linear Systems on Graphics Hardware. In Proceedings of the 2005 ACM/IEEE Conference on Supercomputing (November 12 - 18, 2005). Conference on High Performance Networking and Computing. IEEE Computer Society, Washington, DC, 3. DOI= http://dx.doi.org/10.1109/SC.2005.42
-
Galoppo, N., Govindaraju, N. K., Henson, M., and Manocha, D. 2005. "LU-GPU: Efficient Algorithms for Solving Dense Linear Systems on Graphics Hardware". In Proceedings of the 2005 ACM/IEEE Conference on Supercomputing (November 12 - 18, 2005). Conference on High Performance Networking and Computing. IEEE Computer Society, Washington, DC, 3. DOI= http://dx.doi.org/10.1109/SC.2005.42
-
-
-
-
2
-
-
70350771131
-
Benchmarking GPUs to tune dense linear algebra
-
Online, Available
-
V. Volkov and J. W. Demmel, "Benchmarking GPUs to tune dense linear algebra," in SC '08: Proceedings of the 2008 ACM/IEEE conference on Supercomputing. Piscataway, NJ, USA: IEEE Press, 2008, pp. 1-11. [Online]. Available: http://dx.doi.org/10.1145/1413370.1413402
-
(2008)
SC '08: Proceedings of the 2008 ACM/IEEE conference on Supercomputing. Piscataway, NJ, USA: IEEE Press
, pp. 1-11
-
-
Volkov, V.1
Demmel, J.W.2
-
3
-
-
51849144655
-
-
Barrachina, S., Castillo, M., Igual, F. D., Mayo, R., and Quintana-Ortí, E. S. 2008. Solving Dense Linear Systems on Graphics Processors. In Proceedings of the 14th international Euro-Par Conference on Parallel Processing (Las Palmas de Gran Canaria, Spain, August 26 - 29, 2008). E. Luque, T. Margalef, and D. Benítez, Eds. Lecture Notes In Computer Science, 5168. Springer-Verlag, Berlin, Heidelberg, 739-748. DOI= http://dx.doi.org/10.1007/978-3-540-85451-7-79 C. J.
-
Barrachina, S., Castillo, M., Igual, F. D., Mayo, R., and Quintana-Ortí, E. S. 2008. "Solving Dense Linear Systems on Graphics Processors". In Proceedings of the 14th international Euro-Par Conference on Parallel Processing (Las Palmas de Gran Canaria, Spain, August 26 - 29, 2008). E. Luque, T. Margalef, and D. Benítez, Eds. Lecture Notes In Computer Science, vol. 5168. Springer-Verlag, Berlin, Heidelberg, 739-748. DOI= http://dx.doi.org/10.1007/978-3-540-85451-7-79 C. J.
-
-
-
-
4
-
-
74049153535
-
-
V. Volkov and J. W. Demmel, LU, QR and Cholesky Factorizations using Vector Capabilities of GPUs, EECS Department University of California, Berkeley Technical Report No. UCB/EECS-2008-49 May 13, 2008
-
V. Volkov and J. W. Demmel, "LU, QR and Cholesky Factorizations using Vector Capabilities of GPUs", EECS Department University of California, Berkeley Technical Report No. UCB/EECS-2008-49 May 13, 2008
-
-
-
-
5
-
-
74049118588
-
-
NVIDIA 2008. NVIDIA CUDA Compute Unified Device Architecture, Programming Guide, v. 2.2.
-
NVIDIA 2008. NVIDIA CUDA Compute Unified Device Architecture, Programming Guide, v. 2.2.
-
-
-
-
6
-
-
34548458883
-
Performance Analysis of General-Purpose Computation on Commodity Graphics Hardware: A Case Study Using Bioinformatics
-
DOI= http://dx.doi.org/10.1007/s11265-007-0064-7, Sep
-
Liu, W., Schmidt, B., and Müller-Wittig, W. 2007. "Performance Analysis of General-Purpose Computation on Commodity Graphics Hardware: A Case Study Using Bioinformatics". J. VLSI Signal Process. Syst. 48, 3 (Sep. 2007), 209-221. DOI= http://dx.doi.org/10.1007/s11265-007-0064-7
-
(2007)
J. VLSI Signal Process. Syst
, vol.48
, Issue.3
, pp. 209-221
-
-
Liu, W.1
Schmidt, B.2
Müller-Wittig, W.3
-
7
-
-
25844479498
-
A Survey of General-Purpose Computation on Graphics Hardware
-
J. Owens, D. Luebke, N. Govindaraju, M. Harris, J. Kruger, A. Lefohn and T. Purcell, "A Survey of General-Purpose Computation on Graphics Hardware," in Eurographics 2005, pp. 21-51.
-
(2005)
Eurographics
, pp. 21-51
-
-
Owens, J.1
Luebke, D.2
Govindaraju, N.3
Harris, M.4
Kruger, J.5
Lefohn, A.6
Purcell, T.7
-
8
-
-
78650082402
-
NVIDIA GeForce GTX 200 GPU architectural overview, second-generation unified GPU architecture for visual computing
-
Tech. Rep, NVIDIA, 2008. URL
-
NVIDIA, NVIDIA GeForce GTX 200 GPU architectural overview, second-generation unified GPU architecture for visual computing, Tech. Rep., NVIDIA, 2008. URL: www.nvidia.com/docs/IO/55506/GeForce-GTX-200-GPU-Technical- Brief.pdf
-
-
-
-
9
-
-
0001712739
-
A survey of parallel algorithms in numerical linear algebra
-
D. Heller, "A survey of parallel algorithms in numerical linear algebra", SIAM Review 20, (1978) 740-777.
-
(1978)
SIAM Review
, vol.20
, pp. 740-777
-
-
Heller, D.1
-
11
-
-
0004354231
-
A Strassen-type matrix inversion algorithm
-
I. Dimov, O. TanevkEds, IOS Press
-
S.M. Balle, P.C. Hansen, "A Strassen-type matrix inversion algorithm", in: I. Dimov, O. Tanevk(Eds.), Advances in Parallel Algorithms, IOS Press, 1994, pp. 22-30.
-
(1994)
Advances in Parallel Algorithms
, pp. 22-30
-
-
Balle, S.M.1
Hansen, P.C.2
-
12
-
-
0035546674
-
Optimal parallelization of a recursive algorithm for triangular matrix inversion on MIMD computers
-
1 December, ISSN 0167-8191, DOI: 10.1016/S0167-8191(01)00111-9
-
Nasri, W., Mahjoub, Z., "Optimal parallelization of a recursive algorithm for triangular matrix inversion on MIMD computers", Parallel Computing, Volume 27, Issue 13, 1 December 2001, Pages 1767-1782, ISSN 0167-8191, DOI: 10.1016/S0167-8191(01)00111-9.
-
(2001)
Parallel Computing
, vol.27
, Issue.13
, pp. 1767-1782
-
-
Nasri, W.1
Mahjoub, Z.2
-
13
-
-
78651269052
-
-
Fatahalian, K., Sugerman, J., and Hanrahan, P. 2004. Understanding the efficiency of GPU algorithms for matrix-matrix multiplication. In Proceedings of the ACM SIGGRAPH/EUROGRAPHICS Conference on Graphics Hardware (Grenoble, France, August 29 - 30, 2004). HWWS '04. ACM, New York, NY, 133-137. DOI= http://doi.acm.org/10.1145/1058129.1058148
-
Fatahalian, K., Sugerman, J., and Hanrahan, P. 2004. "Understanding the efficiency of GPU algorithms for matrix-matrix multiplication". In Proceedings of the ACM SIGGRAPH/EUROGRAPHICS Conference on Graphics Hardware (Grenoble, France, August 29 - 30, 2004). HWWS '04. ACM, New York, NY, 133-137. DOI= http://doi.acm.org/10.1145/1058129.1058148
-
-
-
-
14
-
-
50949166640
-
-
Barrachina, S.; Castillo, M.; Igual, F.D.; Mayo, R.; Quintana-Orti, E.S., Evaluation and tuning of the Level 3 CUBLAS for graphics processors, Parallel and Distributed Processing, 2008. IPDPS 2008. IEEE International Symposium on Supercomputing, no., pp.1-8, 14-18 April 2008 URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?arnumber= 4536485&isnumber=4536075
-
Barrachina, S.; Castillo, M.; Igual, F.D.; Mayo, R.; Quintana-Orti, E.S., "Evaluation and tuning of the Level 3 CUBLAS for graphics processors," Parallel and Distributed Processing, 2008. IPDPS 2008. IEEE International Symposium on Supercomputing, vol., no., pp.1-8, 14-18 April 2008 URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?arnumber= 4536485&isnumber=4536075
-
-
-
-
15
-
-
1542698782
-
The Native POSIX Thread Library for Linux
-
Technical report, Redhat, February 2003
-
Drepper, U., Molnar, I. "The Native POSIX Thread Library for Linux". Technical report, Redhat, February 2003.
-
-
-
Drepper, U.1
Molnar, I.2
-
16
-
-
74049103184
-
-
Anderson, E., Bai, Z., Bischof, C., Blackford, L. S., Demmel, J., Dongarra, J. J., Du Croz, J., Hammarling, S., Greenbaum, A., McKenney, A., and Sorensen, D. 1999 LAPACK Users' Guide (Third Ed.). Society for Industrial and Applied Mathematics.
-
Anderson, E., Bai, Z., Bischof, C., Blackford, L. S., Demmel, J., Dongarra, J. J., Du Croz, J., Hammarling, S., Greenbaum, A., McKenney, A., and Sorensen, D. 1999 "LAPACK Users' Guide (Third Ed.)". Society for Industrial and Applied Mathematics.
-
-
-
-
17
-
-
0002806690
-
OpenMP: An industry standard API for shared-memory programming
-
IEEE, Jan-Mar 1998 URL
-
Dagum, L.; Menon, R., "OpenMP: an industry standard API for shared-memory programming", Computational Science & Engineering, IEEE , vol.5, no.1, pp.46-55, Jan-Mar 1998 URL: http://ieeexplore.ieee.org/ stamp/stamp.jsp?arnumber= 660313&isnumber=14417
-
Computational Science & Engineering
, vol.5
, Issue.1
, pp. 46-55
-
-
Dagum, L.1
Menon, R.2
|