-
3
-
-
84877609547
-
Brook for gpus: Stream computing on graphics hardware
-
NewYork,NY, USA: ACM
-
I. Buck, T. Foley, D. Horn, J. Sugerman, K. Fatahalian, M. Houston, and P. Hanrahan, "Brook for gpus: stream computing on graphics hardware," in SIGGRAPH '04: ACM SIGGRAPH 2004 Papers. NewYork,NY, USA: ACM, 2004, pp. 777-786.
-
(2004)
SIGGRAPH '04: ACM SIGGRAPH 2004 Papers
, pp. 777-786
-
-
Buck, I.1
Foley, T.2
Horn, D.3
Sugerman, J.4
Fatahalian, K.5
Houston, M.6
Hanrahan, P.7
-
4
-
-
78651550268
-
Scalable parallel programming with cuda
-
J. Nickolls, I. Buck, M. Garland, and K. Skadron, "Scalable parallel programming with cuda," Queue, vol.6, no.2, pp. 40-53, 2008.
-
(2008)
Queue
, vol.6
, Issue.2
, pp. 40-53
-
-
Nickolls, J.1
Buck, I.2
Garland, M.3
Skadron, K.4
-
5
-
-
49049088756
-
Gpu computing
-
May
-
J. Owens, M. Houston, D. Luebke, S. Green, J. Stone, and J. Phillips, "Gpu computing," Proceedings of the IEEE, vol.96, no.5, pp. 879-899, May 2008.
-
(2008)
Proceedings of the IEEE
, vol.96
, Issue.5
, pp. 879-899
-
-
Owens, J.1
Houston, M.2
Luebke, D.3
Green, S.4
Stone, J.5
Phillips, J.6
-
6
-
-
33947588048
-
A survey of general-purpose computation on graphics hardware
-
[Online]. Available
-
J. D. Owens, D. Luebke, N. Govindaraju, M. Harris, J. Krger, A. E. Lefohn, and T. J. Purcell, "A survey of general-purpose computation on graphics hardware," Computer Graphics Forum, vol.26, no.1, pp. 80-113, 2007. [Online]. Available: http://www.blackwellsynergy.com/doi/pdf/10.1111/j. 1467-8659.2007.01012.x
-
(2007)
Computer Graphics Forum
, vol.26
, Issue.1
, pp. 80-113
-
-
Owens, J.D.1
Luebke, D.2
Govindaraju, N.3
Harris, M.4
Krger, J.5
Lefohn, A.E.6
Purcell, T.J.7
-
7
-
-
49249086142
-
Larrabee: A many-core x86 architecture for visual computing
-
L. Seiler, D. Carmean, E. Sprangle, T. Forsyth, M. Abrash, P. Dubey, S. Junkins, A. Lake, J. Sugerman, R. Cavin, R. Espasa, E. Grochowski, T. Juan, and P. Hanrahan, "Larrabee: a many-core x86 architecture for visual computing," ACM Trans. Graph., vol.27, no.3, pp. 1-15, 2008.
-
(2008)
ACM Trans. Graph.
, vol.27
, Issue.3
, pp. 1-15
-
-
Seiler, L.1
Carmean, D.2
Sprangle, E.3
Forsyth, T.4
Abrash, M.5
Dubey, P.6
Junkins, S.7
Lake, A.8
Sugerman, J.9
Cavin, R.10
Espasa, R.11
Grochowski, E.12
Juan, T.13
Hanrahan, P.14
-
8
-
-
72049110101
-
-
http://www.khronos.org/opencl/
-
-
-
-
9
-
-
25844503119
-
Introduction to the cell multiprocessor
-
J. A. Kahle, M. N. Day, H. P. Hofstee, C. R. Johns, T. R. Maeurer, and D. Shippy, "Introduction to the cell multiprocessor," IBM J. Res. Dev., vol. 49, no. 4/5, pp. 589-604, 2005.
-
(2005)
IBM J. Res. Dev.
, vol.49
, Issue.4-5
, pp. 589-604
-
-
Kahle, J.A.1
Day, M.N.2
Hofstee, H.P.3
Johns, C.R.4
Maeurer, T.R.5
Shippy, D.6
-
10
-
-
33846818766
-
Examining the viability of FPGA supercomputing
-
S. Craven and P. Athanas, "Examining the viability of FPGA supercomputing," EURASIP J. Embedded Syst., vol.2007, no.1, pp. 13-13, 2007.
-
(2007)
EURASIP J. Embedded Syst.
, vol.2007
, Issue.1
, pp. 13-13
-
-
Craven, S.1
Athanas, P.2
-
11
-
-
44849137198
-
Nvidia tesla: A unified graphics and computing architecture
-
E. Lindholm, J. Nickolls, S. Oberman, and J. Montrym, "Nvidia tesla: A unified graphics and computing architecture," IEEE Micro, vol.28, no.2, pp. 39-55, 2008.
-
(2008)
IEEE Micro
, vol.28
, Issue.2
, pp. 39-55
-
-
Lindholm, E.1
Nickolls, J.2
Oberman, S.3
Montrym, J.4
-
13
-
-
0018515759
-
Basic linear algebra subprograms for fortran usage
-
C. L. Lawson, R. J. Hanson, D. R. Kincaid, and F. T. Krogh, "Basic linear algebra subprograms for fortran usage," ACM Trans. Math. Softw., vol.5, no.3, pp. 308-323, 1979.
-
(1979)
ACM Trans. Math. Softw.
, vol.5
, Issue.3
, pp. 308-323
-
-
Lawson, C.L.1
Hanson, R.J.2
Kincaid, D.R.3
Krogh, F.T.4
-
17
-
-
84999370993
-
The linpack benchmark : AAAn explanation
-
NewYork,NY, USA: Springer-Verlag New York, Inc.
-
J. J. Dongarra, "the linpack benchmark : an explanation," in Proceedings of the 1st International Conference on Supercomputing. NewYork,NY, USA: Springer-Verlag New York, Inc., 1988, pp. 456-474.
-
(1988)
Proceedings of the 1st International Conference on Supercomputing
, pp. 456-474
-
-
Dongarra, J.J.1
-
18
-
-
72049109841
-
-
Intel. [Online]. Available
-
Intel, thread Affinity Interface. [Online]. Available: http://software.intel.com/en-us/intel-compilers/
-
Thread Affinity Interface
-
-
-
19
-
-
23944462603
-
Gpu cluster for high performance computing
-
Washington, DC, USA: IEEE Computer Society
-
Z. Fan, F. Qiu, A. Kaufman, and S. Yoakum-Stover, "Gpu cluster for high performance computing," in SC '04: Proceedings of the 2004 ACM/IEEE conference on Supercomputing. Washington, DC, USA: IEEE Computer Society, 2004, p. 47.
-
(2004)
SC '04: Proceedings of the 2004 ACM/IEEE Conference on Supercomputing
, pp. 47
-
-
Fan, Z.1
Qiu, F.2
Kaufman, A.3
Yoakum-Stover, S.4
-
20
-
-
50949166640
-
Evaluation and tuning of the level 3 cublas for graphics processors
-
1-8, April
-
S. Barrachina, M. Castillo, F. Igual, R. Mayo, and E. Quintana-Orti, "Evaluation and tuning of the level 3 cublas for graphics processors," Parallel and Distributed Processing, 2008. IPDPS 2008. IEEE International Symposium on, pp. 1-8, April 2008.
-
(2008)
Parallel and Distributed Processing, 2008. IPDPS 2008. IEEE International Symposium on
-
-
Barrachina, S.1
Castillo, M.2
Igual, F.3
Mayo, R.4
Quintana-Orti, E.5
-
21
-
-
79959466764
-
Optimization principles and application performance evaluation of a multithreaded gpu using cuda
-
New York, NY, USA: ACM
-
S. Ryoo, C. I. Rodrigues, S. S. Baghsorkhi, S. S. Stone, D. B. Kirk, and W. mei W. Hwu, "Optimization principles and application performance evaluation of a multithreaded gpu using cuda," in PPoPP '08: Proceedings of the 13th ACM SIGPLAN Symposium on Principles and practice of parallel programming. New York, NY, USA: ACM, 2008, pp. 73-82.
-
(2008)
PPoPP '08: Proceedings of the 13th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming
, pp. 73-82
-
-
Ryoo, S.1
Rodrigues, C.I.2
Baghsorkhi, S.S.3
Stone, S.S.4
Kirk, D.B.5
Mei, W.6
Hwu, W.7
-
22
-
-
70350771131
-
Benchmarking gpus to tune dense linear algebra
-
V. Volkov and J. Demmel, "Benchmarking gpus to tune dense linear algebra," in SC, 2008, p. 31.
-
(2008)
SC
, pp. 31
-
-
Volkov, V.1
Demmel, J.2
|