-
1
-
-
0024082546
-
The iuput/output complexity of sorting and related problems
-
AGGARWAL, A., AND VITTER, J. S. .1988. The iuput/output complexity of sorting and related problems. Commun. ACM 31, 1116-1127.
-
(1988)
Commun. ACM
, vol.31
, pp. 1116-1127
-
-
AGGARWAL, A.1
VITTER, J.S.2
-
2
-
-
0003706460
-
-
SlAM, Philadelphia
-
ANDERSON, E., BAI, Z., BISCHOF, C., DEMMEL, J., DONOARRA, J., DU CROZ, J., GREENBAUM, A., HAMMARLING, S., AND SORENSEN, D. 1992. LAPACK User's Guide, Release 1.0. SlAM, Philadelphia.
-
(1992)
LAPACK User's Guide, Release 1.0
-
-
ANDERSON, E.1
BAI, Z.2
BISCHOF, C.3
DEMMEL, J.4
DONOARRA, J.5
DU CROZ, J.6
GREENBAUM, A.7
HAMMARLING, S.8
SORENSEN, D.9
-
3
-
-
34548217409
-
-
AROE, L., B RODAL, G., AND FAOERBERO, R. 2004. Cache oblivious data structures. Handbook on Data Structures and Applications.
-
AROE, L., B RODAL, G., AND FAOERBERO, R. 2004. Cache oblivious data structures. Handbook on Data Structures and Applications.
-
-
-
-
4
-
-
0028743437
-
Compiler transformations for high-performance computing
-
BACON, D. F., GRAHAM, S. L., AND SHARP, O. J. 1994. Compiler transformations for high-performance computing. ACM Comput. Surv. 26, 4, 345-420.
-
(1994)
ACM Comput. Surv
, vol.26
, Issue.4
, pp. 345-420
-
-
BACON, D.F.1
GRAHAM, S.L.2
SHARP, O.J.3
-
7
-
-
0242533311
-
Sparse matrix solvers on the GPU: Conjugate gradients and multigrid
-
BOLZ, J., FARMER, I., GRINSPUN, E., AND SCHRÖDER, P. 2003. Sparse matrix solvers on the GPU: conjugate gradients and multigrid. ACM Trans. Graph. 22, 3, 917-924.
-
(2003)
ACM Trans. Graph
, vol.22
, Issue.3
, pp. 917-924
-
-
BOLZ, J.1
FARMER, I.2
GRINSPUN, E.3
SCHRÖDER, P.4
-
8
-
-
10644248153
-
Brook for GPUs: Stream, computing on graphics hardware
-
BUCK, I., FOLEY, T., HORN, D., SUOERMAN, J., FATAHALIAN, K., HOUSTON, M., AND HANRAHAN, P. 2004. Brook for GPUs: stream, computing on graphics hardware. ACM Trans. Graph. 23, 3, 777-786.
-
(2004)
ACM Trans. Graph
, vol.23
, Issue.3
, pp. 777-786
-
-
BUCK, I.1
FOLEY, T.2
HORN, D.3
SUOERMAN, J.4
FATAHALIAN, K.5
HOUSTON, M.6
HANRAHAN, P.7
-
11
-
-
23944462603
-
GPU cluster for high performance computing
-
FAN, Z., QIU, F., KAUFMAN, A., AND YOAKUM-STOVER, S. 2004. GPU cluster for high performance computing. In ACM/IEEE Supercomputing Conference 2004.
-
(2004)
ACM/IEEE Supercomputing Conference 2004
-
-
FAN, Z.1
QIU, F.2
KAUFMAN, A.3
YOAKUM-STOVER, S.4
-
13
-
-
0033350255
-
Cacheoblivious algorithms
-
FRIGO, M., LEISERSON, C., PROKOP, H., AND RAMACHANDRAN, S. 1999. Cacheoblivious algorithms. Symposium on Foundations of Computer Science.
-
(1999)
Symposium on Foundations of Computer Science
-
-
FRIGO, M.1
LEISERSON, C.2
PROKOP, H.3
RAMACHANDRAN, S.4
-
14
-
-
33845468997
-
LUGPU: Efficient algorithms for solving dense linear systems on graphics hardware
-
GALOPPO, N., GOVINDARAJU, N., HENSON, M., AND MANOCHA, D. 2005. LUGPU: Efficient algorithms for solving dense linear systems on graphics hardware. In Proc. ACM/IEEE SuperComputing Conference.
-
(2005)
Proc. ACM/IEEE SuperComputing Conference
-
-
GALOPPO, N.1
GOVINDARAJU, N.2
HENSON, M.3
MANOCHA, D.4
-
15
-
-
33845440618
-
GPGPU performance tuning
-
Tech. rep, University of Dortmund, Germany
-
GÖDDEKE, D. 2005. GPGPU performance tuning. Tech. rep., University of Dortmund, Germany, http://www.mathematik.uni-dortiimiid.de/ ~goedd8ke/ gpgpu/.
-
(2005)
-
-
GÖDDEKE, D.1
-
16
-
-
3142739595
-
Fast computation of database operations using graphics processors
-
GOVINDARAJU, N., LLOYD, B., WANO, W., LIN, M., AND MANOCHA, D. 2004. Fast computation of database operations using graphics processors. Proc. of ACM SIGMOD.
-
(2004)
Proc. of ACM SIGMOD
-
-
GOVINDARAJU, N.1
LLOYD, B.2
WANO, W.3
LIN, M.4
MANOCHA, D.5
-
17
-
-
29844438097
-
Fast and approximate stream mining of quantites and frequencies using graphics processors
-
GOVINDARAJU, N., RAGHUVANSHI, N., AND MANOCHA, D. 2005. Fast and approximate stream mining of quantites and frequencies using graphics processors. Proc. of ACM SIGMOD.
-
(2005)
Proc. of ACM SIGMOD
-
-
GOVINDARAJU, N.1
RAGHUVANSHI, N.2
MANOCHA, D.3
-
18
-
-
33947607609
-
GPUTeraSort: High performance graphics coprocessor sorting for large database management
-
GOVINDARAJU, N., GRAY, J., KUMAR, R., AND MANOCHA, D. 2006. GPUTeraSort: High performance graphics coprocessor sorting for large database management. Proc. of ACM SIGMOD.
-
(2006)
Proc. of ACM SIGMOD
-
-
GOVINDARAJU, N.1
GRAY, J.2
KUMAR, R.3
MANOCHA, D.4
-
20
-
-
10644280791
-
Cache and bandwidth aware matrix multiplication on the GPU
-
Technical Report UIUCDCS-R-2003-2328, University of Illinois at Urbana-Champaign
-
HALL, J. D., CARS, N., AND HART, J. 2003. Cache and bandwidth aware matrix multiplication on the GPU. Technical Report UIUCDCS-R-2003-2328, University of Illinois at Urbana-Champaign.
-
(2003)
-
-
HALL, J.D.1
CARS, N.2
HART, J.3
-
21
-
-
78651284090
-
Simulation of cloud dynamics on graphics hardware
-
HARRIS, M., BAXTER, B., SCHEUERMANN, G., AND LASTRA, A. 2003. Simulation of cloud dynamics on graphics hardware. SIGGRAPH/Eurographics Workshop on Graphics Hardware.
-
(2003)
SIGGRAPH/Eurographics Workshop on Graphics Hardware
-
-
HARRIS, M.1
BAXTER, B.2
SCHEUERMANN, G.3
LASTRA, A.4
-
22
-
-
0024903997
-
Evaluating associativity in cpu caches
-
HILL, M. D., AND SMITH, A.J. 1989. Evaluating associativity in cpu caches. IEEE Transactions on Computers 38, 12, 1612-1630.
-
(1989)
IEEE Transactions on Computers
, vol.38
, Issue.12
, pp. 1612-1630
-
-
HILL, M.D.1
SMITH, A.J.2
-
25
-
-
0347304618
-
Data-centric multi-level blocking
-
KODUKULA, I., AHMED, N., AND PINOALI, K. 1997. Data-centric multi-level blocking. Proc. of ACM SIGPLAN, 346-357.
-
(1997)
Proc. of ACM SIGPLAN
, pp. 346-357
-
-
KODUKULA, I.1
AHMED, N.2
PINOALI, K.3
-
26
-
-
77954024744
-
-
KRÜOER,. J., AND W.ESTERMANN, R. 2003. Linear algebra operators for GPU implementation of numerical algorithms. ACM Trans. Graph. 22, 3, 908-916.
-
KRÜOER,. J., AND W.ESTERMANN, R. 2003. Linear algebra operators for GPU implementation of numerical algorithms. ACM Trans. Graph. 22, 3, 908-916.
-
-
-
-
27
-
-
0026137116
-
The performance and optimization of blocked algorithms
-
LAM, M., ROTHBERO, E., AND WOLF, M. 1991. The performance and optimization of blocked algorithms. Proc. of 4th International conference on Architectural support for programming languages and operating systems, 63-74.
-
(1991)
Proc. of 4th International conference on Architectural support for programming languages and operating systems
, pp. 63-74
-
-
LAM, M.1
ROTHBERO, E.2
WOLF, M.3
-
30
-
-
0027694019
-
Access normalization: Loop restructuring for numa computers
-
LI, W., AND PINOALI, K. 1993. Access normalization: loop restructuring for numa computers. ACM Transactions on Computer Systems 11, 4, 353-375.
-
(1993)
ACM Transactions on Computer Systems
, vol.11
, Issue.4
, pp. 353-375
-
-
LI, W.1
PINOALI, K.2
-
31
-
-
10644238428
-
Shader algebra
-
MCCOOL, M., TOIT, S. D., POPA, T., CHAN, B., AND MOULE, K. 2004. Shader algebra. ACM Trans. Graph. 23, 3, 787-795.
-
(2004)
ACM Trans. Graph
, vol.23
, Issue.3
, pp. 787-795
-
-
MCCOOL, M.1
TOIT, S.D.2
POPA, T.3
CHAN, B.4
MOULE, K.5
-
32
-
-
34249003958
-
-
OWENS, J., LUEBKE, D., GOVINDARAJU, N., HARRIS, M., KRUGER, J., LEFOHN, A., AND PURCELL, T. 2005. A survey of general-purpose computation on graphics hardware.
-
(2005)
A survey of general-purpose computation on graphics hardware
-
-
OWENS, J.1
LUEBKE, D.2
GOVINDARAJU, N.3
HARRIS, M.4
KRUGER, J.5
LEFOHN, A.6
PURCELL, T.7
-
33
-
-
10444224900
-
Photon mapping on programmable graphics hardware
-
PURCELL, T., DONNER, C., CAMMARANO, M., JENSEN, H., AND HANRAHAN, P. 2003. Photon mapping on programmable graphics hardware. ACM SIGGRAPH/Eurographics Conference on Graphics Hardware, 41-50.
-
(2003)
ACM SIGGRAPH/Eurographics Conference on Graphics Hardware
, pp. 41-50
-
-
PURCELL, T.1
DONNER, C.2
CAMMARANO, M.3
JENSEN, H.4
HANRAHAN, P.5
-
35
-
-
4243187062
-
Towards a theory of cache-efficient algorithms
-
SEN, S., CHATTERJEE, S., AND DUMIR, N. 2002. Towards a theory of cache-efficient algorithms. Journal of the ACM 49, 828-858.
-
(2002)
Journal of the ACM
, vol.49
, pp. 828-858
-
-
SEN, S.1
CHATTERJEE, S.2
DUMIR, N.3
-
37
-
-
0001321490
-
External memory algorithms and data structures: Dealing with, massive data
-
VITTER, J. 2001. External memory algorithms and data structures: Dealing with, massive data. ACM Computing Surveys, 209-271.
-
(2001)
ACM Computing Surveys
, pp. 209-271
-
-
VITTER, J.1
|