메뉴 건너뛰기




Volumn , Issue , 2006, Pages

A memory model for scientific algorithms on graphics processors

Author keywords

Graphics processors; Memory model; Scientific algorithms

Indexed keywords

2D BLOCK REPRESENTATIONS; GRAPHICS PROCESSORS; MEMORY MODELS; SCIENTIFIC ALGORITHMS;

EID: 34548292052     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1145/1188455.1188549     Document Type: Conference Paper
Times cited : (128)

References (39)
  • 1
    • 0024082546 scopus 로고
    • The iuput/output complexity of sorting and related problems
    • AGGARWAL, A., AND VITTER, J. S. .1988. The iuput/output complexity of sorting and related problems. Commun. ACM 31, 1116-1127.
    • (1988) Commun. ACM , vol.31 , pp. 1116-1127
    • AGGARWAL, A.1    VITTER, J.S.2
  • 3
    • 34548217409 scopus 로고    scopus 로고
    • AROE, L., B RODAL, G., AND FAOERBERO, R. 2004. Cache oblivious data structures. Handbook on Data Structures and Applications.
    • AROE, L., B RODAL, G., AND FAOERBERO, R. 2004. Cache oblivious data structures. Handbook on Data Structures and Applications.
  • 4
    • 0028743437 scopus 로고
    • Compiler transformations for high-performance computing
    • BACON, D. F., GRAHAM, S. L., AND SHARP, O. J. 1994. Compiler transformations for high-performance computing. ACM Comput. Surv. 26, 4, 345-420.
    • (1994) ACM Comput. Surv , vol.26 , Issue.4 , pp. 345-420
    • BACON, D.F.1    GRAHAM, S.L.2    SHARP, O.J.3
  • 7
    • 0242533311 scopus 로고    scopus 로고
    • Sparse matrix solvers on the GPU: Conjugate gradients and multigrid
    • BOLZ, J., FARMER, I., GRINSPUN, E., AND SCHRÖDER, P. 2003. Sparse matrix solvers on the GPU: conjugate gradients and multigrid. ACM Trans. Graph. 22, 3, 917-924.
    • (2003) ACM Trans. Graph , vol.22 , Issue.3 , pp. 917-924
    • BOLZ, J.1    FARMER, I.2    GRINSPUN, E.3    SCHRÖDER, P.4
  • 15
    • 33845440618 scopus 로고    scopus 로고
    • GPGPU performance tuning
    • Tech. rep, University of Dortmund, Germany
    • GÖDDEKE, D. 2005. GPGPU performance tuning. Tech. rep., University of Dortmund, Germany, http://www.mathematik.uni-dortiimiid.de/ ~goedd8ke/ gpgpu/.
    • (2005)
    • GÖDDEKE, D.1
  • 17
    • 29844438097 scopus 로고    scopus 로고
    • Fast and approximate stream mining of quantites and frequencies using graphics processors
    • GOVINDARAJU, N., RAGHUVANSHI, N., AND MANOCHA, D. 2005. Fast and approximate stream mining of quantites and frequencies using graphics processors. Proc. of ACM SIGMOD.
    • (2005) Proc. of ACM SIGMOD
    • GOVINDARAJU, N.1    RAGHUVANSHI, N.2    MANOCHA, D.3
  • 18
    • 33947607609 scopus 로고    scopus 로고
    • GPUTeraSort: High performance graphics coprocessor sorting for large database management
    • GOVINDARAJU, N., GRAY, J., KUMAR, R., AND MANOCHA, D. 2006. GPUTeraSort: High performance graphics coprocessor sorting for large database management. Proc. of ACM SIGMOD.
    • (2006) Proc. of ACM SIGMOD
    • GOVINDARAJU, N.1    GRAY, J.2    KUMAR, R.3    MANOCHA, D.4
  • 20
    • 10644280791 scopus 로고    scopus 로고
    • Cache and bandwidth aware matrix multiplication on the GPU
    • Technical Report UIUCDCS-R-2003-2328, University of Illinois at Urbana-Champaign
    • HALL, J. D., CARS, N., AND HART, J. 2003. Cache and bandwidth aware matrix multiplication on the GPU. Technical Report UIUCDCS-R-2003-2328, University of Illinois at Urbana-Champaign.
    • (2003)
    • HALL, J.D.1    CARS, N.2    HART, J.3
  • 22
    • 0024903997 scopus 로고
    • Evaluating associativity in cpu caches
    • HILL, M. D., AND SMITH, A.J. 1989. Evaluating associativity in cpu caches. IEEE Transactions on Computers 38, 12, 1612-1630.
    • (1989) IEEE Transactions on Computers , vol.38 , Issue.12 , pp. 1612-1630
    • HILL, M.D.1    SMITH, A.J.2
  • 26
    • 77954024744 scopus 로고    scopus 로고
    • KRÜOER,. J., AND W.ESTERMANN, R. 2003. Linear algebra operators for GPU implementation of numerical algorithms. ACM Trans. Graph. 22, 3, 908-916.
    • KRÜOER,. J., AND W.ESTERMANN, R. 2003. Linear algebra operators for GPU implementation of numerical algorithms. ACM Trans. Graph. 22, 3, 908-916.
  • 30
    • 0027694019 scopus 로고
    • Access normalization: Loop restructuring for numa computers
    • LI, W., AND PINOALI, K. 1993. Access normalization: loop restructuring for numa computers. ACM Transactions on Computer Systems 11, 4, 353-375.
    • (1993) ACM Transactions on Computer Systems , vol.11 , Issue.4 , pp. 353-375
    • LI, W.1    PINOALI, K.2
  • 35
    • 4243187062 scopus 로고    scopus 로고
    • Towards a theory of cache-efficient algorithms
    • SEN, S., CHATTERJEE, S., AND DUMIR, N. 2002. Towards a theory of cache-efficient algorithms. Journal of the ACM 49, 828-858.
    • (2002) Journal of the ACM , vol.49 , pp. 828-858
    • SEN, S.1    CHATTERJEE, S.2    DUMIR, N.3
  • 37
    • 0001321490 scopus 로고    scopus 로고
    • External memory algorithms and data structures: Dealing with, massive data
    • VITTER, J. 2001. External memory algorithms and data structures: Dealing with, massive data. ACM Computing Surveys, 209-271.
    • (2001) ACM Computing Surveys , pp. 209-271
    • VITTER, J.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.