메뉴 건너뛰기




Volumn 2006, Issue , 2006, Pages 48-57

Programming for parallelism, and locality with hierarchically tiled arrays

Author keywords

Data parallel; Locality enhancement; Parallel programming; Tiling

Indexed keywords

ALGORITHMS; C (PROGRAMMING LANGUAGE); CODES (SYMBOLS); HIERARCHICAL SYSTEMS; PARALLEL PROCESSING SYSTEMS; SOFTWARE ENGINEERING;

EID: 33751022080     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1145/1122971.1122981     Document Type: Conference Paper
Times cited : (96)

References (27)
  • 1
    • 84860018271 scopus 로고    scopus 로고
    • Intel Math Kernel Library, http://www.intel.com/cd/software/products/ asmona/eng/perflib/mkl/index.htm.
  • 13
    • 0031123769 scopus 로고    scopus 로고
    • SUMMA: Scalable universal matrix multiplication algorithm
    • Apr
    • R. A. V. D. Geijn and J. Watts. SUMMA: Scalable Universal Matrix Multiplication Algorithm. Concurrency: Practice and Experience, 9(4):255-274, Apr 1997.
    • (1997) Concurrency: Practice and Experience , vol.9 , Issue.4 , pp. 255-274
    • Geijn, R.A.V.D.1    Watts, J.2
  • 16
    • 84976813879 scopus 로고
    • Compiling fortran D for MlMD distributed-memory machines
    • S. Hiranandani, K. Kennedy, and C.-W. Tseng. Compiling Fortran D for MlMD Distributed-memory Machines. Commun. ACM, 35(8):66-80, 1992.
    • (1992) Commun. ACM , vol.35 , Issue.8 , pp. 66-80
    • Hiranandani, S.1    Kennedy, K.2    Tseng, C.-W.3
  • 18
    • 26444503508 scopus 로고
    • An overview of high performance fortran
    • C. Koelbel and P. Mehrotra. An Overview of High Performance Fortran. SIGPLAN Fortran Forum, 11(4):9-16, 1992.
    • (1992) SIGPLAN Fortran Forum , vol.11 , Issue.4 , pp. 9-16
    • Koelbel, C.1    Mehrotra, P.2
  • 20
    • 0028732614 scopus 로고
    • Global arrays: A portable shared-memory programming model for distributed memory computers
    • pages 340-ff., Los Alamitos, CA, USA, IEEE Computer Society Press
    • J. Nieplocha, R. J. Harrison, and R. J. Littlefield. Global arrays: a portable shared-memory programming model for distributed memory computers. In Supercomputing '94: Proc. of the 1994 Conf. on Supercomputing, pages 340-ff., Los Alamitos, CA, USA, 1994. IEEE Computer Society Press.
    • (1994) Supercomputing '94: Proc. of the 1994 Conf. on Supercomputing
    • Nieplocha, J.1    Harrison, R.J.2    Littlefield, R.J.3
  • 21
    • 0002081678 scopus 로고    scopus 로고
    • Co-array fortran for parallel programming
    • R. W. Numrich and J. Reid. Co-array Fortran for Parallel Programming. SIGPLAN Fortran Forum, 17(2):1-31, 1998.
    • (1998) SIGPLAN Fortran Forum , vol.17 , Issue.2 , pp. 1-31
    • Numrich, R.W.1    Reid, J.2
  • 22
    • 27344435504 scopus 로고    scopus 로고
    • The design and implementation of a first-generation cell processor
    • February
    • D. Pham and et al. The Design and Implementation of a First-generation Cell Processor. In Procs. of the IEEE Solid-State Circuits Symposium, February 2005.
    • (2005) Procs. of the IEEE Solid-state Circuits Symposium
    • Pham, D.1
  • 25
    • 0343462141 scopus 로고    scopus 로고
    • Automated empirical optimizations of sofware and the ATLAS project
    • R. Whaley, A. Petitet, and J. Dongarra. Automated Empirical Optimizations of Sofware and the ATLAS Project. Parallel Computing, 27(1-2):3-35, 2001.
    • (2001) Parallel Computing , vol.27 , Issue.1-2 , pp. 3-35
    • Whaley, R.1    Petitet, A.2    Dongarra, J.3
  • 26
    • 84976827033 scopus 로고
    • A data locality optimizing algorithm
    • ACM Press
    • M. E. Wolf and M. S. Lam. A Data Locality Optimizing Algorithm. In PLDI, pages 30-44. ACM Press, 1991.
    • (1991) PLDI , pp. 30-44
    • Wolf, M.E.1    Lam, M.S.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.