메뉴 건너뛰기




Volumn , Issue , 2008, Pages 111-122

Automatic data movement and computation mapping for multi-level parallel architectures with explicitly managed memories

Author keywords

Data movement; Graphics processor unit; Multi level tiling; Scratchpad memory

Indexed keywords

ARRAY ACCESS FUNCTIONS; AUTOMATIC DETERMINATION; CELL PROCESSOR; COMPUTATIONAL POWER; DATA MOVEMENTS; FAST MEMORY; GRAPHICS PROCESSOR UNITS; LOCAL MEMORIES; MULTI-LEVEL; MULTI-LEVEL TILING; MULTIPLE LEVELS; OFF-CHIP MEMORIES; ON CHIP MEMORY; ON CHIPS; SCRATCH PAD MEMORY; TILE SIZE;

EID: 79959456077     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (57)

References (40)
  • 1
    • 0032305438 scopus 로고    scopus 로고
    • Compiler optimizations for real time execution of loops on limited memory embedded systems
    • S. Anantharaman and S. Pande. Compiler optimizations for real time execution of loops on limited memory embedded systems. In IEEE Real-Time Systems Symposium, pages 154-164, 1998.
    • (1998) IEEE Real-Time Systems Symposium , pp. 154-164
    • Anantharaman, S.1    Pande, S.2
  • 4
    • 0003713964 scopus 로고    scopus 로고
    • 2nd Edition. Athena Scientific. ISBN 1-886529-00-0
    • D. P. Bertsekas. Nonlinear Programming: 2nd Edition. Athena Scientific. ISBN 1-886529-00-0.
    • Nonlinear Programming
    • Bertsekas, D.P.1
  • 7
    • 57349110181 scopus 로고    scopus 로고
    • Affine transformations for communication minimal parallelization and locality optimization of arbitrarily nested loop sequences
    • Ohio State University, May
    • U. Bondhugula, M. Baskaran, S. Krishnamoorthy, J. Ramanujam, A. Rountev, and P. Sadayappan. Affine transformations for communication minimal parallelization and locality optimization of arbitrarily nested loop sequences. Technical Report OSU-CISRC5/07-TR43, Ohio State University, May 2007.
    • (2007) Technical Report OSU-CISRC5/07-TR43
    • Bondhugula, U.1    Baskaran, M.2    Krishnamoorthy, S.3    Ramanujam, J.4    Rountev, A.5    Sadayappan, P.6
  • 9
    • 0029717349 scopus 로고    scopus 로고
    • Counting solutions to linear and nonlinear constraints through ehrhart polynomials: Applications to analyze and transform scientific programs
    • P. Clauss. Counting solutions to linear and nonlinear constraints through ehrhart polynomials: applications to analyze and transform scientific programs. In ICS '96: Proceedings of the 10th international conference on Supercomputing, pages 278-285, 1996.
    • (1996) ICS '96: Proceedings of the 10th International Conference on Supercomputing , pp. 278-285
    • Clauss, P.1
  • 10
    • 79959483988 scopus 로고    scopus 로고
    • CLooG: The Chunky Loop Generator, http://www.cloog.org.
  • 11
    • 0031358458 scopus 로고    scopus 로고
    • Optimal fine and medium grain parallelism detection in polyhedral reduced dependence graphs
    • Dec.
    • A. Darte and F. Vivien. Optimal fine and medium grain parallelism detection in polyhedral reduced dependence graphs. IJPP, 25(6):447-496, Dec. 1997.
    • (1997) IJPP , vol.25 , Issue.6 , pp. 447-496
    • Darte, A.1    Vivien, F.2
  • 15
    • 0026109335 scopus 로고
    • Dataflow analysis of array and scalar references
    • P. Feautrier. Dataflow analysis of array and scalar references. IJPP, 20(1):23-53, 1991.
    • (1991) IJPP , vol.20 , Issue.1 , pp. 23-53
    • Feautrier, P.1
  • 16
    • 0026933251 scopus 로고
    • Some efficient solutions to the affine scheduling problem: I. one-dimensional time
    • P. Feautrier. Some efficient solutions to the affine scheduling problem: I. one-dimensional time. IJPP, 21(5):313-348, 1992.
    • (1992) IJPP , vol.21 , Issue.5 , pp. 313-348
    • Feautrier, P.1
  • 17
    • 0001448065 scopus 로고
    • Some efficient solutions to the affine scheduling problem. partii, multidimensional time
    • P. Feautrier. Some efficient solutions to the affine scheduling problem. partii, multidimensional time. IJPP, 21(6):389-420, 1992.
    • (1992) IJPP , vol.21 , Issue.6 , pp. 389-420
    • Feautrier, P.1
  • 18
    • 84957027384 scopus 로고    scopus 로고
    • Automatic parallelization in the polytope model
    • P. Feautrier. Automatic parallelization in the polytope model. In The Data Parallel Programming Model, pages 79-103, 1996.
    • (1996) The Data Parallel Programming Model , pp. 79-103
    • Feautrier, P.1
  • 20
    • 84862940593 scopus 로고
    • Strategies for cache and local memory management by global program transformation
    • New York, NY, USA, Springer-Verlag New York, Inc.
    • D. Gannon, W. Jalby, and K. Gallivan. Strategies for cache and local memory management by global program transformation. In Proceedings of the 1st International Conference on Supercomputing, pages 229-254, New York, NY, USA, 1988. Springer-Verlag New York, Inc.
    • (1988) Proceedings of the 1st International Conference on Supercomputing , pp. 229-254
    • Gannon, D.1    Jalby, W.2    Gallivan, K.3
  • 29
    • 0030645995 scopus 로고    scopus 로고
    • Maximizing parallelism and minimizing synchronization with affine transforms
    • A. W. Lim and M. S. Lam. Maximizing parallelism and minimizing synchronization with affine transforms. InPOPL'97, pages 201-214, 1997.
    • (1997) InPOPL' , vol.97 , pp. 201-214
    • Lim, A.W.1    Lam, M.S.2
  • 30
    • 79959401728 scopus 로고    scopus 로고
    • NVIDIA CUDA. http://developer.nvidia.com/object/cuda.html.
  • 33
    • 34547683700 scopus 로고    scopus 로고
    • Iterative optimization in the polyhedral model: Part I, one-dimensional time
    • L.-N. Pouchet, C. Bastoul, A. Cohen, and N. Vasilache. Iterative Optimization in the Polyhedral Model: Part I, One-Dimensional Time. In CGO '07, pages 144-156, 2007.
    • (2007) CGO '07 , pp. 144-156
    • Pouchet, L.-N.1    Bastoul, C.2    Cohen, A.3    Vasilache, N.4
  • 34
    • 84976676720 scopus 로고
    • The omega test: A fast and practical integer programming algorithm for dependence analysis
    • Aug
    • W. Pugh. The omega test: a fast and practical integer programming algorithm for dependence analysis. Communication's of the ACM, 8:102-114, Aug. 1992.
    • (1992) Communication's of the ACM , vol.8 , pp. 102-114
    • Pugh, W.1
  • 37
  • 38
    • 78650907365 scopus 로고    scopus 로고
    • Near-Optimal allocation of local memory arrays
    • HP Laboratories Palo Alto
    • R. Schreiber and D. C. Cronquist. Near-Optimal Allocation of Local Memory Arrays. Technical Report HPL-2004-24, HP Laboratories Palo Alto, 2004.
    • (2004) Technical Report HPL-2004-24
    • Schreiber, R.1    Cronquist, D.C.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.