메뉴 건너뛰기




Volumn 32, Issue 10, 2006, Pages 711-732

Message-passing code generation for non-rectangular tiling transformations

Author keywords

Clusters; Code generation; Data parallel; Loop tiling; MPI

Indexed keywords

DATA STORAGE EQUIPMENT; ITERATIVE METHODS; OPTIMIZATION; PARALLEL ALGORITHMS; PARALLEL PROCESSING SYSTEMS;

EID: 33750629461     PISSN: 01678191     EISSN: None     Source Type: Journal    
DOI: 10.1016/j.parco.2006.07.003     Document Type: Article
Times cited : (5)

References (31)
  • 2
    • 0346032593 scopus 로고    scopus 로고
    • Advanced code generation for high performance Fortran
    • Compiler Optimizations for Scalable Parallel Systems: Languages, Compilation Techniques, and Run Time Systems, Springer-Verlag
    • Adve V., and Mellor-Crummey J. Advanced code generation for high performance Fortran. Compiler Optimizations for Scalable Parallel Systems: Languages, Compilation Techniques, and Run Time Systems. Lecture Notes in Computer Science Series (2001), Springer-Verlag 553-596
    • (2001) Lecture Notes in Computer Science Series , pp. 553-596
    • Adve, V.1    Mellor-Crummey, J.2
  • 3
    • 84976724523 scopus 로고    scopus 로고
    • S.P. Amarasinghe, M.S. Lam, Communication optimization and code generation for distributed memory machines, in: Proceedings of the ACM SIGPLAN Conference on Programming Language Design and Implementation, Albuquerque, New Mexico, USA, June 1993.
  • 4
    • 33750611200 scopus 로고    scopus 로고
    • R. Andonov, P. Calland, S. Niar, S. Rajopadhye, N. Yanev, First steps towards optimal oblique tile sizing, in: Proceedings of the 8th International Workshop on Compilers for Parallel Computers, Aussois, January 2000, pp. 351-366.
  • 11
    • 33750624678 scopus 로고    scopus 로고
    • G. Fox, S. Hiranandani, K. Kennedy, C. Koelbel, U. Kremer, C. Tseng, M. Wu, Fortran-D language specification, Technical Report TR-91-170, Department of Computer Science, Rice University, December 1991.
  • 13
    • 84981274197 scopus 로고    scopus 로고
    • G. Goumas, A. Sotiropoulos, N. Koziris, Minimizing completion time for loop tiling with computation and communication overlapping, in: Proceedings of IEEE International Parallel and Distributed Processing Symposium (IPDPS'01), San Francisco, April 2001.
  • 16
    • 0030651937 scopus 로고    scopus 로고
    • K. Högstedt, L. Carter, J. Ferrante, Determining the idle time of a tiling, in: Proceedings of the 24th ACM Symposium on Principles of Programming Languages (POPL), January 1997, pp. 160-173.
  • 17
    • 0032642196 scopus 로고    scopus 로고
    • K. Högstedt, L. Carter, J. Ferrante, Selecting tile shape for minimal execution time, in: Proceedings of the ACM Symposium on Parallel Algorithms and Architectures, 1999, pp. 201-211.
  • 19
    • 84976859541 scopus 로고    scopus 로고
    • M. Lam, E. Rothberg, M. Wolf, The cache performance and optimizations of blocked algorithms, in: Proceedings of the 4th International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS), Santa Clara, California, USA, April 1991, pp. 63-74.
  • 21
    • 0022874874 scopus 로고
    • Advanced compiler optimizations for supercomputers
    • Padua D., and Wolfe W. Advanced compiler optimizations for supercomputers. Communications of the ACM 29 12 (1986) 1184-1201
    • (1986) Communications of the ACM , vol.29 , Issue.12 , pp. 1184-1201
    • Padua, D.1    Wolfe, W.2
  • 24
    • 0026821247 scopus 로고
    • Independent partitioning of algorithms with uniform dependencies
    • Shang W., and Fortes J.A.B. Independent partitioning of algorithms with uniform dependencies. IEEE Transactions on Computers 41 2 (1992) 190-206
    • (1992) IEEE Transactions on Computers , vol.41 , Issue.2 , pp. 190-206
    • Shang, W.1    Fortes, J.A.B.2
  • 25
    • 0029190371 scopus 로고    scopus 로고
    • E. Su, A. Lain, S. Ramaswamy, D.J. Palermo, E.W. Hodges, P. Banerjee, Advanced compilation techniques in the PARADIGM compiler for distributed memory multicomputers, in: Proceedings of the 9th ACM International Conference on Supercomputing (ICS), Madrid, Spain, July 1995, pp. 424-433.
  • 26
    • 0000778059 scopus 로고    scopus 로고
    • Generating efficient tiled code for distributed memory machines
    • Tang P., and Xue J. Generating efficient tiled code for distributed memory machines. Parallel Computing 26 11 (2000) 1369-1410
    • (2000) Parallel Computing , vol.26 , Issue.11 , pp. 1369-1410
    • Tang, P.1    Xue, J.2
  • 28
    • 85013942562 scopus 로고    scopus 로고
    • M. Wolf, M. Lam, A data locality optimizing algorithm, in: Proceedings of the ACM SIGPLAN'91 Conference on Programming Language Design and Implementation (PLDI), Toronto, Ontario, Canada, June 1991, pp. 30-44.
  • 29
    • 0026232450 scopus 로고
    • A loop transformation theory and an algorithm to maximize parallelism
    • Wolf M., and Lam M. A loop transformation theory and an algorithm to maximize parallelism. IEEE Transactions on Parallel and Distributed Systems 2 4 (1991) 452-471
    • (1991) IEEE Transactions on Parallel and Distributed Systems , vol.2 , Issue.4 , pp. 452-471
    • Wolf, M.1    Lam, M.2
  • 30
    • 0003125942 scopus 로고    scopus 로고
    • Communication-minimal tiling of uniform dependence loops
    • Xue J. Communication-minimal tiling of uniform dependence loops. Journal of Parallel and Distributed Computing 42 1 (1997) 42-59
    • (1997) Journal of Parallel and Distributed Computing , vol.42 , Issue.1 , pp. 42-59
    • Xue, J.1
  • 31
    • 0036601528 scopus 로고    scopus 로고
    • Time-minimal tiling when rise is larger than zero
    • Xue J., and Cai W. Time-minimal tiling when rise is larger than zero. Parallel Computing 28 6 (2002) 915-939
    • (2002) Parallel Computing , vol.28 , Issue.6 , pp. 915-939
    • Xue, J.1    Cai, W.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.