메뉴 건너뛰기




Volumn 14, Issue 9, 2003, Pages 944-960

Optimal semi-oblique tiling

Author keywords

2D uniform recurrences; Biological sequence alignment; BSP model; Communication computation granularity; Distributed memory machines; Locality; Loop blocking; MPI; Perfect loop nests; SPMD

Indexed keywords

ALGORITHMS; ITERATIVE METHODS; MATHEMATICAL MODELS; PERFORMANCE; RESPONSE TIME (COMPUTER SYSTEMS); THEOREM PROVING; TWO DIMENSIONAL; VECTORS;

EID: 0142134964     PISSN: 10459219     EISSN: None     Source Type: Journal    
DOI: 10.1109/TPDS.2003.1233716     Document Type: Article
Times cited : (25)

References (34)
  • 1
    • 0029373981 scopus 로고
    • Automatic partitioning of parallel loops and data arrays for distributed shared-memory multiprocessor
    • Sept.
    • A. Agarwal, D. Kranz, and V. Natarajan, "Automatic Partitioning of Parallel Loops and Data Arrays for Distributed Shared-Memory Multiprocessor," IEEE Trans. Parallel and Distributed Systems, vol. 6, no. 9, pp. 943-962, Sept. 1995.
    • (1995) IEEE Trans. Parallel and Distributed Systems , vol.6 , Issue.9 , pp. 943-962
    • Agarwal, A.1    Kranz, D.2    Natarajan, V.3
  • 8
    • 0029218339 scopus 로고
    • Precise tiling for uniform loop nests
    • P. Cappello, C. Mongenet, G.-R. Perrin, P. Quinton, and Y. Robert, eds.; July
    • P.-Y. Calland and T. Risset, "Precise Tiling for Uniform Loop Nests," Application Specific Array Processors, P. Cappello, C. Mongenet, G.-R. Perrin, P. Quinton, and Y. Robert, eds., pp. 330-337, July 1995.
    • (1995) Application Specific Array Processors , pp. 330-337
    • Calland, P.-Y.1    Risset, T.2
  • 11
    • 0021760002 scopus 로고
    • Fast optimal alignement
    • J. Fickett, "Fast Optimal Alignement," Nucleic Acids Research, vol. 12, no. 1, pp. 175-179, 1984.
    • (1984) Nucleic Acids Research , vol.12 , Issue.1 , pp. 175-179
    • Fickett, J.1
  • 13
    • 0032069399 scopus 로고    scopus 로고
    • On supernode transformation with minimized total running time
    • May
    • E. Hodzic and W. Shang, "On Supernode Transformation with Minimized Total Running Time," IEEE Trans. Parallel and Distributed Systems, vol. 9, no. 5, pp. 417-428 May 1998.
    • (1998) IEEE Trans. Parallel and Distributed Systems , vol.9 , Issue.5 , pp. 417-428
    • Hodzic, E.1    Shang, W.2
  • 14
    • 0003638028 scopus 로고    scopus 로고
    • Predicting performance for tiled perfectly nested loops
    • PhD thesis, Dept. of Computer Science and Eng., Univ. of California, San Diego, Dec.
    • K. Högstedt, "Predicting Performance for Tiled Perfectly Nested Loops," PhD thesis, Dept. of Computer Science and Eng., Univ. of California, San Diego, Dec. 1999.
    • (1999)
    • Högstedt, K.1
  • 17
    • 4243832310 scopus 로고    scopus 로고
    • An analysis of the execution time of tiled loops
    • (journal submission), Mar.
    • K. Högstedt, L. Carter, and J. Ferrante, "An Analysis of the Execution Time of Tiled Loops," http://www-cse.ucsd.edu/ferrante/karjour.ps (journal submission), Mar. 2000.
    • (2000)
    • Högstedt, K.1    Carter, L.2    Ferrante, J.3
  • 19
    • 0001512318 scopus 로고
    • The organization of computations for uniform recurrence equations
    • July
    • R.M. Karp, R.E. Miller, and S. Winograd, "The Organization of Computations for Uniform Recurrence Equations," J. ACM, vol. 14, no. 3, pp. 563-590, July 1967.
    • (1967) J. ACM , vol.14 , Issue.3 , pp. 563-590
    • Karp, R.M.1    Miller, R.E.2    Winograd, S.3
  • 20
    • 0025502605 scopus 로고
    • Pipelined data-parallel algorithms: Part 1-Concept and modelling
    • Oct.
    • C.-T. King, W.-H. Chou, and L. Ni, "Pipelined Data-Parallel Algorithms: Part 1-Concept and Modelling," IEEE Trans. Parallel and Distributed Systems, vol. 1, no. 4, pp. 470-485 Oct. 1990.
    • (1990) IEEE Trans. Parallel and Distributed Systems , vol.1 , Issue.4 , pp. 470-485
    • King, C.-T.1    Chou, W.-H.2    Ni, L.3
  • 23
    • 0022482205 scopus 로고
    • Partitioning and mapping algorithms into fixed size systolic arrays
    • Jan.
    • D.I. Moldovan and J.A.B. Fortes, "Partitioning and Mapping Algorithms into Fixed Size Systolic Arrays," IEEE Trans. Computers, vol. 35, no. 1, pp. 1-12, Jan. 1986.
    • (1986) IEEE Trans. Computers , vol.35 , Issue.1 , pp. 1-12
    • Moldovan, D.I.1    Fortes, J.A.B.2
  • 25
    • 0029181784 scopus 로고
    • Optimal tile size adjustment in compiling general DOACROSS loop nests
    • July
    • H. Ohta, Y. Saito, M. Kainaga, and H. Ono, "Optimal Tile Size Adjustment in Compiling General DOACROSS Loop Nests," Proc. Int'l Conf. Supercomputing, pp. 270-279, July 1995.
    • (1995) Proc. Int'l Conf. Supercomputing , pp. 270-279
    • Ohta, H.1    Saito, Y.2    Kainaga, M.3    Ono, H.4
  • 27
    • 0026274706 scopus 로고
    • Tiling multidimensional iteration spaces for non shared-memory machines
    • J. Ramanujam and P. Sadayappan, "Tiling Multidimensional Iteration Spaces for Non Shared-Memory Machines," Supercomputing, pp. 111-120, 1991.
    • (1991) Supercomputing , pp. 111-120
    • Ramanujam, J.1    Sadayappan, P.2
  • 28
    • 0003929457 scopus 로고
    • Automatic blocking of nested loops
    • Technical Report 90.38, RIACS, NASA Ames Research Center, Aug.
    • R. Schreiber and J. Dongarra, "Automatic Blocking of Nested Loops," Technical Report 90.38, RIACS, NASA Ames Research Center, Aug. 1990.
    • (1990)
    • Schreiber, R.1    Dongarra, J.2
  • 30
    • 0025467711 scopus 로고
    • A bridging model for parallel computation
    • Aug.
    • L.G. Valiant, "A Bridging Model for Parallel Computation," Comm. ACM, vol. 33, no. 8, pp. 103-111, Aug. 1990.
    • (1990) Comm. ACM , vol.33 , Issue.8 , pp. 103-111
    • Valiant, L.G.1
  • 33
    • 4243740738 scopus 로고    scopus 로고
    • Time skewing for parallel computers
    • Technical Report TR-388, Dept. of Computer Science, Rutgers Univ., June
    • D. Wonnacott, "Time Skewing for Parallel Computers," Technical Report TR-388, Dept. of Computer Science, Rutgers Univ., June 1999.
    • (1999)
    • Wonnacott, D.1
  • 34
    • 0000703719 scopus 로고    scopus 로고
    • On tiling as a loop transformation
    • J. Xue, "On Tiling as a Loop Transformation," Parallel Processing Letters, vol. 7, no. 4, pp. 490-424, 1997.
    • (1997) Parallel Processing Letters , vol.7 , Issue.4 , pp. 490-424
    • Xue, J.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.