SCOPUS 정보 검색 플랫폼

Proceedings - 21st International Parallel and Distributed Processing Symposium, IPDPS 2007; Abstracts and CD-ROM

Volumn , Issue , 2007, Pages

Towards optimal multi-level tiling for stencil computations

(4) Renganarayana, Lakshminarayanan a Harthikote Matha, Manjukumar a Dewri, Rinku a Rajopadhye, Sanjay a

a Department of Environmental and Radiological Health Sciences (United States)

Author keywords

[No Author keywords available]

Indexed keywords

OPTIMIZATION; PROGRAM PROCESSORS;

DESIGN-SPACE EXPLORATION; PROCESSOR ARCHITECTURE; STENCIL COMPUTATIONS;

COMPUTATION THEORY;

EID: 34548752231 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/IPDPS.2007.370291 Document Type: Conference Paper

Times cited : (36)

References (32)

1
- 0142134964
- Optimal semi-oblique tiling
- R. Andonov, S. Balev, S. V. Rajopadhye, and N. Yanev. Optimal semi-oblique tiling. IEEE Trans. Parallel Distrib. Syst., 14(9):944-960, 2003.
- (2003) IEEE Trans. Parallel Distrib. Syst , vol.14 , Issue.9 , pp. 944-960
- Andonov, R.¹ Balev, S.² Rajopadhye, S.V.³ Yanev, N.⁴

2
- 0029717350
- Automatic optimization of communication in compiling out-ofcore stencil codes
- R. Bordawekar, A. Choudhary, and J. Ramanujam. Automatic optimization of communication in compiling out-ofcore stencil codes. In ICS '96: Proceedings of the 10th international conference on Supercomputing, 1996.
- (1996) ICS '96: Proceedings of the 10th international conference on Supercomputing
- Bordawekar, R.¹ Choudhary, A.² Ramanujam, J.³

3
- 0004055894
- Cambridge University Press
- S. Boyd and L. Vandenberghe. Convex Optimization. Cambridge University Press, 2004.
- (2004) Convex Optimization
- Boyd, S.¹ Vandenberghe, L.²

4
- 29244443735
- M. Bromley, S. Heller, T. McNerney, and J. Guy L. Steele. Fortran at ten Gigaflops: the connection machine convolution compiler. In PLDI '91: Proceedings of the ACM SIGPLAN 1991 conference on Programming language design and implementation, 1991.
- M. Bromley, S. Heller, T. McNerney, and J. Guy L. Steele. Fortran at ten Gigaflops: the connection machine convolution compiler. In PLDI '91: Proceedings of the ACM SIGPLAN 1991 conference on Programming language design and implementation, 1991.

5
- 0000209582
- Regular partitioning for synthesizing fixed-size systolic arrays
- A. Darte. Regular partitioning for synthesizing fixed-size systolic arrays. Integration, The VLSI J., 12(3):293-304, 1991.
- (1991) Integration, The VLSI J , vol.12 , Issue.3 , pp. 293-304
- Darte, A.¹

6
- 4243166952
- Tight bounds on cache use for stencil operations on rectangular grids
- M. A. Frumkin and R. F. V. der Wijngaart. Tight bounds on cache use for stencil operations on rectangular grids. J. ACM, 49(3):434-453, 2002.
- (2002) J. ACM , vol.49 , Issue.3 , pp. 434-453
- Frumkin, M.A.¹ der Wijngaart, R.F.V.²

7
- 0023379612
- Solving pdes on loosely-coupled parallel processors
- W. D. Gropp. Solving pdes on loosely-coupled parallel processors. Parallel Computing, 5(1-2):165-173, 1987.
- (1987) Parallel Computing , vol.5 , Issue.1-2 , pp. 165-173
- Gropp, W.D.¹

8
- 0030651937
- Determining the idle time of a tiling
- K. Högstedt, L. Carter, and J. Ferrante. Determining the idle time of a tiling. In POPL, 1997.
- (1997) POPL
- Högstedt, K.¹ Carter, L.² Ferrante, J.³

9
- 0005875647
- Hpfbench: A high performance fortran benchmark suite
- Y. C. Hu, G. Jin, S. L. Johnsson, D. Kehagias, and N. Shalaby. Hpfbench: a high performance fortran benchmark suite. ACM Trans. Math. Softw., 26(1):99-149, 2000.
- (2000) ACM Trans. Math. Softw , vol.26 , Issue.1 , pp. 99-149
- Hu, Y.C.¹ Jin, G.² Johnsson, S.L.³ Kehagias, D.⁴ Shalaby, N.⁵

10
- 85026986651
- Supernode partitioning
- F. Irigoin and R. Triolet. Supernode partitioning. In POPL '88: Proceedings of the 15th ACM SICPLAN-SICACT symposium on Principles of programming languages, 1988.
- (1988) POPL '88: Proceedings of the 15th ACM SICPLAN-SICACT symposium on Principles of programming languages
- Irigoin, F.¹ Triolet, R.²

11
- 84958661690
- Impact of modern memory subsystems on cache optimizations for stencil computations
- S. Kamil, P. Husbands, L. Oliker, J. Shalf, and K. Yelick. Impact of modern memory subsystems on cache optimizations for stencil computations. In MSP '05: Proceedings of the 2005 workshop on Memory system performance, 2005.
- (2005) MSP '05: Proceedings of the 2005 workshop on Memory system performance
- Kamil, S.¹ Husbands, P.² Oliker, L.³ Shalf, J.⁴ Yelick, K.⁵

12
- 0001512318
- The organization of computations for uniform recurrence equations
- R. M. Karp, R. E. Miller, and S. Winograd. The organization of computations for uniform recurrence equations. J. ACM, 14(3):563-590, 1967.
- (1967) J. ACM , vol.14 , Issue.3 , pp. 563-590
- Karp, R.M.¹ Miller, R.E.² Winograd, S.³

13
- 0043048462
- An infeasible interiorpoint algorithm for solving primal and dual geometric programs
- K. O. Kortanek, X. Xu, and Y. Ye. An infeasible interiorpoint algorithm for solving primal and dual geometric programs. Math. Program., 76(1):155-181, 1997.
- (1997) Math. Program , vol.76 , Issue.1 , pp. 155-181
- Kortanek, K.O.¹ Xu, X.² Ye, Y.³

14
- 24644456455
- Automatic tiling of iterative stencil loops
- Z. Li and Y. Song. Automatic tiling of iterative stencil loops. ACM Trans. Program. Lang. Syst., 26(6):975-1028, 2004.
- (2004) ACM Trans. Program. Lang. Syst , vol.26 , Issue.6 , pp. 975-1028
- Li, Z.¹ Song, Y.²

15
- 20344396845
- YALMIP : A toolbox for modeling and optimization in MATLAB
- J. Löfberg. YALMIP : A toolbox for modeling and optimization in MATLAB. In Proceedings of the CACSD Conference, 2004.
- (2004) Proceedings of the CACSD Conference
- Löfberg, J.¹

16
- 0032308685
- Quantifying the multi-level nature of tiling interactions
- N. Mitchell, K. Högstedt, L. Carter, and J. Ferrante. Quantifying the multi-level nature of tiling interactions. International J. of Parallel Programming, 26(6):641-670, 1998.
- (1998) International J. of Parallel Programming , vol.26 , Issue.6 , pp. 641-670
- Mitchell, N.¹ Högstedt, K.² Carter, L.³ Ferrante, J.⁴

17
- 0022482205
- Partitioning and mapping algorithms into fixed size systolic arrays
- 351, 12
- D. I. Moldovan and J. A. B. Fortes. Partitioning and mapping algorithms into fixed size systolic arrays. IEEE Trans. Comput., 35(1)--12, 1986.
- (1986) IEEE Trans. Comput
- Moldovan, D.I.¹ Fortes, J.A.B.²

18
- 2442670256
- Available from
- NAS Parallel Benchmarks. Available from http://www.netlib.org/parkbench/.
- NAS Parallel Benchmarks

19
- 34548743372
- PARKBENCH:, Available from
- PARKBENCH: PARallel Kernels and BENCHmarks. Available from http://www.netlib.org/parkbench/.
- PARallel Kernels and BENCHmarks

20
- 51249173427
- The mapping of linear recurrence equations on regular arrays
- P. Quinton and V. Van Dongen. The mapping of linear recurrence equations on regular arrays. Journal of VLSI Signal Processing, 1(2):95-113, 1989.
- (1989) Journal of VLSI Signal Processing , vol.1 , Issue.2 , pp. 95-113
- Quinton, P.¹ Van Dongen, V.²

21
- 0025446495
- Synthesizing systolic arrays from recurrence equations
- June
- S. V. Rajopadhye and R. M. Fujimoto. Synthesizing systolic arrays from recurrence equations. Parallel Computing, 14:163-189, June 1990.
- (1990) Parallel Computing , vol.14 , pp. 163-189
- Rajopadhye, S.V.¹ Fujimoto, R.M.²

22
- 84934300040
- A geometric programming framework for optimal multi-level tiling
- L. Renganarayana and S. Rajopadhye. A geometric programming framework for optimal multi-level tiling. In SC '04: Proceedings of the ACM/IEEE conference on Supercomputing, 2004.
- (2004) SC '04: Proceedings of the ACM/IEEE conference on Supercomputing
- Renganarayana, L.¹ Rajopadhye, S.²

23
- 78649765479
- Tiling optimizations for 3d scientific computations
- G. Rivera and C.-W. Tseng. Tiling optimizations for 3d scientific computations. In Supercomputing '00: Proceedings of the ACM/IEEE conference on Supercomputing, 2000.
- (2000) Supercomputing '00: Proceedings of the ACM/IEEE conference on Supercomputing
- Rivera, G.¹ Tseng, C.-W.²

24
- 84900322610
- Compiling stencils in high performance fortran
- G. Roth, J. Mellor-Crummey, K. Kennedy, and R. G. Brickner. Compiling stencils in high performance fortran. In Supercomputing '97: Proceedings of the ACM/IEEE conference on Supercomputing, 1997.
- (1997) Supercomputing '97: Proceedings of the ACM/IEEE conference on Supercomputing
- Roth, G.¹ Mellor-Crummey, J.² Kennedy, K.³ Brickner, R.G.⁴

25
- 84873964280
- SPEC CPU2000 benchmark, http://www.spec.org.
- SPEC CPU2000 benchmark

26
- 84943297310
- Automatically tuned linear algebra software
- R. C. Whaley and J. J. Dongarra. Automatically tuned linear algebra software. In Supercomputing '98: Proceedings of the 1998 ACM/IEEE conference on Supercomputing (CDROM), 1998.
- (1998) Supercomputing '98: Proceedings of the 1998 ACM/IEEE conference on Supercomputing (CDROM)
- Whaley, R.C.¹ Dongarra, J.J.²

27
- 84976827033
- A data locality optimizing algorithm
- M. E. Wolf and M. S. Lam. A data locality optimizing algorithm. In PLDI '91: Proceedings of the ACM SICPlAN 1991 conference on Programming language design and implementation, 1991.
- (1991) PLDI '91: Proceedings of the ACM SICPlAN 1991 conference on Programming language design and implementation
- Wolf, M.E.¹ Lam, M.S.²

28
- 70749125366
- More iteration space tiling
- M. Wolfe. More iteration space tiling. In Supercomputing '89: Proceedings of the 1989 ACM/IEEE conference on Supercomputing, 1989.
- (1989) Supercomputing '89: Proceedings of the 1989 ACM/IEEE conference on Supercomputing
- Wolfe, M.¹

29
- 0033905336
- Using time skewing to eliminate idle time due to memory bandwidth and network limitations
- D. Wonnacott. Using time skewing to eliminate idle time due to memory bandwidth and network limitations. In IPDPS '00: Proceedings of the 14th International Symposium on Parallel and Distributed Processing, 2000.
- (2000) IPDPS '00: Proceedings of the 14th International Symposium on Parallel and Distributed Processing
- Wonnacott, D.¹

30
- 1542392248
- Achieving scalable locality with time skewing
- D. Wonnacott. Achieving scalable locality with time skewing. Int. J. Parallel Program., 30(3):181-221, 2002.
- (2002) Int. J. Parallel Program , vol.30 , Issue.3 , pp. 181-221
- Wonnacott, D.¹

31
- 0000703719
- On tiling as a loop transformation
- J. Xue. On tiling as a loop transformation. Parallel Processing Letters, 7(4):409-424, 1997.
- (1997) Parallel Processing Letters , vol.7 , Issue.4 , pp. 409-424
- Xue, J.¹

32
- 0442303278
- Kluwer Academic Publishers
- J. Xue. Loop tiling for parallelism. Kluwer Academic Publishers, 2000.
- (2000) Loop tiling for parallelism
- Xue, J.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.