SCOPUS 정보 검색 플랫폼

ACM Transactions on Programming Languages and Systems

Volumn 26, Issue 6, 2004, Pages 975-1028

Automatic tiling of iterative stencil loops

(2) Li, Zhiyuan a,b Song, Yonghong a,b

a PURDUE UNIVERSITY (United States)

b Purdue University (United States)

Author keywords

Caches; Loop transformations; Optimizing compilers

Indexed keywords

CACHES; ITERATIVE STENCIL LOOPS; LOOP TRANSFORMATIONS; OPTIMIZING COMPILERS;

BUFFER STORAGE; COMPUTER SIMULATION; ITERATIVE METHODS; POLYNOMIALS;

COMPUTER PROGRAMMING;

EID: 24644456455 PISSN: 01640925 EISSN: None Source Type: Journal
DOI: 10.1145/1034774.1034777 Document Type: Article

Times cited : (56)

References (57)

1
- 24644447770
- ADMAS, J. C. 1999. MUDPACK: Multigrid Software for Elliptic Partial Differential Equations. Available on line at http://www.scd.ucar.edu/css/ software/mudpack/.
- (1999) MUDPACK: Multigrid Software for Elliptic Partial Differential Equations
- Admas, J.C.¹

2
- 0033700781
- Synthesizing transformations for locality enhancement of imperfectly-nested loop nests
- Santa FE, NM
- AHMED, N., MATEEV, N., AND PINGALI, K. 2000. Synthesizing transformations for locality enhancement of imperfectly-nested loop nests. In Proceedings of the 2000 International Conference on Supercomputing (Santa FE, NM). 141-152.
- (2000) Proceedings of the 2000 International Conference on Supercomputing , pp. 141-152
- Ahmed, N.¹ Mateev, N.² Pingali, K.³

3
- 0003515463
- Prentice-Hall Inc., Englewood Cliffs, NJ
- AHUJA, R., MAGNANTI, T., AND ORLIN, J. 1993. Network Flows: Theory, Algorithms, and Applications. Prentice-Hall Inc., Englewood Cliffs, NJ.
- (1993) Network Flows: Theory, Algorithms, and Applications
- Ahuja, R.¹ Magnanti, T.² Orlin, J.³

4
- 84976725287
- Software pipelining
- ALLAN, V., JONES, R., LEE, R., AND ALLAN, S. 1995. Software pipelining. ACM Comput. Surv. 27, 3 (Sept.), 367-432.
- (1995) ACM Comput. Surv. , vol.27 , Issue.3 SEPT. , pp. 367-432
- Allan, V.¹ Jones, R.² Lee, R.³ Allan, S.⁴

5
- 0023438847
- Automatic translation of FORTRAN programs to vector form
- ALLEN, J. R. AND KENNEDY, K. 1984. Automatic translation of FORTRAN programs to vector form. ACM Trans. Programm. Lang. Syst. 9, 4 (Oct.), 491-542.
- (1984) ACM Trans. Programm. Lang. Syst. , vol.9 , Issue.4 OCT. , pp. 491-542
- Allen, J.R.¹ Kennedy, K.²

6
- 84948647315
- Recursive formulation of some dense linear algebra algorithms
- San Antonio, TX
- ANDERSEN, B. S., GUSTAVSON, F. G., WASNIEWSKI, J., AND YALAMOV, P. Y. 1999. Recursive formulation of some dense linear algebra algorithms. In Proceedings of the SIAM Conference on Parallel Processing for Scientific Computing (San Antonio, TX).
- (1999) Proceedings of the SIAM Conference on Parallel Processing for Scientific Computing
- Andersen, B.S.¹ Gustavson, F.G.² Wasniewski, J.³ Yalamov, P.Y.⁴

7
- 0029181140
- Data and computation transformation for multiprocessors
- Santa Barbara, CA
- ANDERSON, J. M., AMARASINGHE, S. P., AND LAM, M. S. 1995. Data and computation transformation for multiprocessors. In Proceedings of the Fifth ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming (Santa Barbara, CA). 166-178.
- (1995) Proceedings of the Fifth ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming , pp. 166-178
- Anderson, J.M.¹ Amarasinghe, S.P.² Lam, M.S.³

8
- 1242313972
- A compiler framework for restructuring data declarations to enhance cache and tlb effectiveness
- Toronto, Ont., Canada
- BACON, D., CHOW, J.-H., JU, D., MUTHUKUMAR, K., AND SARKAR, V. 1994. A compiler framework for restructuring data declarations to enhance cache and tlb effectiveness. In Proceedings of CASCON'94 (Toronto, Ont., Canada).
- (1994) Proceedings of CASCON'94
- Bacon, D.¹ Chow, J.-H.² Ju, D.³ Muthukumar, K.⁴ Sarkar, V.⁵

9
- 0032313172
- Non-linear and symbolic data dependence testing
- BLUME, W. AND EIGENMANN, R. 1998. Non-linear and symbolic data dependence testing. IEEE Trans. Parall. Distrib. Syst. 9, 12 (Dec.), 1180-1194.
- (1998) IEEE Trans. Parall. Distrib. Syst. , vol.9 , Issue.12 DEC. , pp. 1180-1194
- Blume, W.¹ Eigenmann, R.²

10
- 0032648736
- Static tiling for heterogeneous computing platforms
- BOULET, P., DONGARRA, J., ROBERT, Y., AND VIVIEN, F. 1999. Static tiling for heterogeneous computing platforms. Parall. Comput. 25, 547-568.
- (1999) Parall. Comput. , vol.25 , pp. 547-568
- Boulet, P.¹ Dongarra, J.² Robert, Y.³ Vivien, F.⁴

11
- 0024701521
- Coloring heuristics for register al-location
- BRIGGS, P., COOPER, K., KENNEDY, K., AND TORCSON, L. 1989. Coloring heuristics for register al-location. In Proceedings of ACM SIGPLAN Conference on Programming Language Design and Implementation. 275-384.
- (1989) Proceedings of ACM SIGPLAN Conference on Programming Language Design and Implementation , pp. 275-384
- Briggs, P.¹ Cooper, K.² Kennedy, K.³ Torcson, L.⁴

12
- 0029666646
- Memory bandwidth limitations of future microprocessors
- Philadelphia, PA
- BURGER, D. C., GOODMAN, J. R., AND KÄGI, A. 1996. Memory bandwidth limitations of future microprocessors. In Proceedings of the 23rd International Symposium on Computer Architecture (Philadelphia, PA). 78-89.
- (1996) Proceedings of the 23rd International Symposium on Computer Architecture , pp. 78-89
- Burger, D.C.¹ Goodman, J.R.² Kägi, A.³

13
- 0032676178
- A tile selection algorithm for data locality and cache interference
- Rhodes, Greece
- CHAME, J. AND MOON, S. 1999. A tile selection algorithm for data locality and cache interference. In Proceedings of the Thirteenth ACM International Conference on Supercomputing (Rhodes, Greece). 492-499.
- (1999) Proceedings of the Thirteenth ACM International Conference on Supercomputing , pp. 492-499
- Chame, J.¹ Moon, S.²

14
- 0032652980
- Nonlinear array layouts for hierarchical memory systems
- Rhodes, Greece
- CHATTERJEE, S., JAIN, V., LEBECK, A., MUNDHRA, S., AND THOTTETHODI, M. 1999a. Nonlinear array layouts for hierarchical memory systems. In Proceedings of the Thirteenth ACM International Conference on Supercomputing (Rhodes, Greece). 444-453.
- (1999) Proceedings of the Thirteenth ACM International Conference on Supercomputing , pp. 444-453
- Chatterjee, S.¹ Jain, V.² Lebeck, A.³ Mundhra, S.⁴ Thottethodi, M.⁵

15
- 0032659795
- Recursive array layouts and fast parallel matrix multiplication
- Saint Malo, France
- CHATTERJEE, S., LEBECK, A., PATNALA, P. K., AND THOTTETHODI, M. 1999b. Recursive array layouts and fast parallel matrix multiplication. In Proceedings of the 11th ACM Symposium on Parallel Algorithms and Architectures (Saint Malo, France).
- (1999) Proceedings of the 11th ACM Symposium on Parallel Algorithms and Architectures
- Chatterjee, S.¹ Lebeck, A.² Patnala, P.K.³ Thottethodi, M.⁴

16
- 0034836237
- Loop optimization for a class of memory-constrained computations
- Naples, Italy
- COCIORVA, D., WILKINS, J. W., LAM, C., BAUMGARTNER, G., RAMANUJAM, J., AND SADAYAPPAN, P. 2001. Loop optimization for a class of memory-constrained computations. In Proceedings of the 15th ACM International Conference on Supercomputing (Naples, Italy).
- (2001) Proceedings of the 15th ACM International Conference on Supercomputing
- Cociorva, D.¹ Wilkins, J.W.² Lam, C.³ Baumgartner, G.⁴ Ramanujam, J.⁵ Sadayappan, P.⁶

17
- 84976745804
- Tile size selection using cache organization and data layout
- La Jolla, CA
- COLEMAN, S. AND MCKINLEY, K. S. 1995. Tile size selection using cache organization and data layout. In Proceedings of the ACM SIGPLAN Conference on Programming Language Design and Implementation (La Jolla, CA). 279-290.
- (1995) Proceedings of the ACM SIGPLAN Conference on Programming Language Design and Implementation , pp. 279-290
- Coleman, S.¹ McKinley, K.S.²

18
- 0028562179
- Space-time transformation of while-loops using speculative execution
- Knoxville, TN
- COLLARD, J.-F. 1994. Space-time transformation of while-loops using speculative execution. In Proceedings of the Scalable High Performance Computing Conference (Knoxville, TN). 429-436.
- (1994) Proceedings of the Scalable High Performance Computing Conference , pp. 429-436
- Collard, J.-F.¹

19
- 0004116989
- MIT Press, Cambridge, MA, and McGraw-Hill Book Company, New York, NY
- CORMEN, T., LEISERSON, C., AND RIVEST, R. 1990. Introduction to Algorithms. MIT Press, Cambridge, MA, and McGraw-Hill Book Company, New York, NY.
- (1990) Introduction to Algorithms
- Cormen, T.¹ Leiserson, C.² Rivest, R.³

20
- 84981274540
- Reducing effective bandwidth through compiler enhancement of global cache reuse
- DING, C. AND KENNEDY, K. 2001. Reducing effective bandwidth through compiler enhancement of global cache reuse. In Proceedings of the International Parallel and Distributed Processing Symposium.
- (2001) Proceedings of the International Parallel and Distributed Processing Symposium
- Ding, C.¹ Kennedy, K.²

21
- 85015240805
- On estimating and enhancing cache effectiveness
- Lecture Notes in Computer Science, Springer-Verlag, Berlin, Germany. August 1991
- FERRANTE, J., SARKAR, V., AND THRASH, W. 1991. On estimating and enhancing cache effectiveness. In Proceedings of the Fourth International Workshop on Languages and Compilers for Parallel Computing. Lecture Notes in Computer Science, vol. 1863. Springer-Verlag, Berlin, Germany, 328-341. August 1991.
- (1991) Proceedings of the Fourth International Workshop on Languages and Compilers for Parallel Computing , vol.1863 , pp. 328-341
- Ferrante, J.¹ Sarkar, V.² Thrash, W.³

22
- 0003603813
- W. H. Freeman and Company, New York, NY
- GARY, M. R. AND JOHNSON, D. S. 1979. Computers and Intractability: A Guide to the Theory of NP-Completeness. W. H. Freeman and Company, New York, NY.
- (1979) Computers and Intractability: A Guide to the Theory of NP-completeness
- Gary, M.R.¹ Johnson, D.S.²

23
- 0031611719
- Precise miss analysis for program transformations with caches of arbitrary associativity
- San Jose, CA
- GHOSH, S., MARTONOSI, M., AND MALIK, S. 1998. Precise miss analysis for program transformations with caches of arbitrary associativity. In Proceedings of the Eighth ACM Conference on Architectural Support for Programming Languages and Operating Systems (San Jose, CA). 228-239.
- (1998) Proceedings of the Eighth ACM Conference on Architectural Support for Programming Languages and Operating Systems , pp. 228-239
- Ghosh, S.¹ Martonosi, M.² Malik, S.³

24
- 0030678732
- Experience with efficient array data flow analysis for array privatization
- Las Vegas, NV
- GU, J., LI, Z., AND LEE, G. 1997. Experience with efficient array data flow analysis for array privatization. In Proceedings of the Sixth ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming (Las Vegas, NV). 157-167.
- (1997) Proceedings of the Sixth ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming , pp. 157-167
- Gu, J.¹ Li, Z.² Lee, G.³

25
- 0008690122
- Ph.D. dissertation. Department of Computer Science, University of Illinois at Urbana-Champaign, Urbana, IL
- HAGHIGHAT, M. R. 1990. Symbolic dependence analysis for high performance parallelizing compilers. Ph.D. dissertation. Department of Computer Science, University of Illinois at Urbana-Champaign, Urbana, IL.
- (1990) Symbolic Dependence Analysis for High Performance Parallelizing Compilers
- Haghighat, M.R.¹

26
- 0004302191
- Morgan Kaufmann Publishers, San Francisco, CA
- HENNESSY, J. AND PATTERSON, D. 1996. Computer Architecture: A Quantitative Approach. Morgan Kaufmann Publishers, San Francisco, CA.
- (1996) Computer Architecture: A Quantitative Approach
- Hennessy, J.¹ Patterson, D.²

27
- 84858693885
- Increasing temporal locality with skewing and recursive blocking
- Denver, CO
- JIN, G., MELLOR-CRUMMEY, J., AND FOWLER, R. 2001. Increasing temporal locality with skewing and recursive blocking. In Proceedings of IEEE/ACM SC 2001 (Denver, CO).
- (2001) Proceedings of IEEE/ACM SC 2001
- Jin, G.¹ Mellor-Crummey, J.² Fowler, R.³

28
- 0037722074
- A matrix-based approach to the global locality optimization problem
- PACT'98, Paris, France
- KANDEMIR, M., CHOUDHARY, A., RAMANUJAM, J., AND BANERJEE, P. 1998. A matrix-based approach to the global locality optimization problem. In Proceedings of the International Conference on Parallel Architectures and Compilation Techniques (PACT'98, Paris, France).
- (1998) Proceedings of the International Conference on Parallel Architectures and Compilation Techniques
- Kandemir, M.¹ Choudhary, A.² Ramanujam, J.³ Banerjee, P.⁴

29
- 0033703285
- Fast greedy weighted fusion
- Santa Fe, NM
- KENNEDY, K. 2000. Fast greedy weighted fusion. In Proceedings of the 2000 International Conference on Supercomputing (Santa Fe, NM).
- (2000) Proceedings of the 2000 International Conference on Supercomputing
- Kennedy, K.¹

30
- 0001465739
- Maximizing loop parallelism and improving data locality via loop fusion and distribution
- Portland, OR, Aug. 1993. Lecture Notes in Computer Science, Springer-Verlag, Berlin, Germany
- KENNEDY, K. AND MVKINLEY, K. S. 1993. Maximizing loop parallelism and improving data locality via loop fusion and distribution. In Proceedings of the Sixth Workhsop on Languages and Compilers for Parallel Computing (Portland, OR, Aug. 1993). Lecture Notes in Computer Science, vol. 768, Springer-Verlag, Berlin, Germany.
- (1993) Proceedings of the Sixth Workhsop on Languages and Compilers for Parallel Computing , vol.768
- Kennedy, K.¹ Mvkinley, K.S.²

31
- 0347304618
- Data-centric multi-level blocking
- Las Vegas, NV
- KODUKULA, I., AHMED, N., AND PINGALI, K. 1997. Data-centric multi-level blocking. In Proceedings of the ACM SIGPLAN Conference on Programming Language Design and Implementation (Las Vegas, NV). 346-357.
- (1997) Proceedings of the ACM SIGPLAN Conference on Programming Language Design and Implementation , pp. 346-357
- Kodukula, I.¹ Ahmed, N.² Pingali, K.³

32
- 84855816103
- Transformations of imperfectly nested loops
- KODUKULA, I. AND PINGALI, K. 1996. Transformations of imperfectly nested loops. In Proceedings of Supercomputing.
- (1996) Proceedings of Supercomputing
- Kodukula, I.¹ Pingali, K.²

33
- 0026137116
- The cache performance and optimizations of blocked algorithms
- Santa Clara, CA
- LAM, M. S., ROTHBERG, E. E., AND WOLF, M. E. 1991. The cache performance and optimizations of blocked algorithms. In Proceedings of the Fourth International Conference on Architectural Support for Programming Languages and Operating Systems (Santa Clara, CA). 63-74.
- (1991) Proceedings of the Fourth International Conference on Architectural Support for Programming Languages and Operating Systems , pp. 63-74
- Lam, M.S.¹ Rothberg, E.E.² Wolf, M.E.³

34
- 0031075726
- Fusion of loops for parallelism and locality
- MANJIKIAN, N. AND ABDELRAHMAN, T. 1997. Fusion of loops for parallelism and locality. IEEE Trans. Parall. and Distribut. Syst. 8, 2 (Feb.), 193-209.
- (1997) IEEE Trans. Parall. and Distribut. Syst. , vol.8 , Issue.2 FEB. , pp. 193-209
- Manjikian, N.¹ Abdelrahman, T.²

35
- 3142754802
- Smallest-last ordering and clustering and graph coloring algorithms
- Department of Computer Science and Engineering, Southern Methodist University, Dallas, TX
- MATULA, D. AND BECK, L. 1981. Smallest-last ordering and clustering and graph coloring algorithms. Tech. rep. TR CSE 8104. Department of Computer Science and Engineering, Southern Methodist University, Dallas, TX.
- (1981) Tech. Rep. , vol.TR CSE 8104
- Matula, D.¹ Beck, L.²

36
- 0032308685
- Quantifying the multi-level nature of tiling interactions
- MITCHELL, N., HÖGSTEDT, K., CARTER, L., AND FERRANTE, J. 1998. Quantifying the multi-level nature of tiling interactions. Int. J. Parall. Programm. 26, 6 (Dec.), 641-670.
- (1998) Int. J. Parall. Programm. , vol.26 , Issue.6 DEC. , pp. 641-670
- Mitchell, N.¹ Högstedt, K.² Carter, L.³ Ferrante, J.⁴

37
- 0032064896
- Interprocedural analysis for loop scheduling and data allocation
- NGUYEN, T. AND LI, Z. 1998. Interprocedural analysis for loop scheduling and data allocation. Parall. Comput. 24, 3, 477-504.
- (1998) Parall. Comput. , vol.24 , Issue.3 , pp. 477-504
- Nguyen, T.¹ Li, Z.²

38
- 84861254160
- OBJECT-ORIENTED SCIENTIFIC COMPUTING. 2001. Blitz++. Object-Oriented Scientific Computing, Available online at http://www.oonumerics.org/blitz. benchmarks/.
- (2001) Blitz++. Object-oriented Scientific Computing

39
- 0030711410
- Non-singular data transformations: Definition, validity and applications
- Vienna, Austria
- O'BOYLE, M. AND KNIJNENBURG, P. 1997. Non-singular data transformations: Definition, validity and applications. In Proceedings of the ACM International Conference on Supercomputing (Vienna, Austria). 309-316.
- (1997) Proceedings of the ACM International Conference on Supercomputing , pp. 309-316
- O'Boyle, M.¹ Knijnenburg, P.²

40
- 0033076195
- Augmenting loop tiling with data alignment for improved cache performance
- PANDA, P., NAKAMURA, H., DUTT, N., AND NICOLAU, A. 1999. Augmenting loop tiling with data alignment for improved cache performance. IEEE Trans. Comput. 48, 2 (Feb.), 142-149.
- (1999) IEEE Trans. Comput. , vol.48 , Issue.2 FEB. , pp. 142-149
- Panda, P.¹ Nakamura, H.² Dutt, N.³ Nicolau, A.⁴

41
- 24644482622
- Analysis of memory hierarchy performance of block data layout
- Vancouver, B.C., Canada
- PARK, N., HONG, B., AND PRASANNA, V. K. 2002. Analysis of memory hierarchy performance of block data layout. In Proceedings of the International Conference on Parallel Processing (Vancouver, B.C., Canada). 34-44.
- (2002) Proceedings of the International Conference on Parallel Processing , pp. 34-44
- Park, N.¹ Hong, B.² Prasanna, V.K.³

42
- 84976676720
- A practical algorithm for exact array dependence analysis
- PUGH, W. 1992. A practical algorithm for exact array dependence analysis. Commun. ACM 35, 8 (Aug.), 102-114.
- (1992) Commun. ACM , vol.35 , Issue.8 AUG. , pp. 102-114
- Pugh, W.¹

43
- 10844250736
- Iteration space slicing for locality
- San Diego, CA
- PUGH, W. AND ROSSER, E. 1999. Iteration space slicing for locality. In Proceedings of the Twelfth International Workshop on Languages and Compilers for Parallel Computing (San Diego, CA).
- (1999) Proceedings of the Twelfth International Workshop on Languages and Compilers for Parallel Computing
- Pugh, W.¹ Rosser, E.²

44
- 17244382508
- Exploiting monotone convergence functions in parallel programs
- University of Maryland, College Park, MD
- PUGH, W., ROSSER, E., AND SHPEISMAN, T. 1996. Exploiting monotone convergence functions in parallel programs. Tech. rep. CS-TR-3636. University of Maryland, College Park, MD.
- (1996) Tech. Rep. , vol.CS-TR-3636
- Pugh, W.¹ Rosser, E.² Shpeisman, T.³

45
- 0002193401
- A comparison of compiler tiling algorithms
- Amsterdam, The Netherlands
- RIVERA, G. AND TSENG, C.-W. 1999. A comparison of compiler tiling algorithms. In Proceedings of the Eighth International Conference on Compiler Construction (Amsterdam, The Netherlands).
- (1999) Proceedings of the Eighth International Conference on Compiler Construction
- Rivera, G.¹ Tseng, C.-W.²

46
- 78649765479
- Tiling optimizations for 3D scientific computations
- RIVERA, G. AND TSENG, C.-W. 2000. Tiling optimizations for 3D scientific computations. In Proceedings of the IEEE/ACM SC 2000.
- (2000) Proceedings of the IEEE/ACM SC 2000
- Rivera, G.¹ Tseng, C.-W.²

47
- 0005045396
- Ph.D. dissertation. Department of Computer Science, University of Maryland at College Park, MD
- ROSSER, E. 1998. Fine-grained analysis of array computations. Ph.D. dissertation. Department of Computer Science, University of Maryland at College Park, MD.
- (1998) Fine-grained Analysis of Array Computations
- Rosser, E.¹

48
- 24644498982
- Loop transformations for hierarchical parallelism and locality
- Pittsburgh, PA
- SAHKAR, V. 1998. Loop transformations for hierarchical parallelism and locality. In Proceedings of the Fourth Workshop on Languages, Compilers, and Run-time Systems for Scalable Computers (Pittsburgh, PA).
- (1998) Proceedings of the Fourth Workshop on Languages, Compilers, and Run-time Systems for Scalable Computers
- Sahkar, V.¹

49
- 17244374581
- New tiling techniques to improve cache temporal locality
- Atlanta, GA
- SONG, Y. AND LI, Z. 1999. New tiling techniques to improve cache temporal locality. In Proceedings of the ACM SIGPLAN Conference on Programming Language Design and Implementation (Atlanta, GA). 215-228.
- (1999) Proceedings of the ACM SIGPLAN Conference on Programming Language Design and Implementation , pp. 215-228
- Song, Y.¹ Li, Z.²

50
- 0034825667
- Data locality enhancement by memory reduction
- Naples, Italy
- SONG, Y., XU, R., WANG, C., AND LI, Z. 2001. Data locality enhancement by memory reduction. In Proceedings of the 15th ACM International Conference on Supercomputing (Naples, Italy).
- (2001) Proceedings of the 15th ACM International Conference on Supercomputing
- Song, Y.¹ Xu, R.² Wang, C.³ Li, Z.⁴

51
- 0031612767
- Schedule-independent storage mapping for loops
- San Jose, CA
- STROUT, M., CARTER, L., FERRANTE, J., AND SIMON, B. 1998. Schedule-independent storage mapping for loops. In Proceedings of the Eighth International Conference on Architectural Support for Programming Languages and Operating Systems (San Jose, CA). 24-33.
- (1998) Proceedings of the Eighth International Conference on Architectural Support for Programming Languages and Operating Systems , pp. 24-33
- Strout, M.¹ Carter, L.² Ferrante, J.³ Simon, B.⁴

52
- 0028429842
- Cache interference phenomena
- Nashville, TN
- TEMAM, O., FRICKER, C., AND JALBY, W. 1994. Cache interference phenomena. In Proceedings of the ACM BIOMETRICS Conference on Measurement and Modeling of Computer Systems (Nashville, TN). 261-271.
- (1994) Proceedings of the ACM BIOMETRICS Conference on Measurement and Modeling of Computer Systems , pp. 261-271
- Temam, O.¹ Fricker, C.² Jalby, W.³

53
- 0034819362
- Language support for Morton-order matrices
- Snowbird, UT
- WISE, D. S., ALEXANDER, G. A., FRENS, J. D., AND GU, Y. 2001. Language support for Morton-order matrices. In Proceedings of the ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming (Snowbird, UT).
- (2001) Proceedings of the ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming
- Wise, D.S.¹ Alexander, G.A.² Frens, J.D.³ Gu, Y.⁴

54
- 0003553286
- Ph.D. dissertation. Department of Computer Science, Stanford University, Stanford, CA
- WOLF, M. 1992. Improving locality and parallelism in nested loops. Ph.D. dissertation. Department of Computer Science, Stanford University, Stanford, CA.
- (1992) Improving Locality and Parallelism in Nested Loops
- Wolf, M.¹

55
- 0003927035
- Addison-Wesley Publishing Company, Reading, MA
- WOLFE, M. 1995. High Performance Compilers for Parallel Computing. Addison-Wesley Publishing Company, Reading, MA.
- (1995) High Performance Compilers for Parallel Computing
- Wolfe, M.¹

56
- 1542392248
- Achieving scalable locality with time skewing
- WONNACOTT, D. 2002. Achieving scalable locality with time skewing. Int. J. Parall. Programm. 30, 3 (June), 181-221.
- (2002) Int. J. Parall. Programm. , vol.30 , Issue.3 JUNE , pp. 181-221
- Wonnacott, D.¹

57
- 0442303278
- Kluwer Academic Publishers, Dordrecht, The Netherlands
- XUE, J. 2000. Loop Tiling for Parallelism. Kluwer Academic Publishers, Dordrecht, The Netherlands.
- (2000) Loop Tiling for Parallelism
- Xue, J.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.