SCOPUS 정보 검색 플랫폼

Proceedings of the International Conference on Supercomputing

Volumn 2002-November, Issue , 2002, Pages

Better tiling and array contraction for compiling scientific programs

(2) Pike, Geoff a Hilfinger, Paul N a

a UNIVERSITY OF CALIFORNIA (United States)

Author keywords

[No Author keywords available]

Indexed keywords

DIGITAL STORAGE;

ARRAY CONTRACTION; IMPROVE PERFORMANCE; INTERLEAVINGS; LOOP FUSION; LOOP TILING; MULTIPLE LOOPS; OPTIMISATIONS; SCIENTIFIC PROGRAMS; STORAGE OPTIMIZATION; TILE SIZE;

PROGRAM COMPILERS;

EID: 27844503782 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/SC.2002.10040 Document Type: Conference Paper

Times cited : (7)

References (30)

1
- 0030661485
- Optimizing matrix multiply using PHiPAC: A portable, high-performance, ANSI C coding methodology
- J. Bilmes et al. Optimizing matrix multiply using PHiPAC: A portable, high-performance, ANSI C coding methodology. In Proc. ICS'97, pages 340-347, 1997.
- (1997) Proc. ICS'97 , pp. 340-347
- Bilmes, J.¹

2
- 0042697458
- A multigrid tutorial
- W. L. Briggs. A Multigrid Tutorial. SIAM, 1987.
- (1987) SIAM
- Briggs, W.L.¹

3
- 0003008455
- Quantifying memory bandwidth limitations of current and future microprocessors
- Doug Burger, James R. Goodman, and Alain Kägi. Quantifying memory bandwidth limitations of current and future microprocessors. In Proceedings of the 23rd International Symposium on Computer Architecture, 1996.
- (1996) Proceedings of the 23rd International Symposium on Computer Architecture
- Burger, D.¹ Goodman, J.R.² Kägi, A.³

4
- 0002741087
- UCSD Technical Report November
- Larry Carter, Jeanne Ferrante, Susan Flynn Hummel, Bowen Alpern, Kang-Su Gatlin. Hierarchical Tiling: A Methodology for High Performance. UCSD Technical Report CS96-508, November 1996.
- (1996) Hierarchical Tiling: A Methodology for High Performance
- Carter, L.¹ Ferrante, J.² Hummel, S.F.³ Alpern, B.⁴ Gatlin, K.-S.⁵

5
- 84981274540
- Improving effective bandwidth through compiler enhancement of global cache reuse
- San Francisco, CA
- Chen Ding and Ken Kennedy. Improving Effective Bandwidth through Compiler Enhancement of Global Cache Reuse. In Proc. IPDPS 2001, San Francisco, CA, 2001.
- (2001) Proc. IPDPS 2001
- Ding, C.¹ Kennedy, K.²

6
- 33745205180
- Maximizing cache memory usage for multigrid algorithms
- Z. Chen, R. E. Ewing and Z.-C. Shi, editors, Springer-Verlag, Lecture Notes in Physics, Berlin
- C. C. Douglas et al. Maximizing Cache Memory Usage for Multigrid Algorithms. In Z. Chen, R. E. Ewing and Z.-C. Shi, editors, Multiphase Flows and Transport in Porous Media: State of the Art, Springer-Verlag, Lecture Notes in Physics, Berlin, 2000.
- (2000) Multiphase Flows and Transport in Porous Media: State of the Art
- Douglas, C.C.¹

7
- 85117191258
- FFTW. http://www.fftw.org/.

8
- 1142307058
- Technical Report Computer Science Division, University of California, Berkeley
- P. N. Hilfinger et al. Titanium Language Reference Manual. Technical Report CSD-01-1163, Computer Science Division, University of California, Berkeley, 2001.
- (2001) Titanium Language Reference Manual
- Hilfinger, P.N.¹

9
- 84875636475
- Load balancing and data locality via fractiling: An experimental study
- Boleslaw K. Szymanski and Balaram Sinharoy, editors, Kluwer Academic Publishers, Boston, MA
- S. Flynn Hummel, I. Banicescu, C. Wang, and J. Wein. Load Balancing and Data Locality via Fractiling: An Experimental Study. In Boleslaw K. Szymanski and Balaram Sinharoy, editors, Proc. Third Workshop on Languages, Compilers, and Run-Time Systems for Scalable Computers, pages 85-89. Kluwer Academic Publishers, Boston, MA, 1995.
- (1995) Proc. Third Workshop on Languages, Compilers, and Run-Time Systems for Scalable Computers , pp. 85-89
- Hummel, S.F.¹ Banicescu, I.² Wang, C.³ Wein, J.⁴

10
- 0004972603
- Ph.D. dissertation, University of California, Berkeley
- Eun-Jin Im. Optimizing the Performance of Sparse Matrix-Vector Multiplication. Ph.D. dissertation, University of California, Berkeley, 2000.
- (2000) Optimizing the Performance of Sparse Matrix-Vector Multiplication
- Im, E.-J.¹

11
- 0037545250
- Optimizing sparse matrix computations for register reuse in sparsity
- Eun-Jin Im and Katherine Yelick. Optimizing Sparse Matrix Computations for Register Reuse in SPARSITY. International Conference on Computational Science, 2001.
- (2001) International Conference on Computational Science
- Im, E.-J.¹ Yelick, K.²

12
- 34547524504
- Increasing temporal locality with skewing and recursive blocking
- Denver, Colorado, November
- G. Jin, J. Mellor-Crummey, and R. Fowler. Increasing Temporal Locality with Skewing and Recursive Blocking. In Proc. SC2001, Denver, Colorado, November 2001.
- (2001) Proc. SC2001
- Jin, G.¹ Mellor-Crummey, J.² Fowler, R.³

13
- 0003904906
- Technical Report Dept. of Computer Science, University of Maryland, College Park, March
- Wayne Kelly, Vadim Maslov, William Pugh, Evan Rosser, Tatiana Shpeisman, and David Wonnacott. The Omega Library interface guide. Technical Report CS-TR-3445, Dept. of Computer Science, University of Maryland, College Park, March 1995.
- (1995) The Omega Library Interface Guide
- Kelly, W.¹ Maslov, V.² Pugh, W.³ Rosser, E.⁴ Shpeisman, T.⁵ Wonnacott, D.⁶

14
- 0013103243
- The effect of cache models on iterative compilation for combined tiling and unrolling
- T. Kisuki, P. M. W. Knijnenburg, K. Gallivan, and M. F. P. O'Boyle. The Effect of Cache Models on Iterative Compilation for Combined Tiling and Unrolling. In Proc. FDDO-3, pages 31-40, 2000.
- (2000) Proc. FDDO-3 , pp. 31-40
- Kisuki, T.¹ Knijnenburg, P.M.W.² Gallivan, K.³ O'Boyle, M.F.P.⁴

15
- 77952396100
- Technical Report LIACS, Leiden University
- T. Kisuki, P. M. W. Knijnenburg, and M. F. P. O'Boyle. Combined Selection of Tile Sizes and Unroll Factors Using Iterative Compilation. Technical Report 2000-07, LIACS, Leiden University, 2000.
- (2000) Combined Selection of Tile Sizes and Unroll Factors Using Iterative Compilation
- Kisuki, T.¹ Knijnenburg, P.M.W.² O'Boyle, M.F.P.³

16
- 84949235179
- Iterative compilation
- P. M. W. Knijnenburg, T. Kisuki, and M. F. P. O'Boyle. Iterative Compilation. In Embedded Processor Design Challenges-System Architecture, Modeling and Simulation (SAMOS), Springer Lecture Notes in Computer Science vol. 2268, pages 171-187, 2002.
- (2002) Embedded Processor Design Challenges-System Architecture, Modeling and Simulation (SAMOS), Springer Lecture Notes in Computer Science , vol.2268 , pp. 171-187
- Knijnenburg, P.M.W.¹ Kisuki, T.² O'Boyle, M.F.P.³

17
- 0347304618
- Data-centric Multi-level Blocking
- June
- Induprakas Kodukula, Nawaaz Ahmed, and Keshav Pingali. Data-centric Multi-level Blocking. In SIGPLAN 1997 conference on Programming Language Design and Implementation, June 1997.
- (1997) SIGPLAN 1997 Conference on Programming Language Design and Implementation
- Kodukula, I.¹ Ahmed, N.² Pingali, K.³

18
- 0026137116
- The cache performance and optimizations of blocked algorithms
- M. S. Lam, E. E. Rothberg, and M. E. Wolf. The cache performance and optimizations of blocked algorithms. In Proceedings of the Sixth International Conference on Architectural Support for Programming Languages and Operating Systems, 1991.
- (1991) Proceedings of the Sixth International Conference on Architectural Support for Programming Languages and Operating Systems
- Lam, M.S.¹ Rothberg, E.E.² Wolf, M.E.³

19
- 0032067773
- Maximizing parallelism and minimizing synchronization with affine partitions
- Amy W. Lim and Monica S. Lam. Maximizing parallelism and minimizing synchronization with affine partitions. Parallel Computing, 24:445-475, 1998.
- (1998) Parallel Computing , vol.24 , pp. 445-475
- Lim, A.W.¹ Lam, M.S.²

20
- 17644395320
- Blocking and array contraction across arbitrarily nested loops using affine partitioning
- Amy W. Lim, Shih-Wei Liao, and Monica S. Lam. Blocking and Array Contraction Across Arbitrarily Nested Loops Using Affine Partitioning. ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2001.
- (2001) ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming
- Lim, A.W.¹ Liao, S.-W.² Lam, M.S.³

21
- 85117257346
- Preprint
- M. F. P. O'Boyle, P. M. W. Knijnenburg, and G. G. Fursin. Feedback Assisted Iterative Compilation. Preprint, 2000.
- (2000) Feedback Assisted Iterative Compilation
- O'Boyle, M.F.P.¹ Knijnenburg, P.M.W.² Fursin, G.G.³

22
- 8344251400
- Ph.D. dissertation, University of California, Berkeley, January
- Geoffrey Pike. Reordering and Storage Optimizations for Scientific Programs. Ph.D. dissertation, University of California, Berkeley, January 2002.
- (2002) Reordering and Storage Optimizations for Scientific Programs
- Pike, G.¹

23
- 35248876385
- Parallel 3D adaptive mesh refinement in titanium
- San Antonio, TX, March
- G. Pike, L. Semenzato, P. Colella, P. Hilfinger. Parallel 3D Adaptive Mesh Refinement in Titanium. In Proceedings of the SIAM Conference on Parallel Processing for Scientific Computing, San Antonio, TX, March 1999.
- (1999) Proceedings of the SIAM Conference on Parallel Processing for Scientific Computing
- Pike, G.¹ Semenzato, L.² Colella, P.³ Hilfinger, P.⁴

24
- 85117198971
- Cache-efficient multigrid algorithms
- San Francisco, CA, May
- Sriram Sellappa and Siddhartha Chatterjee. Cache-Efficient Multigrid Algorithms. In Proceedings of the 2001 International Conference on Computational Science (ICCS 2001), San Francisco, CA, May 2001.
- (2001) Proceedings of the 2001 International Conference on Computational Science (ICCS 2001)
- Sellappa, S.¹ Chatterjee, S.²

25
- 0034825667
- Data locality enhancement by memory reduction
- June
- Y. Song, R. Xu, C. Wang, and Z. Li. Data Locality Enhancement by Memory Reduction. 15th ACM International Conference on Supercomputing, June 2001.
- (2001) 15th ACM International Conference on Supercomputing
- Song, Y.¹ Xu, R.² Wang, C.³ Li, Z.⁴

26
- 0031612767
- Schedule-independent storage mapping for loops
- October
- Michelle Mills Strout, Larry Carter, Jeanne Ferrante, and Beth Simon. Schedule-independent storage mapping for loops. International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS), October 1998.
- (1998) International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS)
- Strout, M.M.¹ Carter, L.² Ferrante, J.³ Simon, B.⁴

27
- 18844383699
- A unified framework for schedule and storage optimization
- William Thies, Frédéric Vivien, Jeffrey Sheldon, and Saman Amarasinghe. A Unified Framework for Schedule and Storage Optimization. In Proceedings of the 2001 SIGPLAN Conference on Programming Language Design and Implementation.
- Proceedings of the 2001 SIGPLAN Conference on Programming Language Design and Implementation
- Thies, W.¹ Vivien, F.² Sheldon, J.³ Amarasinghe, S.⁴

28
- 1842843480
- Statistical models for automatic performance tuning
- San Francisco, CA, May
- R. Vuduc, J. Demmel, and J. Bilmes. Statistical Models for Automatic Performance Tuning. In Proceedings of the 2001 International Conference on Computational Science (ICCS 2001), San Francisco, CA, May 2001.
- (2001) Proceedings of the 2001 International Conference on Computational Science (ICCS 2001)
- Vuduc, R.¹ Demmel, J.² Bilmes, J.³

29
- 0003418094
- Technical Report LAPACK Working Note 131, University of Tennessee
- R. Whaley and J. Dongarra. Automatically Tuned Linear Algebra Software. Technical Report UT CS-97-366, LAPACK Working Note No. 131, University of Tennessee, 1997.
- (1997) Automatically Tuned Linear Algebra Software
- Whaley, R.¹ Dongarra, J.²

30
- 85013942562
- A data locality optimizing algorithm
- Michael E. Wolf and Monica S. Lam. A data locality optimizing algorithm. In ACM SIGPLAN'91 Conference on Programming Language Design and Implementation, 1991.
- (1991) ACM SIGPLAN'91 Conference on Programming Language Design and Implementation
- Wolf, M.E.¹ Lam, M.S.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.