SCOPUS 정보 검색 플랫폼

Volumn 51, Issue 3, 2002, Pages 327-345

Efficient representation scheme for multidimensional array operations

(3) Lin, Chun Yuan a Liu, Jen Shiuh a Chung, Yeh Ching a

Author keywords

Array operations; Data structure; Extended Kamaugh map representation; Multidimensional arrays; Traditional matrix representation

Indexed keywords

ALGORITHMS; DATA STRUCTURES; FORTRAN (PROGRAMMING LANGUAGE); LOGIC DESIGN; MATRIX ALGEBRA; PROGRAM COMPILERS;

ARRAY OPERATIONS; EXTENDED KARNAUGH MAP REPRESENTATION; MATRIX TRANSFORMATION METHOD; MULTIDIMENSIONAL ARRAYS; TRADITIONAL MATRIX REPRESENTATION;

PARALLEL PROCESSING SYSTEMS;

EID: 0036523329 PISSN: 00189340 EISSN: None Source Type: Journal
DOI: 10.1109/12.990130 Document Type: Article

Times cited : (40)

References (35)

1
- 0003429675
- Intertext Publications/McGraw-Hill Inc.
- J.C. Adams, W.S. Brainerd, J.T. Martin, B.T. Smith, and J.L. Wagener, Fortran 90 Handbook. Intertext Publications/McGraw-Hill Inc. 1992.
- (1992) Fortran 90 Handbook
- Adams, J.C.¹ Brainerd, W.S.² Martin, J.T.³ Smith, B.T.⁴ Wagener, J.L.⁵

2
- 0029429935
- Balancing processor loads and exploiting data locality in N-body simulations
- I. Banicescu and S.F. Hummel, "Balancing Processor Loads and Exploiting Data Locality in N-Body Simulations," Proc. 1995 ACM/IEEE Supercomputing Conf., Dec. 1995.
- Proc. 1995 ACM/IEEE Supercomputing Conf., Dec. 1995
- Banicescu, I.¹ Hummel, S.F.²

3
- 0025447908
- Improving register allocation for subscripted variables
- June
- D. Callahan, S. Carr, and K. Kennedy, "Improving Register Allocation for Subscripted Variables," Proc. ACM SIGPLAN 1990 Conf. Programming Language Design and Implementation, pp. 53-65, June 1990.
- (1990) Proc. ACM SIGPLAN 1990 Conf. Programming Language Design and Implementation , pp. 53-65
- Callahan, D.¹ Carr, S.² Kennedy, K.³

4
- 84976831704
- Compiler optimizations for improving data locality
- Oct.
- S. Carr, K.S. McKinley, and C.-W. Tseng, "Compiler Optimizations for Improving Data Locality," Proc. Sixth Int'l Conf. Architectural Support for Programming Languages and Operating Systems, pp. 252-262, Oct. 1994.
- (1994) Proc. Sixth Int'l Conf. Architectural Support for Programming Languages and Operating Systems , pp. 252-262
- Carr, S.¹ McKinley, K.S.² Tseng, C.-W.³

5
- 0029235623
- Hierarchical tiling for improved superscalar performance
- Apr.
- L. Carter, J. Ferrante, and S.F. Hummel, "Hierarchical Tiling for Improved Superscalar Performance," Proc. Nineth Int'l Symp. Parallel Processing, pp. 239-245, Apr. 1995.
- (1995) Proc. Nineth Int'l Symp. Parallel Processing , pp. 239-245
- Carter, L.¹ Ferrante, J.² Hummel, S.F.³

6
- 0032659795
- Recursive array layouts and fast parallel matrix multiplication
- June
- S. Chatterjee, A.R. Lebeck, P.K. Patnala, and M. Thottethodi, "Recursive Array Layouts and Fast Parallel Matrix Multiplication," Proc. Eleventh Ann. ACM Symp. Parallel Algorithms and Architectures, pp. 222-231, June 1999.
- (1999) Proc. Eleventh Ann. ACM Symp. Parallel Algorithms and Architectures , pp. 222-231
- Chatterjee, S.¹ Lebeck, A.R.² Patnala, P.K.³ Thottethodi, M.⁴

7
- 0032652980
- Nonlinear array layouts for hierarchical memory systems
- June
- S. Chatterjee, V.V. Jain, A.R. Lebeck, S. Mundhra, and M. Thottehodi, "Nonlinear Array Layouts for Hierarchical Memory Systems," Proc. 1999 ACM Int'l Conf. Supercomputing, pp. 444-453, June 1999.
- (1999) Proc. 1999 ACM Int'l Conf. Supercomputing , pp. 444-453
- Chatterjee, S.¹ Jain, V.V.² Lebeck, A.R.³ Mundhra, S.⁴ Thottehodi, M.⁵

8
- 0003795618
- Unifying data and control transformations for distributed shared memory machines
- Technical Report TR-542, Dept. of Computer-Science, Univ. of Rochester, Nov.
- M. Cierniak and W. Li, "Unifying Data and Control Transformations for Distributed Shared Memory Machines," Technical Report TR-542, Dept. of Computer-Science, Univ. of Rochester, Nov. 1994.
- (1994)
- Cierniak, M.¹ Li, W.²

9
- 84976745804
- Tile size selection using cache organization and data layout
- June
- S. Coleman and K.S. McKinley, "Tile Size Selection Using Cache Organization and Data Layout," Proc. ACM SIGPLAN '95 Conf. Programming Language Design and Implementation, pp. 279-290, June 1995.
- (1995) Proc. ACM SIGPLAN '95 Conf. Programming Language Design and Implementation , pp. 279-290
- Coleman, S.¹ McKinley, K.S.²

10
- 0003462314
- Boston, Mass.: Birkhauser
- J.K. Cullum and R.A. Willoughby, Algorithms for Large Symmetric Eignenvalue Computations, vol. 1. Boston, Mass.: Birkhauser, 1985.
- (1985) Algorithms for Large Symmetric Eignenvalue Computations , vol.1
- Cullum, J.K.¹ Willoughby, R.A.²

11
- 84882715552
- Cache misses prediction for high performance sparse algorithms
- Sept.
- B.B. Fraguela, R. Doallo, and E.L. Zapata, "Cache Misses Prediction for High Performance Sparse Algorithms," Proc. Fourth Int'l Euro-Par Conf. (Euro-Par '98), pp. 224-233, Sept. 1998.
- (1998) Proc. Fourth Int'l Euro-Par Conf. (Euro-Par '98) , pp. 224-233
- Fraguela, B.B.¹ Doallo, R.² Zapata, E.L.³

12
- 0011916940
- Cache probabilistic modeling for basic sparse algebra kernels involving matrices with a non-uniform distribution
- June
- B.B. Fraguela, R. Doallo, and E.L. Zapata, "Cache Probabilistic Modeling for Basic Sparse Algebra Kernels Involving Matrices with a Non-Uniform Distribution," Proc. 24th IEEE Euromicro Conf., pp. 345-348, June 1998.
- (1998) Proc. 24th IEEE Euromicro Conf. , pp. 345-348
- Fraguela, B.B.¹ Doallo, R.² Zapata, E.L.³

13
- 0032089580
- Modeling set associative caches behaviour for irregular computations
- June
- B.B. Fraguela, R. Doallo, and E.L. Zapata, "Modeling Set Associative Caches Behaviour for Irregular Computations," ACM Int'l Conf. Measurement and Modeling of Computer Systems (SIGMETRICS '98), pp. 192-201, June 1998.
- (1998) ACM Int'l Conf. Measurement and Modeling of Computer Systems (SIGMETRICS '98) , pp. 192-201
- Fraguela, B.B.¹ Doallo, R.² Zapata, E.L.³

14
- 0033358624
- Automatic analytical modeling for the estimation of cache misses
- B.B. Fraguela, R. Doallo, and E.L. Zapata, "Automatic Analytical Modeling for the Estimation of Cache Misses," Proc. Int'l Conf. Parallel Architectures and Compilation Techniques (PACT '99), Oct. 1999.
- Proc. Int'l Conf. Parallel Architectures and Compilation Techniques (PACT '99), Oct. 1999
- Fraguela, B.B.¹ Doallo, R.² Zapata, E.L.³

15
- 0030688479
- Auto-blocking matrix-multiplication or tracking BLAS3 performance from source code
- J.D. Frens and D.S. Wise, "Auto-Blocking Matrix-Multiplication or Tracking BLAS3 Performance from Source Code," Proc. Sixth ACM SIGPLAN Symp. Principles and Practice of Parallel Programming, June 1997.
- Proc. Sixth ACM SIGPLAN Symp. Principles and Practice of Parallel Programming, June 1997
- Frens, J.D.¹ Wise, D.S.²

16
- 0004236492
- Baltimore, Md: Johns Hopkins Univ. Press
- G.H. Golub and C.F. Van Loan, Matrix Computations, Second ed. Baltimore, Md: Johns Hopkins Univ. Press, 1989.
- (1989) Matrix Computations, Second Ed.
- Golub, G.H.¹ Van Loan, C.F.²

17
- 0033075413
- Improving cache locality by a combination of loop and data transformations
- Feb.
- M. Kandemir, J. Ramanujam, and A. Choudhary, "Improving Cache Locality by a Combination of Loop and Data Transformations," IEEE Trans. Computers, Feb. 1999.
- (1999) IEEE Trans. Computers
- Kandemir, M.¹ Ramanujam, J.² Choudhary, A.³

18
- 0030662867
- A compiler algorithm for optimizing locality in loop nests
- July
- M. Kandemir, J. Ramanujam, and A. Chaoudhary, "A Compiler Algorithm for Optimizing Locality in Loop Nests," Proc. 1997 ACM Int'l Conf. Supercomputing, pp. 269-276, July 1997.
- (1997) Proc. 1997 ACM Int'l Conf. Supercomputing , pp. 269-276
- Kandemir, M.¹ Ramanujam, J.² Chaoudhary, A.³

19
- 84863051745
- The SPARAMAT approach to automatic comprehension of sparse matrix computations
- C.W. Kebler and C.H. Smith, "The SPARAMAT Approach to Automatic Comprehension of Sparse Matrix Computations," Proc. Seventh Int'l Workshop Program Comprehension, pp. 200-207, 1999.
- (1999) Proc. Seventh Int'l Workshop Program Comprehension , pp. 200-207
- Kebler, C.W.¹ Smith, C.H.²

20
- 12444274571
- Compiling parallel sparse code for user-defined data structures
- V. Kotlyar, K. Pingali, and P. Stodghill, "Compiling Parallel Sparse Code for User-Defined Data Structures," Proc. Eighth SIAM Conf. Parallel Processing for Scientific Computing, Mar. 1997.
- Proc. Eighth SIAM Conf. Parallel Processing for Scientific Computing, Mar. 1997
- Kotlyar, V.¹ Pingali, K.² Stodghill, P.³

21
- 10844257146
- A relation approach to the compilation of sparse matrix programs
- Aug.
- V. Kotlyar, K. Pingali, and P. Stodhill, "A Relation Approach to the Compilation of Sparse Matrix Programs," Euro Par, Aug. 1997.
- (1997) Euro Par
- Kotlyar, V.¹ Pingali, K.² Stodhill, P.³

22
- 84900322507
- Compiling parallel code for sparse matrix applications
- V. Kotlyar, K. Pingali, and P. Stodghill, "Compiling Parallel Code for Sparse Matrix Applications," Proc. Supercomputing Conf., Aug. 1997.
- Proc. Supercomputing Conf., Aug. 1997
- Kotlyar, V.¹ Pingali, K.² Stodghill, P.³

23
- 84989868541
- A tensor product formulation of strassen's matrix multiplication algorithm with memory reduction
- Apr.
- B. Kumar, C-H. Huang, R.W. Johnson, and P. Sadayappan, "A Tensor Product Formulation of Strassen's Matrix Multiplication Algorithm with Memory Reduction," Proc. Seventh Int'l Parallel Processing Symp., pp. 582-588, Apr. 1993.
- (1993) Proc. Seventh Int'l Parallel Processing Symp. , pp. 582-588
- Kumar, B.¹ Huang, C.-H.² Johnson, R.W.³ Sadayappan, P.⁴

24
- 0026137116
- The cache performance and optimizations of blocked algorithms
- Apr.
- M.S. Lam, E.E. Rothberg, and M.E. Wolf, "The Cache Performance and Optimizations of Blocked Algorithms," Proc. Fourth Int'l Conf. Architectural Support for Programming Languages and Operating Systems, pp. 63-74, Apr. 1991.
- (1991) Proc. Fourth Int'l Conf. Architectural Support for Programming Languages and Operating Systems , pp. 63-74
- Lam, M.S.¹ Rothberg, E.E.² Wolf, M.E.³

25
- 0003378935
- A singular loop transformation framework based on non-singular matrices
- W. Li and K. Pingali, "A Singular Loop Transformation Framework Based on Non-Singular Matrices," Proc. Fifth Workshop Languages and Compilers for Parallel Computers, pp. 249-260, 1992.
- (1992) Proc. Fifth Workshop Languages and Compilers for Parallel Computers , pp. 249-260
- Li, W.¹ Pingali, K.²

26
- 0011977499
- Efficient representation for multi-dimensional matrix operations
- Mar.
- J.-S. Liu, J.-Y. Lin, and Y.-C. Chung, "Efficient Representation for Multi-Dimensional Matrix Operations," Proc. Workshop Compiler Techniques for High Performance Computing (CTHPC), pp. 133-142, Mar. 2000.
- (2000) Proc. Workshop Compiler Techniques for High Performance Computing (CTHPC) , pp. 133-142
- Liu, J.-S.¹ Lin, J.-Y.² Chung, Y.-C.³

27
- 84907042187
- Efficient parallel algorithms for multi-dimensional matrix operations
- Dec.
- J.-S. Liu, J.-Y. Lin, and Y.-C. Chung, "Efficient Parallel Algorithms for Multi-Dimensional Matrix Operations," Proc. IEEE Int'l Symp. Parallel Architectures, Algorithms and Networks (I-SPAN), pp.224-229, Dec. 2000.
- (2000) Proc. IEEE Int'l Symp. Parallel Architectures, Algorithms and Networks (I-SPAN) , pp. 224-229
- Liu, J.-S.¹ Lin, J.-Y.² Chung, Y.-C.³

28
- 0030190854
- Improving data locality with loop transformations
- July
- K.S. McKinley, S. Carr, and C.-W. Tseng, "Improving Data Locality with Loop Transformations," ACM Trans. Programming Languages and Systems, July 1996.
- (1996) ACM Trans. Programming Languages and Systems
- McKinley, K.S.¹ Carr, S.² Tseng, C.-W.³

29
- 84968921118
- Integrating loop and data transformations for global optimization
- Oct.
- M.F.P. O'Boyle and P.M.W. Knijnenburg, "Integrating Loop and Data Transformations for Global Optimization," Proc. Int'l Conf. Parallel Architectures and Compilation Techniques (PACT '98), pp. 12-19, Oct. 1998.
- (1998) Proc. Int'l Conf. Parallel Architectures and Compilation Techniques (PACT '98) , pp. 12-19
- O'Boyle, M.F.P.¹ Knijnenburg, P.M.W.²

30
- 0004161838
- Cambridge Univ. Press
- W.H. Press, S.A. Teukolsky, W.T. Vetterling, and B.P. Flannery, Numerical Recipes in Fortran 90: The Art of Parallel Scientific Computing. Cambridge Univ. Press, 1996.
- (1996) Numerical Recipes in Fortran 90: The Art of Parallel Scientific Computing
- Press, W.H.¹ Teukolsky, S.A.² Vetterling, W.T.³ Flannery, B.P.⁴

31
- 0031698424
- Caching efficient multithreaded fast multiplication of sparse matrices
- P.D. Sulatycke and K. Ghose, "Caching Efficient Multithreaded Fast Multiplication of Sparse Matrices," Proc. First Merged Int'l Parallel Processing Symp. and Symp. Parallel and Distributed Processing, pp. 117-123, 1998.
- (1998) Proc. First Merged Int'l Parallel Processing Symp. and Symp. Parallel and Distributed Processing , pp. 117-123
- Sulatycke, P.D.¹ Ghose, K.²

32
- 0011916941
- Turing strassen's matrix multiplication for memory efficiency
- M. Thottethodi, S. Chatterjee, and A.R. Lebeck, "Turing Strassen's Matrix Multiplication for Memory Efficiency," Proc. ACM/IEEE SC98 Conf. High Performance Networking and Computing, Nov. 1998.
- Proc. ACM/IEEE SC98 Conf. High Performance Networking and Computing, Nov. 1998
- Thottethodi, M.¹ Chatterjee, S.² Lebeck, A.R.³

33
- 0030295713
- Parallelization techniques for sparse matrix applications
- M. Ujaldon, E.L. Zapata, S.D. Sharma, and J. Saltz, "Parallelization Techniques for Sparse Matrix Applications," J. Parallel and Distribution Computing, 1996.
- (1996) J. Parallel and Distribution Computing
- Ujaldon, M.¹ Zapata, E.L.² Sharma, S.D.³ Saltz, J.⁴

34
- 84976827033
- A data locality optimizing algorithm
- June
- M. Wolf and M. Lam, "A Data Locality Optimizing Algorithm," Proc. ACM SIGPLAN '91 Conf. Programming Language Design and Implementation, pp. 30-44, June 1991.
- (1991) Proc. ACM SIGPLAN '91 Conf. Programming Language Design and Implementation , pp. 30-44
- Wolf, M.¹ Lam, M.²

35
- 85028864961
- Run-time optimization of sparse matrix-vector multiplication on SIMD machines
- July
- L.H. Ziantz, C.C. Ozturan, and B.K. Szymanski, "Run-Time Optimization of Sparse Matrix-Vector Multiplication on SIMD Machines," Proc. Int'l Conf. Parallel Architectures and Languages, pp. 313-322, July 1994.
- (1994) Proc. Int'l Conf. Parallel Architectures and Languages , pp. 313-322
- Ziantz, L.H.¹ Ozturan, C.C.² Szymanski, B.K.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.