SCOPUS 정보 검색 플랫폼

IEEE Transactions on Computers

Volumn 53, Issue 5, 2004, Pages 547-566

Efficient and accurate analytical modeling of whole-program data cache behavior

(2) Xue, Jingling a Vera, Xavier b

a UNIVERSITY OF NEW SOUTH WALES (Australia)

b Institutionen för Datateknik Mälardalens Högskola (Sweden)

Author keywords

Analytical modeling; Cache memories; Data locality; Modeling techniques; Performance evaluation

Indexed keywords

COMPUTER HARDWARE; COMPUTER SIMULATION; HEURISTIC METHODS; OPTIMIZATION; PROGRAM COMPILERS; SAMPLING;

ANALYTICAL MODELING; DATA CACHE; DATA LOCALITY; PERFORMANCE EVALUATION;

BUFFER STORAGE;

EID: 3042664555 PISSN: 00189340 EISSN: None Source Type: Journal
DOI: 10.1109/TC.2004.1275296 Document Type: Article

Times cited : (23)

References (60)

1
- 84955578911
- Eliminating virtual function calls in C++ programs
- G. Aigner and U. Hölzle, "Eliminating Virtual Function Calls in C++ Programs," Proc. 10th European Conf. Object-Oriented Programming (ECOOP '96), pp. 142-166, 1996.
- (1996) Proc. 10th European Conf. Object-Oriented Programming (ECOOP '96) , pp. 142-166
- Aigner, G.¹ Hölzle, U.²

2
- 0030645124
- Exploiting hardware performance counters with flow and context sensitive profiling
- G. Ammons, T. Ball, and J. Larus, "Exploiting Hardware Performance Counters with Flow and Context Sensitive Profiling," Proc. ACM SIGPLAN '97 Conf. Programming Language Design and Implementation (PLDI '97), pp. 85-96, 1997.
- (1997) Proc. ACM SIGPLAN '97 Conf. Programming Language Design and Implementation (PLDI '97) , pp. 85-96
- Ammons, G.¹ Ball, T.² Larus, J.³

3
- 3242744876
- Ictineo: A tool for research on ILP
- E. Ayguadé, C. Barrado, A. González, J. Labarta, J. Llosa, D. López, S. Moreno, D. Padua, F. Reig, Q. Riera, and M. Valero, "Ictineo: A Tool for Research on ILP," Proc. Supercomputing '96, 1996.
- (1996) Proc. Supercomputing '96
- Ayguadé, E.¹ Barrado, C.² González, A.³ Labarta, J.⁴ Llosa, J.⁵ López, D.⁶ Moreno, S.⁷ Padua, D.⁸ Reig, F.⁹ Riera, Q.¹⁰ Valero, M.¹¹

4
- 1242313972
- A compiler framework for restructuring data declarations to enhance cache and TLB effectiveness
- Nov.
- D. F. Bacon, J.-H. Chow, D.-C.R. Ju, K. Muthukumar, and V. Sarkar, "A Compiler Framework for Restructuring Data Declarations to Enhance Cache and TLB Effectiveness," Proc. IBM Centers for Advanced Studies Conf. (CASCON '94), pp. 270-282, Nov. 1994.
- (1994) Proc. IBM Centers for Advanced Studies Conf. (CASCON '94) , pp. 270-282
- Bacon, D.F.¹ Chow, J.-H.² Ju, D.-C.R.³ Muthukumar, K.⁴ Sarkar, V.⁵

5
- 84899747534
- An efficient solver for cache miss equations
- N. Bermudo, X. Vera, A. González, and J. Llosa, "An Efficient Solver for Cache Miss Equations," Proc. IEEE Int'l Symp. Performance Analysis of Systems and Software (ISPASS'00), 2000.
- Proc. IEEE Int'l Symp. Performance Analysis of Systems and Software (ISPASS'00), 2000
- Bermudo, N.¹ Vera, X.² González, A.³ Llosa, J.⁴

6
- 0026866013
- Profile-guided automatic inline expansion for C programs
- P.P. Chang, S.A. Mahlke, W.Y. Chen, and W.W. Hwu, "Profile-Guided Automatic Inline Expansion for C Programs," Software - Practice and Experience, vol. 25, pp. 249-369, 1992.
- (1992) Software - Practice and Experience , vol.25 , pp. 249-369
- Chang, P.P.¹ Mahlke, S.A.² Chen, W.Y.³ Hwu, W.W.⁴

7
- 0034832018
- Exact analysis of the cache behavior of nested loops
- S. Chatterjee, E. Parker, P.J. Hanlon, and A.R. Lebeck, "Exact Analysis of the Cache Behavior of Nested Loops," Proc. ACM SIGPLAN '01 Conf. Programming Language Design and Implementation (PLDI '01), pp. 286-297, 2001.
- (2001) Proc. ACM SIGPLAN '01 Conf. Programming Language Design and Implementation (PLDI '01) , pp. 286-297
- Chatterjee, S.¹ Parker, E.² Hanlon, P.J.³ Lebeck, A.R.⁴

8
- 0034836613
- Efficient representations and abstractions for quantifying and exploiting data reference locality
- T.M. Chilimbi, "Efficient Representations and Abstractions for Quantifying and Exploiting Data Reference Locality," Proc. ACM SIGPLAN '01 Conf. Programming Language Design and Implementation (PLDI '01), pp. 191-202, 2001.
- (2001) Proc. ACM SIGPLAN '01 Conf. Programming Language Design and Implementation (PLDI '01) , pp. 191-202
- Chilimbi, T.M.¹

9
- 0029717349
- Counting solutions to linear and non-linear constraints through Ehrhart polynomials
- P. Clauss, "Counting Solutions to Linear and Non-Linear Constraints through Ehrhart Polynomials," Proc. ACM Int'l Conf. Supercomputing (ICS '96), pp. 278-285, 1996.
- (1996) Proc. ACM Int'l Conf. Supercomputing (ICS '96) , pp. 278-285
- Clauss, P.¹

10
- 84976745804
- Tile size selection using cache organization and data layout
- June
- S. Coleman and K. S. McKinley, "Tile Size Selection Using Cache Organization and Data Layout," Proc. ACM SIGPLAN '95 Conf. Programming Language Design and Implementation (PLDI '95), pp. 279-290, June 1995.
- (1995) Proc. ACM SIGPLAN '95 Conf. Programming Language Design and Implementation (PLDI '95) , pp. 279-290
- Coleman, S.¹ McKinley, K.S.²

11
- 0004234657
- Addison-Wesley
- M. DeGroot, Probability and Statistics. Addison-Wesley, 1998.
- (1998) Probability and Statistics
- DeGroot, M.¹

12
- 0004007719
- Improving effective bandwidth through compiler enhancement of global dynamic cache reuse
- PhD thesis, Rice Univ.
- C. Ding, "Improving Effective Bandwidth through Compiler Enhancement of Global Dynamic Cache Reuse," PhD thesis, Rice Univ., 2000.
- (2000)
- Ding, C.¹

13
- 84981274540
- Improving effective bandwidth through compiler enhancement of global cache reuse
- C. Ding and K. Kennedy, "Improving Effective Bandwidth through Compiler Enhancement of Global Cache Reuse," Proc. 2001 Int'l Parallel and Distributed Processing Symp. (IPDPS '01), Apr. 2001.
- Proc. 2001 Int'l Parallel and Distributed Processing Symp. (IPDPS '01), Apr. 2001
- Ding, C.¹ Kennedy, K.²

14
- 0028530861
- The Polaris internal representation
- Oct.
- K.A. Faigin, J.P. Hoeflinger, D.A. Padua, P.M. Petersen, and S.A. Weatherford, "The Polaris Internal Representation," Int'l J. Parallel Programming, vol. 22, no. 5, pp. 553-586, Oct. 1994.
- (1994) Int'l J. Parallel Programming , vol.22 , Issue.5 , pp. 553-586
- Faigin, K.A.¹ Hoeflinger, J.P.² Padua, D.A.³ Petersen, P.M.⁴ Weatherford, S.A.⁵

15
- 0001023389
- Parametric integer programming
- P. Feautrier, "Parametric Integer Programming," Operations Research, vol. 22, pp. 243-268, 1988.
- (1988) Operations Research , vol.22 , pp. 243-268
- Feautrier, P.¹

16
- 84957027384
- Automatic parallelization in the polytope model
- G.R. Perrin and A. Darte, eds.; Springer Verlag
- P. Feautrier, "Automatic Parallelization in the Polytope Model," The Data Parallel Programming Model, G.R. Perrin and A. Darte, eds., pp. 79-103, Springer Verlag, 1996.
- (1996) The Data Parallel Programming Model , pp. 79-103
- Feautrier, P.¹

17
- 0002461724
- Applying compiler techniques to cache behavior prediction
- C. Ferdinand, F. Martin, and R. Wilhelm, "Applying Compiler Techniques to Cache Behavior Prediction," Proc. ACM SIGPLAN Workshop Languages, Compilers, and Tools for Real-Time System (LCTRTS '97), pp. 37-46, 1997.
- (1997) Proc. ACM SIGPLAN Workshop Languages, Compilers, and Tools for Real-Time System (LCTRTS '97) , pp. 37-46
- Ferdinand, C.¹ Martin, F.² Wilhelm, R.³

18
- 85015240805
- On estimating and enhancing cache effectiveness
- J. Ferrante, V. Sarkar, and W. Thrash, "On Estimating and Enhancing Cache Effectiveness," Proc. Fourth Workshop Compilers for Parallel Computers, pp. 328-343, 1991.
- (1991) Proc. Fourth Workshop Compilers for Parallel Computers , pp. 328-343
- Ferrante, J.¹ Sarkar, V.² Thrash, W.³

19
- 0032089580
- Modeling set associative caches behavior for irregular computations
- June
- B.B. Fraguela, R. Doallo, and E.L. Zapata, "Modeling Set Associative Caches Behavior for Irregular Computations," ACM Performance Evaluation Rev., vol. 26, no. 1, pp. 192-201, June 1998.
- (1998) ACM Performance Evaluation Rev. , vol.26 , Issue.1 , pp. 192-201
- Fraguela, B.B.¹ Doallo, R.² Zapata, E.L.³

20
- 0033358624
- Automatic analytical modeling for the estimation of cache misses
- B.B. Fraguela, R. Doallo, and E.L. Zapata, "Automatic Analytical Modeling for the Estimation of Cache Misses," Proc. Int'l Conf. Parallel Architectures and Compilation Techniques (PACT '99), 1999.
- Proc. Int'l Conf. Parallel Architectures and Compilation Techniques (PACT '99), 1999
- Fraguela, B.B.¹ Doallo, R.² Zapata, E.L.³

21
- 0001366267
- Strategies for cache and local memory management by global program transformations
- D. Gannon, W. Jalby, and K. Gallivan, "Strategies for Cache and Local Memory Management by Global Program Transformations," J. Parallel and Distributed Computing, vol. 5, pp. 587-616, 1988.
- (1988) J. Parallel and Distributed Computing , vol.5 , pp. 587-616
- Gannon, D.¹ Jalby, W.² Gallivan, K.³

22
- 0001714824
- Cache miss equations: A compiler framework for analyzing and tuning memory behavior
- S. Ghosh, M. Martonosi, and S. Malik, "Cache Miss Equations: A Compiler Framework for Analyzing and Tuning Memory Behavior," ACM Trans. Programming Languages and Systems, vol. 21, pp. 4, pp. 703-746, 1999.
- (1999) ACM Trans. Programming Languages and Systems , vol.21 , Issue.4 , pp. 703-746
- Ghosh, S.¹ Martonosi, M.² Malik, S.³

23
- 0005329615
- Procedure placement using temporal-ordering information
- N. Gloy and M.D. Smith, "Procedure Placement Using Temporal-Ordering Information," ACM Trans. Programming Languages and Systems, vol. 21, no. 5, pp. 1028-1075, 1999.
- (1999) ACM Trans. Programming Languages and Systems , vol.21 , Issue.5 , pp. 1028-1075
- Gloy, N.¹ Smith, M.D.²

24
- 0003630067
- A comparison of locality transformations for iregular codes
- H. Han and C.-W. Tseng, "A Comparison of Locality Transformations for Iregular Codes," Proc. Fifth Workshop Languages, Compilers, and Run-Time Systems for Scalable Computers, May 2000.
- Proc. Fifth Workshop Languages, Compilers, and Run-Time Systems for Scalable Computers, May 2000
- Han, H.¹ Tseng, C.-W.²

25
- 0033204190
- Analytical modeling of set-associative caches
- Oct.
- J.S. Harper, D.J. Kerbyson, and G.R. Nudd, "Analytical Modeling of Set-Associative Caches," IEEE Trans. Computers, vol. 48, no. 10, pp. 1009-1024, Oct. 1999.
- (1999) IEEE Trans. Computers , vol.48 , Issue.10 , pp. 1009-1024
- Harper, J.S.¹ Kerbyson, D.J.² Nudd, G.R.³

26
- 0004302191
- first ed. Morgan Kaufmann
- J.L. Hennessy and D.A. Patterson, Computer Architecture: A Quantitative Approach, first ed. Morgan Kaufmann, 1996.
- (1996) Computer Architecture: A Quantitative Approach
- Hennessy, J.L.¹ Patterson, D.A.²

27
- 12344315233
- DineroIII: A uniprocessor cache simulator
- M. Hill, "DineroIII: A Uniprocessor Cache Simulator," http://www.cs.wisc.edu/~larus/warts.html, 2004.
- (2004)
- Hill, M.¹

28
- 0032652980
- Nonlinear array layout for hierarchical memory systems
- June
- S.C.V.V. Jain, A.R. Lebeck, S. Mundhra, and M. Thottethodi, "Nonlinear Array Layout for Hierarchical Memory Systems," Proc. ACM Int'l Conf. Supercomputing (ICS '99), pp. 444-453, June 1999.
- (1999) Proc. ACM Int'l Conf. Supercomputing (ICS '99) , pp. 444-453
- Jain, S.C.V.V.¹ Lebeck, A.R.² Mundhra, S.³ Thottethodi, M.⁴

29
- 0033077834
- A linear algebra framework for automatic determination of optimal data layouts
- Feb.
- M. Kandemir, A. Choudhary, P. Banerjee, and J. Ramanujam, "A Linear Algebra Framework for Automatic Determination of Optimal Data Layouts," IEEE Trans. Parallel and Distributed Systems, vol. 10, no. 2, pp. 115-135, Feb. 1999.
- (1999) IEEE Trans. Parallel and Distributed Systems , vol.10 , Issue.2 , pp. 115-135
- Kandemir, M.¹ Choudhary, A.² Banerjee, P.³ Ramanujam, J.⁴

30
- 84976736383
- Page placement algorithms for large real-index caches
- R.E. Kessler and M.D. Hill, "Page Placement Algorithms for Large Real-Index Caches," ACM Trans. Computer Systems, vol. 10, no. 4, pp. 338-359, 1992.
- (1992) ACM Trans. Computer Systems , vol.10 , Issue.4 , pp. 338-359
- Kessler, R.E.¹ Hill, M.D.²

31
- 0030685988
- Data-centric multi-level blocking
- I. Kodukul, N. Ahmed, and K. Pingali, "Data-Centric Multi-Level Blocking," Proc. ACM SIGPLAN '97 Conf. Programming Language Design and Implementation (PLDI '97), pp. 346-357, 1997.
- (1997) Proc. ACM SIGPLAN '97 Conf. Programming Language Design and Implementation (PLDI '97) , pp. 346-357
- Kodukul, I.¹ Ahmed, N.² Pingali, K.³

32
- 0026137116
- The cache performance and optimizations of blocked algorithms
- Apr.
- M.S. Lam, E.E. Rothberg, and M.E. Wolf, "The Cache Performance and Optimizations of Blocked Algorithms," Proc. Fourth Int'l Conf. Architectural Support for Programming Languages and Operating Systems (ASPLOS '91), pp. 63-74, Apr. 1991.
- (1991) Proc. Fourth Int'l Conf. Architectural Support for Programming Languages and Operating Systems (ASPLOS '91) , pp. 63-74
- Lam, M.S.¹ Rothberg, E.E.² Wolf, M.E.³

33
- 85029516676
- Loop parallelization in the polytope model
- E. Best, ed.
- C. Lengauer, "Loop Parallelization in the Polytope Model," Proc. Int'l Conf. Concurrency Theory (CONCUR '93), E. Best, ed., pp. 398-416, 1993.
- (1993) Proc. Int'l Conf. Concurrency Theory (CONCUR '93) , pp. 398-416
- Lengauer, C.¹

34
- 84978485471
- MemSpy: Analyzing memory system bottlenecks in programs
- M. Martonosi, A. Gupta, and T. Anderson, "MemSpy: Analyzing Memory System Bottlenecks in Programs," Proc. ACM SIGMETRICS '92 Conf. Measurement and Modeling of Computer Systems, pp. 1-12, 1992.
- (1992) Proc. ACM SIGMETRICS '92 Conf. Measurement and Modeling of Computer Systems , pp. 1-12
- Martonosi, M.¹ Gupta, A.² Anderson, T.³

35
- 3042676705
- Solving systems of affine (In)equalities: PIP's user's guide
- The PIP System, "Solving Systems of Affine (In)Equalities: PIP's User's Guide," http://www.prism.uvsq.fr/~paf, 2002.
- (2002)

36
- 3042532547
- SUIF: An infrastructure for research on parallelizing and optimizing compilers
- The SUIF Compiler Group, "SUIF: An Infrastructure for Research on Parallelizing and Optimizing Compilers," http://suif.stanford.edu, 2004.
- (2004)

37
- 0024859772
- Program optimization for instruction caches
- Apr.
- S. McFarling, "Program Optimization for Instruction Caches," Proc. Int'l Conf. Architectural Support for Programming Languages and Operating Systems (ASPLOS '89), pp. 183-191, Apr. 1989.
- (1989) Proc. Int'l Conf. Architectural Support for Programming Languages and Operating Systems (ASPLOS '89) , pp. 183-191
- McFarling, S.¹

38
- 0030190854
- Improving data locality with loop transformations
- July
- K. McKinley, S. Carr, and C.-W. Tseng, "Improving Data Locality with Loop Transformations," ACM Trans. Programming Languages and Systems, vol. 18, no. 4, pp. 424-453, July 1996.
- (1996) ACM Trans. Programming Languages and Systems , vol.18 , Issue.4 , pp. 424-453
- McKinley, K.¹ Carr, S.² Tseng, C.-W.³

39
- 0003665539
- Quantifying loop nest locality using SPEC '95 and the perfect benchmarks
- Sept.
- K.S. McKinley and O. Temam, "Quantifying Loop Nest Locality Using SPEC '95 and the Perfect Benchmarks," ACM Trans. Computer Systems, vol. 17, no. 4, pp. 288-336, Sept. 1999.
- (1999) ACM Trans. Computer Systems , vol.17 , Issue.4 , pp. 288-336
- McKinley, K.S.¹ Temam, O.²

40
- 1542601822
- Improving memory hierarchy performance for irregular applications using data and computation reorderings
- J.M. Mellor-Crummey, D.B. Whalley, and K. Kennedy, "Improving Memory Hierarchy Performance for Irregular Applications Using Data and Computation Reorderings," Int'l J. Parallel Programming, vol. 29, no. 3, pp. 217-247, 2001.
- (2001) Int'l J. Parallel Programming , vol.29 , Issue.3 , pp. 217-247
- Mellor-Crummey, J.M.¹ Whalley, D.B.² Kennedy, K.³

41
- 0003690936
- Software methods for improvements of cache performance on supercomputer applications
- PhD thesis, Dept. of Computer Science, Rice Univ., May
- A.K. Porterfield, "Software Methods for Improvements of Cache Performance on Supercomputer Applications," PhD thesis, Dept. of Computer Science, Rice Univ., May 1989.
- (1989)
- Porterfield, A.K.¹

42
- 84976676720
- The omega test: A fast and practical integer programming algorithm for dependence analysis
- Aug.
- W. Pugh, "The Omega Test: A Fast and Practical Integer Programming Algorithm for Dependence Analysis," Comm. ACM, vol. 35, no. 8, pp. 102-114, Aug. 1992.
- (1992) Comm. ACM , vol.35 , Issue.8 , pp. 102-114
- Pugh, W.¹

43
- 0028132512
- Computing solutions to presburger formulas: How and why
- W. Pugh, "Computing Solutions to Presburger Formulas: How and Why," Proc. ACM SIGPLAN '94 Conf. Programming Language Design and Implementation (PLDI '94), pp. 121-134, 1994.
- (1994) Proc. ACM SIGPLAN '94 Conf. Programming Language Design and Implementation (PLDI '94) , pp. 121-134
- Pugh, W.¹

44
- 0031622954
- Data transformations for eliminating conflict misses
- G. Rivera and C.-W. Tseng, "Data Transformations for Eliminating Conflict Misses," Proc. ACM SIGPLAN '98 Conf. Programming Language Design and Implementation (PLDI '98), pp. 38-49, 1998.
- (1998) Proc. ACM SIGPLAN '98 Conf. Programming Language Design and Implementation (PLDI '98) , pp. 38-49
- Rivera, G.¹ Tseng, C.-W.²

45
- 0032635362
- New tiling techniques to improve cache temporal locality
- May
- Y. Song and Z. Li, "New Tiling Techniques to Improve Cache Temporal Locality," Proc. ACM SIGPLAN '99 Conf. Programming Language Design and Implementation (PLDI '99), pp. 215-228, May 1999.
- (1999) Proc. ACM SIGPLAN '99 Conf. Programming Language Design and Implementation (PLDI '99) , pp. 215-228
- Song, Y.¹ Li, Z.²

46
- 85093111577
- An empirical study of method inlining for a Java just-in-time compiler
- T. Suganuma, T. Yasue, and T. Nakatani, "An Empirical Study of Method Inlining for a Java Just-in-Time Compiler," Proc. Second Java Virtual Machine Research and Technology Symp., 2002.
- Proc. Second Java Virtual Machine Research and Technology Symp., 2002
- Suganuma, T.¹ Yasue, T.² Nakatani, T.³

47
- 0028429842
- Cache interference phenomena
- O. Temam, C. Fricker, and W. Jalby, "Cache Interference Phenomena," Proc. ACM SIGMETRICS '94 Conf. Measurement and Modeling of Computer Systems, pp. 261-271, 1994.
- (1994) Proc. ACM SIGMETRICS '94 Conf. Measurement and Modeling of Computer Systems , pp. 261-271
- Temam, O.¹ Fricker, C.² Jalby, W.³

48
- 0027764718
- To copy or not to copy: A compile-time technique for accessing when data copying should be used to eliminate cache conflicts
- O. Temam, E. Granston, and W. Jalby, "To Copy or Not to Copy: A Compile-Time Technique for Accessing when Data Copying Should Be Used to Eliminate Cache Conflicts," Proc. Supercomputing '93, pp. 410-419, 1993.
- (1993) Proc. Supercomputing '93 , pp. 410-419
- Temam, O.¹ Granston, E.² Jalby, W.³

49
- 85031661900
- Characterizing the behavior of sparse algorithms on caches
- O. Temam and W. Jalby, "Characterizing the Behavior of Sparse Algorithms on Caches," Proc. Supercomputing '92, pp. 578-587, 1992.
- (1992) Proc. Supercomputing '92 , pp. 578-587
- Temam, O.¹ Jalby, W.²

50
- 0032304622
- Optimizing the instruction cache performance of the operating system
- J. Torrellas, C. Xia, and R.L. Daigle, "Optimizing the Instruction Cache Performance of the Operating System," IEEE Trans. Computers, vol. 47, no. 12, pp. 1363-1381, 1998.
- (1998) IEEE Trans. Computers , vol.47 , Issue.12 , pp. 1363-1381
- Torrellas, J.¹ Xia, C.² Daigle, R.L.³

51
- 0031153459
- Trace-driven memory simulation: A survey
- Sept.
- R.A. Uhlig, and T.N. Mudge, "Trace-Driven Memory Simulation: A Survey," ACM Computing Surveys, vol. 29, no. 3, pp. 128-170, Sept. 1997.
- (1997) ACM Computing Surveys , vol.29 , Issue.3 , pp. 128-170
- Uhlig, R.A.¹ Mudge, T.N.²

52
- 33847139399
- Near-optimal padding for removing conflict misses
- X. Vera, J. Llosa, and A. González, "Near-Optimal Padding for Removing Conflict Misses," Proc. 15th Workshop Languages and Compilers for Parallel Computers (LCPC '02), July 2002.
- Proc. 15th Workshop Languages and Compilers for Parallel Computers (LCPC '02), July 2002
- Vera, X.¹ Llosa, J.² González, A.³

53
- 33646187750
- A fast and accurate approach to analyze cache memory behavior
- X. Vera, J. Llosa, A. González, and N. Bermudo, "A Fast and Accurate Approach to Analyze Cache Memory Behavior," Proc. European Conf. Parallel Computing (Europar '00), 2000.
- Proc. European Conf. Parallel Computing (Europar '00), 2000
- Vera, X.¹ Llosa, J.² González, A.³ Bermudo, N.⁴

54
- 84949805844
- Let's study whole-program cache behaviour analytically
- Feb.
- X. Vera and J. Xue, "Let's Study Whole-Program Cache Behaviour Analytically," Proc. Int'l Symp. High Performance Computer Architecture (HPCA-8), pp. 175-186, Feb. 2002.
- (2002) Proc. Int'l Symp. High Performance Computer Architecture (HPCA-8) , pp. 175-186
- Vera, X.¹ Xue, J.²

55
- 0031369396
- Timing analysis of data caches and set-associative caches
- R. White, F. Mueller, C. Healy, D. Whalley, and M.G. Harmon, "Timing Analysis of Data Caches and Set-Associative Caches," Proc. Third IEEE Real-Time Technology and Applications Symp. (RTAS '97), June 1997.
- Proc. Third IEEE Real-Time Technology and Applications Symp. (RTAS '97), June 1997
- White, R.¹ Mueller, F.² Healy, C.³ Whalley, D.⁴ Harmon, M.G.⁵

56
- 0004005802
- A library for doing polyhedral operations
- Technical Report 785, Oregon State Univ.
- D. Wilde, "A Library for Doing Polyhedral Operations," Technical Report 785, Oregon State Univ., 1993.
- (1993)
- Wilde, D.¹

57
- 84976827033
- A data locality optimizing algorithm
- June
- M.E. Wolf and M.S. Lam, "A Data Locality Optimizing Algorithm," Proc. ACM SIGPLAN '91 Conf. Programming Language Design and Implementation (PLDI '91), pp. 30-44, June 1991.
- (1991) Proc. ACM SIGPLAN '91 Conf. Programming Language Design and Implementation (PLDI '91) , pp. 30-44
- Wolf, M.E.¹ Lam, M.S.²

58
- 0031079360
- Unimodular transformations of non-perfectly nested loops
- J. Xue, "Unimodular Transformations of Non-Perfectly Nested Loops," Parallel Computing, vol. 22, no. 12, pp. 1621-1645, 1997.
- (1997) Parallel Computing , vol.22 , Issue.12 , pp. 1621-1645
- Xue, J.¹

59
- 0442303278
- Boston: Kluwer Academic, Aug.
- J. Xue, Loop Tiling for Parallelism. Boston: Kluwer Academic, Aug. 2000.
- (2000) Loop Tiling for Parallelism
- Xue, J.¹

60
- 0032315190
- Reuse-driven tiling for improving data locality
- J. Xue and C.-H. Huang, "Reuse-Driven Tiling for Improving Data Locality," Int'l J. Parallel Programming, vol. 26, no. 6, pp. 671-696, 1998.
- (1998) Int'l J. Parallel Programming , vol.26 , Issue.6 , pp. 671-696
- Xue, J.¹ Huang, C.-H.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.