-
1
-
-
33846012075
-
-
A. V. Aho, R. Sethi, and J. D. Ullman. Compilers:, Addison-Wesley
-
A. V. Aho, R. Sethi, and J. D. Ullman. Compilers: Principles, Techniques and Tools. Addison-Wesley, 1986.
-
(1986)
Principles, Techniques and Tools
-
-
-
2
-
-
0030201485
-
Pc software performance tuning
-
M. Atkins and R. Subramaniam. Pc software performance tuning. IEEE Computer, 29:47-54, 1996.
-
(1996)
IEEE Computer
, vol.29
, pp. 47-54
-
-
Atkins, M.1
Subramaniam, R.2
-
3
-
-
35248818778
-
Improving data locality by chunking
-
Compiler Construction
-
C. Bastoul and P. Feautrier. Improving data locality by chunking. In Compiler Construction, LNCS 2622, pages 320-335, 2003.
-
(2003)
LNCS
, vol.2622
, pp. 320-335
-
-
Bastoul, C.1
Feautrier, P.2
-
4
-
-
25144486545
-
SIP: Performance tuning through source code interdependence
-
E. Berg and E. Hagersten. SIP: Performance tuning through source code interdependence. In Euro-Par, LNCS 2400, pages 177-186, 2002.
-
(2002)
Euro-Par, LNCS 2400
, pp. 177-186
-
-
Berg, E.1
Hagersten, E.2
-
5
-
-
33244462442
-
Fast data-locality profiling of native execution
-
E. Berg and E. Hagersten. Fast data-locality profiling of native execution. In SIGMETRICS, pages 169-180, 2005.
-
(2005)
SIGMETRICS
, pp. 169-180
-
-
Berg, E.1
Hagersten, E.2
-
6
-
-
14944380098
-
Generating cache hints for improved program efficiency
-
K. Beyls and E. H. D'Hollander. Generating cache hints for improved program efficiency. J. of Systems Architecture, 51(4):223-250, 2005.
-
(2005)
J. of Systems Architecture
, vol.51
, Issue.4
, pp. 223-250
-
-
Beyls, K.1
D'Hollander, E.H.2
-
7
-
-
25144442689
-
RDVIS: A tool that visualizes the causes of low locality and hints program optimizations
-
K. Beyls, E. H. D'Hollander, and F. Vandeputte. RDVIS: A tool that visualizes the causes of low locality and hints program optimizations. In ICCS, volume 3515 of LNCS, pages 166-173, 2005.
-
(2005)
ICCS, volume 3515 of LNCS
, pp. 166-173
-
-
Beyls, K.1
D'Hollander, E.H.2
Vandeputte, F.3
-
8
-
-
0002957550
-
Rivet: A flexible environment for computer systems visualization
-
R. Bosch and C. S. et al. Rivet: A flexible environment for computer systems visualization. Computer Graphics-US, 34:68-73, 2000.
-
(2000)
Computer Graphics-US
, vol.34
, pp. 68-73
-
-
Bosch, R.1
S., C.2
-
9
-
-
17244376579
-
Cache-conscious structure definition
-
T. M. Chilimbi, B. Davidson, and J. R. Larus. Cache-conscious structure definition. In PLDI, SIGPLAN Notices, pages 13-24, 1999.
-
(1999)
PLDI, SIGPLAN Notices
, pp. 13-24
-
-
Chilimbi, T.M.1
Davidson, B.2
Larus, J.R.3
-
10
-
-
32844473507
-
Facilitating the search for compositions of program transformations
-
June
-
A. Cohen, S. Girbal, D. Parello, M. Sigler, O. Temam, and N. Vasilache. Facilitating the search for compositions of program transformations. In ACM Int. Conf. on Supercomputing (ICS'05), Boston, Massachusetts., June 2005.
-
(2005)
ACM Int. Conf. on Supercomputing (ICS'05), Boston, Massachusetts
-
-
Cohen, A.1
Girbal, S.2
Parello, D.3
Sigler, M.4
Temam, O.5
Vasilache, N.6
-
11
-
-
1642502420
-
Improving effective bandwidth through compiler enhancement of global cache reuse
-
C. Ding and K. Kennedy. Improving effective bandwidth through compiler enhancement of global cache reuse. Journal of Parallel and Distributed Computing, 64(1):108-134, 2004.
-
(2004)
Journal of Parallel and Distributed Computing
, vol.64
, Issue.1
, pp. 108-134
-
-
Ding, C.1
Kennedy, K.2
-
12
-
-
33746684270
-
Instruction based memory distance analysis and its application to optimization
-
C. Fang, S. Carr, S. Onder, and Z. Wang. Instruction based memory distance analysis and its application to optimization. In PACT, 2005.
-
(2005)
PACT
-
-
Fang, C.1
Carr, S.2
Onder, S.3
Wang, Z.4
-
13
-
-
0027242764
-
Mtool: An integrated system for performance debugging shared memory multiprocessor applications
-
A. Goldberg and J. Hennessy. Mtool: An integrated system for performance debugging shared memory multiprocessor applications. IEEE Trans. on Parallel and Distributed Systems, 4(1):28-40, 1993.
-
(1993)
IEEE Trans. on Parallel and Distributed Systems
, vol.4
, Issue.1
, pp. 28-40
-
-
Goldberg, A.1
Hennessy, J.2
-
15
-
-
51049089199
-
Improving locality for adaptive irregular scientific codes
-
H. Han and C.-W. Tseng. Improving locality for adaptive irregular scientific codes. In LCPC, pages 173-188, 2000.
-
(2000)
LCPC
, pp. 173-188
-
-
Han, H.1
Tseng, C.-W.2
-
16
-
-
85015260319
-
Data space-oriented tiling for enhancing locality
-
388-414
-
I. Kadayif and M. Kandemir. Data space-oriented tiling for enhancing locality. Trans. on Embedded Computing Sys., 4(2):388-414, 2005.
-
(2005)
Trans. on Embedded Computing Sys
, vol.4
, Issue.2
-
-
Kadayif, I.1
Kandemir, M.2
-
17
-
-
0032680282
-
An integer linear programming approach for optimizing cache locality
-
M. Kandemir, P. Banerjee, A. Choudhary, J. Ramanujam, and E. Ayguade. An integer linear programming approach for optimizing cache locality. In Supercomputing, pages 500-509, 1999.
-
(1999)
Supercomputing
, pp. 500-509
-
-
Kandemir, M.1
Banerjee, P.2
Choudhary, A.3
Ramanujam, J.4
Ayguade, E.5
-
18
-
-
0346865818
-
Data-centric transformations for locality enhancement
-
I. Kodukula and K. Pingali. Data-centric transformations for locality enhancement. Int. J. Parallel Program., 29(3):319-364, 2001.
-
(2001)
Int. J. Parallel Program
, vol.29
, Issue.3
, pp. 319-364
-
-
Kodukula, I.1
Pingali, K.2
-
19
-
-
31844446709
-
Automatic Pool Allocation: Improving Performance by Controlling Data Structure Layout in the Heap
-
C. Lattner and V. Adve. Automatic Pool Allocation: Improving Performance by Controlling Data Structure Layout in the Heap. In PLDI, SIGPLAN Notices, 2005.
-
(2005)
PLDI, SIGPLAN Notices
-
-
Lattner, C.1
Adve, V.2
-
20
-
-
0028517833
-
Cache profiling and the SPEC benchmarks: A case study
-
A. R. Lebeck and D. A. Wood. Cache profiling and the SPEC benchmarks: A case study. IEEE Computer, 27(10):15-26, 1994.
-
(1994)
IEEE Computer
, vol.27
, Issue.10
, pp. 15-26
-
-
Lebeck, A.R.1
Wood, D.A.2
-
21
-
-
0029290849
-
Tuning memory performance of sequential and parallel programs
-
April
-
M. Martonosi, A. Gupta, and T. Anderson. Tuning memory performance of sequential and parallel programs. IEEE Computer, April 1995.
-
(1995)
IEEE Computer
-
-
Martonosi, M.1
Gupta, A.2
Anderson, T.3
-
22
-
-
0030190854
-
Improving data locality with loop transformations
-
K. S. McKinley, S. Carr, and C.-W. Tseng. Improving data locality with loop transformations. ACM Transactions on Programming Languages and Systems, 18(4):424-453, 1996.
-
(1996)
ACM Transactions on Programming Languages and Systems
, vol.18
, Issue.4
, pp. 424-453
-
-
McKinley, K.S.1
Carr, S.2
Tseng, C.-W.3
-
23
-
-
0036679608
-
HPCView: A tool for top-down analysis of node performance
-
J. Mellor-Crummey and R. F. et al. HPCView: a tool for top-down analysis of node performance. J. of Supercomputing, 23:81-104, 2002.
-
(2002)
J. of Supercomputing
, vol.23
, pp. 81-104
-
-
Mellor-Crummey, J.1
F., R.2
-
24
-
-
1542601822
-
Improving memory hierarchy performance for irregular applications using data and computation reorderings
-
June
-
J. Mellor-Crummey, D. Whalley, and K. Kennedy. Improving memory hierarchy performance for irregular applications using data and computation reorderings. International Journal of Parallel Programming, 29(3):217-247, June 2001.
-
(2001)
International Journal of Parallel Programming
, vol.29
, Issue.3
, pp. 217-247
-
-
Mellor-Crummey, J.1
Whalley, D.2
Kennedy, K.3
-
25
-
-
33646391495
-
-
B. Quaing, J. Tao, and W. Karl. Yaco: A user conducted visualization tool for supporting cache optimization. In Proceedings of HPCC 2005, 3726 of Lecture Notes in Computer Science, pages 694-703, 2005.
-
B. Quaing, J. Tao, and W. Karl. Yaco: A user conducted visualization tool for supporting cache optimization. In Proceedings of HPCC 2005, volume 3726 of Lecture Notes in Computer Science, pages 694-703, 2005.
-
-
-
-
26
-
-
0032635362
-
-
Y. Song and Z. Li. New Tiling Techniques to Improve Cache Temporal Locality. In PLDI, SIGPLAN Not., pages 215-228, 1999.
-
Y. Song and Z. Li. New Tiling Techniques to Improve Cache Temporal Locality. In PLDI, SIGPLAN Not., pages 215-228, 1999.
-
-
-
-
27
-
-
1542710739
-
Sparse Tiling for Stationary Iterative Methods
-
M. M. Strout, L. Carter, J. Ferrante, and B. Kreaseck. Sparse Tiling for Stationary Iterative Methods. International Journal of High Performance Computing Applications, 18(1):95-113, 2004.
-
(2004)
International Journal of High Performance Computing Applications
, vol.18
, Issue.1
, pp. 95-113
-
-
Strout, M.M.1
Carter, L.2
Ferrante, J.3
Kreaseck, B.4
-
28
-
-
0031186412
-
The cache visualization tool
-
E. Vanderdeijl, O. Temam, E. Granston, and G. Kanbier. The cache visualization tool. IEEE Computer, 30(7):71, 1997.
-
(1997)
IEEE Computer
, vol.30
, Issue.7
, pp. 71
-
-
Vanderdeijl, E.1
Temam, O.2
Granston, E.3
Kanbier, G.4
-
29
-
-
25144519532
-
A tool suite for simulation based analysis of memory access behavior
-
J. Weidendorfer, M. Kowarschik, and C. Trinitis. A tool suite for simulation based analysis of memory access behavior. In ICCS, volume 3038 of LNCS, pages 440-447, 2004.
-
(2004)
ICCS, volume 3038 of LNCS
, pp. 440-447
-
-
Weidendorfer, J.1
Kowarschik, M.2
Trinitis, C.3
-
30
-
-
84976827033
-
A data locality optimizing algorithm
-
M. E. Wolf and M. S. Lam. A data locality optimizing algorithm. In PLDI, SIGPLAN Notices, pages 30-44, 1991.
-
(1991)
PLDI, SIGPLAN Notices
, pp. 30-44
-
-
Wolf, M.E.1
Lam, M.S.2
-
31
-
-
1542392248
-
Achieving scalable locality with time skewing
-
D. Wonnacott. Achieving scalable locality with time skewing. Int. J. Parallel Program., 30(3):181-221, 2002.
-
(2002)
Int. J. Parallel Program
, vol.30
, Issue.3
, pp. 181-221
-
-
Wonnacott, D.1
-
33
-
-
8344272049
-
Array regrouping and structure splitting using whole-program reference affinity
-
Y. Zhong, M. Orlovich, X. Shen, and C. Ding. Array regrouping and structure splitting using whole-program reference affinity. In PLDI, SIGPLAN Notices, 2004.
-
(2004)
PLDI, SIGPLAN Notices
-
-
Zhong, Y.1
Orlovich, M.2
Shen, X.3
Ding, C.4
|