-
1
-
-
0031270220
-
Continuous profiling: Where have all the cycles gone?
-
J. M. Anderson, L. M. Berc, J. Dean, S. Ghemawat, M. R. Henzinger, S.-T. A. Leung, R. L. Sites, M. T. Vandevoorde, C. A. Waldspurger, and W. E. Weihl, "Continuous profiling: where have all the cycles gone?" ACM Transactions on Computer Systems, vol. 15, no. 4, pp. 357-390, 1997.
-
(1997)
ACM Transactions on Computer Systems
, vol.15
, Issue.4
, pp. 357-390
-
-
Anderson, J.M.1
Berc, L.M.2
Dean, J.3
Ghemawat, S.4
Henzinger, M.R.5
Leung, S.-T.A.6
Sites, R.L.7
Vandevoorde, M.T.8
Waldspurger, C.A.9
Weihl, W.E.10
-
2
-
-
0003605996
-
-
NASA, Tech. Rep. NAS-95-020
-
D. Bailey, T. Harris, W. Saphir, R. van der Wijingaart, A. Woo, and M. Yarrow, "The NAS parallel benchmarks 2.0," NASA, Tech. Rep. NAS-95-020, 1995.
-
(1995)
The NAS parallel benchmarks 2.0
-
-
Bailey, D.1
Harris, T.2
Saphir, W.3
van der Wijingaart, R.4
Woo, A.5
Yarrow, M.6
-
4
-
-
0030260286
-
Compiler-directed page coloring for multiprocessors
-
E. Bugnion, J. M. Anderson, T. C. Mowry, M. Rosenblum, and M. S. Lam, "Compiler-directed page coloring for multiprocessors," in 7th Intl. Conf. on Arch. Support for Programming Languages and Operating Systems (ASPLOS), 1996, pp. 244-255.
-
(1996)
7th Intl. Conf. on Arch. Support for Programming Languages and Operating Systems (ASPLOS)
, pp. 244-255
-
-
Bugnion, E.1
Anderson, J.M.2
Mowry, T.C.3
Rosenblum, M.4
Lam, M.S.5
-
5
-
-
21244474546
-
Predicting inter-thread cache contention on a chip multi-processor architecture
-
D. Chandra, E. Guo, S. Kim, and Y. Solihin, "Predicting inter-thread cache contention on a chip multi-processor architecture," in 11th Intl. Symp. on High-Performance Computer Architecture (HPCA), 2005, pp. 340-351.
-
(2005)
11th Intl. Symp. on High-Performance Computer Architecture (HPCA)
, pp. 340-351
-
-
Chandra, D.1
Guo, E.2
Kim, S.3
Solihin, Y.4
-
6
-
-
0024906840
-
Improving cache performance by selective cache bypass
-
Architecture Track
-
C.-H. Chi and H. Dietz, "Improving cache performance by selective cache bypass," in Twenty-Second Annual Hawaii International Conference on System Sciences, vol. 1, Architecture Track, 1989, pp. 277-285.
-
(1989)
Twenty-Second Annual Hawaii International Conference on System Sciences
, vol.1
, pp. 277-285
-
-
Chi, C.-H.1
Dietz, H.2
-
7
-
-
40349095122
-
Managing distributed, shared L2 caches through OS-level page allocation
-
S. Cho and L. Jin, "Managing distributed, shared L2 caches through OS-level page allocation," in 39th Intl. Symp. on Microarchitecture (MICRO), 2006, pp. 455-468.
-
(2006)
39th Intl. Symp. on Microarchitecture (MICRO)
, pp. 455-468
-
-
Cho, S.1
Jin, L.2
-
8
-
-
0031340339
-
ProfileMe: Hardware support for instruction-level profiling on out-of-order processors
-
J. Dean, J. E. Hicks, C. A. Waldspurger, W. E. Weihl, and G. Chrysos, "ProfileMe: hardware support for instruction-level profiling on out-of-order processors," in 30th Intl. Symp. on Microarchitecture (MICRO), 1997, pp. 292-302.
-
(1997)
30th Intl. Symp. on Microarchitecture (MICRO)
, pp. 292-302
-
-
Dean, J.1
Hicks, J.E.2
Waldspurger, C.A.3
Weihl, W.E.4
Chrysos, G.5
-
9
-
-
33845215381
-
Enhancing last-level cache performance by block bypassing and early miss determination
-
H. Dybdahl and P. Stenström, "Enhancing last-level cache performance by block bypassing and early miss determination," in Asia-Pacific Computer Systems Arch. Conf, 2006, pp. 52-66.
-
(2006)
Asia-Pacific Computer Systems Arch. Conf
, pp. 52-66
-
-
Dybdahl, H.1
Stenström, P.2
-
10
-
-
0029204095
-
A data cache with multiple caching strategies tuned to different types of locality
-
A. González, C. Aliagas, and M. Valero, "A data cache with multiple caching strategies tuned to different types of locality," in Intl. Conf. in Supercomputing (ICS), 1995, pp. 338-347.
-
(1995)
Intl. Conf. in Supercomputing (ICS)
, pp. 338-347
-
-
González, A.1
Aliagas, C.2
Valero, M.3
-
11
-
-
0033319665
-
-
T. L. Johnson, D. A. Connors, M. C. Merten, and W. mei W. Hwu, Run-time cache bypassing, IEEE Transactions on Computers, 48, no. 12, pp. 1338-1354, 1999.
-
T. L. Johnson, D. A. Connors, M. C. Merten, and W. mei W. Hwu, "Run-time cache bypassing," IEEE Transactions on Computers, vol. 48, no. 12, pp. 1338-1354, 1999.
-
-
-
-
12
-
-
84976736383
-
Page placement algorithms for large real-indexed caches
-
R. E. Kessler and M. D. Hill, "Page placement algorithms for large real-indexed caches," ACM Transactions on Computer Systems, vol. 10, no. 4, pp. 338-359, 1992.
-
(1992)
ACM Transactions on Computer Systems
, vol.10
, Issue.4
, pp. 338-359
-
-
Kessler, R.E.1
Hill, M.D.2
-
13
-
-
41149104074
-
Counter-based cache replacement and bypassing algorithms
-
M. Kharbutli and Y. Solihin, "Counter-based cache replacement and bypassing algorithms," IEEE Transactions on Computers, vol. 57, no. 4, pp. 433-447, 2008.
-
(2008)
IEEE Transactions on Computers
, vol.57
, Issue.4
, pp. 433-447
-
-
Kharbutli, M.1
Solihin, Y.2
-
14
-
-
0031348717
-
OS-controlled cache predictability for real-time systems
-
J. Liedtke, H. Hartig, and M. Hohmuth, "OS-controlled cache predictability for real-time systems," in Real-Time Technology and Applications Symposium, 1997, pp. 213-227.
-
(1997)
Real-Time Technology and Applications Symposium
, pp. 213-227
-
-
Liedtke, J.1
Hartig, H.2
Hohmuth, M.3
-
15
-
-
57749186047
-
Gaining insights into multi-core cache partitioning: Bridging the gap between simulation and real systems
-
J. Lin, Q. Lu, X. Ding, Z. Zhang, X. Zhang, and P. Sadayappan, "Gaining insights into multi-core cache partitioning: Bridging the gap between simulation and real systems," in 14th Intl. Symp. on High-Performance Comp. Arch. (HPCA), 2008, pp. 367-378.
-
(2008)
14th Intl. Symp. on High-Performance Comp. Arch. (HPCA)
, pp. 367-378
-
-
Lin, J.1
Lu, Q.2
Ding, X.3
Zhang, Z.4
Zhang, X.5
Sadayappan, P.6
-
17
-
-
0026976725
-
The effect of page allocation on caches
-
W. L. Lynch, B. K. Bray, and M. J. Flynn, "The effect of page allocation on caches," in 25th Intl. Symp. on Microarchitecture (MICRO), 1992, pp. 222-225.
-
(1992)
25th Intl. Symp. on Microarchitecture (MICRO)
, pp. 222-225
-
-
Lynch, W.L.1
Bray, B.K.2
Flynn, M.J.3
-
18
-
-
84976714711
-
Compiler support for software-based cache partitioning
-
E. Mueller, "Compiler support for software-based cache partitioning," in Conference on Languages, Compilers, and Tools for Embedded Systems (LCTES), 1995, pp. 125-133.
-
(1995)
Conference on Languages, Compilers, and Tools for Embedded Systems (LCTES)
, pp. 125-133
-
-
Mueller, E.1
-
19
-
-
48449102571
-
Modeling cache sharing on chip multiprocessor architectures
-
P. Petoumenos, G. Keramidas, H. Zeffer, S. Kaxiras, and E. Hagersten, "Modeling cache sharing on chip multiprocessor architectures," in Intl. Symp. on Workload Characterization (IISWC), 2006, pp. 160-171.
-
(2006)
Intl. Symp. on Workload Characterization (IISWC)
, pp. 160-171
-
-
Petoumenos, P.1
Keramidas, G.2
Zeffer, H.3
Kaxiras, S.4
Hagersten, E.5
-
20
-
-
38049084889
-
Exploiting single-usage for effective memory management
-
T. Piquet, O. Rochecouste, and A. Seznec, "Exploiting single-usage for effective memory management," in Asia-Pacific Computer Systems Architecture Conference, 2007, pp. 90-101.
-
(2007)
Asia-Pacific Computer Systems Architecture Conference
, pp. 90-101
-
-
Piquet, T.1
Rochecouste, O.2
Seznec, A.3
-
22
-
-
35348920021
-
Adaptive insertion policies for high performance caching
-
M. K. Qureshi, A. Jaleel, Y. N. Patt, S. C. Steely, and J. Emer, "Adaptive insertion policies for high performance caching," in Intl. Symp. on Comp. Arch. (ISCA), 2007, pp. 381-391.
-
(2007)
Intl. Symp. on Comp. Arch. (ISCA)
, pp. 381-391
-
-
Qureshi, M.K.1
Jaleel, A.2
Patt, Y.N.3
Steely, S.C.4
Emer, J.5
-
23
-
-
0032644674
-
Reducing cache misses using hardware and software page placement
-
T. Sherwood, B. Calder, and J. Emer, "Reducing cache misses using hardware and software page placement," in International Conference on Supercomputing (ICS), 1999, pp. 155-164.
-
(1999)
International Conference on Supercomputing (ICS)
, pp. 155-164
-
-
Sherwood, T.1
Calder, B.2
Emer, J.3
-
25
-
-
1642371317
-
Dynamic partitioning of shared cache memory
-
G. E. Suh, L. Rudolph, and S. Devadas, "Dynamic partitioning of shared cache memory," Journal of Supercomputing, vol. 28, no. 1, pp. 7-26, 2004.
-
(2004)
Journal of Supercomputing
, vol.28
, Issue.1
, pp. 7-26
-
-
Suh, G.E.1
Rudolph, L.2
Devadas, S.3
-
26
-
-
70349109319
-
Managing shared L2 caches on multicore systems in software
-
D. Tarn, R. Azimi, L. Soares, and M. Stumm, "Managing shared L2 caches on multicore systems in software," in Workshop on the Interaction between Operating Systems and Computer Architecture (WIOSCA), 2007.
-
(2007)
Workshop on the Interaction between Operating Systems and Computer Architecture (WIOSCA)
-
-
Tarn, D.1
Azimi, R.2
Soares, L.3
Stumm, M.4
-
27
-
-
0029508817
-
A modified approach to data cache management
-
G. Tyson, M. Farrens, J. Matthews, and A. R. Pleszkun, "A modified approach to data cache management," in 28th Intl. Symp. on Microarchitecture (MICRO), 1995, pp. 93-103.
-
(1995)
28th Intl. Symp. on Microarchitecture (MICRO)
, pp. 93-103
-
-
Tyson, G.1
Farrens, M.2
Matthews, J.3
Pleszkun, A.R.4
-
28
-
-
0346935130
-
Data caches in multitasking hard real-time systems
-
X. Vera, B. Lisper, and J. Xue, "Data caches in multitasking hard real-time systems," in 24th IEEE International Real-Time Systems Symposium (RTSS), 2003, pp. 154-165.
-
(2003)
24th IEEE International Real-Time Systems Symposium (RTSS)
, pp. 154-165
-
-
Vera, X.1
Lisper, B.2
Xue, J.3
-
29
-
-
3042560075
-
Software-based cache partitioning for real-time applications
-
A. Wolfe, "Software-based cache partitioning for real-time applications," Journal of Computer and Software Engineering, vol. 2, no. 3, pp. 315-327, 1994.
-
(1994)
Journal of Computer and Software Engineering
, vol.2
, Issue.3
, pp. 315-327
-
-
Wolfe, A.1
-
31
-
-
84948958301
-
Compiler managed micro-cache bypassing for high performance EPIC processors
-
Y. Wu, R. Rakvic, L.-L. Chen, C.-C. Miao, G. Chrysos, and J. Fang, "Compiler managed micro-cache bypassing for high performance EPIC processors," in Intl. Symp. on Microarchitecture (MICRO), 2002, pp. 134-145.
-
(2002)
Intl. Symp. on Microarchitecture (MICRO)
, pp. 134-145
-
-
Wu, Y.1
Rakvic, R.2
Chen, L.-L.3
Miao, C.-C.4
Chrysos, G.5
Fang, J.6
-
32
-
-
47349115313
-
A framework for coarse-grain optimizations in the on-chip memory hierarchy
-
J. Zebchuk, E. Safl, and A. Moshovos, "A framework for coarse-grain optimizations in the on-chip memory hierarchy," in 40th Intl. Symp. on Microarchitecture (MICRO), 2007, pp. 314-327.
-
(2007)
40th Intl. Symp. on Microarchitecture (MICRO)
, pp. 314-327
-
-
Zebchuk, J.1
Safl, E.2
Moshovos, A.3
|