-
2
-
-
84944730283
-
Scheduling algorithms with bus bandwidth considerations for SMPs
-
C. Antonopoulos, D. Nikolopoulos, and T. Papatheodorou. Scheduling algorithms with bus bandwidth considerations for SMPs. In ICPP, pages 547.554, 2003.
-
(2003)
ICPP
, pp. 547-554
-
-
Antonopoulos, C.1
Nikolopoulos, D.2
Papatheodorou, T.3
-
3
-
-
42149141725
-
PATH: Page access tracking to improve memory management
-
R. Azimi, L. Soares, M. Stumm, T. Walsh, and A. Demke Brown. PATH: page access tracking to improve memory management. In ISMM, pages 31.42, 2007.
-
(2007)
ISMM
, pp. 31-42
-
-
Azimi, R.1
Soares, L.2
Stumm, M.3
Walsh, T.4
Brown, A.D.5
-
4
-
-
32844456410
-
Online performance analysis by statistical sampling of microprocessor performance counters
-
DOI 10.1145/1088149.1088163, ICS05 - Proceedings of the 19th ACM International Conference on Supercomputing
-
R. Azimi, M. Stumm, and R. Wisniewski. Online performance analysis by statistical sampling of microprocessor performance counters. In ICS, pages 101. 110, 2005. (Pubitemid 43251314)
-
(2005)
Proceedings of the International Conference on Supercomputing
, pp. 101-110
-
-
Azimi, R.1
Stumm, M.2
Wisniewski, R.W.3
-
5
-
-
0034461413
-
Memory hierarchy reconfiguration for energy and performance in general-purpose processor architectures
-
R. Balasubramonian, D. Albonesi, A. Buyuktosunoglu, and S. Dwarkadas. Memory hierarchy reconfiguration for energy and performance in general-purpose processor architectures. In MICRO, pages 245.257, 2000. (Pubitemid 32255847)
-
(2000)
Proceedings of the Annual International Symposium on Microarchitecture
, pp. 245-257
-
-
Balasubramonian, R.1
Albonesi, D.2
Buyuktosunoglu, A.3
Dwarkadas, S.4
-
6
-
-
2642534571
-
StatCache: A probabilistic approach to efficient and accurate data locality analysis
-
E. Berg and E. Hagersten. StatCache: A probabilistic approach to efficient and accurate data locality analysis. In ISPASS, pages 20.27, 2004.
-
(2004)
ISPASS
, pp. 20-27
-
-
Berg, E.1
Hagersten, E.2
-
7
-
-
33244462442
-
Fast data-locality profiling of native execution
-
SIGMETRICS 2005: International Conference on Measurement and Modeling of Computer Systems - Proceedings
-
E. Berg and E. Hagersten. Fast data-locality profiling of native execution. In SIGMETRICS, pages 169.180, 2005. (Pubitemid 43275419)
-
(2005)
Performance Evaluation Review
, vol.33
, Issue.1
, pp. 169-180
-
-
Berg, E.1
Hagersten, E.2
-
8
-
-
33750837706
-
A statistical multiprocessor cache model
-
1620793, ISPASS 2006: IEEE International Symposium on Performance Analysis of Systems and Software, 2006
-
E. Berg, H. Zeffer, and E. Hagersten. A statistical multiprocessor cache model. In ISPASS, pages 89.99, 2006. (Pubitemid 44711113)
-
(2006)
ISPASS 2006: IEEE International Symposium on Performance Analysis of Systems and Software, 2006
, vol.2006
, pp. 89-99
-
-
Berg, E.1
Zeffer, H.2
Hagersten, E.3
-
9
-
-
16644378094
-
Design and implementation of a dynamic optimization framework for Windows
-
D. Bruening, E. Duesterwald, and S. Amarasinghe. Design and implementation of a dynamic optimization framework for Windows. In FDDO, 2001.
-
(2001)
FDDO
-
-
Bruening, D.1
Duesterwald, E.2
Amarasinghe, S.3
-
11
-
-
21244474546
-
Predicting inter-thread cache contention on a chip multi-processor architecture
-
D. Chandra, F. Guo, S. Kim, and Y. Solihin. Predicting inter-thread cache contention on a chip multi-processor architecture. In HPCA, pages 340.351, 2005.
-
(2005)
HPCA
, pp. 340-351
-
-
Chandra, D.1
Guo, F.2
Kim, S.3
Solihin, Y.4
-
12
-
-
40349095122
-
Managing distributed, shared L2 caches through OS-level page allocation
-
DOI 10.1109/MICRO.2006.31, 4041868, Proceedings of the 39th Annual IEEE/ACM International Symposium on Microarchitecture, MICRO-39
-
S. Cho and L Jin. Managing distributed, shared L2 caches through OS-level page allocation. In MICRO, pages 455.468, 2006. (Pubitemid 351337018)
-
(2006)
Proceedings of the Annual International Symposium on Microarchitecture, MICRO
, pp. 455-465
-
-
Cho, S.1
Jin, L.2
-
14
-
-
33749602130
-
Performance of multithreaded chip multiprocessors and implications for operating system design
-
A. Fedorova, M. Seltzer, C. Small, and D. Nussbaum. Performance of multithreaded chip multiprocessors and implications for operating system design. In USENIX ATC, pages 26.26, 2005.
-
(2005)
USENIX ATC
, pp. 26-26
-
-
Fedorova, A.1
Seltzer, M.2
Small, C.3
Nussbaum, D.4
-
15
-
-
33750363816
-
An analytical model for cache replacement policy performance
-
DOI 10.1145/1140103.1140304, SIGMETRICS 2006/Performance 2006 - Joint International Conference on Measurement and Modeling of Computer Systems, Proceedings
-
F. Guo and Y. Solihin. An analytical model for cache replacement policy performance. In SIGMETRICS, pages 228.239, 2006. (Pubitemid 44619094)
-
(2006)
Performance Evaluation Review
, vol.34
, Issue.1
, pp. 228-239
-
-
Guo, F.1
Solihin, Y.2
-
16
-
-
8344246922
-
CQoS: A framework for enabling QoS in shared caches of CMP platforms
-
R. Iyer. CQoS: a framework for enabling QoS in shared caches of CMP platforms. In ICS, pages 257.266, 2004.
-
(2004)
ICS
, pp. 257-266
-
-
Iyer, R.1
-
17
-
-
36349002905
-
QoS policies and architecture for cache/memory in CMP platforms
-
DOI 10.1145/1269899.1254886, SIGMETRICS'07 - Proceedings of the 2007 International Conference on Measurement and Modeling of Computer Systems
-
R. Iyer, L. Zhao, F. Guo, R. Illikkal, D. Newell, Y. Solihin, L. Hsu, and S. Reinhardt. QoS policies and architecture for cache/memory in CMP platforms. In SIGMETRICS, pages 25.36, 2007. (Pubitemid 350158070)
-
(2007)
Performance Evaluation Review
, vol.35
, Issue.1
, pp. 25-36
-
-
Iyer, R.1
Zhao, L.2
Guo, F.3
Illikkal, R.4
Makineni, S.5
Newell, D.6
Solihin, Y.7
Hsu, L.8
Reinhardt, S.9
-
18
-
-
84894240497
-
A low-overhead high-performance unified buffer management scheme that exploits sequential and looping references
-
J. Kim, J. Choi, J. Kim, S. Noh, S. Min, Y. Cho, and C. Kim. A low-overhead high-performance unified buffer management scheme that exploits sequential and looping references. In OSDI, pages 119.34, 2000.
-
(2000)
OSDI
, pp. 119-134
-
-
Kim, J.1
Choi, J.2
Kim, J.3
Noh, S.4
Min, S.5
Cho, Y.6
Kim, C.7
-
19
-
-
10444238444
-
Fair cache sharing and partitioning in a chip multiprocessor architecture
-
S. Kim, D. Chandra, and Y. Solihin. Fair cache sharing and partitioning in a chip multiprocessor architecture. In PACT, pages 111.122, 2004.
-
(2004)
PACT
, pp. 111-122
-
-
Kim, S.1
Chandra, D.2
Solihin, Y.3
-
20
-
-
80053025940
-
Implementing stack simulation for highlyassociative memories
-
Y. Kim, M. Hill, and D. Wood. Implementing stack simulation for highlyassociative memories. In SIGMETRICS, pages 212.213, 1991.
-
(1991)
SIGMETRICS
, pp. 212-213
-
-
Kim, Y.1
Hill, M.2
Wood, D.3
-
21
-
-
0031348717
-
OS-controlled cache predictability for real-time systems
-
J. Liedtke, H. Härtig, and M. Hohmuth. OS-controlled cache predictability for real-time systems. In RTAS, pages 213.227, 1997.
-
(1997)
RTAS
, pp. 213-227
-
-
Liedtke, J.1
Härtig, H.2
Hohmuth, M.3
-
22
-
-
57749186047
-
Gaining insights into multicore cache partitioning: Bridging the gap between simulation and real systems
-
J. Lin, Q. Lu, X. Ding, Z. Zhang, X. Zhang, and P. Sadayappan. Gaining insights into multicore cache partitioning: Bridging the gap between simulation and real systems. In HPCA, pages 367.378, 2008.
-
(2008)
HPCA
, pp. 367-378
-
-
Lin, J.1
Lu, Q.2
Ding, X.3
Zhang, Z.4
Zhang, X.5
Sadayappan, P.6
-
23
-
-
2342468635
-
Organizing the last line of defense before hitting the memory wall for CMPs
-
C. Liu, A. Sivasubramaniam, and M. Kandemir. Organizing the last line of defense before hitting the memory wall for CMPs. In HPCA, pages 176.185, 2004.
-
(2004)
HPCA
, pp. 176-185
-
-
Liu, C.1
Sivasubramaniam, A.2
Kandemir, M.3
-
24
-
-
31944440969
-
Pin: Building customized program analysis tools with dynamic instrumentation
-
Proceedings of the 2005 ACM SIGPLAN Conference on Programming Language Design and Implementation, PLDI 05
-
C.-K. Luk, R. Cohn, R. Muth, H. Patil, A. Klauser, G. Lowney, S. Wallace, V. Reddi, and K. Hazelwood. Pin: building customized program analysis tools with dynamic instrumentation. In PLDI, pages 190.200, 2005. (Pubitemid 43185951)
-
(2005)
Proceedings of the ACM SIGPLAN Conference on Programming Language Design and Implementation (PLDI)
, pp. 190-200
-
-
Luk, C.-K.1
Cohn, R.2
Muth, R.3
Patil, H.4
Klauser, A.5
Lowney, G.6
Wallace, S.7
Reddi, V.J.8
Hazelwood, K.9
-
25
-
-
0014701246
-
Evaluation techniques and storage hierarchies
-
R. Mattson, J. Gecsei, D. Slutz, and I. Traiger. Evaluation techniques and storage hierarchies. IBM Systems J., 9(2):78.117, 1970.
-
(1970)
IBM Systems J.
, vol.9
, Issue.2
, pp. 78-117
-
-
Mattson, R.1
Gecsei, J.2
Slutz, D.3
Traiger, I.4
-
26
-
-
63549102138
-
Multi-optimization power management for chip multiprocessors
-
K. Meng, R. Joseph, R. Dick, and L. Shang. Multi-optimization power management for chip multiprocessors. In PACT, pages 177.186, 2008.
-
(2008)
PACT
, pp. 177-186
-
-
Meng, K.1
Joseph, R.2
Dick, R.3
Shang, L.4
-
27
-
-
34548033967
-
JIT instrumentation: A novel approach to dynamically instrument operating systems
-
DOI 10.1145/1272996.1273000, Operating Systems Review - Proceedings of the 2007 EuroSys Conference
-
M. Olszewski, K. Mierle, A. Czajkowski, and A. Demke Brown. JIT instrumentation: a novel approach to dynamically instrument operating systems. In EuroSys, pages 3.16, 2007. (Pubitemid 47281571)
-
(2007)
Operating Systems Review (ACM)
, pp. 3-16
-
-
Olszewski, M.1
Mierle, K.2
Czajkowski, A.3
Brown, A.D.4
-
28
-
-
84883502375
-
Informed prefetching and caching
-
R. Patterson, G. Gibson, E. Ginting, D. Stodolsky, and J. Zelenka. Informed prefetching and caching. In SOSP, pages 79.95, 1995.
-
(1995)
SOSP
, pp. 79-95
-
-
Patterson, R.1
Gibson, G.2
Ginting, E.3
Stodolsky, D.4
Zelenka, J.5
-
29
-
-
34548042910
-
Utility-based cache partitioning: A low-overhead, high-performance, runtime mechanism to partition shared caches
-
DOI 10.1109/MICRO.2006.49, 4041865, Proceedings of the 39th Annual IEEE/ACM International Symposium on Microarchitecture, MICRO-39
-
M. Qureshi and Y. Patt. Utility-based cache partitioning: A low-overhead, highperformance, runtime mechanism to partition shared caches. In MICRO, pages 423.432, 2006. (Pubitemid 351337015)
-
(2006)
Proceedings of the Annual International Symposium on Microarchitecture, MICRO
, pp. 423-432
-
-
Qureshi, M.K.1
Patt, Y.N.2
-
30
-
-
34247108325
-
Architectural support for operating system-driven CMP cache management
-
DOI 10.1145/1152154.1152160, PACT 2006 - Proceedings of the Fifteenth International Conference on Parallel Architectures and Compilation Techniques
-
N. Rafique, W. Lim, and M. Thottethodi. Architectural support for operating system-driven CMP cache management. In PACT, pages 2.12, 2006. (Pubitemid 46601076)
-
(2006)
Parallel Architectures and Compilation Techniques - Conference Proceedings, PACT
, vol.2006
, pp. 2-12
-
-
Rafique, N.1
Lim, W.-T.2
Thottethodi, M.3
-
31
-
-
0031367150
-
A resource allocation model for QoS management
-
R. Rajkumar, C. Lee, J. Lehoczky, and D. Siewiorek. A resource allocation model for QoS management. In RTSS, pages 298.308, 1997.
-
(1997)
RTSS
, pp. 298-308
-
-
Rajkumar, R.1
Lee, C.2
Lehoczky, J.3
Siewiorek, D.4
-
33
-
-
77953988297
-
Using Valgrind to detect undefined value errors with bit-precision
-
J. Seward and N. Nethercote. Using Valgrind to detect undefined value errors with bit-precision. In USENIX ATC, pages 17.30, 2005.
-
(2005)
USENIX ATC
, pp. 17-30
-
-
Seward, J.1
Nethercote, N.2
-
34
-
-
33846527415
-
Locality approximation using time
-
DOI 10.1145/1190216.1190227, Conference Record of POPL 2007: The 34th ACM SIGPLAN-SIGACT Symposium on Principles of Programming Languages - Papers Presented at the Symposium
-
X. Shen, J. Shaw, B. Meeker, and C. Ding. Locality approximation using time. In POPL, pages 55.61, 2007. (Pubitemid 47326532)
-
(2007)
Conference Record of the Annual ACM Symposium on Principles of Programming Languages
, pp. 55-61
-
-
Shen, X.1
Shaw, J.2
Meeker, B.3
Ding, C.4
-
35
-
-
0038345698
-
Phase tracking and prediction
-
T. Sherwood, S. Sair, and B. Calder. Phase tracking and prediction. In ISCA, pages 336.349, 2003.
-
(2003)
ISCA
, pp. 336-349
-
-
Sherwood, T.1
Sair, S.2
Calder, B.3
-
37
-
-
66749168716
-
Reducing the harmful effects of last-level cache polluters with an OS-level, software-only pollute buffer
-
L. Soares, D. Tam, and M. Stumm. Reducing the harmful effects of last-level cache polluters with an OS-level, software-only pollute buffer. In MICRO, 2008.
-
(2008)
MICRO
-
-
Soares, L.1
Tam, D.2
Stumm, M.3
-
38
-
-
67650033179
-
Dynamic partitioning of the cache hierarchy in shared data centers
-
G. Soundararajan, J. Chen, M. Sharaf, and C. Amza. Dynamic partitioning of the cache hierarchy in shared data centers. In VLDB, pages 635.646, 2008.
-
(2008)
VLDB
, pp. 635-646
-
-
Soundararajan, G.1
Chen, J.2
Sharaf, M.3
Amza, C.4
-
41
-
-
1642371317
-
Dynamic partitioning of shared cache memory
-
Apr.
-
E. Suh, L Rudolph, and S. Devadas. Dynamic partitioning of shared cache memory. The J. of Supercomputing, 28(1):7.26, Apr. 2004.
-
(2004)
J. of Supercomputing
, vol.28
, Issue.1
, pp. 7-26
-
-
Suh, E.1
Rudolph, L.2
Devadas, S.3
-
42
-
-
57749176037
-
Managing shared L2 caches on multicore systems in software
-
D. Tam, R. Azimi, L. Soares, and M. Stumm. Managing shared L2 caches on multicore systems in software. In WIOSCA, pages 26.33, 2007.
-
(2007)
WIOSCA
, pp. 26-33
-
-
Tam, D.1
Azimi, R.2
Soares, L.3
Stumm, M.4
-
43
-
-
34548030923
-
Thread clustering: Sharing-aware scheduling on SMP-CMP-SMT multiprocessors
-
DOI 10.1145/1272996.1273004, Operating Systems Review - Proceedings of the 2007 EuroSys Conference
-
D. Tam, R. Azimi, and M. Stumm. Thread clustering: Sharing-aware scheduling on SMP-CMP-SMT multiprocessors. In EuroSys, pages 47.58, 2007. (Pubitemid 47281574)
-
(2007)
Operating Systems Review (ACM)
, pp. 47-58
-
-
Tam, D.1
Azimi, R.2
Stumm, M.3
-
44
-
-
0026881152
-
Improving disk cache hit-ratios through cache partitioning
-
D. Thiebaut, H. Stone, and J.Wolf. Improving disk cache hit-ratios through cache partitioning. IEEE TOC, 41(6):665.676, 1992.
-
(1992)
IEEE TOC
, vol.41
, Issue.6
, pp. 665-676
-
-
Thiebaut, D.1
Stone, H.2
Wolf, J.3
-
45
-
-
85076757258
-
CRAMM: Virtual memory support for garbage-collected applications
-
T. Yang, E. Berger, S. Kaplan, and J. Moss. CRAMM: virtual memory support for garbage-collected applications. In OSDI, pages 103.116, 2006.
-
(2006)
OSDI
, pp. 103-116
-
-
Yang, T.1
Berger, E.2
Kaplan, S.3
Moss, J.4
-
46
-
-
34547686261
-
Ubiquitous memory introspection
-
DOI 10.1109/CGO.2007.12, 4145123, International Symposium on Code Generation and Optimization, CGO 2007
-
Q. Zhao, R. Rabbah, S. Amarasinghe, L. Rudolph, and W.-F. Wong. Ubiquitous memory introspection. In CGO, pages 299.311, 2007. (Pubitemid 47214317)
-
(2007)
International Symposium on Code Generation and Optimization, CGO 2007
, pp. 299-311
-
-
Zhao, Q.1
Rabbah, R.2
Amarasinghe, S.3
Rudolph, L.4
Wong, W.-F.5
-
47
-
-
12844271066
-
Dynamic tracking of page miss ratio curve for memory management
-
11th International Conference on Architectural Support for Programming, Languages and Operating Systems, ASPLOS XI
-
P. Zhou, V. Pandey, J. Sundaresan, A. Raghuraman, Y. Zhou, and S. Kumar. Dynamic tracking of page miss ratio curve for memory management. In ASPLOS, pages 177.188, 2004. (Pubitemid 40163698)
-
(2004)
11th International Conference on Architectural Support for Programming Languages and Operating Systems, ASPLOS XI
, pp. 177-188
-
-
Zhou, P.1
Pandey, V.2
Sundaresan, J.3
Raghuraman, A.4
Zhou, Y.5
Kumar, S.6
-
48
-
-
84893513203
-
The multi-queue replacement algorithm for second level buffer caches
-
Y. Zhou, J. Philbin, and K. Li. The multi-queue replacement algorithm for second level buffer caches. In USENIX ATC, pages 91?104, 2001
-
(2001)
USENIX ATC
, pp. 91-104
-
-
Zhou, Y.1
Philbin, J.2
Li, K.3
|