-
1
-
-
0036469652
-
SimpleScalar: An Infrastructure for Computer System Modeling
-
February
-
T. Austin, E. Larson, and D. Ernst. SimpleScalar: An Infrastructure for Computer System Modeling. IEEE Micro Magazine, pages 59-67, February 2002.
-
(2002)
IEEE Micro Magazine
, pp. 59-67
-
-
Austin, T.1
Larson, E.2
Ernst, D.3
-
2
-
-
33746749348
-
Trace Cache Sampling Filter
-
St. Louis, MO, USA, September
-
M. Behar, A. Mendelson, and A. Kolodny. Trace Cache Sampling Filter. In Proceedings of the 14th International Conference on Parallel Architectures and Compilation Techniques, pages 255-266, St. Louis, MO, USA, September 2005.
-
(2005)
Proceedings of the 14th International Conference on Parallel Architectures and Compilation Techniques
, pp. 255-266
-
-
Behar, M.1
Mendelson, A.2
Kolodny, A.3
-
3
-
-
40349090128
-
Die-Stacking (3D) Microarchitecture
-
Orlando, FL, December
-
B. Black, M. M. Annavaram, E. Brekelbaum, J. DeVale, L. Jiang, G. H. Loh, D. McCauley, P. Morrow, D. W. Nelson, D. Pantuso, P. Reed, J. Rupley, S. Shankar, J. P. Shen, and C. Webb. Die-Stacking (3D) Microarchitecture. In Proceedings of the 39th International Symposium on Microarchitecture, Orlando, FL, December 2006.
-
(2006)
Proceedings of the 39th International Symposium on Microarchitecture
-
-
Black, B.1
Annavaram, M.M.2
Brekelbaum, E.3
DeVale, J.4
Jiang, L.5
Loh, G.H.6
McCauley, D.7
Morrow, P.8
Nelson, D.W.9
Pantuso, D.10
Reed, P.11
Rupley, J.12
Shankar, S.13
Shen, J.P.14
Webb, C.15
-
4
-
-
0010232351
-
The Declining Effectiveness of Dynamic Caching for General-Purpose Microprocessors
-
Technical Report 1261, University of Wisconsin, January
-
D. Burger, J. R. Goodman, and A. Kägi. The Declining Effectiveness of Dynamic Caching for General-Purpose Microprocessors. Technical Report 1261, University of Wisconsin, January 1995.
-
(1995)
-
-
Burger, D.1
Goodman, J.R.2
Kägi, A.3
-
5
-
-
21244474546
-
Predicting Inter-Thread Cache Contenton on a Chip Multi-Processor Architecture
-
San Francisco, CA, USA, February
-
D. Chandra, F. Guo, S. Kim, and Y. Solihin. Predicting Inter-Thread Cache Contenton on a Chip Multi-Processor Architecture. In Proceedings of the 11th International Symposium on High Performance Computer Architecture, pages 340-351, San Francisco, CA, USA, February 2005.
-
(2005)
Proceedings of the 11th International Symposium on High Performance Computer Architecture
, pp. 340-351
-
-
Chandra, D.1
Guo, F.2
Kim, S.3
Solihin, Y.4
-
7
-
-
33748631273
-
An Automated Design Flow for 3D Microarchitecture Evaluation
-
Yokohama, Japan, January
-
J. Cong, A. Jagannathan, Y. Ma, G. Reinman, J. Wei, and Y. Zhang. An Automated Design Flow for 3D Microarchitecture Evaluation. In Proceedings of the 11th Asia South Pacific Design Automation Conference, pages 384-389, Yokohama, Japan, January 2006.
-
(2006)
Proceedings of the 11th Asia South Pacific Design Automation Conference
, pp. 384-389
-
-
Cong, J.1
Jagannathan, A.2
Ma, Y.3
Reinman, G.4
Wei, J.5
Zhang, Y.6
-
8
-
-
56349130887
-
Inside Intel Core Microarchitecture and Smart Memory Access
-
J. Doweck. Inside Intel Core Microarchitecture and Smart Memory Access. White paper, Intel Corporation, 2006. http://download.intel.com/technology/ architecture/sma.pdf.
-
(2006)
White paper
-
-
Doweck, J.1
-
9
-
-
47349120126
-
Smart Refresh: An Enhanced Memory Controller Design for Reducing Energy in Conventional and 3D Die-Stacked DRAMs
-
Chicago, IL, December
-
M. Ghosh and H.-H. S. Lee. Smart Refresh: An Enhanced Memory Controller Design for Reducing Energy in Conventional and 3D Die-Stacked DRAMs. In Proceedings of the 40th International Symposium on Microarchitecture, Chicago, IL, December 2007.
-
(2007)
Proceedings of the 40th International Symposium on Microarchitecture
-
-
Ghosh, M.1
Lee, H.-H.S.2
-
11
-
-
33748874422
-
SimPoint 3.0: Faster and More Flexible Program Analysis
-
Madison, WI, USA, June
-
G. Hamerly, E. Perelman, J. Lau, and B. Calder. SimPoint 3.0: Faster and More Flexible Program Analysis. In Proceedings of the Workshop on Modeling, Benchmarking and Simulation, Madison, WI, USA, June 2005.
-
(2005)
Proceedings of the Workshop on Modeling, Benchmarking and Simulation
-
-
Hamerly, G.1
Perelman, E.2
Lau, J.3
Calder, B.4
-
12
-
-
34247143442
-
Communist, Utilitarian, and Capitalist Cache Policies on CMPs: Caches as a Shared Resource
-
Seattle, WA, USA, September
-
L. R. Hsu, S. K. Reinhardt, R. R. Iyer, and S. Makineni. Communist, Utilitarian, and Capitalist Cache Policies on CMPs: Caches as a Shared Resource. In Proceedings of the 15th International Conference on Parallel Architectures and Compilation Techniques, pages 13-22, Seattle, WA, USA, September 2006.
-
(2006)
Proceedings of the 15th International Conference on Parallel Architectures and Compilation Techniques
, pp. 13-22
-
-
Hsu, L.R.1
Reinhardt, S.K.2
Iyer, R.R.3
Makineni, S.4
-
14
-
-
76749097528
-
-
A. Jaleel, W. Hasenplaugh, M. Qureshi, J. Sebot, S. S. Jr., and J. Emer. Adaptive Insertion Policies for Managing Shared Caches. In Proceedings of the 17th International Conference on Parallel Architectures and Compilation Techniques, Brasov, Romania, September 2007.
-
A. Jaleel, W. Hasenplaugh, M. Qureshi, J. Sebot, S. S. Jr., and J. Emer. Adaptive Insertion Policies for Managing Shared Caches. In Proceedings of the 17th International Conference on Parallel Architectures and Compilation Techniques, Brasov, Romania, September 2007.
-
-
-
-
15
-
-
34547476643
-
PicoServer: Using 3D Stacking Technology to Enable a Compact Energy Efficient Chip Multiprocessor
-
San Jose, CA, USA, October
-
T. H. Kgil, S. D'Souza, A. G. Saidi, N. Binkert, R. Dreslinski, S. Reinhardt, K. Flautner, and T. Mudge. PicoServer: Using 3D Stacking Technology to Enable a Compact Energy Efficient Chip Multiprocessor. In Proceedings of the 12th Symposium on Architectural Support for Programming Languages and Operating Systems, pages 117-128, San Jose, CA, USA, October 2006.
-
(2006)
Proceedings of the 12th Symposium on Architectural Support for Programming Languages and Operating Systems
, pp. 117-128
-
-
Kgil, T.H.1
D'Souza, S.2
Saidi, A.G.3
Binkert, N.4
Dreslinski, R.5
Reinhardt, S.6
Flautner, K.7
Mudge, T.8
-
16
-
-
35348908288
-
A Novel Dimensionally-Decomposed Router for On-Chip Communication in 3D Architectures
-
San Diego, CA, USA, June
-
J. Kim, C. Nicopoulos, D. Park, R. Das, Y. Xie, N. Vijaykrishnan, M. S. Yousif, and C. R. Das. A Novel Dimensionally-Decomposed Router for On-Chip Communication in 3D Architectures. In Proceedings of the 34th International Symposium on Computer Architecture, San Diego, CA, USA, June 2007.
-
(2007)
Proceedings of the 34th International Symposium on Computer Architecture
-
-
Kim, J.1
Nicopoulos, C.2
Park, D.3
Das, R.4
Xie, Y.5
Vijaykrishnan, N.6
Yousif, M.S.7
Das, C.R.8
-
17
-
-
10444238444
-
Fair Cache Sharing and Partitioning in a Chip Multiprocessor Architecture
-
Antibes Juan-les-Pins, France, September
-
S. Kim, D. Chandra, and Y. Solihin. Fair Cache Sharing and Partitioning in a Chip Multiprocessor Architecture. In Proceedings of the 13th International Conference on Parallel Architectures and Compilation Techniques, pages 111-122, Antibes Juan-les-Pins, France, September 2004.
-
(2004)
Proceedings of the 13th International Conference on Parallel Architectures and Compilation Techniques
, pp. 111-122
-
-
Kim, S.1
Chandra, D.2
Solihin, Y.3
-
18
-
-
33845914023
-
Design and Management of 3D Chip Multiprocessors Using Network-in-Memory
-
Boston, MA, USA, June
-
F. Li, C. Nicopoulos, T. Richardson, Y. Xie, V. Narayanan, and M. Kandemir. Design and Management of 3D Chip Multiprocessors Using Network-in-Memory. In Proceedings of the 33rd International Symposium on Computer Architecture, pages 130-141, Boston, MA, USA, June 2006.
-
(2006)
Proceedings of the 33rd International Symposium on Computer Architecture
, pp. 130-141
-
-
Li, F.1
Nicopoulos, C.2
Richardson, T.3
Xie, Y.4
Narayanan, V.5
Kandemir, M.6
-
19
-
-
57749186047
-
Gaining Insights into Multicore Cache Partitioning: Bridging the Gap between Simulation and Real Systems
-
Salt Lake City, UT, USA, February
-
J. Lin, Q. Lu, X. Ding, Z. Zhang, and P. Sadayappan. Gaining Insights into Multicore Cache Partitioning: Bridging the Gap between Simulation and Real Systems. In Proceedings of the 14th International Symposium on High Performance Computer Architecture, pages 367-378, Salt Lake City, UT, USA, February 2008.
-
(2008)
Proceedings of the 14th International Symposium on High Performance Computer Architecture
, pp. 367-378
-
-
Lin, J.1
Lu, Q.2
Ding, X.3
Zhang, Z.4
Sadayappan, P.5
-
20
-
-
28344453642
-
Bridging the Processor-Memory Performance Gap with 3D IC Technology
-
November-December
-
C. C. Liu, I. Ganusov, M. Burtscher, and S. Tiwari. Bridging the Processor-Memory Performance Gap with 3D IC Technology. IEEE Design and Test of Computers, 22(6):556-564, November-December 2005.
-
(2005)
IEEE Design and Test of Computers
, vol.22
, Issue.6
, pp. 556-564
-
-
Liu, C.C.1
Ganusov, I.2
Burtscher, M.3
Tiwari, S.4
-
21
-
-
66749155879
-
Cache Bursts: A New Approach for Eliminating Dead Blocks and Increasing Cache Efficiency
-
Lake Como, Italy, November
-
H. Liu, M. Ferdman, J. Huh, and D. Burger. Cache Bursts: A New Approach for Eliminating Dead Blocks and Increasing Cache Efficiency. In Proceedings of the 41st International Symposium on Microarchitecture, pages 222-233, Lake Como, Italy, November 2008.
-
(2008)
Proceedings of the 41st International Symposium on Microarchitecture
, pp. 222-233
-
-
Liu, H.1
Ferdman, M.2
Huh, J.3
Burger, D.4
-
23
-
-
70349190964
-
Zesto: A Cycle-Level Simulator for Highly Detailed Microarchitecture Exploration
-
Boston, MA, USA, April
-
G. H. Loh, S. Subramaniam, and Y. Xie. Zesto: A Cycle-Level Simulator for Highly Detailed Microarchitecture Exploration. In Proceedings of the International Symposium on Performance Analysis of Systems and Software, Boston, MA, USA, April 2009.
-
(2009)
Proceedings of the International Symposium on Performance Analysis of Systems and Software
-
-
Loh, G.H.1
Subramaniam, S.2
Xie, Y.3
-
24
-
-
34548359365
-
Processor Design in 3D Die-Stacking Technologies
-
May-June
-
G. H. Loh, Y. Xie, and B. Black. Processor Design in 3D Die-Stacking Technologies. IEEE Micro Magazine, 27(3), May-June 2007.
-
(2007)
IEEE Micro Magazine
, vol.27
, Issue.3
-
-
Loh, G.H.1
Xie, Y.2
Black, B.3
-
25
-
-
34547204691
-
A Thermally-Aware Performance Analysis of Vertically Integrated (3-D) Processor-Memory Hierarchy
-
San Francisco, CA, USA, July
-
G. L. Loi, B. Agarwal, N. Srivastava, S.-C. Lin, and T. Sherwood. A Thermally-Aware Performance Analysis of Vertically Integrated (3-D) Processor-Memory Hierarchy. In Proceedings of the 43rd Design Automation Conference, pages 991-996, San Francisco, CA, USA, July 2006.
-
(2006)
Proceedings of the 43rd Design Automation Conference
, pp. 991-996
-
-
Loi, G.L.1
Agarwal, B.2
Srivastava, N.3
Lin, S.-C.4
Sherwood, T.5
-
26
-
-
84962144701
-
Balancing Throughput and Fairness in SMT Processors
-
Tucson, AZ, USA, November
-
K. Luo, J. Gummaraju, and M. Franklin. Balancing Throughput and Fairness in SMT Processors. In Proceedings of the 2001 International Symposium on Performance Analysis of Systems and Software, pages 164-171, Tucson, AZ, USA, November 2001.
-
(2001)
Proceedings of the 2001 International Symposium on Performance Analysis of Systems and Software
, pp. 164-171
-
-
Luo, K.1
Gummaraju, J.2
Franklin, M.3
-
28
-
-
64949203821
-
Optimizing Communication and Capacity in a 3D Stacked Reconfigurable Cache Hierarchy
-
Raleigh, NC, USA, February
-
N. Madan, L. Zhao, N. Muralimanohar, A. Udipi, R. Balasubramonian, R. Iyer, S. Makineni, and D. Newell. Optimizing Communication and Capacity in a 3D Stacked Reconfigurable Cache Hierarchy. In Proceedings of the 15th International Symposium on High Performance Computer Architecture, pages 262-274, Raleigh, NC, USA, February 2009.
-
(2009)
Proceedings of the 15th International Symposium on High Performance Computer Architecture
, pp. 262-274
-
-
Madan, N.1
Zhao, L.2
Muralimanohar, N.3
Udipi, A.4
Balasubramonian, R.5
Iyer, R.6
Makineni, S.7
Newell, D.8
-
29
-
-
0003506711
-
-
TN, Compaq Computer Corporation Western Research Laboratory, June
-
S. McFarling. Combining Branch Predictors. TN 36, Compaq Computer Corporation Western Research Laboratory, June 1993.
-
(1993)
Combining Branch Predictors
, pp. 36
-
-
McFarling, S.1
-
30
-
-
52649135185
-
MIRA: A Multi-Layered On-Chip Interconnect Router Architecture
-
Beijing, China, June
-
D. Park, S. Eachempati, R. Das, A. K. Mishra, Y. Xie, N. Vijaykrishnan, and C. R. Das. MIRA: A Multi-Layered On-Chip Interconnect Router Architecture. In Proceedings of the 35th International Symposium on Computer Architecture, pages 251-261, Beijing, China, June 2008.
-
(2008)
Proceedings of the 35th International Symposium on Computer Architecture
, pp. 251-261
-
-
Park, D.1
Eachempati, S.2
Das, R.3
Mishra, A.K.4
Xie, Y.5
Vijaykrishnan, N.6
Das, C.R.7
-
32
-
-
33845874613
-
A Case for MLP-Aware Cache Replacement
-
Boston, MA, USA, June
-
M. K. Qureshi, , D. Lynch, O. Mutlu, and Y. N. Patt. A Case for MLP-Aware Cache Replacement. In Proceedings of the 33rd International Symposium on Computer Architecture, pages 167-178, Boston, MA, USA, June 2006.
-
(2006)
Proceedings of the 33rd International Symposium on Computer Architecture
, pp. 167-178
-
-
Qureshi, M.K.1
Lynch, D.2
Mutlu, O.3
Patt, Y.N.4
-
33
-
-
64949187933
-
Dynamic Spill-Accept for Scalable High-Performance Caching in CMPs
-
Raleigh, NC, USA, February
-
M. K. Qureshi. Dynamic Spill-Accept for Scalable High-Performance Caching in CMPs. In Proceedings of the 15th International Symposium on High Performance Computer Architecture, pages 45-54, Raleigh, NC, USA, February 2009.
-
(2009)
Proceedings of the 15th International Symposium on High Performance Computer Architecture
, pp. 45-54
-
-
Qureshi, M.K.1
-
34
-
-
34548042910
-
Utility-Based Cache Partitioning: A Low-Overhead, High-Performance, Runtime Mechanism to Partition Shared Caches
-
Orlando, FL, December
-
M. K. Qureshi and Y. N. Patt. Utility-Based Cache Partitioning: A Low-Overhead, High-Performance, Runtime Mechanism to Partition Shared Caches. In Proceedings of the 39th International Symposium on Microarchitecture, pages 423-432, Orlando, FL, December 2006.
-
(2006)
Proceedings of the 39th International Symposium on Microarchitecture
, pp. 423-432
-
-
Qureshi, M.K.1
Patt, Y.N.2
-
35
-
-
34247108325
-
Architectural Support for Operating System-Driven CMP Cache Management
-
Seattle, WA, USA, September
-
N. Rafique, W.-T. Lin, and M. Thottethodi. Architectural Support for Operating System-Driven CMP Cache Management. In Proceedings of the 15th International Conference on Parallel Architectures and Compilation Techniques, pages 2-12, Seattle, WA, USA, September 2006.
-
(2006)
Proceedings of the 15th International Conference on Parallel Architectures and Compilation Techniques
, pp. 2-12
-
-
Rafique, N.1
Lin, W.-T.2
Thottethodi, M.3
-
36
-
-
25844489123
-
Design Aspects of a Microprocessor Data Cache using 3D Die Interconnect Technology
-
Austin, TX, USA, May
-
P. Reed, G. Yeung, and B. Black. Design Aspects of a Microprocessor Data Cache using 3D Die Interconnect Technology. In Proceedings of the International Conference on Integrated Circuit Design and Technology, pages 15-18, Austin, TX, USA, May 2005.
-
(2005)
Proceedings of the International Conference on Integrated Circuit Design and Technology
, pp. 15-18
-
-
Reed, P.1
Yeung, G.2
Black, B.3
-
37
-
-
0034443570
-
Symbiotic Jobscheduling for a Simultaneous Multithreading Machine
-
Cambridge, MA, USA, November
-
A. E. Snavely and D. Tullsen. Symbiotic Jobscheduling for a Simultaneous Multithreading Machine. In Proceedings of the 9th Symposium on Architectural Support for Programming Languages and Operating Systems, pages 234-244, Cambridge, MA, USA, November 2000.
-
(2000)
Proceedings of the 9th Symposium on Architectural Support for Programming Languages and Operating Systems
, pp. 234-244
-
-
Snavely, A.E.1
Tullsen, D.2
-
38
-
-
0026925878
-
Optimal Paritioning of Cache Memory
-
September
-
H. S. Stone, J. Tuerk, and J. L. Wolf. Optimal Paritioning of Cache Memory. IEEE Transactions on Computers, 41(9):1054-1068, September 1992.
-
(1992)
IEEE Transactions on Computers
, vol.41
, Issue.9
, pp. 1054-1068
-
-
Stone, H.S.1
Tuerk, J.2
Wolf, J.L.3
-
39
-
-
1642371317
-
Dynamic Partitioning of Shared Cache Memory
-
G. E. Suh, L. Rudolph, and S. Devadas. Dynamic Partitioning of Shared Cache Memory. Journal of Supercomputing, 28(1):7-26, 2004.
-
(2004)
Journal of Supercomputing
, vol.28
, Issue.1
, pp. 7-26
-
-
Suh, G.E.1
Rudolph, L.2
Devadas, S.3
-
41
-
-
76749117477
-
-
S. Thoziyoor, N. Muralimanohar, J. H. Ahn, and N. P. Jouppi. CACTI 5.1. HPL 2008/20, HP Labs, April 2008.
-
S. Thoziyoor, N. Muralimanohar, J. H. Ahn, and N. P. Jouppi. CACTI 5.1. HPL 2008/20, HP Labs, April 2008.
-
-
-
-
42
-
-
33746603614
-
Three-Dimensional Cache Design Using 3DCacti
-
San Jose, CA, USA, October
-
Y.-F. Tsai, Y. Xie, N. Vijaykrishnan, and M. J. Irwin. Three-Dimensional Cache Design Using 3DCacti. In Proceedings of the International Conference on Computer Design, San Jose, CA, USA, October 2005.
-
(2005)
Proceedings of the International Conference on Computer Design
-
-
Tsai, Y.-F.1
Xie, Y.2
Vijaykrishnan, N.3
Irwin, M.J.4
-
43
-
-
65349146389
-
A Low-Radix and Low-Diameter 3D Interconnection Network Design
-
Raleigh, NC, USA, February
-
Y. Xu, Y. Du, B. Zhao, X. Zhou, Y. Zhang, and J. Yang. A Low-Radix and Low-Diameter 3D Interconnection Network Design. In Proceedings of the 15th International Symposium on High Performance Computer Architecture, pages 30-42, Raleigh, NC, USA, February 2009.
-
(2009)
Proceedings of the 15th International Symposium on High Performance Computer Architecture
, pp. 30-42
-
-
Xu, Y.1
Du, Y.2
Zhao, B.3
Zhou, X.4
Zhang, Y.5
Yang, J.6
|