-
1
-
-
0034825713
-
Performance of hardware compressed main memory
-
B. Abali, H. Franke, X. Shen, D.E. Poff, and T.B. Smith, "Performance of Hardware Compressed Main Memory," Proc. Seventh Int'l Symp. High-Performance Computer Architecture, pp. 73-81, 2001.
-
(2001)
Proc. Seventh Int'l Symp. High-Performance Computer Architecture
, pp. 73-81
-
-
Abali, B.1
Franke, H.2
Shen, X.3
Poff, D.E.4
Smith, T.B.5
-
2
-
-
0034844454
-
Data prefetching by dependence graph precomputation
-
M.M. Annavaram, J.M. Patel, and E.S. Davidson, "Data Prefetching by Dependence Graph Precomputation," Proc. 28th Ann. Int'l Symp. Computer Architecture, pp. 52-61, 2001.
-
(2001)
Proc. 28th Ann. Int'l Symp. Computer Architecture
, pp. 52-61
-
-
Annavaram, M.M.1
Patel, J.M.2
Davidson, E.S.3
-
4
-
-
0034856729
-
Dynamically allocating processor resources between nearby and distant ILP
-
R. Balasubramonian, S. Dwarkadas, and D.H. Albonesi, "Dynamically Allocating Processor Resources between Nearby and Distant ILP," Proc. 28th Ann. Int'l Symp. Computer Architecture, pp. 26-37, 2001.
-
(2001)
Proc. 28th Ann. Int'l Symp. Computer Architecture
, pp. 26-37
-
-
Balasubramonian, R.1
Dwarkadas, S.2
Albonesi, D.H.3
-
6
-
-
0031594024
-
Multi-Level texture caching for 3D graphics hardware
-
M. Cox, N. Bhandari, and M. Shantz, "Multi-Level Texture Caching for 3D Graphics Hardware," Proc. 25th Ann. Int'l Symp. Computer Architecture, pp. 86-97, 1998.
-
(1998)
Proc. 25th Ann. Int'l Symp. Computer Architecture
, pp. 86-97
-
-
Cox, M.1
Bhandari, N.2
Shantz, M.3
-
8
-
-
0032687058
-
A performance comparison of contemporary DRAM architectures
-
V. Cuppu, B. Jacob, B. Davis, and T. Mudge, "A Performance Comparison of Contemporary DRAM Architectures," Proc. 26th Ann. Int'l Symp. Computer Architecture, pp. 222-233, 1999.
-
(1999)
Proc. 26th Ann. Int'l Symp. Computer Architecture
, pp. 222-233
-
-
Cuppu, V.1
Jacob, B.2
Davis, B.3
Mudge, T.4
-
9
-
-
0030348712
-
AlphaServer 4100 performance characterization
-
Z. Cvetanovic and D.D. Donaldson, "AlphaServer 4100 Performance Characterization," Digital Technical J., vol. 8, no. 4, pp. 3-20, 1996.
-
(1996)
Digital Technical J.
, vol.8
, Issue.4
, pp. 3-20
-
-
Cvetanovic, Z.1
Donaldson, D.D.2
-
10
-
-
0036374270
-
The architecture of the DIVA processing-in-memory chip
-
J. Draper, J. Chame, M. Hall, C. Steele, T. Barrett, J. LaCoss, J. Granacki, J. Shin, C. Chen, C.W. Kang, I. Kim, and G. Daglikoca, "The Architecture of the DIVA Processing-in-Memory Chip," Proc. 16th Int'l Conf. Supercomputing, pp. 14-25, 2002.
-
(2002)
Proc. 16th Int'l Conf. Supercomputing
, pp. 14-25
-
-
Draper, J.1
Chame, J.2
Hall, M.3
Steele, C.4
Barrett, T.5
LaCoss, J.6
Granacki, J.7
Shin, J.8
Chen, C.9
Kang, C.W.10
Kim, I.11
Daglikoca, G.12
-
12
-
-
78650753155
-
Memory-Intensive benchmarks: IRAM vs. cache-based machines
-
B. Gaeke, P. Husbands, X. Li, L. Oliker, K. Yelick, and R. Biswas, "Memory-Intensive Benchmarks: IRAM vs. Cache-Based Machines," Proc. 16th Int'l Parallel and Distributed Processing Symp., pp. 30-30, 2002.
-
(2002)
Proc. 16th Int'l Parallel and Distributed Processing Symp.
, pp. 30-30
-
-
Gaeke, B.1
Husbands, P.2
Li, X.3
Oliker, L.4
Yelick, K.5
Biswas, R.6
-
14
-
-
0003997750
-
CDRAM in a unified memory architecture
-
C.A. Hart, "CDRAM in a Unified Memory Architecture," Proc. CompCon '94, pp. 261-266, 1994.
-
(1994)
Proc. CompCon '94
, pp. 261-266
-
-
Hart, C.A.1
-
15
-
-
0025419834
-
The cache DRAM architecture: A DRAM with an on-chip cache memory
-
Apr
-
H. Hidaka, Y. Matsuda, M. Asakura, and K. Fujishima, "The Cache DRAM Architecture: A DRAM with an On-Chip Cache Memory," IEEE Micro, vol. 10, no. 2, pp. 14-25, Apr. 1990.
-
(1990)
IEEE Micro
, vol.10
, Issue.2
, pp. 14-25
-
-
Hidaka, H.1
Matsuda, Y.2
Asakura, M.3
Fujishima, K.4
-
17
-
-
0029666640
-
DCD-Disk caching disk: A new approach for boosting I/O performance
-
Y. Hu and Q. Yang, "DCD-Disk Caching Disk: A New Approach for Boosting I/O Performance," Proc. 23rd Ann. Int'l Symp. Computer Architecture, pp. 169-178, 1996.
-
(1996)
Proc. 23rd Ann. Int'l Symp. Computer Architecture
, pp. 169-178
-
-
Hu, Y.1
Yang, Q.2
-
19
-
-
0026938770
-
A new Era of fast dynamic RAMs
-
Oct
-
F. Jones et al., "A New Era of Fast Dynamic RAMs," IEEE Spectrum, pp. 43-49, Oct. 1992.
-
(1992)
IEEE Spectrum
, pp. 43-49
-
-
Jones, F.1
-
20
-
-
0025429331
-
Improving direct-mapped cache performance by the addition of a small fully-associative cache and prefetch buffers
-
N.P. Jouppi, "Improving Direct-Mapped Cache Performance by the Addition of a Small Fully-Associative Cache and Prefetch Buffers," Proc. 17th Ann. Int'l Symp. Computer Architecture, pp. 364-373, 1990.
-
(1990)
Proc. 17th Ann. Int'l Symp. Computer Architecture
, pp. 364-373
-
-
Jouppi, N.P.1
-
21
-
-
84862448424
-
-
Technical Report HPL-2000-53, HP Laboratories, Palo Alto, Calif., Apr
-
P. Keltcher, S. Richardson, and S. Siu, "An Equal Area Comparison of Embedded DRAM and SRAM Memory Architectures for a Chip Multiprocessor," Technical Report HPL-2000-53, HP Laboratories, Palo Alto, Calif., Apr. 2000.
-
(2000)
an Equal Area Comparison of Embedded DRAM and SRAM Memory Architectures for a Chip Multiprocessor
-
-
Keltcher, P.1
Richardson, S.2
Siu, S.3
-
25
-
-
0005503448
-
Vector IRAM: A media-enhanced vector processor with embedded DRAM
-
C. Kozyrakis, J. Gebis, D. Martin, S. Williams, I. Mavroidis, S. Pope, D. Jones, and D. Patterson, "Vector IRAM: A Media-Enhanced Vector Processor with Embedded DRAM," Proc. Hot Chips 12, 2000.
-
Proc. Hot Chips
, vol.12
, pp. 2000
-
-
Kozyrakis, C.1
Gebis, J.2
Martin, D.3
Williams, S.4
Mavroidis, I.5
Pope, S.6
Jones, D.7
Patterson, D.8
-
26
-
-
0034461711
-
Eager writeback - A technique for improving bandwidth utilization
-
H.-H. Lee, G. Tyson, and M. Farrens, "Eager Writeback - A Technique for Improving Bandwidth Utilization," Proc 33rd IEEE/ACM Int'l Symp. Microarchitecture, pp. 11-21, 2000.
-
(2000)
Proc 33rd IEEE/ACM Int'l Symp. Microarchitecture
, pp. 11-21
-
-
Lee, H.-H.1
Tyson, G.2
Farrens, M.3
-
27
-
-
0034818343
-
Reducing DRAM latencies with an integrated memory hierarchy design
-
W. Lin, S. Reinhardt, and D. Burger, "Reducing DRAM Latencies with an Integrated Memory Hierarchy Design," Proc. Seventh Int'l Symp. High-Performance Computer Architecture, pp. 301-312, 2001.
-
(2001)
Proc. Seventh Int'l Symp. High-Performance Computer Architecture
, pp. 301-312
-
-
Lin, W.1
Reinhardt, S.2
Burger, D.3
-
28
-
-
0034839064
-
Tolerating memory latency through software-controlled pre-execution in simultaneous multithreading processors
-
C.-K. Luk, "Tolerating Memory Latency through Software-Controlled Pre-Execution in Simultaneous Multithreading Processors," Proc. 28th Ann. Int'l Symp. Computer Architecture, pp. 40-51, 2001.
-
(2001)
Proc. 28th Ann. Int'l Symp. Computer Architecture
, pp. 40-51
-
-
Luk, C.-K.1
-
30
-
-
0031096193
-
A case for intelligent RAM
-
Mar./Apr
-
D. Patterson, T. Anderson, N. Cardwell, R. Fromm, K. Keeton, C. Kozyrakis, R. Thomas, and K. Yelick, "A Case for Intelligent RAM," IEEE Micro, pp. 34-44, Mar./Apr. 1997.
-
(1997)
IEEE Micro
, pp. 34-44
-
-
Patterson, D.1
Anderson, T.2
Cardwell, N.3
Fromm, R.4
Keeton, K.5
Kozyrakis, C.6
Thomas, R.7
Yelick, K.8
-
31
-
-
0033075272
-
Functional implementation techniques for CPU cache memories
-
Feb
-
J.-K. Peir, W.W. Hsu, and A.J. Smith, "Functional Implementation Techniques for CPU Cache Memories," IEEE Trans. Computers, vol. 48, no. 2, pp. 100-110, Feb. 1999.
-
(1999)
IEEE Trans. Computers
, vol.48
, Issue.2
, pp. 100-110
-
-
Peir, J.-K.1
Hsu, W.W.2
Smith, A.J.3
-
32
-
-
0036375949
-
Bloom filtering cache misses for accurate data speculation and prefetching
-
J.-K. Peir, S.-C. Lai, S.-L. Lu, J. Stark, and K. Lai, "Bloom Filtering Cache Misses for Accurate Data Speculation and Prefetching," Proc. 16th Int'l Conf. Supercomputing (ICS-02), pp. 189-198, 2002.
-
(2002)
Proc. 16th Int'l Conf. Supercomputing (ICS-02)
, pp. 189-198
-
-
Peir, J.-K.1
Lai, S.-C.2
Lu, S.-L.3
Stark, J.4
K. Lai5
-
33
-
-
0035101241
-
The IA-64 itanium processor cartridge
-
Jan./Feb
-
W.A. Samaras, N. Cherukuri, and S. Venkataraman, "The IA-64 Itanium Processor Cartridge," IEEE Micro, vol. 21, no. 1, pp. 82-89, Jan./Feb. 2001.
-
(2001)
IEEE Micro
, vol.21
, Issue.1
, pp. 82-89
-
-
Samaras, W.A.1
Cherukuri, N.2
Venkataraman, S.3
-
34
-
-
0029666645
-
Missing the memory wall: The case for processor/memory integration
-
A. Saulsbury, F. Pong, and A. Nowatzyk, "Missing the Memory Wall: The Case for Processor/Memory Integration," Proc. 23rd Ann. Int'l Symp. Computer Architecure, pp. 90-103, 1996.
-
(1996)
Proc. 23rd Ann. Int'l Symp. Computer Architecure
, pp. 90-103
-
-
Saulsbury, A.1
Pong, F.2
Nowatzyk, A.3
-
35
-
-
0028324009
-
Decoupled sectored caches: Conciliating low tag implementation cost and low miss ratio
-
A. Seznec, "Decoupled Sectored Caches: Conciliating Low Tag Implementation Cost and Low Miss Ratio," Proc. 21st Ann. Int'l Symp. Computer Architecture, pp. 384-393, 1994.
-
(1994)
Proc. 21st Ann. Int'l Symp. Computer Architecture
, pp. 384-393
-
-
Seznec, A.1
-
36
-
-
0037340044
-
A decoupled predictor-directed stream prefetching architecture
-
Mar
-
T. Sherwood and B. Calder, "A Decoupled Predictor-Directed Stream Prefetching Architecture," IEEE Trans. Computers, vol. 52, no. 5, Mar. 2003.
-
(2003)
IEEE Trans. Computers
, vol.52
, Issue.5
-
-
Sherwood, T.1
Calder, B.2
-
37
-
-
0003450887
-
-
technical report, COMPAQ Western Research Lab, Aug
-
P. Shivakumar and N.P. Jouppi, "CACTI 3.0: An Integrated Cache Timing, Power, and Area Model," technical report, COMPAQ Western Research Lab, Aug. 2001.
-
(2001)
CACTI 3.0: an Integrated Cache Timing, Power, and Area Model
-
-
Shivakumar, P.1
Jouppi, N.P.2
-
40
-
-
0035272785
-
Pinnacle: IBM MXT in a memory controller chip
-
Mar/Apr
-
R.B. Tremaine, T.B. Smith, M. Wazlowski, D. Har, K.-K. Mak, and S. Arramreddy, "Pinnacle: IBM MXT in a Memory Controller Chip," IEEE Micro, vol. 21, no. 2, pp. 56-68, Mar/Apr. 2001.
-
(2001)
IEEE Micro
, vol.21
, Issue.2
, pp. 56-68
-
-
Tremaine, R.B.1
Smith, T.B.2
Wazlowski, M.3
Har, D.4
Mak, K.-K.5
Arramreddy, S.6
-
41
-
-
0029508817
-
A modified approach to data cache management
-
G. Tyson, M. Farrens, J. Matthews, and A.R. Pleszkun, "A Modified Approach to Data Cache Management," Proc. 28th Ann. Int'l Symp. Microarchitecture, pp. 93-103, 1995.
-
(1995)
Proc. 28th Ann. Int'l Symp. Microarchitecture
, pp. 93-103
-
-
Tyson, G.1
Farrens, M.2
Matthews, J.3
Pleszkun, A.R.4
-
44
-
-
0003999721
-
-
Technical Report UW CSE 97-03-04, Univ. of Washington, Feb
-
W. Wong and J.-L. Baer, "DRAM On-Chip Caching," Technical Report UW CSE 97-03-04, Univ. of Washington, Feb. 1997.
-
(1997)
DRAM On-Chip Caching
-
-
Wong, W.1
Baer, J.-L.2
-
46
-
-
0342340303
-
-
Technical Report CSL-TR-97-731, Computer Systems Laboratory, Stanford Univ., Aug
-
T. Yamauchi, L. Hammond, and K. Olukotun, "A Single Chip Multiprocessor Integrated with High Density DRAM," Technical Report CSL-TR-97-731, Computer Systems Laboratory, Stanford Univ., Aug. 1997.
-
(1997)
A Single Chip Multiprocessor Integrated With High Density DRAM
-
-
Yamauchi, T.1
Hammond, L.2
Olukotun, K.3
-
48
-
-
0032651228
-
Speculation techniques for improving load related instruction scheduling
-
A. Yoaz, M. Erez, R. Ronen, and S. Jourdan, "Speculation Techniques for Improving Load Related Instruction Scheduling," Proc. 26th Ann. Int'l Symp. Computer Architecture, pp. 42-53, 1999.
-
(1999)
Proc. 26th Ann. Int'l Symp. Computer Architecture
, pp. 42-53
-
-
Yoaz, A.1
Erez, M.2
Ronen, R.3
Jourdan, S.4
-
49
-
-
0034460897
-
A permutation-based page interleaving scheme to reduce row-buffer conflicts and exploit data locality
-
Z. Zhang, Z. Zhu, and X. Zhang, "A Permutation-Based Page Interleaving Scheme to Reduce Row-Buffer Conflicts and Exploit Data Locality," Proc. 33rd IEEE/ACM Int'l Symp. Microarchitecture, pp. 32-41, 2000.
-
(2000)
Proc. 33rd IEEE/ACM Int'l Symp. Microarchitecture
, pp. 32-41
-
-
Zhang, Z.1
Zhu, Z.2
Zhang, X.3
-
50
-
-
0035389657
-
Cached DRAM: A simple and effective technique for memory access latency reduction on ILP processors
-
July/Aug
-
Z. Zhang, Z. Zhu, and X. Zhang, "Cached DRAM: A Simple and Effective Technique for Memory Access Latency Reduction on ILP Processors," IEEE Micro, vol. 21, no. 4, pp. 22-32, July/Aug. 2001.
-
(2001)
IEEE Micro
, vol.21
, Issue.4
, pp. 22-32
-
-
Zhang, Z.1
Zhu, Z.2
Zhang, X.3
-
51
-
-
84949752992
-
Fine-grain priority scheduling on multi-channel memory systems
-
Z. Zhu, Z. Zhang, and X. Zhang, "Fine-Grain Priority Scheduling on Multi-Channel Memory Systems," Proc. Eighth Int'l Symp. High-Performance Computer Architecture, pp. 107-116, 2002.
-
(2002)
Proc. Eighth Int'l Symp. High-Performance Computer Architecture
, pp. 107-116
-
-
Zhu, Z.1
Zhang, Z.2
Zhang, X.3
|