-
1
-
-
84976789640
-
Memory Bandwidth Optimizations for Wide-Bus Machines
-
Jan. (Incorrectly published under M.A. Alexander et al.)
-
M.J. Alexander, M.W. Bailey, B.R. Childers, J.W. Davidson, and S. Jinturkar, "Memory Bandwidth Optimizations for Wide-Bus Machines," Proc. IEEE 26th Hawaii Int'l Conf. Systems Sciences (HICSS-26), pp. 466-475, Jan. 1993. (Incorrectly published under M.A. Alexander et al.).
-
(1993)
Proc. IEEE 26th Hawaii Int'l Conf. Systems Sciences (HICSS-26)
, pp. 466-475
-
-
Alexander, M.J.1
Bailey, M.W.2
Childers, B.R.3
Davidson, J.W.4
Jinturkar, S.5
-
2
-
-
0026267802
-
An Effective On-Chip Preloading Scheme to Reduce Data Access Penalty
-
Nov.
-
J.-L. Baer and T.-F. Chen, "An Effective On-Chip Preloading Scheme to Reduce Data Access Penalty," Proc. Supercomputing '91, pp. 176-186, Nov. 1991.
-
(1991)
Proc. Supercomputing '91
, pp. 176-186
-
-
Baer, J.-L.1
Chen, T.-F.2
-
3
-
-
0029254155
-
Myrinet - A Gigabit-per-Second Local-Area Network
-
Feb.
-
N.J. Boden, D. Cohen, R.E. Felderman, A.E. Kulawik, C.L. Seitz, J.N. Seizovic, and W.-K. Su, "Myrinet - A Gigabit-per-Second Local-Area Network," IEEE Micro, vol. 15, no. 1, pp. 29-36, Feb. 1995.
-
(1995)
IEEE Micro
, vol.15
, Issue.1
, pp. 29-36
-
-
Boden, N.J.1
Cohen, D.2
Felderman, R.E.3
Kulawik, A.E.4
Seitz, C.L.5
Seizovic, J.N.6
Su, W.-K.7
-
5
-
-
0010232351
-
The Declining Effectiveness of Dynamic Caching for General-Purpose Microprocessors
-
Univ. Wisconsin
-
D. Burger, J.R. Goodman, and A. Kägi, "The Declining Effectiveness of Dynamic Caching for General-Purpose Microprocessors," Technical Report 1261, Univ. Wisconsin, 1995.
-
(1995)
Technical Report 1261
-
-
Burger, D.1
Goodman, J.R.2
Kägi, A.3
-
6
-
-
33749896339
-
Single PE Optimization Techniques for the Cray T3D System
-
Sept.
-
J. Brooks, "Single PE Optimization Techniques for the Cray T3D System," Proc. First European T3D Workshop, Sept. 1995.
-
(1995)
Proc. First European T3D Workshop
-
-
Brooks, J.1
-
7
-
-
0026308525
-
Using Lookahead to Reduce Memory Bank Contention for Decoupled Operand References
-
Nov.
-
P.L. Bird and R.A. Uhlig, "Using Lookahead to Reduce Memory Bank Contention for Decoupled Operand References," Proc. Supercomputing '91, pp. 187-196, Nov. 1991.
-
(1991)
Proc. Supercomputing '91
, pp. 187-196
-
-
Bird, P.L.1
Uhlig, R.A.2
-
8
-
-
0026980850
-
An Efficient Architecture for Loop Based Data Preloading
-
Dec.
-
W.Y. Chen, R.A. Bringmann, S.A. Mahlke, R.E. Hank, and J.E. Sicolo, "An Efficient Architecture for Loop Based Data Preloading," Proc. IEEE/ACM 25th Int'l Symp. Microarchitecture, pp. 92-101, Dec. 1992.
-
(1992)
Proc. IEEE/ACM 25th Int'l Symp. Microarchitecture
, pp. 92-101
-
-
Chen, W.Y.1
Bringmann, R.A.2
Mahlke, S.A.3
Hank, R.E.4
Sicolo, J.E.5
-
9
-
-
84944405613
-
Command Vector Memory Systems: High Performance at Low Cost
-
Oct.
-
J. Corbal, R. Espasa, and M. Valero, "Command Vector Memory Systems: High Performance at Low Cost," Proc. 1998 Int'l Conf. Parallel Architectures and Compilation Techniques, pp. 68-77, Oct. 1998.
-
(1998)
Proc. 1998 Int'l Conf. Parallel Architectures and Compilation Techniques
, pp. 68-77
-
-
Corbal, J.1
Espasa, R.2
Valero, M.3
-
10
-
-
0028757333
-
Sunder: A Programmable Hardware Prefetch Architecture for Numerical Loops
-
Nov.
-
T.-C. Chiueh, "Sunder: A Programmable Hardware Prefetch Architecture for Numerical Loops," Proc. Supercomputing '94, pp. 488-497, Nov. 1994.
-
(1994)
Proc. Supercomputing '94
, pp. 488-497
-
-
Chiueh, T.-C.1
-
11
-
-
0032761638
-
Impulse: Building a Smarter Memory Controller
-
Jan.
-
J.B. Carter, W.C. Hsieh, L.B. Stoller, M.R. Swanson, L. Zhang, E.L. Brunvand, A. Davis, C.-C. Kuo, R. Kuramkote, M.A. Parker, L. Schaelicke, and T. Tateyama, "Impulse: Building a Smarter Memory Controller," Proc. Fifth Ann. Symp. High Performance Computer Architecture, pp. 70-79, Jan. 1999.
-
(1999)
Proc. Fifth Ann. Symp. High Performance Computer Architecture
, pp. 70-79
-
-
Carter, J.B.1
Hsieh, W.C.2
Stoller, L.B.3
Swanson, M.R.4
Zhang, L.5
Brunvand, E.L.6
Davis, A.7
Kuo, C.-C.8
Kuramkote, R.9
Parker, M.A.10
Schaelicke, L.11
Tateyama, T.12
-
12
-
-
0024142664
-
Automatic Management of Programmable Caches
-
Aug.
-
R. Cytron, S. Karlovsky, and K.P. McAuliffe, "Automatic Management of Programmable Caches," Proc. 1988 Int'l Conf. Parallel Processing, pp. 229-238, Aug. 1988.
-
(1988)
Proc. 1988 Int'l Conf. Parallel Processing
, pp. 229-238
-
-
Cytron, R.1
Karlovsky, S.2
McAuliffe, K.P.3
-
13
-
-
0026138044
-
Software Prefetching
-
Apr.
-
D. Callahan, K. Kennedy, and A. Porterfield, "Software Prefetching," Proc. Fourth Symp. Architectural Support for Programming Languages and Operating Systems, pp. 40-52, Apr. 1991.
-
(1991)
Proc. Fourth Symp. Architectural Support for Programming Languages and Operating Systems
, pp. 40-52
-
-
Callahan, D.1
Kennedy, K.2
Porterfield, A.3
-
14
-
-
33646901785
-
Tolerating Data Access Latency with Register Preloading
-
Sept.
-
W.Y. Chen, S.A. Mahlke, and W.W. Hwu, "Tolerating Data Access Latency with Register Preloading," Proc. 1992 Int'l Conf. Supercomputing, pp. 104-113, Sept. 1992.
-
(1992)
Proc. 1992 Int'l Conf. Supercomputing
, pp. 104-113
-
-
Chen, W.Y.1
Mahlke, S.A.2
Hwu, W.W.3
-
15
-
-
0031274906
-
Direct Rambus Technology: The New Main Memory Standard
-
Nov./Dec.
-
R. Crisp, "Direct Rambus Technology: The New Main Memory Standard," IEEE Micro, vol. 17, no. 6, pp. 18-28, Nov./Dec. 1997.
-
(1997)
IEEE Micro
, vol.17
, Issue.6
, pp. 18-28
-
-
Crisp, R.1
-
18
-
-
0025402476
-
A Set of Level 3 Basic Linear Algebra Subprograms
-
Mar.
-
J.J. Dongarra, J. DuCroz, I. Duff, and S. Hammerling, "A Set of Level 3 Basic Linear Algebra Subprograms," ACM Trans. Math. Software, vol. 16, no. 1, pp. 1-17, Mar. 1990.
-
(1990)
ACM Trans. Math. Software
, vol.16
, Issue.1
, pp. 1-17
-
-
Dongarra, J.J.1
DuCroz, J.2
Duff, I.3
Hammerling, S.4
-
19
-
-
0029341212
-
Sequential Hardware Prefetching in Shared-Memory Multiprocessors
-
July
-
F. Dahlgren, M. Dubois, and P. Stenstrom, "Sequential Hardware Prefetching in Shared-Memory Multiprocessors," IEEE Trans. Parallel and Distributed Systems, vol. 6, no. 7, pp. 733-746, July 1995.
-
(1995)
IEEE Trans. Parallel and Distributed Systems
, vol.6
, Issue.7
, pp. 733-746
-
-
Dahlgren, F.1
Dubois, M.2
Stenstrom, P.3
-
21
-
-
0030679080
-
Memory-System Design Considerations for Dynamically-Scheduled Processors
-
June
-
K.I. Farkas, P. Chow, N. Jouppi, and Z. Vranesic, "Memory-System Design Considerations for Dynamically-Scheduled Processors," Proc. 24th Ann. Int'l Symp. Computer Architecture, pp. 133-143, June 1997.
-
(1997)
Proc. 24th Ann. Int'l Symp. Computer Architecture
, pp. 133-143
-
-
Farkas, K.I.1
Chow, P.2
Jouppi, N.3
Vranesic, Z.4
-
23
-
-
0027247331
-
The Chinese Remainder Theorem and the Prime Memory System
-
May
-
Q.S. Gao, "The Chinese Remainder Theorem and the Prime Memory System," Proc. 20th Ann. Int'l Symp. Computer Architecture, pp. 337-340, May 1993.
-
(1993)
Proc. 20th Ann. Int'l Symp. Computer Architecture
, pp. 337-340
-
-
Gao, Q.S.1
-
24
-
-
0023560346
-
Vector Access Performance in Parallel Memories Using a Skewed Storage Scheme
-
Dec.
-
D.T. Harper III and J.R. Jump, "Vector Access Performance in Parallel Memories Using a Skewed Storage Scheme," IEEE Trans. Computers, vol. 36, no. 12, pp. 1,440-1,449, Dec. 1987.
-
(1987)
IEEE Trans. Computers
, vol.36
, Issue.12
-
-
Harper III, D.T.1
Jump, J.R.2
-
25
-
-
0032785291
-
Access Order and Effective Bandwidth for Streams on a Direct Rambus Memory
-
Jan.
-
S.I. Hong, S.A. McKee, M.H. Salinas, R.H. Klenke, J.H. Aylor, and W.A. Wulf, "Access Order and Effective Bandwidth for Streams on a Direct Rambus Memory," Proc. Fifth Ann. Symp. High Performance Computer Architecture, pp. 80-89, Jan. 1999.
-
(1999)
Proc. Fifth Ann. Symp. High Performance Computer Architecture
, pp. 80-89
-
-
Hong, S.I.1
McKee, S.A.2
Salinas, M.H.3
Klenke, R.H.4
Aylor, J.H.5
Wulf, W.A.6
-
27
-
-
0025429331
-
Improving Direct-Mapped Cache Performance by the Addition of a Small Fully Associative Cache and Prefetch Buffers
-
May
-
N.P. Jouppi, "Improving Direct-Mapped Cache Performance by the Addition of a Small Fully Associative Cache and Prefetch Buffers," Proc. 17th Ann. Int'l Symp. Computer Architecture, pp. 364-373, May 1990.
-
(1990)
Proc. 17th Ann. Int'l Symp. Computer Architecture
, pp. 364-373
-
-
Jouppi, N.P.1
-
29
-
-
0028383937
-
A Shared-Memory MPP from Cray Research
-
R.K. Koeninger, M. Furtney, and M. Walker, "A Shared-Memory MPP from Cray Research," Digital Technical J., vol. 6, no. 2, pp. 8-21, 1994.
-
(1994)
Digital Technical J.
, vol.6
, Issue.2
, pp. 8-21
-
-
Koeninger, R.K.1
Furtney, M.2
Walker, M.3
-
31
-
-
33749912655
-
Breaking the Memory Bottleneck, Parts 1 & 2
-
Jan./Feb.
-
D. Loshin and D. Budge, "Breaking the Memory Bottleneck, Parts 1 & 2," Supercomputing Review, Jan./Feb. 1992.
-
(1992)
Supercomputing Review
-
-
Loshin, D.1
Budge, D.2
-
35
-
-
0003981343
-
The Livermore Fortran Kernels: A Computer Test of the Numerical Performance Range
-
Lawrence Livermore Nat'l Laboratory, Dec.
-
F.H. McMahon, "The Livermore Fortran Kernels: A Computer Test of the Numerical Performance Range," Technical Report UCRL-53745, Lawrence Livermore Nat'l Laboratory, Dec. 1986.
-
(1986)
Technical Report UCRL-53745
-
-
McMahon, F.H.1
-
37
-
-
0026918402
-
Design and Evaluation of a Compiler Algorithm for Prefetching
-
Oct.
-
T.C. Mowry, M.S. Lam, and A. Gupta, "Design and Evaluation of a Compiler Algorithm for Prefetching," Proc. Fifth Symp. Architectural Support for Programming Languages and Operating Systems, pp. 62-73, Oct. 1992.
-
(1992)
Proc. Fifth Symp. Architectural Support for Programming Languages and Operating Systems
, pp. 62-73
-
-
Mowry, T.C.1
Lam, M.S.2
Gupta, A.3
-
38
-
-
0034581564
-
Parallel Access Ordering for SDRAM Memories
-
Jan.
-
B.K. Mathew, S.A. McKee, J.B. Carter, and A. Davis, "Parallel Access Ordering for SDRAM Memories," Proc. Sicth Ann. Symp. High Performance Computer Architecture, pp. 39-48, Jan. 2000.
-
(2000)
Proc. Sicth Ann. Symp. High Performance Computer Architecture
, pp. 39-48
-
-
Mathew, B.K.1
McKee, S.A.2
Carter, J.B.3
Davis, A.4
-
39
-
-
0005052298
-
A Vectorizing Software Pipelining Compiler for LIW and Superscalar Architectures
-
L. Meadows, S. Nakamoto, and V. Schuster, "A Vectorizing Software Pipelining Compiler for LIW and Superscalar Architectures," Proc. RISC '92, pp. 331-343, 1992.
-
(1992)
Proc. RISC '92
, pp. 331-343
-
-
Meadows, L.1
Nakamoto, S.2
Schuster, V.3
-
45
-
-
33749976518
-
-
DL 0035-00.c0.5.28, Mar.
-
Rambus, Inc., "64M/72M Direct RDRAM Data Sheet,"DL 0035-00.c0.5.28, Mar. 1998. http://www.rambus.com/html/ documentation.html.
-
(1998)
64M/72M Direct RDRAM Data Sheet
-
-
-
48
-
-
0344300562
-
Prefetch Unit for Vector Operation on Scalar Computers
-
Sept.
-
I. Sklenar, "Prefetch Unit for Vector Operation on Scalar Computers," Computer Architecture News, vol. 20, no. 4, pp. 31-37, Sept. 1992.
-
(1992)
Computer Architecture News
, vol.20
, Issue.4
, pp. 31-37
-
-
Sklenar, I.1
-
49
-
-
0031594008
-
Increasing TLB Reach Using Superpages Backed by Shadow Memory
-
June
-
M.R. Swanson, L.B. Stoller, and J.B. Carter, "Increasing TLB Reach Using Superpages Backed by Shadow Memory," Proc. 25th Ann. Int'l Symp. Computer Architecture, pp. 204-213, June 1998.
-
(1998)
Proc. 25th Ann. Int'l Symp. Computer Architecture
, pp. 204-213
-
-
Swanson, M.R.1
Stoller, L.B.2
Carter, J.B.3
-
50
-
-
0026865523
-
Increasing the Number of Strides for Conflict-Free Vector Access
-
May
-
M. Valero, T. Lang, J.M. Llaberia, M. Peiron, E. Ayguade, and J.J. Navarro, "Increasing the Number of Strides for Conflict-Free Vector Access," Proc. 19th Ann. Int'l Symp. Computer Architecture, pp. 372-381, May 1992.
-
(1992)
Proc. 19th Ann. Int'l Symp. Computer Architecture
, pp. 372-381
-
-
Valero, M.1
Lang, T.2
Llaberia, J.M.3
Peiron, M.4
Ayguade, E.5
Navarro, J.J.6
-
51
-
-
25344457028
-
Conflict-Free Access for Streams in Multi-Module Memories
-
Universitat Politecnica de Catalunya, Barcelona, Spain
-
M. Valero, T. Lang, M. Peiron, and E. Ayguade, "Conflict-Free Access for Streams in Multi-Module Memories," Technical Report UPC-DAC-93-11, Universitat Politecnica de Catalunya, Barcelona, Spain, 1993.
-
(1993)
Technical Report UPC-DAC-93-11
-
-
Valero, M.1
Lang, T.2
Peiron, M.3
Ayguade, E.4
-
52
-
-
0003158656
-
Hitting the Wall: Implications of the Obvious
-
Mar.
-
W.A. Wulf and S.A. McKee, "Hitting the Wall: Implications of the Obvious," Computer Architecture News, vol. 23, no. 1, pp. 20-24, Mar. 1995.
-
(1995)
Computer Architecture News
, vol.23
, Issue.1
, pp. 20-24
-
-
Wulf, W.A.1
McKee, S.A.2
-
53
-
-
0033365581
-
Memory System Support for Image Processing
-
Oct.
-
L. Zhang, J.B. Carter, W.C. Hsieh, and S.A. McKee, "Memory System Support for Image Processing," Proc. 1999 Int'l Conf. Parallel Architectures and Compilation Techniques, pp. 98-107, Oct. 1999.
-
(1999)
Proc. 1999 Int'l Conf. Parallel Architectures and Compilation Techniques
, pp. 98-107
-
-
Zhang, L.1
Carter, J.B.2
Hsieh, W.C.3
McKee, S.A.4
|