-
1
-
-
0029181140
-
Data and Computation Transformations for Multiprocessors
-
July
-
J.M. Anderson, S.P. Amarasinghe, and M.S. Lam, "Data and Computation Transformations for Multiprocessors," Proc. Fifth ACM SIGPLAN Symp. Principles and Practice of Parallel Programming, pp. 166-178, July 1995.
-
(1995)
Proc. Fifth ACM SIGPLAN Symp. Principles and Practice of Parallel Programming
, pp. 166-178
-
-
Anderson, J.M.1
Amarasinghe, S.P.2
Lam, M.S.3
-
4
-
-
0027843603
-
Data Locality and Load Balancing in COOL
-
May
-
R. Chandra, A. Gupta, and J.L. Hennessy, "Data Locality and Load Balancing in COOL," Proc. ACM SIGPLAN Symp. Principles and Practice of Parallel Programming, pp. 249-259, May 1993.
-
(1993)
Proc. ACM SIGPLAN Symp. Principles and Practice of Parallel Programming
, pp. 249-259
-
-
Chandra, R.1
Gupta, A.2
Hennessy, J.L.3
-
5
-
-
33749875226
-
The Startfire SMP Interconnect
-
Nov.
-
A. Charlesworth, N. Aneshansley, M. Haakmeester, D. Drogichen, G. Gilbert, R. Williams, and A. Phelps, "The Startfire SMP Interconnect," Proc. Supercomputing '97, Nov. 1997.
-
(1997)
Proc. Supercomputing '97
-
-
Charlesworth, A.1
Aneshansley, N.2
Haakmeester, M.3
Drogichen, D.4
Gilbert, G.5
Williams, R.6
Phelps, A.7
-
7
-
-
0027940807
-
Performance Optimizations, Implementation, and Verification of the SGI Challenge Multiprocessor
-
Jan.
-
M. Galles and E. Williams, "Performance Optimizations, Implementation, and Verification of the SGI Challenge Multiprocessor," Proc. 27th Hawaii Int'l Conf. System Sciences, vol. 1, pp. 134-143, Jan. 1994.
-
(1994)
Proc. 27th Hawaii Int'l Conf. System Sciences
, vol.1
, pp. 134-143
-
-
Galles, M.1
Williams, E.2
-
9
-
-
0347304618
-
Data-Centric Multi-Level Blocking
-
May
-
I. Kodukula, N. Ahmed, and K. Pingali, "Data-Centric Multi-Level Blocking," Proc. ACM SIGPLAN Conf. Programming Language Design and Implementation, pp. 346-357, May 1997.
-
(1997)
Proc. ACM SIGPLAN Conf. Programming Language Design and Implementation
, pp. 346-357
-
-
Kodukula, I.1
Ahmed, N.2
Pingali, K.3
-
10
-
-
0029202473
-
Dynamic Self-Invalidation: Reducing Coherence Overhead in Shared-Memory Multiprocessors
-
A.R. Lebeck and D.A. Wood, "Dynamic Self-Invalidation: Reducing Coherence Overhead in Shared-Memory Multiprocessors," Proc. 22nd Annual Int'l Symp. Computer Architecture, pp. 48-59, 1995.
-
(1995)
Proc. 22nd Annual Int'l Symp. Computer Architecture
, pp. 48-59
-
-
Lebeck, A.R.1
Wood, D.A.2
-
11
-
-
0026137116
-
The Cache Performance and Optimizations of Blocked Algorithms
-
Apr.
-
M.S. Lam, E.E. Rothberg, and M.E. Wolf, "The Cache Performance and Optimizations of Blocked Algorithms," Proc. ASPLOS '91, pp. 63-74, Apr. 1991.
-
(1991)
Proc. ASPLOS '91
, pp. 63-74
-
-
Lam, M.S.1
Rothberg, E.E.2
Wolf, M.E.3
-
12
-
-
85023182347
-
Locality and Loop Scheduling on NUMA Multiprocessors
-
H. Li, S. Tandri, M. Stumm, and K.C. Sevcik, "Locality and Loop Scheduling on NUMA Multiprocessors," Int'l Conf. Parallel Processing vol. II, pp. 140-144, 1993.
-
(1993)
Int'l Conf. Parallel Processing
, vol.2
, pp. 140-144
-
-
Li, H.1
Tandri, S.2
Stumm, M.3
Sevcik, K.C.4
-
13
-
-
0028419803
-
Using Processor Affinity in Loop Scheduling Scheme on Shared-Memory Multiprocessors
-
Apr.
-
E.P. Markatos and T.J. LeBlanc, "Using Processor Affinity in Loop Scheduling Scheme on Shared-Memory Multiprocessors," IEEE Trans. Parallel and Distributed Systems, vol. 5, no. 4, pp. 379-400, Apr. 1994.
-
(1994)
IEEE Trans. Parallel and Distributed Systems
, vol.5
, Issue.4
, pp. 379-400
-
-
Markatos, E.P.1
LeBlanc, T.J.2
-
14
-
-
0030190854
-
Improving Data Locality with Loop Transformations
-
July
-
K.S. McKinley, S. Carr, and C.W. Tseng, "Improving Data Locality with Loop Transformations," ACM Trans. Programming Languages and Systems, vol. 18, no. 4, pp. 424-453, July 1996.
-
(1996)
ACM Trans. Programming Languages and Systems
, vol.18
, Issue.4
, pp. 424-453
-
-
McKinley, K.S.1
Carr, S.2
Tseng, C.W.3
-
15
-
-
0442285975
-
A Quantitative Analysis of Loop Nest Locality
-
Oct.
-
K.S. McKinley and O. Teman, "A Quantitative Analysis of Loop Nest Locality," Proc. ASPLOS '96, pp. 94-104, Oct. 1996.
-
(1996)
Proc. ASPLOS '96
, pp. 94-104
-
-
McKinley, K.S.1
Teman, O.2
-
16
-
-
2842513495
-
Thread Scheduling for Cache Locality
-
Oct.
-
J. Philbin, J. Edler, O.J. Anshus, C.C. Douglas, and K. Li, "Thread Scheduling for Cache Locality," Proc. ASPLOS '96, pp. 60-71, Oct. 1996.
-
(1996)
Proc. ASPLOS '96
, pp. 60-71
-
-
Philbin, J.1
Edler, J.2
Anshus, O.J.3
Douglas, C.C.4
Li, K.5
-
17
-
-
0023535689
-
Guided Self-Scheduling: A Practical Self-Scheduling Scheme for Parallel Supercomputers
-
Dec.
-
C. Polychronopoulos and D. Kuck, "Guided Self-Scheduling: A Practical Self-Scheduling Scheme for Parallel Supercomputers," IEEE Trans. Computers, vol. 36, no. 12, pp. 1,425-1,439, Dec. 1987.
-
(1987)
IEEE Trans. Computers
, vol.36
, Issue.12
-
-
Polychronopoulos, C.1
Kuck, D.2
-
18
-
-
0030653560
-
Using the SimOS Machine Simulator to Study Complex Computer Systems
-
M. Rosenblum, E. Bugnion, S. Devine, and S.A. Herrod, "Using the SimOS Machine Simulator to Study Complex Computer Systems," ACM Trans. Modeling and Computer Simulation, vol. 7, no. 1, pp. 78-103, 1997.
-
(1997)
ACM Trans. Modeling and Computer Simulation
, vol.7
, Issue.1
, pp. 78-103
-
-
Rosenblum, M.1
Bugnion, E.2
Devine, S.3
Herrod, S.A.4
-
19
-
-
0030359246
-
The AlphaServer 4100 Cached Processor Module Architecture and Design
-
M.B. Steinman, G.J. Harris, A. Kocev, V.C. Lamere, and R.D. Pannell, "The AlphaServer 4100 Cached Processor Module Architecture and Design," Digital Technical J. vol. 8, no. 4, pp. 21-37, 1996.
-
(1996)
Digital Technical J.
, vol.8
, Issue.4
, pp. 21-37
-
-
Steinman, M.B.1
Harris, G.J.2
Kocev, A.3
Lamere, V.C.4
Pannell, R.D.5
-
20
-
-
0028446907
-
False Sharing and Spatial Locality in Multiprocessor Caches
-
June
-
J. Torrellas, M.S. Lam, and J.L. Hennessy, "False Sharing and Spatial Locality in Multiprocessor Caches," IEEE Trans. Computers, vol. 43, no. 6, pp. 651-663, June 1994.
-
(1994)
IEEE Trans. Computers
, vol.43
, Issue.6
, pp. 651-663
-
-
Torrellas, J.1
Lam, M.S.2
Hennessy, J.L.3
-
21
-
-
0028016738
-
MINT: A Front End for Efficient Simulation of Shared-Memory Multiprocessors
-
Jan.
-
J.E. Veenstra and R.J. Fowler, "MINT: A Front End for Efficient Simulation of Shared-Memory Multiprocessors," Proc. MASCOTS '94, pp. 201-207, Jan. 1994.
-
(1994)
Proc. MASCOTS '94
, pp. 201-207
-
-
Veenstra, J.E.1
Fowler, R.J.2
-
25
-
-
0030846981
-
Adaptively Scheduling Parallel Loops in Distributed Shared-Memory Systems
-
Jan.
-
Y. Yan, C.M. Jin, and X. Zhang, "Adaptively Scheduling Parallel Loops in Distributed Shared-Memory Systems," IEEE Trans. Parallel and Distributed Systems, vol. 8, no. 1, pp. 70-81, Jan. 1997.
-
(1997)
IEEE Trans. Parallel and Distributed Systems
, vol.8
, Issue.1
, pp. 70-81
-
-
Yan, Y.1
Jin, C.M.2
Zhang, X.3
-
26
-
-
0004871610
-
A Memory-Layout Oriented Runtime Technique for Locality Optimization
-
Aug.
-
Y. Yan, X. Zhang, and Z. Zhang, "A Memory-Layout Oriented Runtime Technique for Locality Optimization," Proc. 1998 Int'l Conf. Parallel Processing (ICPP '98), pp. 189-196, Aug. 1998.
-
(1998)
Proc. 1998 Int'l Conf. Parallel Processing (ICPP '98)
, pp. 189-196
-
-
Yan, Y.1
Zhang, X.2
Zhang, Z.3
|