-
2
-
-
0028744946
-
An Efficient Algorithm for the Run-Time Parallelization of DOACROSS Loops
-
Washington DC
-
D.-K. Chen, J. Torrellas, and P.-C. Yew. An Efficient Algorithm for the Run-Time Parallelization of DOACROSS Loops. In Supercomputing Conference, pages 518-527, Washington DC, 1994.
-
(1994)
Supercomputing Conference
, pp. 518-527
-
-
Chen, D.-K.1
Torrellas, J.2
Yew, P.-C.3
-
3
-
-
17244365237
-
Improving Cache Performance in Dynamic Applications through Data and Computation Reorganization at Run Time
-
Atlanta, GA
-
C. Ding and K. Kennedy. Improving Cache Performance in Dynamic Applications through Data and Computation Reorganization at Run Time. In ACM SIGPLAN'99 Conference on Programming Language Design and Implementation, pages 229-241, Atlanta, GA, 1999.
-
(1999)
ACM SIGPLAN'99 Conference on Programming Language Design and Implementation
, pp. 229-241
-
-
Ding, C.1
Kennedy, K.2
-
5
-
-
78650416004
-
Techniques for Speculative Run-Time Parallelization of Loops
-
Orlando, FL
-
M. Gupta and R. Nim. Techniques for Speculative Run-Time Parallelization of Loops. In Supercomputing Conference, Orlando, FL, 1998.
-
(1998)
Supercomputing Conference
-
-
Gupta, M.1
Nim, R.2
-
6
-
-
0034501403
-
Efficient Compiler and Run-Time Support for Parallel Irregular Reductions
-
H. Han and C.-W. Tseng. Efficient Compiler and Run-Time Support for Parallel Irregular Reductions. Parallel Computing, 26(13-14):1861-1887, 2000.
-
(2000)
Parallel Computing
, vol.26
, Issue.13-14
, pp. 1861-1887
-
-
Han, H.1
Tseng, C.-W.2
-
8
-
-
1642288498
-
Improving Fine-Grained Irregular Shared-Memory Benchmarks by Data Reordering
-
Dallas, TX
-
Y. C. Hu, A. L. Cox, and W. Zwaenepoel. Improving Fine-Grained Irregular Shared-Memory Benchmarks by Data Reordering. In Supercomputing Conference, Dallas, TX, 2000.
-
(2000)
Supercomputing Conference
-
-
Hu, Y.C.1
Cox, A.L.2
Zwaenepoel, W.3
-
9
-
-
0032318285
-
Improving Locality Using Loop and Data Transformations in an Integrated Framework
-
Dallas, TX
-
st Annual IEEE/ACM Int'l Symposium on Microarchitecture, pages 285-297, Dallas, TX, 1998.
-
(1998)
st Annual IEEE/ACM Int'l Symposium on Microarchitecture
, pp. 285-297
-
-
Kandemir, M.T.1
Choudhary, A.N.2
Ramanujam, J.3
Banerjee, P.4
-
10
-
-
0032131147
-
A Fast and High Quality Multi-level Scheme for Partitioning Irregular Graphs
-
G. Karypis and V. Kumar. A Fast and High Quality Multi-level Scheme for Partitioning Irregular Graphs. SIAM Journal on Scientific Computing, 20(1):359-392, 1999.
-
(1999)
SIAM Journal on Scientific Computing
, vol.20
, Issue.1
, pp. 359-392
-
-
Karypis, G.1
Kumar, V.2
-
13
-
-
0030190854
-
Improving Data Locality with Loop Transformations
-
K. S. McKinley, S. Carr, and C.-W. Tseng. Improving Data Locality with Loop Transformations. ACM Trans. on Programming Languages and Systems, 18(4):424-453, 1996.
-
(1996)
ACM Trans. on Programming Languages and Systems
, vol.18
, Issue.4
, pp. 424-453
-
-
McKinley, K.S.1
Carr, S.2
Tseng, C.-W.3
-
14
-
-
0032684978
-
Improving Memory Hierarchy Performance for Irregular Applications
-
Rhodes, Greece
-
J. M. Mellor-Crummey, D. B. Whalley, and K. Kennedy. Improving Memory Hierarchy Performance for Irregular Applications. In ACM Int'l Conference on Supercomputing, pages 425-433, Rhodes, Greece, 1999.
-
(1999)
ACM Int'l Conference on Supercomputing
, pp. 425-433
-
-
Mellor-Crummey, J.M.1
Whalley, D.B.2
Kennedy, K.3
-
15
-
-
0023538229
-
Compiler Algorithms for Synchronization
-
S. P. Midkiff and D. A. Padua. Compiler Algorithms for Synchronization. IEEE Transactions on Computers, 36(12):1485-1495, 1987.
-
(1987)
IEEE Transactions on Computers
, vol.36
, Issue.12
, pp. 1485-1495
-
-
Midkiff, S.P.1
Padua, D.A.2
-
18
-
-
0026152428
-
Run-Time Parallelization and Scheduling of Loops
-
J. H. Saltz, R. Mirchandaney, and K. Crowley. Run-Time Parallelization and Scheduling of Loops. IEEE Transactions on Computers, 40(5):603-612, 1991.
-
(1991)
IEEE Transactions on Computers
, vol.40
, Issue.5
, pp. 603-612
-
-
Saltz, J.H.1
Mirchandaney, R.2
Crowley, K.3
-
22
-
-
0035335764
-
Time Stamp Algorithms for Runtime Parallelization of DOACROSS Loops with Dynamic Dependences
-
C. Xu and V. Chaudhary. Time Stamp Algorithms for Runtime Parallelization of DOACROSS Loops with Dynamic Dependences. IEEE Transactions on Parallel and Distributed Systems, 12(5):433-450, 2001.
-
(2001)
IEEE Transactions on Parallel and Distributed Systems
, vol.12
, Issue.5
, pp. 433-450
-
-
Xu, C.1
Chaudhary, V.2
-
23
-
-
0023362714
-
A Scheme to Enforce Data Dependence on Large Multiprocessor Systems
-
C.-Q. Zhu and P.-C. Yew. A Scheme to Enforce Data Dependence on Large Multiprocessor Systems. IEEE Transactions on Software Engineering, 13(6):726-739, 1987.
-
(1987)
IEEE Transactions on Software Engineering
, vol.13
, Issue.6
, pp. 726-739
-
-
Zhu, C.-Q.1
Yew, P.-C.2
|