-
1
-
-
0036590708
-
The data locality of work stealing
-
U. A. Acar, G. E. Blelloch, and R. D. Blumofe. The data locality of work stealing. Theory of Computing Systems, 35(3):321-347, 2002.
-
(2002)
Theory of Computing Systems
, vol.35
, Issue.3
, pp. 321-347
-
-
Acar, U.A.1
Blelloch, G.E.2
Blumofe, R.D.3
-
2
-
-
0024656760
-
An analytical cache model
-
A. Agarwal, M. Horowitz, and J. L. Hennessy. An analytical cache model. A CM Trans. on Computer Systems, 7(2): 184~215, 1989.
-
(1989)
A CM Trans. on Computer Systems
, vol.7
, Issue.2
, pp. 184-215
-
-
Agarwal, A.1
Horowitz, M.2
Hennessy, J.L.3
-
3
-
-
0036038481
-
Cache-oblivious priority queue and graph algorithm applications
-
May
-
L. Arge, M. A. Bender, E. D. Demaine, B. Holland-Minkley, and J. I. Munro. Cache-oblivious priority queue and graph algorithm applications. In Proc. 34th ACM Symp. on Theory of Computing (STOC), pages 268-276, May 2002.
-
(2002)
Proc. 34th ACM Symp. on Theory of Computing (STOC)
, pp. 268-276
-
-
Arge, L.1
Bender, M.A.2
Demaine, E.D.3
Holland-Minkley, B.4
Munro, J.I.5
-
4
-
-
0033722744
-
Piranha: A scalable architecture based on single-chip multiprocessing
-
June
-
L. A. Barroso, K. Gharachorloo, R. McNamara, A. Nowatzyk, S. Qadeer, B. Sano, S. Smith, R. Stets, and B. Verghese. Piranha: A scalable architecture based on single-chip multiprocessing. In Proc. 27th ACM International Symp. on Computer Architecture (ISCA), pages 282-293, June 2000.
-
(2000)
Proc. 27th ACM International Symp. on Computer Architecture (ISCA)
, pp. 282-293
-
-
Barroso, L.A.1
Gharachorloo, K.2
McNamara, R.3
Nowatzyk, A.4
Qadeer, S.5
Sano, B.6
Smith, S.7
Stets, R.8
Verghese, B.9
-
5
-
-
0033713203
-
Application-controlled paging for a shared cache
-
R. D. Barve, E. F. Grove, and J. S. Vitter. Application-controlled paging for a shared cache. SIAM Journal on Computing, 29(4):1290-1303, 2000.
-
(2000)
SIAM Journal on Computing
, vol.29
, Issue.4
, pp. 1290-1303
-
-
Barve, R.D.1
Grove, E.F.2
Vitter, J.S.3
-
6
-
-
0003003638
-
A study of replacment algorithms for virtual storage computers
-
L. A. Belady. A study of replacment algorithms for virtual storage computers. IBM Systems Journal, 5(2):78-101, 1966.
-
(1966)
IBM Systems Journal
, vol.5
, Issue.2
, pp. 78-101
-
-
Belady, L.A.1
-
7
-
-
0003575841
-
Provably efficient scheduling for languages with fine-grained parallelism
-
G. E. Blelloch, P. B. Gibbons, and Y. Matias. Provably efficient scheduling for languages with fine-grained parallelism. Journal of the ACM, 46(2):281-321, 1999.
-
(1999)
Journal of the ACM
, vol.46
, Issue.2
, pp. 281-321
-
-
Blelloch, G.E.1
Gibbons, P.B.2
Matias, Y.3
-
8
-
-
0030707347
-
Space-efficient scheduling of parallelism with synchronization variables
-
June
-
G. E. Blelloch, P. B. Gibbons, Y. Matias, and G. J. Narlikar. Space-efficient scheduling of parallelism with synchronization variables. In Proc. 9th ACM Symp. on Parallel Algorithms and Architectures (SPAA), pages 12-23, June 1997.
-
(1997)
Proc. 9th ACM Symp. on Parallel Algorithms and Architectures (SPAA)
, pp. 12-23
-
-
Blelloch, G.E.1
Gibbons, P.B.2
Matias, Y.3
Narlikar, G.J.4
-
9
-
-
0030387154
-
An analysis of dag-consistent distributed shared-memory algorithms
-
June
-
R. D. Blumofe, M. Frigo, C. F. Joerg, C. E. Leiserson, and K. H. Randall. An analysis of dag-consistent distributed shared-memory algorithms. In Proc. 8th ACM Symp. on Parallel Algorithms and Architectures (SPAA), pages 297-308, June 1996.
-
(1996)
Proc. 8th ACM Symp. on Parallel Algorithms and Architectures (SPAA)
, pp. 297-308
-
-
Blumofe, R.D.1
Frigo, M.2
Joerg, C.F.3
Leiserson, C.E.4
Randall, K.H.5
-
10
-
-
0000269759
-
Scheduling multithreaded computations by work stealing
-
R. D. Blumofe and C. E. Leiserson. Scheduling multithreaded computations by work stealing. Journal of the ACM, 46(5):720-748, 1999.
-
(1999)
Journal of the ACM
, vol.46
, Issue.5
, pp. 720-748
-
-
Blumofe, R.D.1
Leiserson, C.E.2
-
11
-
-
0032095557
-
Performance of shared caches on multithreaded architectures
-
Y.-Y. Chen, J.-K. Peir, and C.-T. King. Performance of shared caches on multithreaded architectures. Journal of Information Science and Engineering, 14(2):499-514, 1998.
-
(1998)
Journal of Information Science and Engineering
, vol.14
, Issue.2
, pp. 499-514
-
-
Chen, Y.-Y.1
Peir, J.-K.2
King, C.-T.3
-
12
-
-
0034832577
-
Low-contention depth-first scheduling of parallel computations with write-once synchronization variables
-
July
-
P. Fatourou. Low-contention depth-first scheduling of parallel computations with write-once synchronization variables. In Proc. 13th ACM Symp. on Parallel Algorithms and Architectures (SPAA), pages 189-198, July 2001.
-
(2001)
Proc. 13th ACM Symp. on Parallel Algorithms and Architectures (SPAA)
, pp. 189-198
-
-
Fatourou, P.1
-
13
-
-
0033350255
-
Cache-oblivious algorithms
-
Oct.
-
M. Frigo, C. E. Leiserson, H. Prokop, and S. Ramachandran. Cache-oblivious algorithms. In Proc. 40th IEEE Symp. on Foundations of Computer Science (FOCS), pages 285-298, Oct. 1999.
-
(1999)
Proc. 40th IEEE Symp. on Foundations of Computer Science (FOCS)
, pp. 285-298
-
-
Frigo, M.1
Leiserson, C.E.2
Prokop, H.3
Ramachandran, S.4
-
14
-
-
0033880036
-
The stanford hydra CMP
-
L. Hammond, B. A. Hubbert, M. Siu, M. K. Prabhu, M. Chen, and K. Olukotun. The Stanford Hydra CMP. IEEE Micro, 20(2):71-84, 2000.
-
(2000)
IEEE Micro
, vol.20
, Issue.2
, pp. 71-84
-
-
Hammond, L.1
Hubbert, B.A.2
Siu, M.3
Prabhu, M.K.4
Chen, M.5
Olukotun, K.6
-
15
-
-
0031235242
-
A single-chip multiprocessor
-
L. Hammond, B. Nayfeh, and K. Olukotun. A single-chip multiprocessor. IEEE Computer, 30(9):79-85, 1997.
-
(1997)
IEEE Computer
, vol.30
, Issue.9
, pp. 79-85
-
-
Hammond, L.1
Nayfeh, B.2
Olukotun, K.3
-
16
-
-
84958948520
-
Competitive analysis of paging
-
Springer. LNCS
-
S. Irani. Competitive analysis of paging. In Online Algorithms. Springer, 1998. LNCS, 1442:52-73.
-
(1998)
Online Algorithms
, vol.1442
, pp. 52-73
-
-
Irani, S.1
-
17
-
-
0025429331
-
Improving direct-mapped cache performance by the addition of a small fully-associative cache and prefetch buffers
-
May
-
N. P. Jouppi. Improving direct-mapped cache performance by the addition of a small fully-associative cache and prefetch buffers. In Proc. 17th ACM International Symp. on Computer Architecture (ISCA), pages 364-373, May 1990.
-
(1990)
Proc. 17th ACM International Symp. on Computer Architecture (ISCA)
, pp. 364-373
-
-
Jouppi, N.P.1
-
19
-
-
0001087280
-
Hyper-threading technology architecture and microarchitecture, white paper
-
Feb.
-
D. T. Marr, F. Binns, D. L. Hill, G. Hinton, D. A. Koufaty, J. A. Miller, and M. Upton. Hyper-threading technology architecture and microarchitecture, white paper. Intel Technical Journal, 6(1), Feb. 2002.
-
(2002)
Intel Technical Journal
, vol.6
, Issue.1
-
-
Marr, D.T.1
Binns, F.2
Hill, D.L.3
Hinton, G.4
Koufaty, D.A.5
Miller, J.A.6
Upton, M.7
-
20
-
-
0034823696
-
Towards a first vertical prototyping of an extremely fine-grained parallel programming approach
-
July
-
D. Naishlos, J. Nuzman, C.-W. Tseng, and U. Vishkin. Towards a first vertical prototyping of an extremely fine-grained parallel programming approach. In Proc. 13th ACM Symp. on Parallel Algorithms and Architectures (SPAA), pages 93-102, July 2001.
-
(2001)
Proc. 13th ACM Symp. on Parallel Algorithms and Architectures (SPAA)
, pp. 93-102
-
-
Naishlos, D.1
Nuzman, J.2
Tseng, C.-W.3
Vishkin, U.4
-
21
-
-
0036489340
-
Scheduling threads for low space requirement and good locality
-
G. J. Narlikar. Scheduling threads for low space requirement and good locality. Theory of Computing Systems, 35(2):151-187, 2002.
-
(2002)
Theory of Computing Systems
, vol.35
, Issue.2
, pp. 151-187
-
-
Narlikar, G.J.1
-
24
-
-
0022012946
-
Amortized efficiency of list update and paging rules
-
D. D. Sleator and R. E. Tarjan. Amortized efficiency of list update and paging rules. Communications of the ACM, 28(2):202-208, 1985.
-
(1985)
Communications of the ACM
, vol.28
, Issue.2
, pp. 202-208
-
-
Sleator, D.D.1
Tarjan, R.E.2
-
26
-
-
1642371317
-
Dynamic partitioning of shared cache memory
-
G. E. Suh, L. Rudolph, and S. Devadas. Dynamic partitioning of shared cache memory. Journal of Supercomputing, 28(1):7-26, 2004.
-
(2004)
Journal of Supercomputing
, vol.28
, Issue.1
, pp. 7-26
-
-
Suh, G.E.1
Rudolph, L.2
Devadas, S.3
-
27
-
-
8344224107
-
Power4 system microarchitecture, technical white paper
-
IBM Server Group, Oct.
-
J. M. Tendler, S. Dodson, S. Fields, H. Le, and B. Sinharoy. Power4 system microarchitecture, technical white paper. Technical Report 20, IBM Server Group, Oct. 2001.
-
(2001)
Technical Report
, vol.20
-
-
Tendler, J.M.1
Dodson, S.2
Fields, S.3
Le, H.4
Sinharoy, B.5
-
29
-
-
0026881152
-
Improving disk cache hit-ratios through cache partitioning
-
D. Thibaut and H. S. Stone. Improving disk cache hit-ratios through cache partitioning. IEEE Transactions on Computers, 41(6):665-676, 1992.
-
(1992)
IEEE Transactions on Computers
, vol.41
, Issue.6
, pp. 665-676
-
-
Thibaut, D.1
Stone, H.S.2
-
30
-
-
0034316177
-
The MAJC architecture: A synthesis of parallelism and scalability
-
M. Tremblay, J. Chan, S. Chaudhry, A. W. Conigliaro, and S. S. Tse. The MAJC architecture: A synthesis of parallelism and scalability. IEEE Micro, 20(6): 12-25, 2000.
-
(2000)
IEEE Micro
, vol.20
, Issue.6
, pp. 12-25
-
-
Tremblay, M.1
Chan, J.2
Chaudhry, S.3
Conigliaro, A.W.4
Tse, S.S.5
|