-
1
-
-
84868887384
-
-
www.sun.com/processors/UltraSPARC-T1/, 2007.
-
(2007)
-
-
-
2
-
-
84868882190
-
-
www.tilera.com, 2007.
-
(2007)
-
-
-
6
-
-
0024082546
-
The input/output complexity of sorting and related problems
-
A. Aggarwal and J. S. Vitter. The input/output complexity of sorting and related problems. Communications of the ACM, 31(9), 1988.
-
(1988)
Communications of the ACM
, vol.31
, Issue.9
-
-
Aggarwal, A.1
Vitter, J.S.2
-
7
-
-
0023451961
-
Optimal parallel merging and sorting without memory conflicts
-
S. Akl and N. Santoro. Optimal parallel merging and sorting without memory conflicts. IEEE Transactions on Computers, 36(11), 1987.
-
(1987)
IEEE Transactions on Computers
, vol.36
, Issue.11
-
-
Akl, S.1
Santoro, N.2
-
8
-
-
0028483922
-
The uniform memory hierachy model of computation
-
122/3, Springer
-
B. Alpern, L. Carter, E. Feig, and T. Selker. The uniform memory hierachy model of computation. Algorthmica, 12(2/3), 1994. Springer.
-
(1994)
Algorthmica
-
-
Alpern, B.1
Carter, L.2
Feig, E.3
Selker, T.4
-
9
-
-
0033722744
-
Piranha: A scalable architecture based on single-chip multiprocessing
-
L. A. Barroso, K. Gharachorloo, R. McNamara, A. Nowatzyk, S. Qadeer, B. Sano, S. Smith, R. Stets, and B. Verghese. Piranha: A scalable architecture based on single-chip multiprocessing. In ACM ISCA, 2000.
-
(2000)
ACM ISCA
-
-
Barroso, L.A.1
Gharachorloo, K.2
McNamara, R.3
Nowatzyk, A.4
Qadeer, S.5
Sano, B.6
Smith, S.7
Stets, R.8
Verghese, B.9
-
10
-
-
35248813384
-
Optimal sparse matrix dense vector multiplication in the I/O-model
-
M. A. Bender, G. S. Brodal, R. Fagerberg, R. Jacob, and E. Vicari. Optimal sparse matrix dense vector multiplication in the I/O-model. In ACM SPAA, 2007.
-
(2007)
ACM SPAA
-
-
Bender, M.A.1
Brodal, G.S.2
Fagerberg, R.3
Jacob, R.4
Vicari, E.5
-
12
-
-
8344240379
-
Effectively sharing a cache among threads
-
G. E. Blelloch and P. B. Gibbons. Effectively sharing a cache among threads. In ACM SPAA, 2004.
-
(2004)
ACM SPAA
-
-
Blelloch, G.E.1
Gibbons, P.B.2
-
13
-
-
0003575841
-
Provably efficient scheduling for languages with fine-grained parallelism
-
G. E. Blelloch, P. B. Gibbons, and Y. Matias. Provably efficient scheduling for languages with fine-grained parallelism. Journal of the ACM, 46(2), 1999.
-
(1999)
Journal of the ACM
, vol.46
, Issue.2
-
-
Blelloch, G.E.1
Gibbons, P.B.2
Matias, Y.3
-
15
-
-
0030387154
-
An analysis of dag-consistent distributed shared-memory algorithms
-
R. D. Blumofe, M. Frigo, C. F. Joerg, C. E. Leiserson, and K. H. Randall. An analysis of dag-consistent distributed shared-memory algorithms. In ACM SPAA, 1996.
-
(1996)
ACM SPAA
-
-
Blumofe, R.D.1
Frigo, M.2
Joerg, C.F.3
Leiserson, C.E.4
Randall, K.H.5
-
16
-
-
0016046965
-
The parallel evaluation of general arithmeticexpressions
-
R. Brent. The parallel evaluation of general arithmeticexpressions. Journal of the ACM, 21:201-206, 1974.
-
(1974)
Journal of the ACM
, vol.21
, pp. 201-206
-
-
Brent, R.1
-
17
-
-
35248843628
-
Supennatrix out-of-order scheduling of matrix operations for SMP and multi-core architectures
-
E. Chan, E. S. Qumtana-Orti, G. Quintana-Orti, and R. van de Geijn. Supennatrix out-of-order scheduling of matrix operations for SMP and multi-core architectures. In ACM SPAA, 2007.
-
(2007)
ACM SPAA
-
-
Chan, E.1
Qumtana-Orti, E.S.2
Quintana-Orti, G.3
van de Geijn, R.4
-
18
-
-
35248852476
-
Scheduling threads for constructive cache sharing on CMPs
-
S. Chen, P. B. Gibbons, M. Kozuch, V. Liaskovitis, A. Ailamaki, G. E. Blelloch, B. Falsafi, L. Fix, N. Hardavellas, T. C. Mowry, and C. Wilkerson. Scheduling threads for constructive cache sharing on CMPs. In ACM SPAA, 2007.
-
(2007)
ACM SPAA
-
-
Chen, S.1
Gibbons, P.B.2
Kozuch, M.3
Liaskovitis, V.4
Ailamaki, A.5
Blelloch, G.E.6
Falsafi, B.7
Fix, L.8
Hardavellas, N.9
Mowry, T.C.10
Wilkerson, C.11
-
19
-
-
58449123296
-
The cacheoblivious gaussian elimination paradigm: Theoretical framework, parallelization and experimental evaluation
-
R. Chowdhury and V. Ramachandran. The cacheoblivious gaussian elimination paradigm: Theoretical framework, parallelization and experimental evaluation. In ACM SPAA, 2007.
-
(2007)
ACM SPAA
-
-
Chowdhury, R.1
Ramachandran, V.2
-
20
-
-
34548334096
-
Performance of multithreaded chip multiprocessors and implications for operating system design
-
A. Fedorova, M. Seltzer, C. Small, and D. Nussbaum. Performance of multithreaded chip multiprocessors and implications for operating system design. In USENIX Ann. Tech. Conf., 2005.
-
(2005)
USENIX Ann. Tech. Conf
-
-
Fedorova, A.1
Seltzer, M.2
Small, C.3
Nussbaum, D.4
-
22
-
-
58449120511
-
The cache complexity of multithreaded cache oblivious algorithms
-
M. Frigo and V. Strumpen. The cache complexity of multithreaded cache oblivious algorithms. In ACM SPAA, 2006.
-
(2006)
ACM SPAA
-
-
Frigo, M.1
Strumpen, V.2
-
23
-
-
58449128303
-
-
M. T. Goodrich, M. Nelson, and N. Sitchinava. Sorting in parallel external-memory multicores. Technical report, U.C. Irvine, 2007.
-
M. T. Goodrich, M. Nelson, and N. Sitchinava. Sorting in parallel external-memory multicores. Technical report, U.C. Irvine, 2007.
-
-
-
-
24
-
-
0033880036
-
The Stanford Hydra CMP
-
L. Hammond, B. A. Hubbert, M. Siu, M. K. Prabhu, M. Chen, and K. Olukotun. The Stanford Hydra CMP. IEEE Micro, 20(2), 2000.
-
(2000)
IEEE Micro
, vol.20
, Issue.2
-
-
Hammond, L.1
Hubbert, B.A.2
Siu, M.3
Prabhu, M.K.4
Chen, M.5
Olukotun, K.6
-
27
-
-
0036489340
-
Scheduling threads for low space requirement and good locality
-
Springer
-
G. J. Narlikar. Scheduling threads for low space requirement and good locality. Theory of Computing Systems, 35(2), 2002. Springer.
-
(2002)
Theory of Computing Systems
, vol.35
, Issue.2
-
-
Narlikar, G.J.1
-
28
-
-
0029666647
-
Evaluation of design alternatives for a multiprocessor microprocessor
-
B. A. Nayfeh, L. Hammond, and K. Olukotun. Evaluation of design alternatives for a multiprocessor microprocessor. In ACM ISCA, 1996.
-
(1996)
ACM ISCA
-
-
Nayfeh, B.A.1
Hammond, L.2
Olukotun, K.3
-
29
-
-
34250487811
-
Gaussian elimination is not optimal
-
Springer
-
V. Strassen. Gaussian elimination is not optimal. Numerische Mathematik, 13(4), 1969. Springer.
-
(1969)
Numerische Mathematik
, vol.13
, Issue.4
-
-
Strassen, V.1
-
30
-
-
34548030923
-
Thread clustering: Sharing-aware scheduling on SMP-CMP-SMT multiprocessors
-
D. Tam, R. Azimi, and M. Stumm. Thread clustering: Sharing-aware scheduling on SMP-CMP-SMT multiprocessors. In ACM EuroSys, 2007.
-
(2007)
ACM EuroSys
-
-
Tam, D.1
Azimi, R.2
Stumm, M.3
|