-
2
-
-
49249086142
-
Larrabee: A many-core x86 architecture for visual computing
-
L. Seiler, D. Carmean, E. Sprangle, T. Forsyth, M. Abrash, P. Dubey, S. Junkins, A. Lake, J. Sugerman, R. Cavin, R. Espasa, E. Grochowski, T. Juan, and P. Hanrahan, "Larrabee: a many-core x86 architecture for visual computing," ACM Trans. Graph., vol. 27, 2008.
-
(2008)
ACM Trans. Graph.
, vol.27
-
-
Seiler, L.1
Carmean, D.2
Sprangle, E.3
Forsyth, T.4
Abrash, M.5
Dubey, P.6
Junkins, S.7
Lake, A.8
Sugerman, J.9
Cavin, R.10
Espasa, R.11
Grochowski, E.12
Juan, T.13
Hanrahan, P.14
-
3
-
-
0025467711
-
A bridging model for parallel computation
-
L. G. Valiant, "A bridging model for parallel computation," Communications of the ACM, vol. 33, no. 8, 1990.
-
(1990)
Communications of the ACM
, vol.33
, Issue.8
-
-
Valiant, L.G.1
-
4
-
-
0030083764
-
Treadmarks: Shared memory computing on networks of workstations
-
C. Amza, A. L. Cox, S. Dwarkadas, P. Keleher, H. Lu, R. Rajamony, W. Yu, and W. Zwaenepoel, "Treadmarks: Shared memory computing on networks of workstations, " Computer, vol. 29, no. 2, 1996.
-
(1996)
Computer
, vol.29
, Issue.2
-
-
Amza, C.1
Cox, A.L.2
Dwarkadas, S.3
Keleher, P.4
Lu, H.5
Rajamony, R.6
Yu, W.7
Zwaenepoel, W.8
-
5
-
-
84976770155
-
Munin: Distributed shared memory based on typespecific memory coherence
-
New York, NY, USA: ACM
-
J. K. Bennett, J. B. Carter, and W. Zwaenepoel, "Munin: distributed shared memory based on typespecific memory coherence," PPoPP'90. New York, NY, USA: ACM, 1990, pp. 168-176.
-
(1990)
PPoPP'90
, pp. 168-176
-
-
Bennett, J.K.1
Carter, J.B.2
Zwaenepoel, W.3
-
6
-
-
70450237431
-
Rigel: An architecture and scalable programming interface for a 1000-core accelerator
-
June
-
J. H. Kelm, D. R. Johnson, M. R. Johnson, N. C. Crago, W. Tuohy, A. Mahesri, S. S. Lumetta, M. I. Frank, and S. J. Patel, "Rigel: An architecture and scalable programming interface for a 1000-core accelerator," ISCA'09, June 2009.
-
(2009)
ISCA'09
-
-
Kelm, J.H.1
Johnson, D.R.2
Johnson, M.R.3
Crago, N.C.4
Tuohy, W.5
Mahesri, A.6
Lumetta, S.S.7
Frank, M.I.8
Patel, S.J.9
-
7
-
-
66749170578
-
Tradeoffs in designing accelerator architectures for visual computing
-
A. Mahesri, D. Johnson, N. Crago, and S. J. Patel, "Tradeoffs in designing accelerator architectures for visual computing," MICRO'08, 2008.
-
(2008)
MICRO'08
-
-
Mahesri, A.1
Johnson, D.2
Crago, N.3
Patel, S.J.4
-
8
-
-
0347507496
-
The implementation of the cilk-5 multithreaded language
-
M. Frigo, C. E. Leiserson, and K. H. Randall, "The implementation of the cilk-5 multithreaded language, " SIGPLAN Not., vol. 33, no. 5, 1998.
-
(1998)
SIGPLAN Not.
, vol.33
, Issue.5
-
-
Frigo, M.1
Leiserson, C.E.2
Randall, K.H.3
-
9
-
-
78651550268
-
Scalable parallel programming with CUDA
-
J. Nickolls, I. Buck, M. Garland, and K. Skadron, "Scalable parallel programming with CUDA, " Queue, vol. 6, no. 2, 2008.
-
(2008)
Queue
, vol.6
, Issue.2
-
-
Nickolls, J.1
Buck, I.2
Garland, M.3
Skadron, K.4
-
10
-
-
70349100958
-
-
1st ed., Khronos OpenCL Working Group, December
-
OpenCL Specification, 1st ed., Khronos OpenCL Working Group, December 2008.
-
(2008)
OpenCL Specification
-
-
-
13
-
-
33749064644
-
Recognition, mining and synthesis moves computers to the era of tera
-
Feb.
-
P. Dubey, "Recognition, mining and synthesis moves computers to the era of tera," Technology Intel Magazine, Feb. 2005.
-
(2005)
Technology Intel Magazine
-
-
Dubey, P.1
-
14
-
-
35348826095
-
Physical simulation for animation and visual effects: Parallelization and characterization for chip multiprocessors
-
C. J. Hughes, R. Grzeszczuk, E. Sifakis, D. Kim, S. Kumar, A. P. Selle, J. Chhugani, M. Holliman, and Y.-K. Chen, "Physical simulation for animation and visual effects: parallelization and characterization for chip multiprocessors," ISCA'07, 2007.
-
(2007)
ISCA'07
-
-
Hughes, C.J.1
Grzeszczuk, R.2
Sifakis, E.3
Kim, D.4
Kumar, S.5
Selle, A.P.6
Chhugani, J.7
Holliman, M.8
Chen, Y.-K.9
-
15
-
-
51549095074
-
-
Princeton University, Tech. Rep. TR-81108, January
-
C. Bienia, S. Kumar, J. P. Singh, and K. Li., "The PARSEC benchmark suite: Characterization and architectural implications," Princeton University, Tech. Rep. TR-81108, January 2008.
-
(2008)
The PARSEC Benchmark Suite: Characterization and Architectural Implications
-
-
Bienia, C.1
Kumar, S.2
Singh, J.P.3
Li, K.4
-
16
-
-
70449675556
-
The ALPBench benchmark suite for complex multimedia applications
-
Oct.
-
M.-L. Li, R. Sasanka, S. Adve, Y.-K. Chen, and E. Debes, "The ALPBench benchmark suite for complex multimedia applications," IWCS'05, Oct. 2005.
-
(2005)
IWCS'05
-
-
Li, M.-L.1
Sasanka, R.2
Adve, S.3
Chen, Y.-K.4
Debes, E.5
-
17
-
-
79959466764
-
Optimization principles and application performance evaluation of a multithreaded GPU using CUDA
-
S. Ryoo, C. I. Rodrigues, S. S. Baghsorkhi, S. S. Stone, D. B. Kirk, and W. mei W. Hwu, "Optimization principles and application performance evaluation of a multithreaded GPU using CUDA," in PPoPP'08, 2008.
-
(2008)
PPoPP'08
-
-
Ryoo, S.1
Rodrigues, C.I.2
Baghsorkhi, S.S.3
Stone, S.S.4
Kirk, D.B.5
Mei, W.6
Hwu, W.7
-
18
-
-
35348861326
-
Comparing memory systems for chip multiprocessors
-
J. Leverich, H. Arakida, A. Solomatnikov, A. Firoozshahian, M. Horowitz, and C. Kozyrakis, "Comparing memory systems for chip multiprocessors," ISCA'07, 2007, pp. 358-368.
-
(2007)
ISCA'07
, pp. 358-368
-
-
Leverich, J.1
Arakida, H.2
Solomatnikov, A.3
Firoozshahian, A.4
Horowitz, M.5
Kozyrakis, C.6
-
19
-
-
0030402378
-
Scope consistency: A bridge between release consistency and entry consistency
-
L. Iftode, J. P. Singh, and K. Li, "Scope consistency: A bridge between release consistency and entry consistency," SPAA'96, 1996, pp. 277-287.
-
(1996)
SPAA'96
, pp. 277-287
-
-
Iftode, L.1
Singh, J.P.2
Li, K.3
-
20
-
-
0027307267
-
The midway distributed shared memory system
-
Feb
-
B. Bershad, M. Zekauskas, and W. Sawdon, "The midway distributed shared memory system, " Compcon Spring '93, Digest of Papers., pp. 528-537, Feb 1993.
-
(1993)
Compcon Spring '93, Digest of Papers
, pp. 528-537
-
-
Bershad, B.1
Zekauskas, M.2
Sawdon, W.3
-
21
-
-
0027699767
-
Cooperative shared memory: Software and hardware for scalable multiprocessors
-
M. D. Hill, J. R. Larus, S. K. Reinhardt, and D. A. Wood, "Cooperative shared memory: software and hardware for scalable multiprocessors, " ACM Trans. Comput. Syst., vol. 11, no. 4, 1993.
-
(1993)
ACM Trans. Comput. Syst.
, vol.11
, Issue.4
-
-
Hill, M.D.1
Larus, J.R.2
Reinhardt, S.K.3
Wood, D.A.4
-
22
-
-
0029721693
-
Dag-consistent distributed shared memory
-
R. D. Blumofe, M. Frigo, C. F. Joerg, C. E. Leiserson, and K. H. Randall, "Dag-consistent distributed shared memory," IPPS'96, 1996, pp. 132-141.
-
(1996)
IPPS'96
, pp. 132-141
-
-
Blumofe, R.D.1
Frigo, M.2
Joerg, C.F.3
Leiserson, C.E.4
Randall, K.H.5
-
23
-
-
35348855586
-
Carbon: Architectural support for fine-grained parallelism on chip multiprocessors
-
S. Kumar, C. J. Hughes, and A. Nguyen, "Carbon: architectural support for fine-grained parallelism on chip multiprocessors," ISCA'07, 2007.
-
(2007)
ISCA'07
-
-
Kumar, S.1
Hughes, C.J.2
Nguyen, A.3
-
24
-
-
0004029273
-
Cache consistency and sequential consistency
-
March
-
J. Goodman, "Cache consistency and sequential consistency, " SCI Working Grp., Tech. Rep. 61, March 1989.
-
(1989)
SCI Working Grp., Tech. Rep.
, pp. 61
-
-
Goodman, J.1
-
25
-
-
0018518477
-
How to make a multiprocessor computer that correctly executes multiprocess programs
-
September
-
L. Lamport, "How to make a multiprocessor computer that correctly executes multiprocess programs, " IEEE Transactions on Computers, vol. C-28, no. 9, pp. 690-691, September 1979.
-
(1979)
IEEE Transactions on Computers
, vol.C-28
, Issue.9
, pp. 690-691
-
-
Lamport, L.1
-
26
-
-
33749137885
-
-
Version 9, SPARC International Inc., September
-
SPARC Architecture Manual, Version 9, SPARC International Inc., September 2000.
-
(2000)
SPARC Architecture Manual
-
-
-
27
-
-
0025433676
-
Weak ordering-a new definition
-
S. V. Adve and M. D. Hill, "Weak ordering-a new definition," ISCA'90, 1990, pp. 2-14.
-
(1990)
ISCA'90
, pp. 2-14
-
-
Adve, S.V.1
Hill, M.D.2
-
28
-
-
0028115367
-
How to get good performance from the CM-5 data network
-
E. A. Brewer and B. C. Kuszmaul, "How to get good performance from the CM-5 data network," in ISPP'94, 1994, pp. 858-867.
-
(1994)
ISPP'94
, pp. 858-867
-
-
Brewer, E.A.1
Kuszmaul, B.C.2
|