-
2
-
-
0029180378
-
The MIT alewife machine: Architecture and performance
-
New York, NY, USA, ACM
-
A. Agarwal, R. Bianchini, D. Chaiken, K. L. Johnson, D. Kranz, J. Kubiatowicz, B.-H. Lim, K. Mackenzie, and D. Yeung. The MIT alewife machine: architecture and performance. In ISCA '95, pages 2-13, New York, NY, USA, 1995. ACM.
-
(1995)
ISCA '95
, pp. 2-13
-
-
Agarwal, A.1
Bianchini, R.2
Chaiken, D.3
Johnson, K.L.4
Kranz, D.5
Kubiatowicz, J.6
Lim, B.-H.7
Mackenzie, K.8
Yeung, D.9
-
3
-
-
0022767619
-
Linda and friends
-
Aug.
-
S. Ahuja, N. Carriero, and D. Gelernter. Linda and friends. IEEE Trans. on Computers, 19(8):26-34, Aug. 1986.
-
(1986)
IEEE Trans. on Computers
, vol.19
, Issue.8
, pp. 26-34
-
-
Ahuja, S.1
Carriero, N.2
Gelernter, D.3
-
4
-
-
35248832108
-
STAPL: An adaptive, generic parallel C++ library
-
P. An, A. Jula, S. Rus, S. Saunders, T. Smith, G. Tanase, N. Thomas, N. Amato, and L. Rauchwerger. STAPL: An adaptive, generic parallel C++ library. LNCS, pages 193-208, 2003.
-
(2003)
LNCS
, pp. 193-208
-
-
An, P.1
Jula, A.2
Rus, S.3
Saunders, S.4
Smith, T.5
Tanase, G.6
Thomas, N.7
Amato, N.8
Rauchwerger, L.9
-
5
-
-
0024131247
-
Distributed programming with shared data
-
Oct
-
H. Bal and A. Tanenbaum. Distributed programming with shared data. In ICCL '88, pages 82-91, Oct 1988.
-
(1988)
ICCL '88
, pp. 82-91
-
-
Bal, H.1
Tanenbaum, A.2
-
6
-
-
70449467862
-
Entering the petaflop era: The architecture and performance of roadrunner
-
Piscataway, NJ, USA, IEEE Press
-
K. J. Barker, K. Davis, A. Hoisie, D. J. Kerbyson, M. Lang, S. Pakin, and J. C. Sancho. Entering the petaflop era: the architecture and performance of roadrunner. In SC'08, pages 1-11, Piscataway, NJ, USA, 2008. IEEE Press.
-
(2008)
SC'08
, pp. 1-11
-
-
Barker, K.J.1
Davis, K.2
Hoisie, A.3
Kerbyson, D.J.4
Lang, M.5
Pakin, S.6
Sancho, J.C.7
-
7
-
-
34548265764
-
Cellss: A programming model for the cell be architecture
-
New York, NY, USA, ACM
-
P. Bellens, J. M. Perez, R. M. Badia, and J. Labarta. Cellss: a programming model for the cell be architecture. In SC'06, page 86, New York, NY, USA, 2006. ACM.
-
(2006)
SC'06
, pp. 86
-
-
Bellens, P.1
Perez, J.M.2
Badia, R.M.3
Labarta, J.4
-
9
-
-
0024055867
-
Multilanguage parallel programming of heterogeneous machines
-
Aug
-
R. Bisiani and A. Forin. Multilanguage parallel programming of heterogeneous machines. IEEE Trans. on Computers, 37(8):930-945, Aug 1988.
-
(1988)
IEEE Trans. on Computers
, vol.37
, Issue.8
, pp. 930-945
-
-
Bisiani, R.1
Forin, A.2
-
11
-
-
85088003777
-
GPU computing with NVIDIA CUDA
-
New York, NY, USA, ACM
-
I. Buck. GPU computing with NVIDIA CUDA. In SIGGRAPH '07, page 6, New York, NY, USA, 2007. ACM.
-
(2007)
SIGGRAPH '07
, pp. 6
-
-
Buck, I.1
-
12
-
-
84883300486
-
Implementation and performance of munin
-
New York, NY, USA, ACM
-
J. B. Carter, J. K. Bennett, and W. Zwaenepoel. Implementation and performance of munin. In SOSP '91, pages 152-164, New York, NY, USA, 1991. ACM.
-
(1991)
SOSP '91
, pp. 152-164
-
-
Carter, J.B.1
Bennett, J.K.2
Zwaenepoel, W.3
-
13
-
-
17144409441
-
Modular interprocedural pointer analysis using access paths: Design, implementation, and evaluation
-
New York, NY, USA, ACM
-
B.-C. Cheng and W. W. Hwu. Modular interprocedural pointer analysis using access paths: design, implementation, and evaluation. In PLDI '00, pages 57-69, New York, NY, USA, 2000. ACM.
-
(2000)
PLDI '00
, pp. 57-69
-
-
Cheng, B.-C.1
Hwu, W.W.2
-
14
-
-
84992015947
-
Parallel programming using skeleton functions
-
London, UK, Springer-Verlag
-
J. Darlington, A. J. Field, P. G. Harrison, P. H. J. Kelly, D. W. N. Sharp, and Q. Wu. Parallel programming using skeleton functions. In PARLE'93, pages 146-160, London, UK, 1993. Springer-Verlag.
-
(1993)
PARLE'93
, pp. 146-160
-
-
Darlington, J.1
Field, A.J.2
Harrison, P.G.3
Kelly, P.H.J.4
Sharp, D.W.N.5
Wu, Q.6
-
15
-
-
0043207371
-
The clouds distributed operating system
-
Nov
-
P. Dasgupta, J. LeBlanc, R.J., M. Ahamad, and U. Ramachandran. The clouds distributed operating system. IEEE Trans. on Computers, 24(11):34-44, Nov 1991.
-
(1991)
IEEE Trans. on Computers
, vol.24
, Issue.11
, pp. 34-44
-
-
Dasgupta, P.1
Leblanc, J.2
J, R.3
Ahamad, M.4
Ramachandran, U.5
-
16
-
-
84947663399
-
An analysis of memnet - An experiment in high-speed shared-memory local networking
-
New York, NY, USA, ACM
-
G. Delp, A. Sethi, and D. Farber. An analysis of memnet - an experiment in high-speed shared-memory local networking. In SIGCOMM '88, pages 165-174, New York, NY, USA, 1988. ACM.
-
(1988)
SIGCOMM '88
, pp. 165-174
-
-
Delp, G.1
Sethi, A.2
Farber, D.3
-
17
-
-
0024936732
-
Mirage: A coherent distributed shared memory design
-
New York, NY, USA, ACM
-
B. Fleisch and G. Popek. Mirage: a coherent distributed shared memory design. In SOSP '89, pages 211-223, New York, NY, USA, 1989. ACM.
-
(1989)
SOSP '89
, pp. 211-223
-
-
Fleisch, B.1
Popek, G.2
-
18
-
-
0027148844
-
The KSR 1: Bridging the gap between shared memory and MPPs
-
Feb
-
S. Frank, I. Burkhardt, H., and J. Rothnie. The KSR 1: bridging the gap between shared memory and MPPs. In Compcon Spring '93, pages 285-294, Feb 1993.
-
(1993)
Compcon Spring '93
, pp. 285-294
-
-
Frank, S.1
Burkhardt H, I.2
Rothnie, J.3
-
19
-
-
57349092386
-
CUBA: An architecture for efficient cpu/co-processor data communication
-
New York, NY, USA, ACM
-
I. Gelado, J. H. Kelm, S. Ryoo, S. S. Lumetta, N. Navarro, and W.W. Hwu. CUBA: an architecture for efficient cpu/co-processor data communication. In ICS '08, pages 299-308, New York, NY, USA, 2008. ACM.
-
(2008)
ICS '08
, pp. 299-308
-
-
Gelado, I.1
Kelm, J.H.2
Ryoo, S.3
Lumetta, S.S.4
Navarro, N.5
Hwu, W.W.6
-
20
-
-
0026818115
-
The scalable coherent interface and related standards projects
-
D. B. Gustavson. The scalable coherent interface and related standards projects. IEEE Micro, 12(1):10-22, 1992.
-
(1992)
IEEE Micro
, vol.12
, Issue.1
, pp. 10-22
-
-
Gustavson, D.B.1
-
21
-
-
1642364107
-
The chimaera reconfigurable functional unit
-
Feb.
-
S. H. Hauck, T. W. Fry, M. M. Hosler, and J. P. Kao. The chimaera reconfigurable functional unit. IEEE Trans. on VLSI, 12(2):206-217, Feb. 2004.
-
(2004)
IEEE Trans. on VLSI
, vol.12
, Issue.2
, pp. 206-217
-
-
Hauck, S.H.1
Fry, T.W.2
Hosler, M.M.3
Kao, J.P.4
-
22
-
-
0031360911
-
Garp: A MIPS processor with a reconfigurable coprocessor
-
Apr
-
J. R. Hauser and J. Wawrzynek. Garp: a MIPS processor with a reconfigurable coprocessor. In FCCM '97, pages 12-21, Apr 1997.
-
(1997)
FCCM '97
, pp. 12-21
-
-
Hauser, J.R.1
Wawrzynek, J.2
-
23
-
-
84976707130
-
The performance impact of flexibility in the Stanford FLASH multiprocessor
-
New York, NY, USA, ACM
-
M. Heinrich, J. Kuskin, D. Ofelt, J. Heinlein, J. Baxter, J. P. Singh, R. Simoni, K. Gharachorloo, D. Nakahira, M. Horowitz, A. Gupta, M. Rosenblum, and J. Hennessy. The performance impact of flexibility in the Stanford FLASH multiprocessor. In ASPLOS '94, pages 274-285, New York, NY, USA, 1994. ACM.
-
(1994)
ASPLOS '94
, pp. 274-285
-
-
Heinrich, M.1
Kuskin, J.2
Ofelt, D.3
Heinlein, J.4
Baxter, J.5
Singh, J.P.6
Simoni, R.7
Gharachorloo, K.8
Nakahira, D.9
Horowitz, M.10
Gupta, A.11
Rosenblum, M.12
Hennessy, J.13
-
26
-
-
67650692011
-
-
IMPACT Group. Parboil benchmark suite. http://impact.crhc.illinois.edu/ parboil.php.
-
Parboil Benchmark Suite
-
-
-
29
-
-
59049085159
-
Predictive runtime code scheduling for heterogeneous architectures
-
Berlin, Heidelberg, Springer-Verlag
-
V. Jiménez, L. Vilanova, I. Gelado, M. Gil, G. Fursin, and N. Navarro. Predictive runtime code scheduling for heterogeneous architectures. In HiPEAC '09, pages 19-33, Berlin, Heidelberg, 2009. Springer-Verlag.
-
(2009)
HiPEAC '09
, pp. 19-33
-
-
Jiménez, V.1
Vilanova, L.2
Gelado, I.3
Gil, M.4
Fursin, G.5
Navarro, N.6
-
30
-
-
25844503119
-
Introduction to the cell multiprocessor
-
J. A. Kahle, M. N. Day, H. P. Hofstee, C. R. Johns, T. R. Maeurer, and D. Shippy. Introduction to the cell multiprocessor. IBM J. Res. Dev., 49(4/5):589-604, 2005.
-
(2005)
IBM J. Res. Dev.
, vol.49
, Issue.4-5
, pp. 589-604
-
-
Kahle, J.A.1
Day, M.N.2
Hofstee, H.P.3
Johns, C.R.4
Maeurer, T.R.5
Shippy, D.6
-
31
-
-
81455130002
-
Treadmarks: Distributed shared memory on standard workstations and operating systems
-
Berkeley, CA, USA, USENIX Association
-
P. Keleher, A. L. Cox, S. Dwarkadas, and W. Zwaenepoel. Treadmarks: distributed shared memory on standard workstations and operating systems. In WTEC'94, pages 10-10, Berkeley, CA, USA, 1994. USENIX Association.
-
(1994)
WTEC'94
, pp. 10-10
-
-
Keleher, P.1
Cox, A.L.2
Dwarkadas, S.3
Zwaenepoel, W.4
-
32
-
-
70450237431
-
Rigel: An architecture and scalable programming interface for a 1000-core accelerator
-
New York, NY, USA, ACM
-
J. H. Kelm, D. R. Johnson, M. R. Johnson, N. C. Crago, W. Tuohy, A. Mahesri, S. S. Lumetta, M. I. Frank, and S. Patel. Rigel: an architecture and scalable programming interface for a 1000-core accelerator. In ISCA '09, pages 140-151, New York, NY, USA, 2009. ACM.
-
(2009)
ISCA '09
, pp. 140-151
-
-
Kelm, J.H.1
Johnson, D.R.2
Johnson, M.R.3
Crago, N.C.4
Tuohy, W.5
Mahesri, A.6
Lumetta, S.S.7
Frank, M.I.8
Patel, S.9
-
33
-
-
0025429467
-
The directory-based cache coherence protocol for the DASH multiprocessor
-
New York, NY, USA, ACM
-
D. Lenoski, J. Laudon, K. Gharachorloo, A. Gupta, and J. Hennessy. The directory-based cache coherence protocol for the DASH multiprocessor. In ISCA '90, pages 148-159, New York, NY, USA, 1990. ACM.
-
(1990)
ISCA '90
, pp. 148-159
-
-
Lenoski, D.1
Laudon, J.2
Gharachorloo, K.3
Gupta, A.4
Hennessy, J.5
-
34
-
-
0024771302
-
Memory coherence in shared virtual memory systems
-
K. Li and P. Hudak. Memory coherence in shared virtual memory systems. ACM Trans. Comput. Syst., 7(4):321-359, 1989.
-
(1989)
ACM Trans. Comput. Syst.
, vol.7
, Issue.4
, pp. 321-359
-
-
Li, K.1
Hudak, P.2
-
35
-
-
44849137198
-
NVIDIA tesla: A unified graphics and computing architecture
-
March-April
-
E. Lindholm, J. Nickolls, S. Oberman, and J. Montrym. NVIDIA tesla: A unified graphics and computing architecture. IEEE Micro, 28(2):39-55, March-April 2008.
-
(2008)
IEEE Micro
, vol.28
, Issue.2
, pp. 39-55
-
-
Lindholm, E.1
Nickolls, J.2
Oberman, S.3
Montrym, J.4
-
36
-
-
0025627049
-
Merlin: A superglue for multicomputer systems
-
C. Maples and L. Wittie. Merlin: A superglue for multicomputer systems. In Compcon Spring '90, volume 90, pages 73-81, 1990.
-
(1990)
Compcon Spring '90
, vol.90
, pp. 73-81
-
-
Maples, C.1
Wittie, L.2
-
37
-
-
0028732614
-
Global arrays: A portable "shared-memory" programming model for distributed memory computers
-
New York, NY, USA, ACM
-
J. Nieplocha, R. J. Harrison, and R. J. Littlefield. Global arrays: a portable "shared-memory" programming model for distributed memory computers. In SC'94, pages 340-349, New York, NY, USA, 1994. ACM.
-
(1994)
SC'94
, pp. 340-349
-
-
Nieplocha, J.1
Harrison, R.J.2
Littlefield, R.J.3
-
39
-
-
53749108455
-
Accelerator architectures
-
July-Aug.
-
S. Patel and W. W. Hwu. Accelerator architectures. IEEE Micro, 28(4):4-12, July-Aug. 2008.
-
(2008)
IEEE Micro
, vol.28
, Issue.4
, pp. 4-12
-
-
Patel, S.1
Hwu, W.W.2
-
40
-
-
49249086142
-
Larrabee: A many-core x86 architecture for visual computing
-
L. Seiler, D. Carmean, E. Sprangle, T. Forsyth, M. Abrash, P. Dubey, S. Junkins, A. Lake, J. Sugerman, R. Cavin, R. Espasa, E. Grochowski, T. Juan, and P. Hanrahan. Larrabee: a many-core x86 architecture for visual computing. ACM Trans. Graph., 27(3):1-15, 2008.
-
(2008)
ACM Trans. Graph.
, vol.27
, Issue.3
, pp. 1-15
-
-
Seiler, L.1
Carmean, D.2
Sprangle, E.3
Forsyth, T.4
Abrash, M.5
Dubey, P.6
Junkins, S.7
Lake, A.8
Sugerman, J.9
Cavin, R.10
Espasa, R.11
Grochowski, E.12
Juan, T.13
Hanrahan, P.14
-
41
-
-
0034187952
-
MorphoSys: An integrated reconfigurable system for data-parallel and computation-intensive applications
-
May
-
H. Singh, M.-H. Lee, G. Lu, F. J. Kurdahi, N. Bagherzadeh, and E. M. C. Filho. MorphoSys: an integrated reconfigurable system for data-parallel and computation-intensive applications. IEEE Trans. on Computers, 49(5):465-481, May 2000.
-
(2000)
IEEE Trans. on Computers
, vol.49
, Issue.5
, pp. 465-481
-
-
Singh, H.1
Lee, M.-H.2
Lu, G.3
Kurdahi, F.J.4
Bagherzadeh, N.5
Filho, E.M.C.6
-
42
-
-
0036892941
-
The programming model of ASSIST, an environment for parallel and distributed portable applications
-
DOI 10.1016/S0167-8191(02)00188-6, PII S0167819102001886
-
M. Vanneschi. The programming model of ASSIST, an environment for parallel and distributed portable applications. Parallel Comput., 28(12):1709-1732, 2002. (Pubitemid 35412373)
-
(2002)
Parallel Computing
, vol.28
, Issue.12
, pp. 1709-1732
-
-
Vanneschi, M.1
-
43
-
-
8744241430
-
The MOLEN polymorphic processor
-
S. Vassiliadis, S. Wong, G. Gaydadjiev, K. Bertels, G. Kuzmanov, and E. M. Panainte. The MOLEN polymorphic processor. IEEE Trans. on Computers, 53(11):1363-1375, 2004.
-
(2004)
IEEE Trans. on Computers
, vol.53
, Issue.11
, pp. 1363-1375
-
-
Vassiliadis, S.1
Wong, S.2
Gaydadjiev, G.3
Bertels, K.4
Kuzmanov, G.5
Panainte, E.M.6
-
44
-
-
0009725006
-
Data Diffusion Machine-a scalable shared virtual memory multiprocessor
-
Springer-Verlag
-
D. Warren and S. Haridi. Data Diffusion Machine-a scalable shared virtual memory multiprocessor. In Fifth Generation Computer Systems 1988, page 943. Springer-Verlag, 1988.
-
(1988)
Fifth Generation Computer Systems 1988
, pp. 943
-
-
Warren, D.1
Haridi, S.2
-
45
-
-
0027228907
-
Hardware assist for distributed shared memory
-
May
-
J. Wilson, A.W., J. LaRowe, R.P., and M. Teller. Hardware assist for distributed shared memory. In DCS '03, pages 246-255, May 1993.
-
(1993)
DCS '03
, pp. 246-255
-
-
Wilson, J.1
W, A.2
LaRowe, J.3
P, R.4
Teller, M.5
-
47
-
-
0025532322
-
Extending distributed shared memory to heterogeneous environments
-
May
-
S. Zhou, M. Stumm, and T. McInerney. Extending distributed shared memory to heterogeneous environments. In DCS '90, pages 30-37, May 1990.
-
(1990)
DCS '90
, pp. 30-37
-
-
Zhou, S.1
Stumm, M.2
McInerney, T.3
|