-
1
-
-
67650065863
-
-
Intel Thread Building Blocks. osstbb.intel.com
-
Intel Thread Building Blocks. osstbb.intel.com.
-
-
-
-
2
-
-
84869355228
-
-
MPI. www.open-mpi.org.
-
-
-
-
3
-
-
84869341273
-
-
NVidia G80. www.nvidia.com.
-
NVidia G80
-
-
-
4
-
-
84869355463
-
-
OpenMP. www.openmp.org.
-
-
-
-
5
-
-
84869355461
-
-
RStream Compiler
-
RStream Compiler. www.reservoir.com.
-
-
-
-
6
-
-
33645956449
-
Simplified discontinuous Galerkin methods for systems of conservation laws with convex extension
-
Discontinuous Galerkin Methods, of, Springer-Verlag, Heidelberg
-
T. Barth. Simplified discontinuous Galerkin methods for systems of conservation laws with convex extension. In Discontinuous Galerkin Methods, volume 11 of Lecture Notes in Computational Science and Engineering. Springer-Verlag, Heidelberg, 1999.
-
(1999)
Lecture Notes in Computational Science and Engineering
, vol.11
-
-
Barth, T.1
-
7
-
-
0032689024
-
Constitutive model and finite element formulation for large strain elasto-plastic analysis of shells
-
Jun
-
Y. Basar and M. Itskov. Constitutive model and finite element formulation for large strain elasto-plastic analysis of shells. In Journal of Computational Mechanics, Jun 1999.
-
(1999)
Journal of Computational Mechanics
-
-
Basar, Y.1
Itskov, M.2
-
8
-
-
2642548834
-
Network-oriented full system simulation using M5
-
N. Binkert, E. Hallnor, and S. Reinhardt. Network-oriented full system simulation using M5. In CAECW, 2003.
-
(2003)
CAECW
-
-
Binkert, N.1
Hallnor, E.2
Reinhardt, S.3
-
9
-
-
33751032129
-
McRT-STM: A high performance software transactional memory system for a multi-core runtime
-
Bratin Saha et al. McRT-STM: a high performance software transactional memory system for a multi-core runtime. In PPoPP, 2006.
-
(2006)
PPoPP
-
-
Saha, B.1
-
10
-
-
51049084341
-
Enabling scalability and performance in a large scale CMP environment
-
Bratin Saha et al. Enabling scalability and performance in a large scale CMP environment. In Eurosys, 2007.
-
(2007)
Eurosys
-
-
Saha, B.1
-
11
-
-
84877609547
-
Brook for GPUs: Stream computing on graphics hardware
-
I. Buck, T. Foley, D. Horn, J. Sugerman, K. Fatahalian, M. Houston, and P. Hanrahan. Brook for GPUs: Stream computing on graphics hardware. In SIGGRAPH, 2004.
-
(2004)
SIGGRAPH
-
-
Buck, I.1
Foley, T.2
Horn, D.3
Sugerman, J.4
Fatahalian, K.5
Houston, M.6
Hanrahan, P.7
-
12
-
-
34547679939
-
Evaluating MapReduce for Multicore and Multiprocessor Systems
-
C. Ranger et al. Evaluating MapReduce for Multicore and Multiprocessor Systems. In HPCA, 2007.
-
(2007)
HPCA
-
-
Ranger, C.1
-
13
-
-
2942753446
-
-
SC, Nov
-
W. Dally, P. Hanrahan, M. Erez, T. J. Knight, F. Labonte, J.-H. Ahn, N. Jayasena, U. J. Kapasi, A. Das, J. Gummaraju, and I. Buck. Merrimac: Supercomputing with streams. In SC, Nov 2003.
-
(2003)
Merrimac: Supercomputing with streams
-
-
Dally, W.1
Hanrahan, P.2
Erez, M.3
Knight, T.J.4
Labonte, F.5
Ahn, J.-H.6
Jayasena, N.7
Kapasi, U.J.8
Das, A.9
Gummaraju, J.10
Buck, I.11
-
14
-
-
34247114371
-
Compiling for Stream Processing
-
A. Das, W. Dally, and P. Mattson. Compiling for Stream Processing. In PACT, 2006.
-
(2006)
PACT
-
-
Das, A.1
Dally, W.2
Mattson, P.3
-
15
-
-
0031622953
-
The implementation of the Cilk-5 multithreaded language
-
M. Frigo, C. E. Leiserson, and K. H. Randall. The implementation of the Cilk-5 multithreaded language. In PLDI, 1998.
-
(1998)
PLDI
-
-
Frigo, M.1
Leiserson, C.E.2
Randall, K.H.3
-
16
-
-
34547423880
-
Exploiting coarse-grained task, data, and pipeline parallelism in stream programs
-
M. Gordon, W. Thies, and S. Amarasinghe. Exploiting coarse-grained task, data, and pipeline parallelism in stream programs. In ASPLOS, 2006.
-
(2006)
ASPLOS
-
-
Gordon, M.1
Thies, W.2
Amarasinghe, S.3
-
17
-
-
47849087164
-
Architectural Support for the Stream Execution Model on General-Purpose Processors
-
J. Gummaraju, M. Erez, J. Coburn, M. Rosenblum, and W. Dally. Architectural Support for the Stream Execution Model on General-Purpose Processors. In PACT, 2007.
-
(2007)
PACT
-
-
Gummaraju, J.1
Erez, M.2
Coburn, J.3
Rosenblum, M.4
Dally, W.5
-
19
-
-
0027262011
-
Transactional memory: Architectural support for lock-free data structures
-
M. Herlihy and J. E. B. Moss. Transactional memory: Architectural support for lock-free data structures. In ISCA, 1993.
-
(1993)
ISCA
-
-
Herlihy, M.1
Moss, J.E.B.2
-
20
-
-
27644567646
-
Power efficient processor architecture and the Cell processor
-
Feb
-
H. P. Hofstee. Power efficient processor architecture and the Cell processor. In HPCA, Feb 2005.
-
(2005)
HPCA
-
-
Hofstee, H.P.1
-
21
-
-
35348861326
-
Comparing Memory Systems for Chip Multiprocessors
-
J. Leverich et al. Comparing Memory Systems for Chip Multiprocessors. In ISCA, 2007.
-
(2007)
ISCA
-
-
Leverich, J.1
-
22
-
-
34548207355
-
-
K. Fatahalian et al. Sequoia: Programming the Memory Hierarchy. In SC, Nov 2006.
-
K. Fatahalian et al. Sequoia: Programming the Memory Hierarchy. In SC, Nov 2006.
-
-
-
-
23
-
-
33745017747
-
Large eddy simulation of reacting turbulent flows in complex geometries
-
May
-
K. Mahesh et al. Large eddy simulation of reacting turbulent flows in complex geometries. ASME J. of Applied Mechanics, May 2006.
-
(2006)
ASME J. of Applied Mechanics
-
-
Mahesh, K.1
-
25
-
-
0036396915
-
The Imagine stream processor
-
Sep
-
U. Kapasi, W. Dally, S. Rixner, J. Owens, and B. Khailany. The Imagine stream processor. In ICCD, Sep 2002.
-
(2002)
ICCD
-
-
Kapasi, U.1
Dally, W.2
Rixner, S.3
Owens, J.4
Khailany, B.5
-
26
-
-
10444269287
-
The Stream Virtual Machine
-
F. Labonte, P. Mattson, I. Buck, C. Kozyrakis, and M. Horowitz. The Stream Virtual Machine. In PACT, 2004.
-
(2004)
PACT
-
-
Labonte, F.1
Mattson, P.2
Buck, I.3
Kozyrakis, C.4
Horowitz, M.5
-
27
-
-
0036505033
-
The Raw microprocessor: A computational fabric for software circuits and general-purpose programs
-
March
-
M. B. Taylor et al. The Raw microprocessor: a computational fabric for software circuits and general-purpose programs. IEEE Micro, 22:25-35, March 2002.
-
(2002)
IEEE Micro
, vol.22
, pp. 25-35
-
-
Taylor, M.B.1
-
28
-
-
34548052234
-
-
M. Erez and J. Ahn and J. Gummaraju and M. Rosenblum and W. Dally. Executing Irregular Scientific Applications on Stream Architectures. In ICS, 2007.
-
M. Erez and J. Ahn and J. Gummaraju and M. Rosenblum and W. Dally. Executing Irregular Scientific Applications on Stream Architectures. In ICS, 2007.
-
-
-
-
29
-
-
0036959649
-
A Stream Compiler for Communication-Exposed Architectures
-
M. Gordon et al. A Stream Compiler for Communication-Exposed Architectures. In ASPLOS, 2002.
-
(2002)
ASPLOS
-
-
Gordon, M.1
-
30
-
-
56849108794
-
A Portable Run-time Interface for Multi-level Memory Hierarchies
-
M. Houston et al. A Portable Run-time Interface for Multi-level Memory Hierarchies. In PPoPP, 2008.
-
(2008)
PPoPP
-
-
Houston, M.1
-
31
-
-
35448961922
-
Dryad: Distributed Data Parallel Programs from Sequential Building Blocks
-
M. Isard et al. Dryad: Distributed Data Parallel Programs from Sequential Building Blocks. In Eurosys, 2007.
-
(2007)
Eurosys
-
-
Isard, M.1
-
32
-
-
42549135730
-
Data-parallel programming on Cell BE and the GPU using the Rapidmind development platform
-
M. D. McCool. Data-parallel programming on Cell BE and the GPU using the Rapidmind development platform. In GSPx Multicore Applications Conference, 2006.
-
(2006)
GSPx Multicore Applications Conference
-
-
McCool, M.D.1
-
33
-
-
31744441529
-
X10: An object-oriented approach to non-uniform cluster computing
-
P. Charles et al. X10: An object-oriented approach to non-uniform cluster computing. In OOPSLA, 2005.
-
(2005)
OOPSLA
-
-
Charles, P.1
-
34
-
-
42549110926
-
Sequoia: Programming the Memory Hierarchy
-
T. Knight et al. Sequoia: Programming the Memory Hierarchy. In PPoPP, 2007.
-
(2007)
PPoPP
-
-
Knight, T.1
-
35
-
-
47249165359
-
Thread Clustering: A Share-aware Scheduling on SMP-CMP-SMT Multiprocessors
-
D. Tam, R. Azimi, and M. Stumm. Thread Clustering: A Share-aware Scheduling on SMP-CMP-SMT Multiprocessors. In EuroSys, 2007.
-
(2007)
EuroSys
-
-
Tam, D.1
Azimi, R.2
Stumm, M.3
-
36
-
-
33947595619
-
ACCELERATOR: Using data-parallelism to program GPUs for general-purpose uses
-
D. Tarditi, S. Puri, and J. Oglesby. ACCELERATOR: Using data-parallelism to program GPUs for general-purpose uses. In ASPLOS, 2006.
-
(2006)
ASPLOS
-
-
Tarditi, D.1
Puri, S.2
Oglesby, J.3
-
38
-
-
21644438927
-
-
SC
-
R. Vuduc, J. W. Demmel, K. A. Yelick, S. Kamil, R. Nishtala, and B. Lee. Performance optimizations and bounds for sparse matrixvector multiply. SC, 2002.
-
(2002)
Performance optimizations and bounds for sparse matrixvector multiply
-
-
Vuduc, R.1
Demmel, J.W.2
Yelick, K.A.3
Kamil, S.4
Nishtala, R.5
Lee, B.6
-
40
-
-
57649169968
-
A Lightweight Streaming Layer for Multicore Execution
-
Dec
-
D. Zhang, Q. Li, R. Rabbah, and S. Amarasinghe. A Lightweight Streaming Layer for Multicore Execution. In Workshop on Design, Architecture, and Simulation of Chip Multiprocessors, Dec 2007.
-
(2007)
Workshop on Design, Architecture, and Simulation of Chip Multiprocessors
-
-
Zhang, D.1
Li, Q.2
Rabbah, R.3
Amarasinghe, S.4
|