-
1
-
-
4544293938
-
Future microprocessors and off-chip SOP interconnect
-
May
-
H. P. Hofstee. Future microprocessors and off-chip SOP interconnect. IEEE Trans. on Advanced Packaging, 27(2), May 2004.
-
(2004)
IEEE Trans. on Advanced Packaging
, vol.27
, Issue.2
-
-
Hofstee, H.P.1
-
2
-
-
0016026944
-
The parallel execution of DO loops
-
L. Lamport. The parallel execution of DO loops. Communications of ACM, 17(2), 1974.
-
(1974)
Communications of ACM
, vol.17
, Issue.2
-
-
Lamport, L.1
-
3
-
-
67650787207
-
Interprocedural dependence analysis and parallelization
-
M. Burke and R. Cytron. Interprocedural dependence analysis and parallelization. PLDI, 1986.
-
(1986)
PLDI
-
-
Burke, M.1
Cytron, R.2
-
5
-
-
0030645995
-
Maximizing parallelism and minimizing synchronization with affine transforms
-
ACM
-
A. W. Lim and M. S. Lam. Maximizing parallelism and minimizing synchronization with affine transforms. Parallel Computing, ACM, 1997.
-
(1997)
Parallel Computing
-
-
Lim, A.W.1
Lam, M.S.2
-
6
-
-
0008241757
-
Polaris: A new-generation parallelizing compiler for MPPs
-
Technical report, In CSRD No. 1306. UIUC, 1993
-
D. A. Padua, R. Eigenmann, et al. Polaris: A new-generation parallelizing compiler for MPPs. Technical report, In CSRD No. 1306. UIUC, 1993.
-
-
-
Padua, D.A.1
Eigenmann, R.2
-
7
-
-
0030380793
-
Maximizing multiprocessor performance with the SUIF compiler
-
M. W. Hall, J. M. Anderson, et al. Maximizing multiprocessor performance with the SUIF compiler. Computer, 29(12), 1996.
-
(1996)
Computer
, vol.29
, Issue.12
-
-
Hall, M.W.1
Anderson, J.M.2
-
8
-
-
70450241771
-
-
Open64. http://www.open64.net.
-
Open64
-
-
-
9
-
-
0031622953
-
The implementation of the Cilk-5 multithreaded language
-
F. Matteo, C. Leiserson, and K. Randall. The implementation of the Cilk-5 multithreaded language. PLDI, 1998.
-
(1998)
PLDI
-
-
Matteo, F.1
Leiserson, C.2
Randall, K.3
-
10
-
-
0036959649
-
A stream compiler for communication-exposed architectures
-
M. Gordon, W. Thies, M. Karczmarek, et al. A stream compiler for communication-exposed architectures. ASPLOS, 2002.
-
(2002)
ASPLOS
-
-
Gordon, M.1
Thies, W.2
Karczmarek, M.3
-
11
-
-
70450264346
-
-
P. Husbands Parry, C. Iancu, and K. Yelick. A performance analysis of the Berkeley UPC compiler. SC, 2003.
-
P. Husbands Parry, C. Iancu, and K. Yelick. A performance analysis of the Berkeley UPC compiler. SC, 2003.
-
-
-
-
12
-
-
34748894221
-
Praun. X10: Concurrent programming for modern architectures
-
V. A. Saraswat, V. Sarkar, and C von. Praun. X10: Concurrent programming for modern architectures. PPoPP, 2007.
-
(2007)
PPoPP
-
-
Saraswat, V.A.1
Sarkar, V.2
von, C.3
-
13
-
-
18844446223
-
SUIF Explorer: An interactive and interprocedural parallelizer
-
L. Shih-Wei, D. Amer, et al. SUIF Explorer: An interactive and interprocedural parallelizer. SIGPLAN Not., 34(8), 1999.
-
(1999)
SIGPLAN Not
, vol.34
, Issue.8
-
-
Shih-Wei, L.1
Amer, D.2
-
16
-
-
33847108581
-
Hierarchically tiled arrays for parallelism and locality
-
Jia Guo, Ganesh Bikshandi, et al. Hierarchically tiled arrays for parallelism and locality. IPDPS, 2006.
-
(2006)
IPDPS
-
-
Guo, J.1
Bikshandi, G.2
-
17
-
-
85040171708
-
Semantical interprocedural parallelization: An overview of the PIPS
-
project. ICS
-
F. Irigoin, P. Jouvelot, and R. Triolet. Semantical interprocedural parallelization: an overview of the PIPS project. ICS 1991
-
(1991)
-
-
Irigoin, F.1
Jouvelot, P.2
Triolet, R.3
-
18
-
-
0026191059
-
Interactive parallel programming using the Parascope editor
-
K. Kennedy, K. S. McKinley, and C. W. Tseng. Interactive parallel programming using the Parascope editor. IEEE TPDS, 2(3), 1991.
-
(1991)
IEEE TPDS
, vol.2
, Issue.3
-
-
Kennedy, K.1
McKinley, K.S.2
Tseng, C.W.3
-
19
-
-
0031121224
-
HPFIT: A set of integrated tools for the parallelization of applications using high performance Fortran. part I: HPFIT and the Transtool environment
-
T. Brandes, S. Chaumette, M. C. Counilh et al. HPFIT: a set of integrated tools for the parallelization of applications using high performance Fortran. part I: HPFIT and the Transtool environment. Parallel Comput., 23(1-2), 1997.
-
(1997)
Parallel Comput
, vol.23
, Issue.1-2
-
-
Brandes, T.1
Chaumette, S.2
Counilh, M.C.3
-
20
-
-
33645236572
-
Development and implementation of an interactive parallelization assistance tool for OpenMP: IPat/OMP
-
M. Ishihara, H. Honda, and M. Sato. Development and implementation of an interactive parallelization assistance tool for OpenMP: iPat/OMP. IEICE Trans. Inf. Syst., E89-D(2), 2006.
-
(2006)
IEICE Trans. Inf. Syst
, vol.E89-D
, Issue.2
-
-
Ishihara, M.1
Honda, H.2
Sato, M.3
-
22
-
-
67650784128
-
Induction variable substitution and reduction recognition in the Polaris parallelizing compiler
-
Technical Report, UIUC, 1994
-
W. M. Pottenger. Induction variable substitution and reduction recognition in the Polaris parallelizing compiler. Technical Report, UIUC, 1994.
-
-
-
Pottenger, W.M.1
-
23
-
-
0036612639
-
Compile time barrier synchronization minimization
-
M. O'Boyle and E. Stöhr. Compile time barrier synchronization minimization. IEEE TPDS, 13(6), 2002.
-
(2002)
IEEE TPDS
, vol.13
, Issue.6
-
-
O'Boyle, M.1
Stöhr, E.2
-
25
-
-
20344377909
-
Evaluating heuristics in automatically mapping multi-loop applications to FPGAs
-
H. Ziegler and M. Hall. Evaluating heuristics in automatically mapping multi-loop applications to FPGAs. FPGA, 2005.
-
(2005)
FPGA
-
-
Ziegler, H.1
Hall, M.2
-
27
-
-
34548803705
-
A Comprehensive Analysis of OpenMP Applications on Dual-Core Intel Xeon SMPs
-
R. E. Grant and A. Afsahi. A Comprehensive Analysis of OpenMP Applications on Dual-Core Intel Xeon SMPs. IPDPS, 2007.
-
(2007)
IPDPS
-
-
Grant, R.E.1
Afsahi, A.2
-
29
-
-
84900342836
-
SPEComp: A New Benchmark Suite for Measuring Parallel Computer Performance
-
V. Aslot, M. Domeika, et al. SPEComp: A New Benchmark Suite for Measuring Parallel Computer Performance. LNCS, 2001.
-
(2001)
LNCS
-
-
Aslot, V.1
Domeika, M.2
-
31
-
-
10444235596
-
Compiler estimation of load imbalance overhead in speculative parallelization
-
J. Dou and M. Cintra. Compiler estimation of load imbalance overhead in speculative parallelization. PACT, 2004.
-
(2004)
PACT
-
-
Dou, J.1
Cintra, M.2
-
32
-
-
67650825867
-
Toward thread-level speculation for coarse-grained parallelism of regular access patterns
-
R. Ramaseshan and F. Mueller. Toward thread-level speculation for coarse-grained parallelism of regular access patterns. MULTIPROG, 2008.
-
(2008)
MULTIPROG
-
-
Ramaseshan, R.1
Mueller, F.2
-
33
-
-
47349089048
-
Revisiting the sequential programming model for multi-core
-
M. Bridges, N. Vachharajani, et al. Revisiting the sequential programming model for multi-core. MICRO, 2007.
-
(2007)
MICRO
-
-
Bridges, M.1
Vachharajani, N.2
-
34
-
-
34548045548
-
-
S. Rus, M. Pennings, and L. Rauchwerger. Sensitivity analysis for automatic parallelization on multi-cores, 2007. ICS, 2007
-
S. Rus, M. Pennings, and L. Rauchwerger. Sensitivity analysis for automatic parallelization on multi-cores, 2007. ICS, 2007
-
-
-
-
35
-
-
67650848208
-
Dynamic dependence analysis: A novel method for data dependence evaluation
-
P. Peterson and D. Padua. Dynamic dependence analysis: A novel method for data dependence evaluation. LCPC, 1992.
-
(1992)
LCPC
-
-
Peterson, P.1
Padua, D.2
-
36
-
-
0038684218
-
The JRPM system for dynamically parallelizing Java programs
-
M. Chen and K. Olukotun. The JRPM system for dynamically parallelizing Java programs. ISCA, 2003.
-
(2003)
ISCA
-
-
Chen, M.1
Olukotun, K.2
-
37
-
-
67650800086
-
Hybrid dependence analysis for automatic parallelization
-
Technical Report, Dept. of CS, Texas A&M U, 2005
-
S. Rus and L. Rauchwerger. Hybrid dependence analysis for automatic parallelization. Technical Report, Dept. of CS, Texas A&M U., 2005.
-
-
-
Rus, S.1
Rauchwerger, L.2
-
38
-
-
35448991274
-
Software behavior oriented parallelization
-
C. Ding, X. Shen, et al. Software behavior oriented parallelization. PLDI, 2007.
-
(2007)
PLDI
-
-
Ding, C.1
Shen, X.2
-
39
-
-
47349118686
-
A practical approach to exploiting coarse-grained pipeline parallelism in C programs
-
W. Thies, V. Chandrasekhar, and S. Amarasinghe. A practical approach to exploiting coarse-grained pipeline parallelism in C programs. MICRO, 2007.
-
(2007)
MICRO
-
-
Thies, W.1
Chandrasekhar, V.2
Amarasinghe, S.3
-
41
-
-
67650060017
-
A compile-time cost model for OpenMP
-
C. Liao and B. Chapman. A compile-time cost model for OpenMP. IPDPS, 2007.
-
(2007)
IPDPS
-
-
Liao, C.1
Chapman, B.2
-
43
-
-
33746276805
-
Runtime empirical selection of loop schedulers on Hyperthreaded SMPs
-
Y. Zhang and M. Voss. Runtime empirical selection of loop schedulers on Hyperthreaded SMPs. IPDPS, 2005.
-
(2005)
IPDPS
-
-
Zhang, Y.1
Voss, M.2
-
44
-
-
0025467711
-
A bridging model for parallel computation
-
L. G. Valiant. A bridging model for parallel computation. Communications of the ACM, 33(8), 1990.
-
(1990)
Communications of the ACM
, vol.33
, Issue.8
-
-
Valiant, L.G.1
-
45
-
-
85016062555
-
Optimizing for reduced code space using genetic algorithms
-
K. Cooper, P. Schielke, and D. Subramanian. Optimizing for reduced code space using genetic algorithms. LCTES, 1999.
-
(1999)
LCTES
-
-
Cooper, K.1
Schielke, P.2
Subramanian, D.3
-
46
-
-
4544251830
-
A machine learning approach to automatic production of compiler heuristics
-
A. Monsifrot, F. Bodin, and R. Quiniou. A machine learning approach to automatic production of compiler heuristics. Artificial Intelligence: Methodology, Systems, Applications, 2002.
-
(2002)
Artificial Intelligence: Methodology, Systems, Applications
-
-
Monsifrot, A.1
Bodin, F.2
Quiniou, R.3
-
47
-
-
57349167317
-
Iterative optimization in the polyhedral model: Part II, multidimensional time
-
L.N. Pouchet, C. Bastoul, A. Cohen, and J. Cavazos. Iterative optimization in the polyhedral model: part II, multidimensional time. PLDI, 2008.
-
(2008)
PLDI
-
-
Pouchet, L.N.1
Bastoul, C.2
Cohen, A.3
Cavazos, J.4
|