-
2
-
-
0031121224
-
Hpfit: A set of integrated tools for the parallelization of applications using high performance fortran. part i: Hpfit and the transtool environment
-
Environment and tools for parallel scientific computing
-
T. Brandes, S. Chaumette, M. C. Counilh, J. Roman, A. Darte, F. Desprez, and J. C. Mignot. Hpfit: A set of integrated tools for the parallelization of applications using high performance fortran. part i: Hpfit and the transtool environment. Parallel Computing, 23(1-2):71-87, 1997. Environment and tools for parallel scientific computing.
-
(1997)
Parallel Computing
, vol.23
, Issue.1-2
, pp. 71-87
-
-
Brandes, T.1
Chaumette, S.2
Counilh, M.C.3
Roman, J.4
Darte, A.5
Desprez, F.6
Mignot, J.C.7
-
3
-
-
3142692758
-
Interprocedural dependence analysis and parallelization
-
M. G. Burke and R. K. Cytron. Interprocedural dependence analysis and parallelization. SIGPLAN Not., 39(4):139-154, 2004.
-
(2004)
SIGPLAN Not.
, vol.39
, Issue.4
, pp. 139-154
-
-
Burke, M.G.1
Cytron, R.K.2
-
4
-
-
51549106553
-
MAPS: An integrated framework for MPSoC application parallelization
-
J. Ceng, J. Castrillon, W. Sheng, H. Scharwachter, R. Leupers, G. Ascheid, H. Meyr, T. Isshiki, and H. Kunieda. MAPS: an integrated framework for MPSoC application parallelization. In DAC 2008: Proceedings of the 45th Annual Design Automation Conference. ACM/IEEE, pages 754-759, 2008.
-
(2008)
DAC 2008: Proceedings of the 45th Annual Design Automation Conference. ACM/IEEE
, pp. 754-759
-
-
Ceng, J.1
Castrillon, J.2
Sheng, W.3
Scharwachter, H.4
Leupers, R.5
Ascheid, G.6
Meyr, H.7
Isshiki, T.8
Kunieda, H.9
-
5
-
-
78149273837
-
Open64 compiler infrastructure for emerging multicore/manycore architecture all symposium tutorial
-
S. C. Chan, G. R. Gao, B. Chapman, T. Linthicum, and A. Dasgupta. Open64 compiler infrastructure for emerging multicore/manycore architecture all symposium tutorial. In IPDPS 2008: 22nd IEEE International Symposium on Parallel and Distributed Processing, Miami, FL, USA, 2008.
-
(2008)
IPDPS 2008: 22nd IEEE International Symposium on Parallel and Distributed Processing, Miami, FL, USA
-
-
Chan, S.C.1
Gao, G.R.2
Chapman, B.3
Linthicum, T.4
Dasgupta, A.5
-
6
-
-
0023385308
-
The program dependence graph and its use in optimization
-
J. Ferrante, K. J. Ottenstein, and J. D. Warren. The program dependence graph and its use in optimization. ACM Trans. Program. Lang. Syst., 9(3):319-349, 1987.
-
(1987)
ACM Trans. Program. Lang. Syst.
, vol.9
, Issue.3
, pp. 319-349
-
-
Ferrante, J.1
Ottenstein, K.J.2
Warren, J.D.3
-
7
-
-
0347507496
-
-
New York, NY, USA, ACM
-
M. Frigo, C. E. Leiserson, and K. H. Randall. The implementation of the Cilk-5 multithreaded language. volume 33, pages 212-223, New York, NY, USA, 1998. ACM.
-
(1998)
The Implementation of the Cilk-5 Multithreaded Language
, vol.33
, pp. 212-223
-
-
Frigo, M.1
Leiserson, C.E.2
Randall, K.H.3
-
8
-
-
0036959649
-
A stream compiler for communication-exposed architectures
-
New York, NY, USA, ACM
-
M. I. Gordon, W. Thies, M. Karczmarek, J. Lin, A. S. Meli, A. A. Lamb, C. Leger, J. Wong, H. Hoffmann, D. Maze, and S. Amarasinghe. A stream compiler for communication-exposed architectures. In ASPLOS-X: Proceedings of the 10th international conference on Architectural support for programming languages and operating systems, pages 291-303, New York, NY, USA, 2002. ACM.
-
(2002)
ASPLOS-X: Proceedings of the 10th International Conference on Architectural Support for Programming Languages and Operating Systems
, pp. 291-303
-
-
Gordon, M.I.1
Thies, W.2
Karczmarek, M.3
Lin, J.4
Meli, A.S.5
Lamb, A.A.6
Leger, C.7
Wong, J.8
Hoffmann, H.9
Maze, D.10
Amarasinghe, S.11
-
9
-
-
78149264907
-
-
volume 0, Los Alamitos, CA, USA, IEEE Computer Society
-
J. Guo, G. Bikshandi, D. Hoeflinger, G. Almasi, B. Fraguela, M. Garzaran, D. Padua, and C. von Praun. Hierarchically tiled arrays for parallelism and locality. volume 0, page 316, Los Alamitos, CA, USA, 2006. IEEE Computer Society.
-
(2006)
Hierarchically Tiled Arrays for Parallelism and Locality
, pp. 316
-
-
Guo, J.1
Bikshandi, G.2
Hoeflinger, D.3
Almasi, G.4
Fraguela, B.5
Garzaran, M.6
Padua, D.7
Von Praun, C.8
-
10
-
-
0030380793
-
Maximizing multiprocessor performance with the SUIF compiler
-
M. W. Hall, J. M. Anderson, S. P. Amarasinghe, B. R. Murphy, S.-W. Liao, E. Bugnion, and M. S. Lam. Maximizing multiprocessor performance with the SUIF compiler. Computer, 29(12):84-89, 1996.
-
(1996)
Computer
, vol.29
, Issue.12
, pp. 84-89
-
-
Hall, M.W.1
Anderson, J.M.2
Amarasinghe, S.P.3
Murphy, B.R.4
Liao, S.-W.5
Bugnion, E.6
Lam, M.S.7
-
11
-
-
1142293067
-
A performance analysis of the berkeley upc compiler
-
New York, NY, USA, ACM
-
P. Husbands, C. Iancu, and K. Yelick. A performance analysis of the berkeley upc compiler. In ICS '03: Proceedings of the 17th annual international conference on Supercomputing, pages 63-73, New York, NY, USA, 2003. ACM.
-
(2003)
ICS '03: Proceedings of the 17th Annual International Conference on Supercomputing
, pp. 63-73
-
-
Husbands, P.1
Iancu, C.2
Yelick, K.3
-
12
-
-
85040171708
-
Semantical interprocedural parallelization: An overview of the PIPS project
-
New York, NY, USA, ACM
-
F. Irigoin, P. Jouvelot, and R. Triolet. Semantical interprocedural parallelization: an overview of the PIPS project. In ICS '91: Proceedings of the 5th international conference on Supercomputing, pages 244-251, New York, NY, USA, 1991. ACM.
-
(1991)
ICS '91: Proceedings of the 5th International Conference on Supercomputing
, pp. 244-251
-
-
Irigoin, F.1
Jouvelot, P.2
Triolet, R.3
-
13
-
-
33645236572
-
Development and implementation of an interactive parallelization assistance tool for OpenMP: IPat/OMP
-
M. Ishihara, H. Honda, and M. Sato. Development and implementation of an interactive parallelization assistance tool for OpenMP: iPat/OMP. IEICE - Trans. Inf. Syst., E89-D(2):399-407, 2006.
-
(2006)
IEICE - Trans. Inf. Syst.
, vol.E89-D
, Issue.2
, pp. 399-407
-
-
Ishihara, M.1
Honda, H.2
Sato, M.3
-
16
-
-
0026191059
-
Interactive parallel programming using the ParaScope editor
-
K. Kennedy, K. S. McKinley, and C. W. Tseng. Interactive parallel programming using the ParaScope editor. IEEE Trans. Parallel Distrib. Syst., 2(3):329-341, 1991.
-
(1991)
IEEE Trans. Parallel Distrib. Syst.
, vol.2
, Issue.3
, pp. 329-341
-
-
Kennedy, K.1
McKinley, K.S.2
Tseng, C.W.3
-
17
-
-
42549111870
-
Optimistic parallelism requires abstractions
-
New York, USA, ACM
-
M. Kulkarni, K. Pingali, B. Walter, G. Ramanarayanan, K. Bala, and L. Chew. Optimistic parallelism requires abstractions. In PLDI '07: Proceedings of the 2007 ACM SIGPLAN conference on Programming language design and implementation, pages 211-222, New York, USA, 2007. ACM.
-
(2007)
PLDI '07: Proceedings of the 2007 ACM SIGPLAN Conference on Programming Language Design and Implementation
, pp. 211-222
-
-
Kulkarni, M.1
Pingali, K.2
Walter, B.3
Ramanarayanan, G.4
Bala, K.5
Chew, L.6
-
18
-
-
0016026944
-
The parallel execution of do loops
-
L. Lamport. The parallel execution of do loops. Commun. ACM, 17(2):83-93, 1974.
-
(1974)
Commun. ACM
, vol.17
, Issue.2
, pp. 83-93
-
-
Lamport, L.1
-
19
-
-
0030645995
-
Maximizing parallelism and minimizing synchronization with affine transforms
-
New York, NY, USA, ACM
-
A. W. Lim and M. S. Lam. Maximizing parallelism and minimizing synchronization with affine transforms. In POPL '97: Proceedings of the 24th ACM SIGPLAN-SIGACT symposium on Principles of programming languages, pages 201-214, New York, NY, USA, 1997. ACM.
-
(1997)
POPL '97: Proceedings of the 24th ACM SIGPLAN-SIGACT Symposium on Principles of Programming Languages
, pp. 201-214
-
-
Lim, A.W.1
Lam, M.S.2
-
20
-
-
33749375700
-
Automatic thread extraction with decoupled software pipelining
-
0
-
G. Ottoni, R. Rangan, A. Stoler, and D. I. August. Automatic thread extraction with decoupled software pipelining. In MICRO-38: Proceedings of the 38th IEEE/ACM International Symposium on Microarchitectureon, 0:105-118, 2005.
-
(2005)
MICRO-38: Proceedings of the 38th IEEE/ACM International Symposium on Microarchitectureon
, pp. 105-118
-
-
Ottoni, G.1
Rangan, R.2
Stoler, A.3
August, D.I.4
-
21
-
-
0008241757
-
Polaris: A new-generation parallelizing compiler for mpps
-
Technical report, Univ. of Illinois at Urbana-Champaign
-
D. A. Padua, R. Eigenmann, J. Hoeflinger, P. Petersen, P. Tu, S. Weatherford, and K. Faigin. Polaris: A new-generation parallelizing compiler for mpps. Technical report, In CSRD Rept. No. 1306. Univ. of Illinois at Urbana-Champaign, 1993.
-
(1993)
CSRD Rept. No. 1306
-
-
Padua, D.A.1
Eigenmann, R.2
Hoeflinger, J.3
Petersen, P.4
Tu, P.5
Weatherford, S.6
Faigin, K.7
-
22
-
-
76749098118
-
Polymorphic pipeline array: A flexible multicore accelerator with virtualized execution for mobile multimedia applications
-
New York, NY, USA, ACM
-
H. Park, Y. Park, and S. Mahlke. Polymorphic pipeline array: a flexible multicore accelerator with virtualized execution for mobile multimedia applications. In MICRO-42: Proceedings of the 42nd Annual IEEE/ACM International Symposium on Microarchitecture, pages 370-380, New York, NY, USA, 2009. ACM.
-
(2009)
MICRO-42: Proceedings of the 42nd Annual IEEE/ACM International Symposium on Microarchitecture
, pp. 370-380
-
-
Park, H.1
Park, Y.2
Mahlke, S.3
-
23
-
-
77952281906
-
Speculative parallelization using software multi-threaded transactions
-
A. Raman, H. Kim, T. R. Mason, T. B. Jablin, and D. I. August. Speculative parallelization using software multi-threaded transactions. In ASPLOS XV: Proceedings of the Fifteenth International Conference on Architectural Support for Programming Languages and Operating Systems , March 2010.
-
ASPLOS XV: Proceedings of the Fifteenth International Conference on Architectural Support for Programming Languages and Operating Systems, March 2010
-
-
Raman, A.1
Kim, H.2
Mason, T.R.3
Jablin, T.B.4
August, D.I.5
-
24
-
-
43449113286
-
Parallel-stage decoupled software pipelining
-
New York, NY, USA, ACM
-
E. Raman, G. Ottoni, A. Raman, M. J. Bridges, and D. I. August. Parallel-stage decoupled software pipelining. In CGO '08: Proceedings of the 6th annual IEEE/ACM international symposium on Code generation and optimization, pages 114-123, New York, NY, USA, 2008. ACM.
-
(2008)
CGO '08: Proceedings of the 6th Annual IEEE/ACM International Symposium on Code Generation and Optimization
, pp. 114-123
-
-
Raman, E.1
Ottoni, G.2
Raman, A.3
Bridges, M.J.4
August, D.I.5
-
25
-
-
84886630310
-
Standard templates adaptive parallel library
-
L. Rauchwerger, F. Arzu, and K. Ouchi. Standard templates adaptive parallel library. In 4th International Workshop on Languages, Compilers and Run-Time Systems for Scalable Computers (LCR), pages 402-409, 1998.
-
(1998)
4th International Workshop on Languages, Compilers and Run-Time Systems for Scalable Computers (LCR)
, pp. 402-409
-
-
Rauchwerger, L.1
Arzu, F.2
Ouchi, K.3
-
27
-
-
34748894221
-
X10: Concurrent programming for modern architectures
-
New York, NY, USA, ACM
-
V. A. Saraswat, V. Sarkar, and C. von Praun. X10: concurrent programming for modern architectures. In PPoPP '07: Proceedings of the 12th ACM SIGPLAN symposium on Principles and practice of parallel programming, pages 271-271, New York, NY, USA, 2007. ACM.
-
(2007)
PPoPP '07: Proceedings of the 12th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming
, pp. 271-271
-
-
Saraswat, V.A.1
Sarkar, V.2
Von Praun, C.3
-
28
-
-
47349118686
-
A practical approach to exploiting coarse-grained pipeline parallelism in c programs
-
Washington, DC, USA, IEEE Computer Society.
-
W. Thies, V. Chandrasekhar, and S. Amarasinghe. A practical approach to exploiting coarse-grained pipeline parallelism in c programs. In MICRO 40: Proceedings of the 40th Annual IEEE/ACM International Symposium on Microarchitecture, pages 356-369, Washington, DC, USA, 2007. IEEE Computer Society.
-
(2007)
MICRO 40: Proceedings of the 40th Annual IEEE/ACM International Symposium on Microarchitecture
, pp. 356-369
-
-
Thies, W.1
Chandrasekhar, V.2
Amarasinghe, S.3
-
30
-
-
66749164066
-
Copy or discard execution model for speculative parallelization on multicores
-
Washington, DC, USA, IEEE Computer Society
-
C. Tian, M. Feng, V. Nagarajan, and R. Gupta. Copy or discard execution model for speculative parallelization on multicores. In MICRO 41: Proceedings of the 41st annual IEEE/ACM International Symposium on Microarchitecture, pages 330-341, Washington, DC, USA, 2008. IEEE Computer Society.
-
(2008)
MICRO 41: Proceedings of the 41st Annual IEEE/ACM International Symposium on Microarchitecture
, pp. 330-341
-
-
Tian, C.1
Feng, M.2
Nagarajan, V.3
Gupta, R.4
-
31
-
-
70450278773
-
Towards a holistic approach to auto-parallelization: Integrating profile-driven parallelism detection and machine-learning based mapping
-
Dublin, Ireland, ACM
-
G. Tournavitis, Z. Wang, B. Franke, and M. F. O'Boyle. Towards a holistic approach to auto-parallelization: integrating profile-driven parallelism detection and machine-learning based mapping. In PLDI '09: Proceedings of the 2009 ACM SIGPLAN conference on Programming language design and implementation, pages 177-187, Dublin, Ireland, 2009. ACM.
-
(2009)
PLDI '09: Proceedings of the 2009 ACM SIGPLAN Conference on Programming Language Design and Implementation
, pp. 177-187
-
-
Tournavitis, G.1
Wang, Z.2
Franke, B.3
O'Boyle, M.F.4
-
33
-
-
41349089872
-
Speculative decoupled software pipelining
-
Washington, DC, USA, IEEE Computer Society
-
N. Vachharajani, R. Rangan, E. Raman, M. J. Bridges, G. Ottoni, and D. I. August. Speculative decoupled software pipelining. In PACT '07: Proceedings of the 16th International Conference on Parallel Architecture and Compilation Techniques, pages 49-59, Washington, DC, USA, 2007. IEEE Computer Society.
-
(2007)
PACT '07: Proceedings of the 16th International Conference on Parallel Architecture and Compilation Techniques
, pp. 49-59
-
-
Vachharajani, N.1
Rangan, R.2
Raman, E.3
Bridges, M.J.4
Ottoni, G.5
August, D.I.6
|