SCOPUS 정보 검색 플랫폼

Proceedings of the ACM SIGPLAN Conference on Programming Language Design and Implementation (PLDI)

Volumn , Issue , 2009, Pages 177-187

Towards a holistic approach to auto-parallelization integrating profile-driven parallelism detection and machine-learning based mapping

(4) Tournavitis, Georgios a Wang, Zheng a Franke, Björn a O'Boyle, Michael F P a

a UNIVERSITY OF EDINBURGH (United Kingdom)

Author keywords

Auto parallelization; Machine learning based parallelism mapping; OpenMP; Profile driven parallelism detection

Indexed keywords

EXPERT PROGRAMMERS; HOLISTIC APPROACH; INTEGRATED APPROACH; MACHINE-LEARNING; MACHINE-LEARNING BASED PARALLELISM MAPPING; MULTI CORE; PARALLEL BENCHMARKS; PARALLEL CODE; PARALLELIZATION STRATEGIES; PARALLELIZATIONS; PARALLELIZING COMPILER; PERFORMANCE IMPROVEMENTS; PERFORMANCE LEVEL; TARGET ARCHITECTURES;

C (PROGRAMMING LANGUAGE); COMPUTER SOFTWARE; EDUCATION; LINGUISTICS; PARALLEL ARCHITECTURES; PROGRAM COMPILERS; SHAPE MEMORY EFFECT;

MAPPING;

EID: 70450278773 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: 10.1145/1542476.1542496 Document Type: Conference Paper

Times cited : (115)

References (47)

1
- 4544293938
- Future microprocessors and off-chip SOP interconnect
- May
- H. P. Hofstee. Future microprocessors and off-chip SOP interconnect. IEEE Trans. on Advanced Packaging, 27(2), May 2004.
- (2004) IEEE Trans. on Advanced Packaging , vol.27 , Issue.2
- Hofstee, H.P.¹

2
- 0016026944
- The parallel execution of DO loops
- L. Lamport. The parallel execution of DO loops. Communications of ACM, 17(2), 1974.
- (1974) Communications of ACM , vol.17 , Issue.2
- Lamport, L.¹

3
- 67650787207
- Interprocedural dependence analysis and parallelization
- M. Burke and R. Cytron. Interprocedural dependence analysis and parallelization. PLDI, 1986.
- (1986) PLDI
- Burke, M.¹ Cytron, R.²

4
- 0037952146
- Morgan Kaufmann
- R. Allen and K. Kennedy. Optimizing Compilers for Modern Architectures: A Dependence-Based Approach. Morgan Kaufmann, 2002.
- (2002) Optimizing Compilers for Modern Architectures: A Dependence-Based Approach
- Allen, R.¹ Kennedy, K.²

5
- 0030645995
- Maximizing parallelism and minimizing synchronization with affine transforms
- ACM
- A. W. Lim and M. S. Lam. Maximizing parallelism and minimizing synchronization with affine transforms. Parallel Computing, ACM, 1997.
- (1997) Parallel Computing
- Lim, A.W.¹ Lam, M.S.²

6
- 0008241757
- Polaris: A new-generation parallelizing compiler for MPPs
- Technical report, In CSRD No. 1306. UIUC, 1993
- D. A. Padua, R. Eigenmann, et al. Polaris: A new-generation parallelizing compiler for MPPs. Technical report, In CSRD No. 1306. UIUC, 1993.
- Padua, D.A.¹ Eigenmann, R.²

7
- 0030380793
- Maximizing multiprocessor performance with the SUIF compiler
- M. W. Hall, J. M. Anderson, et al. Maximizing multiprocessor performance with the SUIF compiler. Computer, 29(12), 1996.
- (1996) Computer , vol.29 , Issue.12
- Hall, M.W.¹ Anderson, J.M.²

8
- 70450241771
- Open64. http://www.open64.net.
- Open64

9
- 0031622953
- The implementation of the Cilk-5 multithreaded language
- F. Matteo, C. Leiserson, and K. Randall. The implementation of the Cilk-5 multithreaded language. PLDI, 1998.
- (1998) PLDI
- Matteo, F.¹ Leiserson, C.² Randall, K.³

10
- 0036959649
- A stream compiler for communication-exposed architectures
- M. Gordon, W. Thies, M. Karczmarek, et al. A stream compiler for communication-exposed architectures. ASPLOS, 2002.
- (2002) ASPLOS
- Gordon, M.¹ Thies, W.² Karczmarek, M.³

11
- 70450264346
- P. Husbands Parry, C. Iancu, and K. Yelick. A performance analysis of the Berkeley UPC compiler. SC, 2003.
- P. Husbands Parry, C. Iancu, and K. Yelick. A performance analysis of the Berkeley UPC compiler. SC, 2003.

12
- 34748894221
- Praun. X10: Concurrent programming for modern architectures
- V. A. Saraswat, V. Sarkar, and C von. Praun. X10: Concurrent programming for modern architectures. PPoPP, 2007.
- (2007) PPoPP
- Saraswat, V.A.¹ Sarkar, V.² von, C.³

13
- 18844446223
- SUIF Explorer: An interactive and interprocedural parallelizer
- L. Shih-Wei, D. Amer, et al. SUIF Explorer: An interactive and interprocedural parallelizer. SIGPLAN Not., 34(8), 1999.
- (1999) SIGPLAN Not , vol.34 , Issue.8
- Shih-Wei, L.¹ Amer, D.²

14
- 35448941890
- Optimistic parallelism requires abstractions
- M. Kulkarni, K. Pingali, B. Walter, et al. Optimistic parallelism requires abstractions. PLDI'07, 2007.
- (2007) PLDI'07
- Kulkarni, M.¹ Pingali, K.² Walter, B.³

15
- 67650816175
- Standard Templates Adaptive Parallel Library
- L. Rauchwerger, F. Arzu, and K. Ouchi. Standard Templates Adaptive Parallel Library. Inter. Workshop LCR, 1998.
- (1998) Inter. Workshop LCR
- Rauchwerger, L.¹ Arzu, F.² Ouchi, K.³

16
- 33847108581
- Hierarchically tiled arrays for parallelism and locality
- Jia Guo, Ganesh Bikshandi, et al. Hierarchically tiled arrays for parallelism and locality. IPDPS, 2006.
- (2006) IPDPS
- Guo, J.¹ Bikshandi, G.²

17
- 85040171708
- Semantical interprocedural parallelization: An overview of the PIPS
- project. ICS
- F. Irigoin, P. Jouvelot, and R. Triolet. Semantical interprocedural parallelization: an overview of the PIPS project. ICS 1991
- (1991)
- Irigoin, F.¹ Jouvelot, P.² Triolet, R.³

18
- 0026191059
- Interactive parallel programming using the Parascope editor
- K. Kennedy, K. S. McKinley, and C. W. Tseng. Interactive parallel programming using the Parascope editor. IEEE TPDS, 2(3), 1991.
- (1991) IEEE TPDS , vol.2 , Issue.3
- Kennedy, K.¹ McKinley, K.S.² Tseng, C.W.³

19
- 0031121224
- HPFIT: A set of integrated tools for the parallelization of applications using high performance Fortran. part I: HPFIT and the Transtool environment
- T. Brandes, S. Chaumette, M. C. Counilh et al. HPFIT: a set of integrated tools for the parallelization of applications using high performance Fortran. part I: HPFIT and the Transtool environment. Parallel Comput., 23(1-2), 1997.
- (1997) Parallel Comput , vol.23 , Issue.1-2
- Brandes, T.¹ Chaumette, S.² Counilh, M.C.³

20
- 33645236572
- Development and implementation of an interactive parallelization assistance tool for OpenMP: IPat/OMP
- M. Ishihara, H. Honda, and M. Sato. Development and implementation of an interactive parallelization assistance tool for OpenMP: iPat/OMP. IEICE Trans. Inf. Syst., E89-D(2), 2006.
- (2006) IEICE Trans. Inf. Syst , vol.E89-D , Issue.2
- Ishihara, M.¹ Honda, H.² Sato, M.³

21
- 67650812989
- A dynamic analysis tool for finding coarse-grain parallelism
- S. Rul, H. Vandierendonck, and K. De Bosschere. A dynamic analysis tool for finding coarse-grain parallelism. In HiPEAC Industrial Workshop, 2008.
- (2008) HiPEAC Industrial Workshop
- Rul, S.¹ Vandierendonck, H.² De Bosschere, K.³

22
- 67650784128
- Induction variable substitution and reduction recognition in the Polaris parallelizing compiler
- Technical Report, UIUC, 1994
- W. M. Pottenger. Induction variable substitution and reduction recognition in the Polaris parallelizing compiler. Technical Report, UIUC, 1994.
- Pottenger, W.M.¹

23
- 0036612639
- Compile time barrier synchronization minimization
- M. O'Boyle and E. Stöhr. Compile time barrier synchronization minimization. IEEE TPDS, 13(6), 2002.
- (2002) IEEE TPDS , vol.13 , Issue.6
- O'Boyle, M.¹ Stöhr, E.²

24
- 67650854441
- A training algorithm for optimal margin classifiers
- E. B. Bernhard, M. G. Isabelle, and N. V. Vladimir. A training algorithm for optimal margin classifiers. Workshop on Computational Learning Theory, 1992.
- (1992) Workshop on Computational Learning Theory
- Bernhard, E.B.¹ Isabelle, M.G.² Vladimir, N.V.³

25
- 20344377909
- Evaluating heuristics in automatically mapping multi-loop applications to FPGAs
- H. Ziegler and M. Hall. Evaluating heuristics in automatically mapping multi-loop applications to FPGAs. FPGA, 2005.
- (2005) FPGA
- Ziegler, H.¹ Hall, M.²

26
- 84973836157
- The NAS parallel benchmarks
- D. H. Bailey, E. Barszcz, et al. The NAS parallel benchmarks. The International Journal of Supercomputer Applications, 5(3), 1991.
- (1991) The International Journal of Supercomputer Applications , vol.5 , Issue.3
- Bailey, D.H.¹ Barszcz, E.²

27
- 34548803705
- A Comprehensive Analysis of OpenMP Applications on Dual-Core Intel Xeon SMPs
- R. E. Grant and A. Afsahi. A Comprehensive Analysis of OpenMP Applications on Dual-Core Intel Xeon SMPs. IPDPS, 2007.
- (2007) IPDPS
- Grant, R.E.¹ Afsahi, A.²

28
- 84876909982
- NAS Parallel Benchmarks 2.3, OpenMP C version. http://phase.hpcc.jp/Omni/benchmarks/NPB/index.html.
- NAS Parallel Benchmarks 2.3, OpenMP C version

29
- 84900342836
- SPEComp: A New Benchmark Suite for Measuring Parallel Computer Performance
- V. Aslot, M. Domeika, et al. SPEComp: A New Benchmark Suite for Measuring Parallel Computer Performance. LNCS, 2001.
- (2001) LNCS
- Aslot, V.¹ Domeika, M.²

30
- 0031594005
- Threaded multiple path execution
- S. Wallace, B. Calder, and D. M. Tullsen. Threaded multiple path execution. ISCA, 1998.
- (1998) ISCA
- Wallace, S.¹ Calder, B.² Tullsen, D.M.³

31
- 10444235596
- Compiler estimation of load imbalance overhead in speculative parallelization
- J. Dou and M. Cintra. Compiler estimation of load imbalance overhead in speculative parallelization. PACT, 2004.
- (2004) PACT
- Dou, J.¹ Cintra, M.²

32
- 67650825867
- Toward thread-level speculation for coarse-grained parallelism of regular access patterns
- R. Ramaseshan and F. Mueller. Toward thread-level speculation for coarse-grained parallelism of regular access patterns. MULTIPROG, 2008.
- (2008) MULTIPROG
- Ramaseshan, R.¹ Mueller, F.²

33
- 47349089048
- Revisiting the sequential programming model for multi-core
- M. Bridges, N. Vachharajani, et al. Revisiting the sequential programming model for multi-core. MICRO, 2007.
- (2007) MICRO
- Bridges, M.¹ Vachharajani, N.²

34
- 34548045548
- S. Rus, M. Pennings, and L. Rauchwerger. Sensitivity analysis for automatic parallelization on multi-cores, 2007. ICS, 2007
- S. Rus, M. Pennings, and L. Rauchwerger. Sensitivity analysis for automatic parallelization on multi-cores, 2007. ICS, 2007

35
- 67650848208
- Dynamic dependence analysis: A novel method for data dependence evaluation
- P. Peterson and D. Padua. Dynamic dependence analysis: A novel method for data dependence evaluation. LCPC, 1992.
- (1992) LCPC
- Peterson, P.¹ Padua, D.²

36
- 0038684218
- The JRPM system for dynamically parallelizing Java programs
- M. Chen and K. Olukotun. The JRPM system for dynamically parallelizing Java programs. ISCA, 2003.
- (2003) ISCA
- Chen, M.¹ Olukotun, K.²

37
- 67650800086
- Hybrid dependence analysis for automatic parallelization
- Technical Report, Dept. of CS, Texas A&M U, 2005
- S. Rus and L. Rauchwerger. Hybrid dependence analysis for automatic parallelization. Technical Report, Dept. of CS, Texas A&M U., 2005.
- Rus, S.¹ Rauchwerger, L.²

38
- 35448991274
- Software behavior oriented parallelization
- C. Ding, X. Shen, et al. Software behavior oriented parallelization. PLDI, 2007.
- (2007) PLDI
- Ding, C.¹ Shen, X.²

39
- 47349118686
- A practical approach to exploiting coarse-grained pipeline parallelism in C programs
- W. Thies, V. Chandrasekhar, and S. Amarasinghe. A practical approach to exploiting coarse-grained pipeline parallelism in C programs. MICRO, 2007.
- (2007) MICRO
- Thies, W.¹ Chandrasekhar, V.² Amarasinghe, S.³

40
- 67650800084
- SC
- J. Ramanujam and P. Sadayappan. A methodology for parallelizing programs for multicomputers and complex memory multiprocessors. SC, 1989.
- (1989) A methodology for parallelizing programs for multicomputers and complex memory multiprocessors
- Ramanujam, J.¹ Sadayappan, P.²

41
- 67650060017
- A compile-time cost model for OpenMP
- C. Liao and B. Chapman. A compile-time cost model for OpenMP. IPDPS, 2007.
- (2007) IPDPS
- Liao, C.¹ Chapman, B.²

42
- 22944492682
- Performance-driven processor allocation
- J. Corbalan, X. Martorell, and J. Labarta. Performance-driven processor allocation. IEEE TPDS, 16(7), 2005.
- (2005) IEEE TPDS , vol.16 , Issue.7
- Corbalan, J.¹ Martorell, X.² Labarta, J.³

43
- 33746276805
- Runtime empirical selection of loop schedulers on Hyperthreaded SMPs
- Y. Zhang and M. Voss. Runtime empirical selection of loop schedulers on Hyperthreaded SMPs. IPDPS, 2005.
- (2005) IPDPS
- Zhang, Y.¹ Voss, M.²

44
- 0025467711
- A bridging model for parallel computation
- L. G. Valiant. A bridging model for parallel computation. Communications of the ACM, 33(8), 1990.
- (1990) Communications of the ACM , vol.33 , Issue.8
- Valiant, L.G.¹

45
- 85016062555
- Optimizing for reduced code space using genetic algorithms
- K. Cooper, P. Schielke, and D. Subramanian. Optimizing for reduced code space using genetic algorithms. LCTES, 1999.
- (1999) LCTES
- Cooper, K.¹ Schielke, P.² Subramanian, D.³

46
- 4544251830
- A machine learning approach to automatic production of compiler heuristics
- A. Monsifrot, F. Bodin, and R. Quiniou. A machine learning approach to automatic production of compiler heuristics. Artificial Intelligence: Methodology, Systems, Applications, 2002.
- (2002) Artificial Intelligence: Methodology, Systems, Applications
- Monsifrot, A.¹ Bodin, F.² Quiniou, R.³

47
- 57349167317
- Iterative optimization in the polyhedral model: Part II, multidimensional time
- L.N. Pouchet, C. Bastoul, A. Cohen, and J. Cavazos. Iterative optimization in the polyhedral model: part II, multidimensional time. PLDI, 2008.
- (2008) PLDI
- Pouchet, L.N.¹ Bastoul, C.² Cohen, A.³ Cavazos, J.⁴

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.