메뉴 건너뛰기




Volumn , Issue , 2009, Pages 177-187

Towards a holistic approach to auto-parallelization integrating profile-driven parallelism detection and machine-learning based mapping

Author keywords

Auto parallelization; Machine learning based parallelism mapping; OpenMP; Profile driven parallelism detection

Indexed keywords

EXPERT PROGRAMMERS; HOLISTIC APPROACH; INTEGRATED APPROACH; MACHINE-LEARNING; MACHINE-LEARNING BASED PARALLELISM MAPPING; MULTI CORE; PARALLEL BENCHMARKS; PARALLEL CODE; PARALLELIZATION STRATEGIES; PARALLELIZATIONS; PARALLELIZING COMPILER; PERFORMANCE IMPROVEMENTS; PERFORMANCE LEVEL; TARGET ARCHITECTURES;

EID: 70450278773     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1145/1542476.1542496     Document Type: Conference Paper
Times cited : (115)

References (47)
  • 1
    • 4544293938 scopus 로고    scopus 로고
    • Future microprocessors and off-chip SOP interconnect
    • May
    • H. P. Hofstee. Future microprocessors and off-chip SOP interconnect. IEEE Trans. on Advanced Packaging, 27(2), May 2004.
    • (2004) IEEE Trans. on Advanced Packaging , vol.27 , Issue.2
    • Hofstee, H.P.1
  • 2
    • 0016026944 scopus 로고
    • The parallel execution of DO loops
    • L. Lamport. The parallel execution of DO loops. Communications of ACM, 17(2), 1974.
    • (1974) Communications of ACM , vol.17 , Issue.2
    • Lamport, L.1
  • 3
    • 67650787207 scopus 로고
    • Interprocedural dependence analysis and parallelization
    • M. Burke and R. Cytron. Interprocedural dependence analysis and parallelization. PLDI, 1986.
    • (1986) PLDI
    • Burke, M.1    Cytron, R.2
  • 5
    • 0030645995 scopus 로고    scopus 로고
    • Maximizing parallelism and minimizing synchronization with affine transforms
    • ACM
    • A. W. Lim and M. S. Lam. Maximizing parallelism and minimizing synchronization with affine transforms. Parallel Computing, ACM, 1997.
    • (1997) Parallel Computing
    • Lim, A.W.1    Lam, M.S.2
  • 6
    • 0008241757 scopus 로고    scopus 로고
    • Polaris: A new-generation parallelizing compiler for MPPs
    • Technical report, In CSRD No. 1306. UIUC, 1993
    • D. A. Padua, R. Eigenmann, et al. Polaris: A new-generation parallelizing compiler for MPPs. Technical report, In CSRD No. 1306. UIUC, 1993.
    • Padua, D.A.1    Eigenmann, R.2
  • 7
    • 0030380793 scopus 로고    scopus 로고
    • Maximizing multiprocessor performance with the SUIF compiler
    • M. W. Hall, J. M. Anderson, et al. Maximizing multiprocessor performance with the SUIF compiler. Computer, 29(12), 1996.
    • (1996) Computer , vol.29 , Issue.12
    • Hall, M.W.1    Anderson, J.M.2
  • 8
    • 70450241771 scopus 로고    scopus 로고
    • Open64. http://www.open64.net.
    • Open64
  • 9
    • 0031622953 scopus 로고    scopus 로고
    • The implementation of the Cilk-5 multithreaded language
    • F. Matteo, C. Leiserson, and K. Randall. The implementation of the Cilk-5 multithreaded language. PLDI, 1998.
    • (1998) PLDI
    • Matteo, F.1    Leiserson, C.2    Randall, K.3
  • 10
    • 0036959649 scopus 로고    scopus 로고
    • A stream compiler for communication-exposed architectures
    • M. Gordon, W. Thies, M. Karczmarek, et al. A stream compiler for communication-exposed architectures. ASPLOS, 2002.
    • (2002) ASPLOS
    • Gordon, M.1    Thies, W.2    Karczmarek, M.3
  • 11
    • 70450264346 scopus 로고    scopus 로고
    • P. Husbands Parry, C. Iancu, and K. Yelick. A performance analysis of the Berkeley UPC compiler. SC, 2003.
    • P. Husbands Parry, C. Iancu, and K. Yelick. A performance analysis of the Berkeley UPC compiler. SC, 2003.
  • 12
    • 34748894221 scopus 로고    scopus 로고
    • Praun. X10: Concurrent programming for modern architectures
    • V. A. Saraswat, V. Sarkar, and C von. Praun. X10: Concurrent programming for modern architectures. PPoPP, 2007.
    • (2007) PPoPP
    • Saraswat, V.A.1    Sarkar, V.2    von, C.3
  • 13
    • 18844446223 scopus 로고    scopus 로고
    • SUIF Explorer: An interactive and interprocedural parallelizer
    • L. Shih-Wei, D. Amer, et al. SUIF Explorer: An interactive and interprocedural parallelizer. SIGPLAN Not., 34(8), 1999.
    • (1999) SIGPLAN Not , vol.34 , Issue.8
    • Shih-Wei, L.1    Amer, D.2
  • 14
    • 35448941890 scopus 로고    scopus 로고
    • Optimistic parallelism requires abstractions
    • M. Kulkarni, K. Pingali, B. Walter, et al. Optimistic parallelism requires abstractions. PLDI'07, 2007.
    • (2007) PLDI'07
    • Kulkarni, M.1    Pingali, K.2    Walter, B.3
  • 16
    • 33847108581 scopus 로고    scopus 로고
    • Hierarchically tiled arrays for parallelism and locality
    • Jia Guo, Ganesh Bikshandi, et al. Hierarchically tiled arrays for parallelism and locality. IPDPS, 2006.
    • (2006) IPDPS
    • Guo, J.1    Bikshandi, G.2
  • 17
    • 85040171708 scopus 로고
    • Semantical interprocedural parallelization: An overview of the PIPS
    • project. ICS
    • F. Irigoin, P. Jouvelot, and R. Triolet. Semantical interprocedural parallelization: an overview of the PIPS project. ICS 1991
    • (1991)
    • Irigoin, F.1    Jouvelot, P.2    Triolet, R.3
  • 18
    • 0026191059 scopus 로고
    • Interactive parallel programming using the Parascope editor
    • K. Kennedy, K. S. McKinley, and C. W. Tseng. Interactive parallel programming using the Parascope editor. IEEE TPDS, 2(3), 1991.
    • (1991) IEEE TPDS , vol.2 , Issue.3
    • Kennedy, K.1    McKinley, K.S.2    Tseng, C.W.3
  • 19
    • 0031121224 scopus 로고    scopus 로고
    • HPFIT: A set of integrated tools for the parallelization of applications using high performance Fortran. part I: HPFIT and the Transtool environment
    • T. Brandes, S. Chaumette, M. C. Counilh et al. HPFIT: a set of integrated tools for the parallelization of applications using high performance Fortran. part I: HPFIT and the Transtool environment. Parallel Comput., 23(1-2), 1997.
    • (1997) Parallel Comput , vol.23 , Issue.1-2
    • Brandes, T.1    Chaumette, S.2    Counilh, M.C.3
  • 20
    • 33645236572 scopus 로고    scopus 로고
    • Development and implementation of an interactive parallelization assistance tool for OpenMP: IPat/OMP
    • M. Ishihara, H. Honda, and M. Sato. Development and implementation of an interactive parallelization assistance tool for OpenMP: iPat/OMP. IEICE Trans. Inf. Syst., E89-D(2), 2006.
    • (2006) IEICE Trans. Inf. Syst , vol.E89-D , Issue.2
    • Ishihara, M.1    Honda, H.2    Sato, M.3
  • 22
    • 67650784128 scopus 로고    scopus 로고
    • Induction variable substitution and reduction recognition in the Polaris parallelizing compiler
    • Technical Report, UIUC, 1994
    • W. M. Pottenger. Induction variable substitution and reduction recognition in the Polaris parallelizing compiler. Technical Report, UIUC, 1994.
    • Pottenger, W.M.1
  • 23
    • 0036612639 scopus 로고    scopus 로고
    • Compile time barrier synchronization minimization
    • M. O'Boyle and E. Stöhr. Compile time barrier synchronization minimization. IEEE TPDS, 13(6), 2002.
    • (2002) IEEE TPDS , vol.13 , Issue.6
    • O'Boyle, M.1    Stöhr, E.2
  • 25
    • 20344377909 scopus 로고    scopus 로고
    • Evaluating heuristics in automatically mapping multi-loop applications to FPGAs
    • H. Ziegler and M. Hall. Evaluating heuristics in automatically mapping multi-loop applications to FPGAs. FPGA, 2005.
    • (2005) FPGA
    • Ziegler, H.1    Hall, M.2
  • 27
    • 34548803705 scopus 로고    scopus 로고
    • A Comprehensive Analysis of OpenMP Applications on Dual-Core Intel Xeon SMPs
    • R. E. Grant and A. Afsahi. A Comprehensive Analysis of OpenMP Applications on Dual-Core Intel Xeon SMPs. IPDPS, 2007.
    • (2007) IPDPS
    • Grant, R.E.1    Afsahi, A.2
  • 29
    • 84900342836 scopus 로고    scopus 로고
    • SPEComp: A New Benchmark Suite for Measuring Parallel Computer Performance
    • V. Aslot, M. Domeika, et al. SPEComp: A New Benchmark Suite for Measuring Parallel Computer Performance. LNCS, 2001.
    • (2001) LNCS
    • Aslot, V.1    Domeika, M.2
  • 31
    • 10444235596 scopus 로고    scopus 로고
    • Compiler estimation of load imbalance overhead in speculative parallelization
    • J. Dou and M. Cintra. Compiler estimation of load imbalance overhead in speculative parallelization. PACT, 2004.
    • (2004) PACT
    • Dou, J.1    Cintra, M.2
  • 32
    • 67650825867 scopus 로고    scopus 로고
    • Toward thread-level speculation for coarse-grained parallelism of regular access patterns
    • R. Ramaseshan and F. Mueller. Toward thread-level speculation for coarse-grained parallelism of regular access patterns. MULTIPROG, 2008.
    • (2008) MULTIPROG
    • Ramaseshan, R.1    Mueller, F.2
  • 33
    • 47349089048 scopus 로고    scopus 로고
    • Revisiting the sequential programming model for multi-core
    • M. Bridges, N. Vachharajani, et al. Revisiting the sequential programming model for multi-core. MICRO, 2007.
    • (2007) MICRO
    • Bridges, M.1    Vachharajani, N.2
  • 34
    • 34548045548 scopus 로고    scopus 로고
    • S. Rus, M. Pennings, and L. Rauchwerger. Sensitivity analysis for automatic parallelization on multi-cores, 2007. ICS, 2007
    • S. Rus, M. Pennings, and L. Rauchwerger. Sensitivity analysis for automatic parallelization on multi-cores, 2007. ICS, 2007
  • 35
    • 67650848208 scopus 로고
    • Dynamic dependence analysis: A novel method for data dependence evaluation
    • P. Peterson and D. Padua. Dynamic dependence analysis: A novel method for data dependence evaluation. LCPC, 1992.
    • (1992) LCPC
    • Peterson, P.1    Padua, D.2
  • 36
    • 0038684218 scopus 로고    scopus 로고
    • The JRPM system for dynamically parallelizing Java programs
    • M. Chen and K. Olukotun. The JRPM system for dynamically parallelizing Java programs. ISCA, 2003.
    • (2003) ISCA
    • Chen, M.1    Olukotun, K.2
  • 37
    • 67650800086 scopus 로고    scopus 로고
    • Hybrid dependence analysis for automatic parallelization
    • Technical Report, Dept. of CS, Texas A&M U, 2005
    • S. Rus and L. Rauchwerger. Hybrid dependence analysis for automatic parallelization. Technical Report, Dept. of CS, Texas A&M U., 2005.
    • Rus, S.1    Rauchwerger, L.2
  • 38
    • 35448991274 scopus 로고    scopus 로고
    • Software behavior oriented parallelization
    • C. Ding, X. Shen, et al. Software behavior oriented parallelization. PLDI, 2007.
    • (2007) PLDI
    • Ding, C.1    Shen, X.2
  • 39
    • 47349118686 scopus 로고    scopus 로고
    • A practical approach to exploiting coarse-grained pipeline parallelism in C programs
    • W. Thies, V. Chandrasekhar, and S. Amarasinghe. A practical approach to exploiting coarse-grained pipeline parallelism in C programs. MICRO, 2007.
    • (2007) MICRO
    • Thies, W.1    Chandrasekhar, V.2    Amarasinghe, S.3
  • 41
    • 67650060017 scopus 로고    scopus 로고
    • A compile-time cost model for OpenMP
    • C. Liao and B. Chapman. A compile-time cost model for OpenMP. IPDPS, 2007.
    • (2007) IPDPS
    • Liao, C.1    Chapman, B.2
  • 42
    • 22944492682 scopus 로고    scopus 로고
    • Performance-driven processor allocation
    • J. Corbalan, X. Martorell, and J. Labarta. Performance-driven processor allocation. IEEE TPDS, 16(7), 2005.
    • (2005) IEEE TPDS , vol.16 , Issue.7
    • Corbalan, J.1    Martorell, X.2    Labarta, J.3
  • 43
    • 33746276805 scopus 로고    scopus 로고
    • Runtime empirical selection of loop schedulers on Hyperthreaded SMPs
    • Y. Zhang and M. Voss. Runtime empirical selection of loop schedulers on Hyperthreaded SMPs. IPDPS, 2005.
    • (2005) IPDPS
    • Zhang, Y.1    Voss, M.2
  • 44
    • 0025467711 scopus 로고
    • A bridging model for parallel computation
    • L. G. Valiant. A bridging model for parallel computation. Communications of the ACM, 33(8), 1990.
    • (1990) Communications of the ACM , vol.33 , Issue.8
    • Valiant, L.G.1
  • 45
    • 85016062555 scopus 로고    scopus 로고
    • Optimizing for reduced code space using genetic algorithms
    • K. Cooper, P. Schielke, and D. Subramanian. Optimizing for reduced code space using genetic algorithms. LCTES, 1999.
    • (1999) LCTES
    • Cooper, K.1    Schielke, P.2    Subramanian, D.3
  • 47
    • 57349167317 scopus 로고    scopus 로고
    • Iterative optimization in the polyhedral model: Part II, multidimensional time
    • L.N. Pouchet, C. Bastoul, A. Cohen, and J. Cavazos. Iterative optimization in the polyhedral model: part II, multidimensional time. PLDI, 2008.
    • (2008) PLDI
    • Pouchet, L.N.1    Bastoul, C.2    Cohen, A.3    Cavazos, J.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.