SCOPUS 정보 검색 플랫폼

Parallel Architectures and Compilation Techniques - Conference Proceedings, PACT

Volumn , Issue , 2010, Pages 377-388

Semi-automatic extraction and exploitation of hierarchical pipeline parallelism using profiling information

(2) Tournavitis, Georgios a Franke, Björn a

a UNIVERSITY OF EDINBURGH (United Kingdom)

Author keywords

parallelization; pipeline parallelism; program dependence graph; streaming applications

Indexed keywords

APPLICATION PROGRAMS; EMBEDDED SYSTEMS; EXTRACTION; LEGACY SYSTEMS; MULTICORE PROGRAMMING; PARALLEL ARCHITECTURES; PARALLEL PROGRAMMING; PERSONAL COMPUTERS;

AUTOMATIC PARALLELIZATION; EMBEDDED COMPUTING SYSTEM; HIGH PERFORMANCE COMPUTING; PARALLEL PROGRAMMING MODEL; PARALLELIZATIONS; PIPELINE PARALLELISMS; PROGRAM DEPENDENCE GRAPH; STREAMING APPLICATIONS;

PIPELINES;

EID: 78149252926 PISSN: 1089795X EISSN: None Source Type: Conference Proceeding
DOI: 10.1145/1854273.1854321 Document Type: Conference Paper

Times cited : (37)

References (33)

1
- 0037952146
- Morgan Kaufmann Publishers Inc. San Francisco, CA, USA
- J. Allen and K. Kennedy. Optimizing compilers for modern architectures: a dependence-based approach. Morgan Kaufmann Publishers Inc. San Francisco, CA, USA, 2001.
- (2001) Optimizing Compilers for Modern Architectures: A Dependence-based Approach
- Allen, J.¹ Kennedy, K.²

2
- 0031121224
- Hpfit: A set of integrated tools for the parallelization of applications using high performance fortran. part i: Hpfit and the transtool environment
- Environment and tools for parallel scientific computing
- T. Brandes, S. Chaumette, M. C. Counilh, J. Roman, A. Darte, F. Desprez, and J. C. Mignot. Hpfit: A set of integrated tools for the parallelization of applications using high performance fortran. part i: Hpfit and the transtool environment. Parallel Computing, 23(1-2):71-87, 1997. Environment and tools for parallel scientific computing.
- (1997) Parallel Computing , vol.23 , Issue.1-2 , pp. 71-87
- Brandes, T.¹ Chaumette, S.² Counilh, M.C.³ Roman, J.⁴ Darte, A.⁵ Desprez, F.⁶ Mignot, J.C.⁷

3
- 3142692758
- Interprocedural dependence analysis and parallelization
- M. G. Burke and R. K. Cytron. Interprocedural dependence analysis and parallelization. SIGPLAN Not., 39(4):139-154, 2004.
- (2004) SIGPLAN Not. , vol.39 , Issue.4 , pp. 139-154
- Burke, M.G.¹ Cytron, R.K.²

4
- 51549106553
- MAPS: An integrated framework for MPSoC application parallelization
- J. Ceng, J. Castrillon, W. Sheng, H. Scharwachter, R. Leupers, G. Ascheid, H. Meyr, T. Isshiki, and H. Kunieda. MAPS: an integrated framework for MPSoC application parallelization. In DAC 2008: Proceedings of the 45th Annual Design Automation Conference. ACM/IEEE, pages 754-759, 2008.
- (2008) DAC 2008: Proceedings of the 45th Annual Design Automation Conference. ACM/IEEE , pp. 754-759
- Ceng, J.¹ Castrillon, J.² Sheng, W.³ Scharwachter, H.⁴ Leupers, R.⁵ Ascheid, G.⁶ Meyr, H.⁷ Isshiki, T.⁸ Kunieda, H.⁹

5
- 78149273837
- Open64 compiler infrastructure for emerging multicore/manycore architecture all symposium tutorial
- S. C. Chan, G. R. Gao, B. Chapman, T. Linthicum, and A. Dasgupta. Open64 compiler infrastructure for emerging multicore/manycore architecture all symposium tutorial. In IPDPS 2008: 22nd IEEE International Symposium on Parallel and Distributed Processing, Miami, FL, USA, 2008.
- (2008) IPDPS 2008: 22nd IEEE International Symposium on Parallel and Distributed Processing, Miami, FL, USA
- Chan, S.C.¹ Gao, G.R.² Chapman, B.³ Linthicum, T.⁴ Dasgupta, A.⁵

6
- 0023385308
- The program dependence graph and its use in optimization
- J. Ferrante, K. J. Ottenstein, and J. D. Warren. The program dependence graph and its use in optimization. ACM Trans. Program. Lang. Syst., 9(3):319-349, 1987.
- (1987) ACM Trans. Program. Lang. Syst. , vol.9 , Issue.3 , pp. 319-349
- Ferrante, J.¹ Ottenstein, K.J.² Warren, J.D.³

7
- 0347507496
- New York, NY, USA, ACM
- M. Frigo, C. E. Leiserson, and K. H. Randall. The implementation of the Cilk-5 multithreaded language. volume 33, pages 212-223, New York, NY, USA, 1998. ACM.
- (1998) The Implementation of the Cilk-5 Multithreaded Language , vol.33 , pp. 212-223
- Frigo, M.¹ Leiserson, C.E.² Randall, K.H.³

8
- 0036959649
- A stream compiler for communication-exposed architectures
- New York, NY, USA, ACM
- M. I. Gordon, W. Thies, M. Karczmarek, J. Lin, A. S. Meli, A. A. Lamb, C. Leger, J. Wong, H. Hoffmann, D. Maze, and S. Amarasinghe. A stream compiler for communication-exposed architectures. In ASPLOS-X: Proceedings of the 10th international conference on Architectural support for programming languages and operating systems, pages 291-303, New York, NY, USA, 2002. ACM.
- (2002) ASPLOS-X: Proceedings of the 10th International Conference on Architectural Support for Programming Languages and Operating Systems , pp. 291-303
- Gordon, M.I.¹ Thies, W.² Karczmarek, M.³ Lin, J.⁴ Meli, A.S.⁵ Lamb, A.A.⁶ Leger, C.⁷ Wong, J.⁸ Hoffmann, H.⁹ Maze, D.¹⁰ Amarasinghe, S.¹¹

9
- 78149264907
- volume 0, Los Alamitos, CA, USA, IEEE Computer Society
- J. Guo, G. Bikshandi, D. Hoeflinger, G. Almasi, B. Fraguela, M. Garzaran, D. Padua, and C. von Praun. Hierarchically tiled arrays for parallelism and locality. volume 0, page 316, Los Alamitos, CA, USA, 2006. IEEE Computer Society.
- (2006) Hierarchically Tiled Arrays for Parallelism and Locality , pp. 316
- Guo, J.¹ Bikshandi, G.² Hoeflinger, D.³ Almasi, G.⁴ Fraguela, B.⁵ Garzaran, M.⁶ Padua, D.⁷ Von Praun, C.⁸

10
- 0030380793
- Maximizing multiprocessor performance with the SUIF compiler
- M. W. Hall, J. M. Anderson, S. P. Amarasinghe, B. R. Murphy, S.-W. Liao, E. Bugnion, and M. S. Lam. Maximizing multiprocessor performance with the SUIF compiler. Computer, 29(12):84-89, 1996.
- (1996) Computer , vol.29 , Issue.12 , pp. 84-89
- Hall, M.W.¹ Anderson, J.M.² Amarasinghe, S.P.³ Murphy, B.R.⁴ Liao, S.-W.⁵ Bugnion, E.⁶ Lam, M.S.⁷

11
- 1142293067
- A performance analysis of the berkeley upc compiler
- New York, NY, USA, ACM
- P. Husbands, C. Iancu, and K. Yelick. A performance analysis of the berkeley upc compiler. In ICS '03: Proceedings of the 17th annual international conference on Supercomputing, pages 63-73, New York, NY, USA, 2003. ACM.
- (2003) ICS '03: Proceedings of the 17th Annual International Conference on Supercomputing , pp. 63-73
- Husbands, P.¹ Iancu, C.² Yelick, K.³

12
- 85040171708
- Semantical interprocedural parallelization: An overview of the PIPS project
- New York, NY, USA, ACM
- F. Irigoin, P. Jouvelot, and R. Triolet. Semantical interprocedural parallelization: an overview of the PIPS project. In ICS '91: Proceedings of the 5th international conference on Supercomputing, pages 244-251, New York, NY, USA, 1991. ACM.
- (1991) ICS '91: Proceedings of the 5th International Conference on Supercomputing , pp. 244-251
- Irigoin, F.¹ Jouvelot, P.² Triolet, R.³

13
- 33645236572
- Development and implementation of an interactive parallelization assistance tool for OpenMP: IPat/OMP
- M. Ishihara, H. Honda, and M. Sato. Development and implementation of an interactive parallelization assistance tool for OpenMP: iPat/OMP. IEICE - Trans. Inf. Syst., E89-D(2):399-407, 2006.
- (2006) IEICE - Trans. Inf. Syst. , vol.E89-D , Issue.2 , pp. 399-407
- Ishihara, M.¹ Honda, H.² Sato, M.³

14
- 84949900964
- Exploiting fine- And coarse-grain parallelism in embedded programs
- I. Karkowski and H. Corporaal. Exploiting fine- and coarse-grain parallelism in embedded programs. In PACT '98: Proceedings of the 1998 International Conference on Parallel Architectures and Compilation Techniques, pages 60-67, 1998.
- (1998) PACT '98: Proceedings of the 1998 International Conference on Parallel Architectures and Compilation Techniques , pp. 60-67
- Karkowski, I.¹ Corporaal, H.²

15
- 78149265397
- Overcoming the limitations of the traditional loop parallelization
- Jan
- I. Karkowski and H. Corporaal. Overcoming the limitations of the traditional loop parallelization. High-Performance Computing and Networking, Jan 1998.
- (1998) High-Performance Computing and Networking
- Karkowski, I.¹ Corporaal, H.²

16
- 0026191059
- Interactive parallel programming using the ParaScope editor
- K. Kennedy, K. S. McKinley, and C. W. Tseng. Interactive parallel programming using the ParaScope editor. IEEE Trans. Parallel Distrib. Syst., 2(3):329-341, 1991.
- (1991) IEEE Trans. Parallel Distrib. Syst. , vol.2 , Issue.3 , pp. 329-341
- Kennedy, K.¹ McKinley, K.S.² Tseng, C.W.³

17
- 42549111870
- Optimistic parallelism requires abstractions
- New York, USA, ACM
- M. Kulkarni, K. Pingali, B. Walter, G. Ramanarayanan, K. Bala, and L. Chew. Optimistic parallelism requires abstractions. In PLDI '07: Proceedings of the 2007 ACM SIGPLAN conference on Programming language design and implementation, pages 211-222, New York, USA, 2007. ACM.
- (2007) PLDI '07: Proceedings of the 2007 ACM SIGPLAN Conference on Programming Language Design and Implementation , pp. 211-222
- Kulkarni, M.¹ Pingali, K.² Walter, B.³ Ramanarayanan, G.⁴ Bala, K.⁵ Chew, L.⁶

18
- 0016026944
- The parallel execution of do loops
- L. Lamport. The parallel execution of do loops. Commun. ACM, 17(2):83-93, 1974.
- (1974) Commun. ACM , vol.17 , Issue.2 , pp. 83-93
- Lamport, L.¹

19
- 0030645995
- Maximizing parallelism and minimizing synchronization with affine transforms
- New York, NY, USA, ACM
- A. W. Lim and M. S. Lam. Maximizing parallelism and minimizing synchronization with affine transforms. In POPL '97: Proceedings of the 24th ACM SIGPLAN-SIGACT symposium on Principles of programming languages, pages 201-214, New York, NY, USA, 1997. ACM.
- (1997) POPL '97: Proceedings of the 24th ACM SIGPLAN-SIGACT Symposium on Principles of Programming Languages , pp. 201-214
- Lim, A.W.¹ Lam, M.S.²

20
- 33749375700
- Automatic thread extraction with decoupled software pipelining
- 0
- G. Ottoni, R. Rangan, A. Stoler, and D. I. August. Automatic thread extraction with decoupled software pipelining. In MICRO-38: Proceedings of the 38th IEEE/ACM International Symposium on Microarchitectureon, 0:105-118, 2005.
- (2005) MICRO-38: Proceedings of the 38th IEEE/ACM International Symposium on Microarchitectureon , pp. 105-118
- Ottoni, G.¹ Rangan, R.² Stoler, A.³ August, D.I.⁴

21
- 0008241757
- Polaris: A new-generation parallelizing compiler for mpps
- Technical report, Univ. of Illinois at Urbana-Champaign
- D. A. Padua, R. Eigenmann, J. Hoeflinger, P. Petersen, P. Tu, S. Weatherford, and K. Faigin. Polaris: A new-generation parallelizing compiler for mpps. Technical report, In CSRD Rept. No. 1306. Univ. of Illinois at Urbana-Champaign, 1993.
- (1993) CSRD Rept. No. 1306
- Padua, D.A.¹ Eigenmann, R.² Hoeflinger, J.³ Petersen, P.⁴ Tu, P.⁵ Weatherford, S.⁶ Faigin, K.⁷

22
- 76749098118
- Polymorphic pipeline array: A flexible multicore accelerator with virtualized execution for mobile multimedia applications
- New York, NY, USA, ACM
- H. Park, Y. Park, and S. Mahlke. Polymorphic pipeline array: a flexible multicore accelerator with virtualized execution for mobile multimedia applications. In MICRO-42: Proceedings of the 42nd Annual IEEE/ACM International Symposium on Microarchitecture, pages 370-380, New York, NY, USA, 2009. ACM.
- (2009) MICRO-42: Proceedings of the 42nd Annual IEEE/ACM International Symposium on Microarchitecture , pp. 370-380
- Park, H.¹ Park, Y.² Mahlke, S.³

23
- 77952281906
- Speculative parallelization using software multi-threaded transactions
- A. Raman, H. Kim, T. R. Mason, T. B. Jablin, and D. I. August. Speculative parallelization using software multi-threaded transactions. In ASPLOS XV: Proceedings of the Fifteenth International Conference on Architectural Support for Programming Languages and Operating Systems , March 2010.
- ASPLOS XV: Proceedings of the Fifteenth International Conference on Architectural Support for Programming Languages and Operating Systems, March 2010
- Raman, A.¹ Kim, H.² Mason, T.R.³ Jablin, T.B.⁴ August, D.I.⁵

24
- 43449113286
- Parallel-stage decoupled software pipelining
- New York, NY, USA, ACM
- E. Raman, G. Ottoni, A. Raman, M. J. Bridges, and D. I. August. Parallel-stage decoupled software pipelining. In CGO '08: Proceedings of the 6th annual IEEE/ACM international symposium on Code generation and optimization, pages 114-123, New York, NY, USA, 2008. ACM.
- (2008) CGO '08: Proceedings of the 6th Annual IEEE/ACM International Symposium on Code Generation and Optimization , pp. 114-123
- Raman, E.¹ Ottoni, G.² Raman, A.³ Bridges, M.J.⁴ August, D.I.⁵

25
- 84886630310
- Standard templates adaptive parallel library
- L. Rauchwerger, F. Arzu, and K. Ouchi. Standard templates adaptive parallel library. In 4th International Workshop on Languages, Compilers and Run-Time Systems for Scalable Computers (LCR), pages 402-409, 1998.
- (1998) 4th International Workshop on Languages, Compilers and Run-Time Systems for Scalable Computers (LCR) , pp. 402-409
- Rauchwerger, L.¹ Arzu, F.² Ouchi, K.³

26
- 70349749486
- Extracting coarse-grain parallelism in general-purpose programs
- S. Rul, H. Vandierendonck, and K. Bosschere. Extracting coarse-grain parallelism in general-purpose programs. In PPoPP '08: Proceedings of the 13th ACM SIGPLAN Symposium on Principles and practice of parallel programming, Feb 2008.
- PPoPP '08: Proceedings of the 13th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, Feb 2008
- Rul, S.¹ Vandierendonck, H.² Bosschere, K.³

27
- 34748894221
- X10: Concurrent programming for modern architectures
- New York, NY, USA, ACM
- V. A. Saraswat, V. Sarkar, and C. von Praun. X10: concurrent programming for modern architectures. In PPoPP '07: Proceedings of the 12th ACM SIGPLAN symposium on Principles and practice of parallel programming, pages 271-271, New York, NY, USA, 2007. ACM.
- (2007) PPoPP '07: Proceedings of the 12th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming , pp. 271-271
- Saraswat, V.A.¹ Sarkar, V.² Von Praun, C.³

28
- 47349118686
- A practical approach to exploiting coarse-grained pipeline parallelism in c programs
- Washington, DC, USA, IEEE Computer Society.
- W. Thies, V. Chandrasekhar, and S. Amarasinghe. A practical approach to exploiting coarse-grained pipeline parallelism in c programs. In MICRO 40: Proceedings of the 40th Annual IEEE/ACM International Symposium on Microarchitecture, pages 356-369, Washington, DC, USA, 2007. IEEE Computer Society.
- (2007) MICRO 40: Proceedings of the 40th Annual IEEE/ACM International Symposium on Microarchitecture , pp. 356-369
- Thies, W.¹ Chandrasekhar, V.² Amarasinghe, S.³

29
- 84959045524
- Streamit: A language for streaming applications
- W. Thies, M. Karczmarek, and S. Amarasinghe. Streamit: A language for streaming applications. Lecture Notes in Computer Science, 2304:179-??, 2002.
- (2002) Lecture Notes in Computer Science , vol.2304 , pp. 179
- Thies, W.¹ Karczmarek, M.² Amarasinghe, S.³

30
- 66749164066
- Copy or discard execution model for speculative parallelization on multicores
- Washington, DC, USA, IEEE Computer Society
- C. Tian, M. Feng, V. Nagarajan, and R. Gupta. Copy or discard execution model for speculative parallelization on multicores. In MICRO 41: Proceedings of the 41st annual IEEE/ACM International Symposium on Microarchitecture, pages 330-341, Washington, DC, USA, 2008. IEEE Computer Society.
- (2008) MICRO 41: Proceedings of the 41st Annual IEEE/ACM International Symposium on Microarchitecture , pp. 330-341
- Tian, C.¹ Feng, M.² Nagarajan, V.³ Gupta, R.⁴

31
- 70450278773
- Towards a holistic approach to auto-parallelization: Integrating profile-driven parallelism detection and machine-learning based mapping
- Dublin, Ireland, ACM
- G. Tournavitis, Z. Wang, B. Franke, and M. F. O'Boyle. Towards a holistic approach to auto-parallelization: integrating profile-driven parallelism detection and machine-learning based mapping. In PLDI '09: Proceedings of the 2009 ACM SIGPLAN conference on Programming language design and implementation, pages 177-187, Dublin, Ireland, 2009. ACM.
- (2009) PLDI '09: Proceedings of the 2009 ACM SIGPLAN Conference on Programming Language Design and Implementation , pp. 177-187
- Tournavitis, G.¹ Wang, Z.² Franke, B.³ O'Boyle, M.F.⁴

32
- 77949711818
- PhD thesis, Princeton University
- N. Vachharajani. Intelligent Speculation for Pipelined Multithreading. PhD thesis, Princeton University, 2008.
- (2008) Intelligent Speculation for Pipelined Multithreading
- Vachharajani, N.¹

33
- 41349089872
- Speculative decoupled software pipelining
- Washington, DC, USA, IEEE Computer Society
- N. Vachharajani, R. Rangan, E. Raman, M. J. Bridges, G. Ottoni, and D. I. August. Speculative decoupled software pipelining. In PACT '07: Proceedings of the 16th International Conference on Parallel Architecture and Compilation Techniques, pages 49-59, Washington, DC, USA, 2007. IEEE Computer Society.
- (2007) PACT '07: Proceedings of the 16th International Conference on Parallel Architecture and Compilation Techniques , pp. 49-59
- Vachharajani, N.¹ Rangan, R.² Raman, E.³ Bridges, M.J.⁴ Ottoni, G.⁵ August, D.I.⁶

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.