SCOPUS 정보 검색 플랫폼

Proceedings of the ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, PPOPP

Volumn , Issue , 2007, Pages 215-225

Tight analysis of the performance potential of thread speculation using SPEC CPU 2006

(9) Kejariwal, Arun a Tian, Xinmin b Girkar, Milind b Li, Wei b Kozhukhov, Sergey b Banerjee, Utpal b Nicolau, Alexander a Veidenbaum, Alexander V a Polychronopoulos, Constantine D c

a UNIVERSITY OF CALIFORNIA (United States)

b INTEL CORPORATION (United States)

c University of California

Author keywords

Conflict probability; Misspeculation penalty; Performance evaluation; Speculative execution; Threading overhead

Indexed keywords

PARALLEL PROGRAMMING; PROBABILITY;

CONFLICT PROBABILITY; MISSPECULATION PENALTY; PERFORMANCE EVALUATION; SPECULATIVE EXECUTION; THREADING OVERHEAD;

PROGRAM PROCESSORS;

EID: 34748924163 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: 10.1145/1229428.1229475 Document Type: Conference Paper

Times cited : (15)

References (35)

1
- 34547211682
- analysis, speculative execution
- A. Kejariwal and A. Nicolau. Reading list of performance analysis, speculative execution, http://www.ics.uci.edu/~akej ariw/ SpeculativeExecutionReadingList.pdf.
- Reading list of performance
- Kejariwal, A.¹ Nicolau, A.²

2
- 0019248192
- A controllable MIMD architectures
- St. Charles, IL, August
- S. F. Lundstrom and G. H. Barnes. A controllable MIMD architectures. In Proceedings of the 1980 International Conference on Parallel Processing, pages 19-27, St. Charles, IL, August 1980.
- (1980) Proceedings of the 1980 International Conference on Parallel Processing , pp. 19-27
- Lundstrom, S.F.¹ Barnes, G.H.²

3
- 34547414505
- On the performance potential of different types of speculative thread-level parallelism
- Cairns, Australia
- A. Kejariwal, X. Tian, W. Li, M. Girkar, S. Kozhukhov, H. Saito, U. Banerjee, A. Nicolau, A. V. Veidenbaum, and C. D. Polychronopoulos. On the performance potential of different types of speculative thread-level parallelism. In Proceedings of the 20th ACM. International Conference on Supercomputing, pages 24-35, Cairns, Australia, 2006.
- (2006) Proceedings of the 20th ACM. International Conference on Supercomputing , pp. 24-35
- Kejariwal, A.¹ Tian, X.² Li, W.³ Girkar, M.⁴ Kozhukhov, S.⁵ Saito, H.⁶ Banerjee, U.⁷ Nicolau, A.⁸ Veidenbaum, A.V.⁹ Polychronopoulos, C.D.¹⁰

4
- 85067328792
- SPEC CPU2006. http://www.spec.org/cpu2006.
- (2006)
- CPU, S.P.E.C.¹

5
- 85060036181
- Validity of the single processor approachtoachieving large scale computing capabilities
- G. M. Amdahl. Validity of the single processor approachtoachieving large scale computing capabilities. In AFIPS Conference Proceedings, pages 483-485, 1967.
- (1967) AFIPS Conference Proceedings , pp. 483-485
- Amdahl, G.M.¹

6
- 85067324496
- Open Research Compiler for Itanium™Processor Family
- Open Research Compiler for Itanium™Processor Family, http://ipf-orc.sourceforge.net/.

7
- 84870395439
- GCC, the GNU Compiler Collection, http://gcc.gnu.org/.
- GCC, the GNU Compiler Collection

8
- 85034806381
- Semantic parallelization: A practical exercise in abstract interpretation
- Munich, West Germany, January
- P. Jouvelot. Semantic parallelization: a practical exercise in abstract interpretation. In Proceedings of the Fourteenth Annual ACM Symposium on the Principles of Programming Languages, pages 39-48, Munich, West Germany, January 1987.
- (1987) Proceedings of the Fourteenth Annual ACM Symposium on the Principles of Programming Languages , pp. 39-48
- Jouvelot, P.¹

9
- 84976742360
- A unified semantic approach for the vectorization and parallelization of generalized reductions
- Crete, Greece, June
- P. Jouvelot and B. Dehbonei. A unified semantic approach for the vectorization and parallelization of generalized reductions. In Proceedings of the 3rd ACM International Conference on Supercomputing, pages 186-194, Crete, Greece, June 1989.
- (1989) Proceedings of the 3rd ACM International Conference on Supercomputing , pp. 186-194
- Jouvelot, P.¹ Dehbonei, B.²

10
- 35048873677
- College Station, TX, October
- D. J. Quinlan, M. Schordan, Q. Yi, and B. R. de Supinski. Semanticdriven parallelization of loops operating on user-defined containers, pages 524-538, College Station, TX, October 2003.
- (2003) Semanticdriven parallelization of loops operating on user-defined containers , pp. 524-538
- Quinlan, D.J.¹ Schordan, M.² Yi, Q.³ de Supinski, B.R.⁴

11
- 0027541302
- Automatic program parallelization
- February
- U. Banerjee, R. Eigenmann, A. Nicolau, and D. Padua. Automatic program parallelization. Proceedings of the IEEE, 81(2):211-243, February 1993.
- (1993) Proceedings of the IEEE , vol.81 , Issue.2 , pp. 211-243
- Banerjee, U.¹ Eigenmann, R.² Nicolau, A.³ Padua, D.⁴

12
- 77950300305
- ILP versus TLP on SMT
- Portland, OR
- N. Mitchell, L. Carter, J. Ferrante, and D. Tullsen. ILP versus TLP on SMT. In Proceedings of the 1999 ACM/EEE Conference on Supercomputing, page 37, Portland, OR, 1999.
- (1999) Proceedings of the 1999 ACM/EEE Conference on Supercomputing , pp. 37
- Mitchell, N.¹ Carter, L.² Ferrante, J.³ Tullsen, D.⁴

13
- 0003015894
- Some scheduling techniques and an easily schedulable horizontal architecture for high performancescientific computing
- Chatham, MA, December
- B. R. Rau and C. D. Glaeser. Some scheduling techniques and an easily schedulable horizontal architecture for high performancescientific computing. In Proceedings of the 14th annual workshop on Microprogramming, pages 183-198, Chatham, MA, December 1981.
- (1981) Proceedings of the 14th annual workshop on Microprogramming , pp. 183-198
- Rau, B.R.¹ Glaeser, C.D.²

14
- 34748836458
- Percolation scheduling
- August
- A. Nicolau. Percolation scheduling. In Proceedings of the 1985 International Conference on Parallel Processing, August 1985.
- (1985) Proceedings of the 1985 International Conference on Parallel Processing
- Nicolau, A.¹

15
- 33746662867
- Efficient techniques for advanced data dependence analysis
- St. Louis, MO
- K. Kyriakopoulos and K. Psarris. Efficient techniques for advanced data dependence analysis. In Proceedings of the 14th International Conference on Parallel Architectures and Compilation Techniques, pages 143-156, St. Louis, MO, 2005.
- (2005) Proceedings of the 14th International Conference on Parallel Architectures and Compilation Techniques , pp. 143-156
- Kyriakopoulos, K.¹ Psarris, K.²

16
- 3142719600
- Perfect pipelining: A new loop parallelization technique
- 87-873, Dept. of Computer Science, Cornell University
- A. Aiken and A. Nicolau. Perfect pipelining: A new loop parallelization technique. Technical/Report 87-873, Dept. of Computer Science, Cornell University, 1987.
- (1987) Technical/Report
- Aiken, A.¹ Nicolau, A.²

17
- 0033361788
- In search of speculative thread-level parallelism
- Newport Beach, CA, October
- J. T. Oplinger, D. L. Heine, and M. S. Lain. In search of speculative thread-level parallelism. In Proceedings of the International Conference on Parallel Architectures and Compilation Techniques, pages 303-313, Newport Beach, CA, October 1999.
- (1999) Proceedings of the International Conference on Parallel Architectures and Compilation Techniques , pp. 303-313
- Oplinger, J.T.¹ Heine, D.L.² Lain, M.S.³

18
- 0033902726
- A quantitative assessment of threadlevel speculation techniques
- Cancun, Mexico, May
- P. Marcuello and A. Gonzalez. A quantitative assessment of threadlevel speculation techniques. In Proceedings of the 14th International Parallel and Distributed Processing Symposium, pages 595-604, Cancun, Mexico, May 2000.
- (2000) Proceedings of the 14th International Parallel and Distributed Processing Symposium , pp. 595-604
- Marcuello, P.¹ Gonzalez, A.²

19
- 32844465384
- Tasking with out-of-order spawn in tls chip multiprocessors: Microarchitecture and compilation
- Cambridge, MA
- J. Renau, J. Tuck, W. Liu, L. Ceze, K. Strauss, and J. Torrellas. Tasking with out-of-order spawn in tls chip multiprocessors: Microarchitecture and compilation. In Proceedings of the 19th ACM International Conference on Supercomputing, pages 179-188, Cambridge, MA, 2005.
- (2005) Proceedings of the 19th ACM International Conference on Supercomputing , pp. 179-188
- Renau, J.¹ Tuck, J.² Liu, W.³ Ceze, L.⁴ Strauss, K.⁵ Torrellas, J.⁶

20
- 0024664199
- Run-time disambiguation: Coping with statically unpredictable dependencies
- A. Nicolau. Run-time disambiguation: coping with statically unpredictable dependencies. IEEE Transactions on Computers, 38(5):633-678, 1989.
- (1989) IEEE Transactions on Computers , vol.38 , Issue.5 , pp. 633-678
- Nicolau, A.¹

21
- 0142149618
- Second Edition. Intel Press
- R. Gerber, A. J. C. Bik, K. B. Smith, and X. Tian. The Software Optimization Cookbook, Second Edition. Intel Press, 2006.
- (2006) The Software Optimization Cookbook
- Gerber, R.¹ Bik, A.J.C.² Smith, K.B.³ Tian, X.⁴

22
- 0031628367
- Optimizing direct-threaded code by selective inlining
- I. Piumarta and F. Riccardi. Optimizing direct-threaded code by selective inlining. Proceedings of the SIGPLAN '98 Conference on Programming Language Design and Implementation, pages 291-300, 1998.
- (1998) Proceedings of the SIGPLAN '98 Conference on Programming Language Design and Implementation , pp. 291-300
- Piumarta, I.¹ Riccardi, F.²

23
- 0038039851
- A compiler framework for speculative analysis and optimizations
- San Diego, CA
- J. Lin, T. Chen, W.-C. Hsu, P.-C. Yew, R. D.-C. Ju, T.-F. Ngai, and S. Chan. A compiler framework for speculative analysis and optimizations. In Proceedings of the SIGPLAN '03 Conference on Programming Language Design and Implementation, pages 289-299, San Diego, CA, 2003.
- (2003) Proceedings of the SIGPLAN '03 Conference on Programming Language Design and Implementation , pp. 289-299
- Lin, J.¹ Chen, T.² Hsu, W.-C.³ Yew, P.-C.⁴ Ju, R.D.-C.⁵ Ngai, T.-F.⁶ Chan, S.⁷

24
- 33646849245
- A general compiler framework for speculative optimizations using data speculative code motion
- San Jose, CA
- X. Dai, A. Zhai, W.-C. Hsu, and P.-C. Yew. A general compiler framework for speculative optimizations using data speculative code motion. In Proceedings of the International. Symposium, on Code Generation and Optimization, pages 280-290, San Jose, CA, 2005.
- (2005) Proceedings of the International. Symposium, on Code Generation and Optimization , pp. 280-290
- Dai, X.¹ Zhai, A.² Hsu, W.-C.³ Yew, P.-C.⁴

25
- 84949806622
- Thread spawning schemes for speculative multithreading
- Boston, MA, February
- P. Marcuello and A. Gonzalez. Thread spawning schemes for speculative multithreading. In Proceedings of the Eighth International Symposium on High-Performance Computer Architecture, pages 55-64, Boston, MA, February 2002.
- (2002) Proceedings of the Eighth International Symposium on High-Performance Computer Architecture , pp. 55-64
- Marcuello, P.¹ Gonzalez, A.²

26
- 85067328576
- Intel®VTune™Performance Analyzer 8.0 for Windows, http: //www.Intel.com/cd/software/products/asmo-na/eng/ vtune/219898.htm.
- Intel®VTune™Performance Analyzer 8.0 for Windows

27
- 1542698782
- February
- U. Drepper. The Native POSIX Thread Library for Linux, http://people.redhat.com/drepper/nptl-design.pdf, February 2005.
- (2005) The Native POSIX Thread Library for Linux
- Drepper, U.¹

28
- 85067326988
- EPCC OpenMP Microbenchinarks. http://www.epcc.ed.ac.uk/ research/openmpbench/openmp.lndex.html.
- OpenMP Microbenchinarks, E.P.C.C.¹

29
- 0022150790
- Allocating independent subtasks on parallel processors
- C. P. Kruskal and A. Weiss. Allocating independent subtasks on parallel processors. IEEE Transactions on Software Engineering, 11(10):1001-1016, 1985.
- (1985) IEEE Transactions on Software Engineering , vol.11 , Issue.10 , pp. 1001-1016
- Kruskal, C.P.¹ Weiss, A.²

30
- 0004230378
- Kluwer Academic Publishers, Boston, MA
- U. Banerjee. Dependence Analysis. Kluwer Academic Publishers, Boston, MA, 1997.
- (1997) Dependence Analysis
- Banerjee, U.¹

31
- 0028016652
- Redundant synchronization elimination, for DOACROSS loops
- Cancun, Mexico
- D.-K. Chen and P.-C. Yew. Redundant synchronization elimination, for DOACROSS loops. In Proceedings of the Eighth International Parallel Processing Symposium, pages 477-481, Cancun, Mexico, 1994.
- (1994) Proceedings of the Eighth International Parallel Processing Symposium , pp. 477-481
- Chen, D.-K.¹ Yew, P.-C.²

32
- 0029205026
- Automatic synchronization elimination in synchronous FORALLs
- McLean, VA, February
- M. Philippsen and E. Heinz. Automatic synchronization elimination in synchronous FORALLs. In Frontiers '95: The 5th Symposium on the Frontiers of Massively Parallel Computation, McLean, VA, February 1995.
- (1995) Frontiers '95: The 5th Symposium on the Frontiers of Massively Parallel Computation
- Philippsen, M.¹ Heinz, E.²

33
- 0003927035
- Addison-Wesley, Redwood City, CA
- M. J. Wolfe. High Performance Compilers for Parallel Computing. Addison-Wesley, Redwood City, CA, 1996.
- (1996) High Performance Compilers for Parallel Computing
- Wolfe, M.J.¹

34
- 1142268815
- Recycling waste: Exploiting wrong-path execution to improve branch prediction
- San Francisco, CA
- H. Akkary, S. T. Srinivasan, and K. Lai. Recycling waste: exploiting wrong-path execution to improve branch prediction. In Proceedings of the 17th ACM International Conference on Supercomputing, pages 12-21, San Francisco, CA, 2003.
- (2003) Proceedings of the 17th ACM International Conference on Supercomputing , pp. 12-21
- Akkary, H.¹ Srinivasan, S.T.² Lai, K.³

35
- 84976861423
- Branch prediction for free
- Albuquerque, NM, June
- T. Ball and J. Laras. Branch prediction for free. In Proceedings of the SIGPLAN '93 Conference on Programming Language Design and Implementation, pages 300-313, Albuquerque, NM, June 1993.
- (1993) Proceedings of the SIGPLAN '93 Conference on Programming Language Design and Implementation , pp. 300-313
- Ball, T.¹ Laras, J.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.