SCOPUS 정보 검색 플랫폼

Proceedings of the 2010 IEEE International Symposium on Parallel and Distributed Processing, IPDPS 2010

Volumn , Issue , 2010, Pages

Speculative execution on multi-GPU systems

(2) Diamos, Gregory a Yalamanchili, Sudhakar a

a GEORGIA INSTITUTE OF TECHNOLOGY (United States)

Author keywords

[No Author keywords available]

Indexed keywords

COMPUTATIONAL CAPABILITY; DESIGN DECISIONS; EXECUTION MODEL; FUTURE GENERATIONS; HETEROGENEOUS SYSTEMS; MANY-CORE; MICRO ARCHITECTURES; PARALLEL PROGRAMMING MODEL; PARALLELIZATIONS; PERFORMANCE CHARACTERISTICS; PROGRAMMING MODELS; RUNTIMES; SEQUENTIAL PROGRAMMING; SPECULATIVE EXECUTION; TARGET SYSTEMS;

ACCELERATION; COMPUTER ARCHITECTURE; DISTRIBUTED PARAMETER NETWORKS; PARALLEL PROGRAMMING; PROGRAM PROCESSORS;

COMPUTER SYSTEMS PROGRAMMING;

EID: 77954021029 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/IPDPS.2010.5470427 Document Type: Conference Paper

Times cited : (15)

References (28)

1
- 0041633858
- Parameter variations and impact on circuits and microarchitecture
- New York, NY, USA: ACM
- B. Shekhar, K. Tanay, N. Siva, T. Jim, K. Ali, and D. Vivek, "Parameter variations and impact on circuits and microarchitecture," in DAC '03: Proceedings of the 40th conference on Design automation. New York, NY, USA: ACM, 2003, pp. 338-342.
- (2003) DAC '03: Proceedings of the 40th Conference on Design Automation , pp. 338-342
- Shekhar, B.¹ Tanay, K.² Siva, N.³ Jim, T.⁴ Ali, K.⁵ Vivek, D.⁶

2
- 42149160020
- Nvidia cuda software and gpu parallel computing architecture
- New York, NY, USA: ACM
- D. Kirk, "Nvidia cuda software and gpu parallel computing architecture," in ISMM '07: Proceedings of the 6th international symposium on Memory management. New York, NY, USA: ACM, 2007, pp. 103-104.
- (2007) ISMM '07: Proceedings of the 6th International Symposium on Memory Management , pp. 103-104
- Kirk, D.¹

3
- 66749136924
- From soda to scotch: The evolution of a wireless baseband processor
- Washington, DC, USA: IEEE Computer Society
- W. Mark, L. Yuan, S. Sangwon, M. Scott, M. Trevor, C. Chaitali, B. Richard, K. Danny, R. Alastair, W. Mladen, and F. Krisztian, "From soda to scotch: The evolution of a wireless baseband processor," in MICRO '08: Proceedings of the 2008 41st IEEE/ACM International Symposium on Microarchitecture. Washington, DC, USA: IEEE Computer Society, 2008, pp. 152-163.
- (2008) MICRO '08: Proceedings of the 2008 41st IEEE/ACM International Symposium on Microarchitecture , pp. 152-163
- Mark, W.¹ Yuan, L.² Sangwon, S.³ Scott, M.⁴ Trevor, M.⁵ Chaitali, C.⁶ Richard, B.⁷ Danny, K.⁸ Alastair, R.⁹ Mladen, W.¹⁰ Krisztian, F.¹¹

4
- 0033722250
- An fpga implementation and performance evaluation of the serpent block cipher
- New York, NY, USA: ACM
- A. Elbirt and C. Paar, "An fpga implementation and performance evaluation of the serpent block cipher," in FPGA '00: Proceedings of the 2000 ACM/SIGDA eighth international symposium on Field programmable gate arrays. New York, NY, USA: ACM, 2000, pp. 33-40.
- (2000) FPGA '00: Proceedings of the 2000 ACM/SIGDA Eighth International Symposium on Field Programmable Gate Arrays , pp. 33-40
- Elbirt, A.¹ Paar, C.²

5
- 0013398077
- Ph.D. dissertation, Department of Electrical Engineering and Computer Science, Massachusetts Institute of Technology, May
- K. H. Randall, "Cilk: Efficient multithreaded computing," Ph.D. dissertation, Department of Electrical Engineering and Computer Science, Massachusetts Institute of Technology, May 1998.
- (1998) Cilk: Efficient Multithreaded Computing
- Randall, K.H.¹

6
- 85027692154
- Champaign, IL, USA, Tech. Rep.
- L. V. Kale and S. Krishnan, "Charm++: A portable concurrent object oriented system based on c++," Champaign, IL, USA, Tech. Rep., 1993.
- (1993) Charm++: A Portable Concurrent Object Oriented System Based on C++
- Kale, L.V.¹ Krishnan, S.²

7
- 33749377408
- Stream programming on general- purpose processors
- Barcelona, Spain, November
- J. Gummaraju and M. Rosenblum, "Stream Programming on General- Purpose Processors," in MICRO 38: Proceedings of the 38th annual ACM/IEEE international symposium on Microarchitecture, Barcelona, Spain, November 2005.
- (2005) MICRO 38: Proceedings of the 38th Annual ACM/IEEE International Symposium on Microarchitecture
- Gummaraju, J.¹ Rosenblum, M.²

8
- 84959045524
- Streamit: A language for streaming applications
- London, UK: SpringerVerlag
- W. Thies, M. Karczmarek, and S. P. Amarasinghe, "Streamit: A language for streaming applications," in CC '02: Proceedings of the 11th International Conference on Compiler Construction. London, UK: SpringerVerlag, 2002, pp. 179-196.
- (2002) CC '02: Proceedings of the 11th International Conference on Compiler Construction , pp. 179-196
- Thies, W.¹ Karczmarek, M.² Amarasinghe, S.P.³

9
- 70350656487
- AMD One AMD Place, Sunnyvale CA, 94088, Tech. Rep. [Online]
- AMD, "Ati stream computing - technical overview," One AMD Place, Sunnyvale CA, 94088, Tech. Rep. [Online]. Available: http://developer. amd.com/gpu-assets/Stream-Computing-Overview.pdf
- Ati Stream Computing - Technical Overview

10
- 67650694407
- NVIDIA, 2nd ed., NVIDIA Corporation, Santa Clara, California, October
- NVIDIA, NVIDIA CUDA Compute Unified Device Architecture, 2nd ed., NVIDIA Corporation, Santa Clara, California, October 2008.
- (2008) NVIDIA CUDA Compute Unified Device Architecture

11
- 70349100958
- December. [Online]
- K. O. W. Group, The OpenCL Specification, December 2008. [Online]. Available: http://www.khronos.Org/registry/cl/specs/opencl-1.0.29.pdf
- (2008) The OpenCL Specification
- Group, K.O.W.¹

12
- 77957759721
- Merge: A programming model for heterogeneous multi-core systems
- New York, NY, USA: ACM
- M. D. Linderman, J. D. Collins, H. Wang, and T. H. Meng, "Merge: a programming model for heterogeneous multi-core systems," in ASPLOS XIII: Proceedings of the 13th international conference on Architectural support for programming languages and operating systems. New York, NY, USA: ACM, 2008, pp. 287-296.
- (2008) ASPLOS XIII: Proceedings of the 13th International Conference on Architectural Support for Programming Languages and Operating Systems , pp. 287-296
- Linderman, M.D.¹ Collins, J.D.² Wang, H.³ Meng, T.H.⁴

13
- 57349153933
- Harmony: An execution model and runtime for heterogeneous many core systems
- Boston, Massachusetts, USA: ACM, june
- G. Diamos and S. Yalamanchili, "Harmony: An execution model and runtime for heterogeneous many core systems," in HPDC'08. Boston, Massachusetts, USA: ACM, june 2008.
- (2008) HPDC'08
- Diamos, G.¹ Yalamanchili, S.²

14
- 76749140917
- Qilin: Exploiting parallelism on heterogeneous multiprocessors with adaptive mapping
- New York, USA: IEEE, devember
- C. Luk, S. Hong, and H. Kim, "Qilin: Exploiting parallelism on heterogeneous multiprocessors with adaptive mapping," in MICRO'09. New York, USA: IEEE, devember 2009.
- (2009) MICRO'09
- Luk, C.¹ Hong, S.² Kim, H.³

15
- 34548207355
- Sequoia: Programming the memory hierarchy
- K. Fatahalian, T. J. Knight, M. Houston, M. Erez, D. R. Horn, L. Leem, J. Y. Park, M. Ren, A. Aiken, W. J. Dally, and P. Hanrahan, "Sequoia: Programming the memory hierarchy," in Proceedings of the 2006 ACM/IEEE Conference on Supercomputing, 2006.
- (2006) Proceedings of the 2006 ACM/IEEE Conference on Supercomputing
- Fatahalian, K.¹ Knight, T.J.² Houston, M.³ Erez, M.⁴ Horn, D.R.⁵ Leem, L.⁶ Park, J.Y.⁷ Ren, M.⁸ Aiken, A.⁹ Dally, W.J.¹⁰ Hanrahan, P.¹¹

16
- 77954019175
- Program demultiplexing: Data-flow based speculative parallelization of methods in sequential programs
- S. Balakrishnan and G. S. Sohi, "Program demultiplexing: Data-flow based speculative parallelization of methods in sequential programs," SIGARCH Comput. Archit. News, vol.34, no.2, pp. 302-313, 2006.
- (2006) SIGARCH Comput. Archit. News , vol.34 , Issue.2 , pp. 302-313
- Balakrishnan, S.¹ Sohi, G.S.²

17
- 66749164066
- Copy or discard execution model for speculative parallelization on multicores
- Washington, DC, USA: IEEE Computer Society
- C. Tian, M. Feng, Nagarajan, Vijay, and R. Gupta, "Copy or discard execution model for speculative parallelization on multicores," in MICRO '08: Proceedings of the 2008 41st IEEE/ACM International Symposium on Microarchitecture. Washington, DC, USA: IEEE Computer Society, 2008, pp. 330-341.
- (2008) MICRO '08: Proceedings of the 2008 41st IEEE/ACM International Symposium on Microarchitecture , pp. 330-341
- Tian, C.¹ Feng, M.² Nagarajan³ Vijay⁴ Gupta, R.⁵

18
- 0031605470
- Data speculation support for a chip multiprocessor
- New York, NY, USA: ACM
- L. Hammond, M. Willey, and K. Olukotun, "Data speculation support for a chip multiprocessor," in ASPLOS-VIII: Proceedings of the eighth international conference on Architectural support for programming languages and operating systems. New York, NY, USA: ACM, 1998, pp. 58-69.
- (1998) ASPLOS-VIII: Proceedings of the Eighth International Conference on Architectural Support for Programming Languages and Operating Systems , pp. 58-69
- Hammond, L.¹ Willey, M.² Olukotun, K.³

19
- 70649102016
- NVIDIA, 1st ed., NVIDIA Corporation, Santa Clara, California, October
- NVIDIA, NVIDIA Compute PTX: Parallel Thread Execution, 1st ed., NVIDIA Corporation, Santa Clara, California, October 2008.
- (2008) NVIDIA Compute PTX: Parallel Thread Execution

20
- 67650692011
- [Online]
- IMPACT, "The parboil benchmark suite," 2007. [Online]. Available: http://www.crhc.uiuc.edu/IMPACT/parboil.php
- (2007) The Parboil Benchmark Suite

21
- 70649104826
- A characterization and analysis of ptx kernels
- Austin, TX, USA, October
- A. Kerr, G. Diamos, and S. Yalamanchili, "A characterization and analysis of ptx kernels," in IISWC09: IEEE International Symposium on Workload Characterization, Austin, TX, USA, October 2009.
- (2009) IISWC09: IEEE International Symposium on Workload Characterization
- Kerr, A.¹ Diamos, G.² Yalamanchili, S.³

22
- 0030645118
- Trading conflict and capacity aliasing in conditional branch predictors
- New York, NY, USA: ACM
- M. Pierre, S. Andre, and U. Richard, "Trading conflict and capacity aliasing in conditional branch predictors," in ISCA '97: Proceedings of the 24th annual international symposium on Computer architecture. New York, NY, USA: ACM, 1997, pp. 292-303.
- (1997) ISCA '97: Proceedings of the 24th Annual International Symposium on Computer Architecture , pp. 292-303
- Pierre, M.¹ Andre, S.² Richard, U.³

23
- 70350771131
- Benchmarking gpus to tune dense linear algebra
- Piscataway, NJ, USA: IEEE Press
- V. Volkov and J. W. Demmel, "Benchmarking gpus to tune dense linear algebra," in SC '08: Proceedings of the 2008 ACM/IEEE conference on Supercomputing. Piscataway, NJ, USA: IEEE Press, 2008, pp. 1-11.
- (2008) SC '08: Proceedings of the 2008 ACM/IEEE Conference on Supercomputing , pp. 1-11
- Volkov, V.¹ Demmel, J.W.²

24
- 68949216895
- Practical symmetric key cryptography on modern graphics hardware
- Berkeley, CA, USA: USENIX Association
- O. Harrison and J. Waldron, "Practical symmetric key cryptography on modern graphics hardware," in SS'08: Proceedings of the 17th conference on Security symposium. Berkeley, CA, USA: USENIX Association, 2008, pp. 195-209.
- (2008) SS'08: Proceedings of the 17th Conference on Security Symposium , pp. 195-209
- Harrison, O.¹ Waldron, J.²

25
- 56449089553
- Characterizing and improving the performance of the intel threading building blocks runtime system
- September. [Online]
- G. Contreras and M. Martonosi, "Characterizing and improving the performance of the intel threading building blocks runtime system," in International Symposium on Workload Characterization (IISWC 2008), September 2008. [Online]. Available: http://www.gigascale.org/pubs/1350.html
- (2008) International Symposium on Workload Characterization (IISWC 2008
- Contreras, G.¹ Martonosi, M.²

26
- 0033689702
- Architectural support for scalable speculative parallelization in shared-memory multiprocessors
- M. Cintra, J. F. Martínez, and J. Torrellas, "Architectural support for scalable speculative parallelization in shared-memory multiprocessors," SIGARCH Comput. Archit. News, vol.28, no.2, pp. 13-24, 2000.
- (2000) SIGARCH Comput. Archit. News , vol.28 , Issue.2 , pp. 13-24
- Cintra, M.¹ Martínez, J.F.² Torrellas, J.³

27
- 0036957879
- A general compiler framework for speculative multithreading
- New York, NY, USA: ACM
- B. Anasua and F. Manoj, "A general compiler framework for speculative multithreading," in SPaAA '02: Proceedings of the fourteenth annual ACM symposium on Parallel algorithms and architectures. New York, NY, USA: ACM, 2002, pp. 99-108.
- (2002) SPaAA '02: Proceedings of the Fourteenth Annual ACM Symposium on Parallel Algorithms and Architectures , pp. 99-108
- Anasua, B.¹ Manoj, F.²

28
- 77953967887
- Extracting threads using traces for system on a chip
- A Coruna, Spain, January
- E. Petit and F. Bodin, "Extracting threads using traces for system on a chip," in 12th International Workshop on Compilers for Parallel Computers (CPC), A Coruna, Spain, January 2006.
- (2006) 12th International Workshop on Compilers for Parallel Computers (CPC)
- Petit, E.¹ Bodin, F.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.