-
1
-
-
0041633858
-
Parameter variations and impact on circuits and microarchitecture
-
New York, NY, USA: ACM
-
B. Shekhar, K. Tanay, N. Siva, T. Jim, K. Ali, and D. Vivek, "Parameter variations and impact on circuits and microarchitecture," in DAC '03: Proceedings of the 40th conference on Design automation. New York, NY, USA: ACM, 2003, pp. 338-342.
-
(2003)
DAC '03: Proceedings of the 40th Conference on Design Automation
, pp. 338-342
-
-
Shekhar, B.1
Tanay, K.2
Siva, N.3
Jim, T.4
Ali, K.5
Vivek, D.6
-
2
-
-
42149160020
-
Nvidia cuda software and gpu parallel computing architecture
-
New York, NY, USA: ACM
-
D. Kirk, "Nvidia cuda software and gpu parallel computing architecture," in ISMM '07: Proceedings of the 6th international symposium on Memory management. New York, NY, USA: ACM, 2007, pp. 103-104.
-
(2007)
ISMM '07: Proceedings of the 6th International Symposium on Memory Management
, pp. 103-104
-
-
Kirk, D.1
-
3
-
-
66749136924
-
From soda to scotch: The evolution of a wireless baseband processor
-
Washington, DC, USA: IEEE Computer Society
-
W. Mark, L. Yuan, S. Sangwon, M. Scott, M. Trevor, C. Chaitali, B. Richard, K. Danny, R. Alastair, W. Mladen, and F. Krisztian, "From soda to scotch: The evolution of a wireless baseband processor," in MICRO '08: Proceedings of the 2008 41st IEEE/ACM International Symposium on Microarchitecture. Washington, DC, USA: IEEE Computer Society, 2008, pp. 152-163.
-
(2008)
MICRO '08: Proceedings of the 2008 41st IEEE/ACM International Symposium on Microarchitecture
, pp. 152-163
-
-
Mark, W.1
Yuan, L.2
Sangwon, S.3
Scott, M.4
Trevor, M.5
Chaitali, C.6
Richard, B.7
Danny, K.8
Alastair, R.9
Mladen, W.10
Krisztian, F.11
-
4
-
-
0033722250
-
An fpga implementation and performance evaluation of the serpent block cipher
-
New York, NY, USA: ACM
-
A. Elbirt and C. Paar, "An fpga implementation and performance evaluation of the serpent block cipher," in FPGA '00: Proceedings of the 2000 ACM/SIGDA eighth international symposium on Field programmable gate arrays. New York, NY, USA: ACM, 2000, pp. 33-40.
-
(2000)
FPGA '00: Proceedings of the 2000 ACM/SIGDA Eighth International Symposium on Field Programmable Gate Arrays
, pp. 33-40
-
-
Elbirt, A.1
Paar, C.2
-
5
-
-
0013398077
-
-
Ph.D. dissertation, Department of Electrical Engineering and Computer Science, Massachusetts Institute of Technology, May
-
K. H. Randall, "Cilk: Efficient multithreaded computing," Ph.D. dissertation, Department of Electrical Engineering and Computer Science, Massachusetts Institute of Technology, May 1998.
-
(1998)
Cilk: Efficient Multithreaded Computing
-
-
Randall, K.H.1
-
6
-
-
85027692154
-
-
Champaign, IL, USA, Tech. Rep.
-
L. V. Kale and S. Krishnan, "Charm++: A portable concurrent object oriented system based on c++," Champaign, IL, USA, Tech. Rep., 1993.
-
(1993)
Charm++: A Portable Concurrent Object Oriented System Based on C++
-
-
Kale, L.V.1
Krishnan, S.2
-
8
-
-
84959045524
-
Streamit: A language for streaming applications
-
London, UK: SpringerVerlag
-
W. Thies, M. Karczmarek, and S. P. Amarasinghe, "Streamit: A language for streaming applications," in CC '02: Proceedings of the 11th International Conference on Compiler Construction. London, UK: SpringerVerlag, 2002, pp. 179-196.
-
(2002)
CC '02: Proceedings of the 11th International Conference on Compiler Construction
, pp. 179-196
-
-
Thies, W.1
Karczmarek, M.2
Amarasinghe, S.P.3
-
9
-
-
70350656487
-
-
AMD One AMD Place, Sunnyvale CA, 94088, Tech. Rep. [Online]
-
AMD, "Ati stream computing - technical overview," One AMD Place, Sunnyvale CA, 94088, Tech. Rep. [Online]. Available: http://developer. amd.com/gpu-assets/Stream-Computing-Overview.pdf
-
Ati Stream Computing - Technical Overview
-
-
-
10
-
-
67650694407
-
-
NVIDIA, 2nd ed., NVIDIA Corporation, Santa Clara, California, October
-
NVIDIA, NVIDIA CUDA Compute Unified Device Architecture, 2nd ed., NVIDIA Corporation, Santa Clara, California, October 2008.
-
(2008)
NVIDIA CUDA Compute Unified Device Architecture
-
-
-
11
-
-
70349100958
-
-
December. [Online]
-
K. O. W. Group, The OpenCL Specification, December 2008. [Online]. Available: http://www.khronos.Org/registry/cl/specs/opencl-1.0.29.pdf
-
(2008)
The OpenCL Specification
-
-
Group, K.O.W.1
-
12
-
-
77957759721
-
Merge: A programming model for heterogeneous multi-core systems
-
New York, NY, USA: ACM
-
M. D. Linderman, J. D. Collins, H. Wang, and T. H. Meng, "Merge: a programming model for heterogeneous multi-core systems," in ASPLOS XIII: Proceedings of the 13th international conference on Architectural support for programming languages and operating systems. New York, NY, USA: ACM, 2008, pp. 287-296.
-
(2008)
ASPLOS XIII: Proceedings of the 13th International Conference on Architectural Support for Programming Languages and Operating Systems
, pp. 287-296
-
-
Linderman, M.D.1
Collins, J.D.2
Wang, H.3
Meng, T.H.4
-
13
-
-
57349153933
-
Harmony: An execution model and runtime for heterogeneous many core systems
-
Boston, Massachusetts, USA: ACM, june
-
G. Diamos and S. Yalamanchili, "Harmony: An execution model and runtime for heterogeneous many core systems," in HPDC'08. Boston, Massachusetts, USA: ACM, june 2008.
-
(2008)
HPDC'08
-
-
Diamos, G.1
Yalamanchili, S.2
-
14
-
-
76749140917
-
Qilin: Exploiting parallelism on heterogeneous multiprocessors with adaptive mapping
-
New York, USA: IEEE, devember
-
C. Luk, S. Hong, and H. Kim, "Qilin: Exploiting parallelism on heterogeneous multiprocessors with adaptive mapping," in MICRO'09. New York, USA: IEEE, devember 2009.
-
(2009)
MICRO'09
-
-
Luk, C.1
Hong, S.2
Kim, H.3
-
15
-
-
34548207355
-
Sequoia: Programming the memory hierarchy
-
K. Fatahalian, T. J. Knight, M. Houston, M. Erez, D. R. Horn, L. Leem, J. Y. Park, M. Ren, A. Aiken, W. J. Dally, and P. Hanrahan, "Sequoia: Programming the memory hierarchy," in Proceedings of the 2006 ACM/IEEE Conference on Supercomputing, 2006.
-
(2006)
Proceedings of the 2006 ACM/IEEE Conference on Supercomputing
-
-
Fatahalian, K.1
Knight, T.J.2
Houston, M.3
Erez, M.4
Horn, D.R.5
Leem, L.6
Park, J.Y.7
Ren, M.8
Aiken, A.9
Dally, W.J.10
Hanrahan, P.11
-
16
-
-
77954019175
-
Program demultiplexing: Data-flow based speculative parallelization of methods in sequential programs
-
S. Balakrishnan and G. S. Sohi, "Program demultiplexing: Data-flow based speculative parallelization of methods in sequential programs," SIGARCH Comput. Archit. News, vol.34, no.2, pp. 302-313, 2006.
-
(2006)
SIGARCH Comput. Archit. News
, vol.34
, Issue.2
, pp. 302-313
-
-
Balakrishnan, S.1
Sohi, G.S.2
-
17
-
-
66749164066
-
Copy or discard execution model for speculative parallelization on multicores
-
Washington, DC, USA: IEEE Computer Society
-
C. Tian, M. Feng, Nagarajan, Vijay, and R. Gupta, "Copy or discard execution model for speculative parallelization on multicores," in MICRO '08: Proceedings of the 2008 41st IEEE/ACM International Symposium on Microarchitecture. Washington, DC, USA: IEEE Computer Society, 2008, pp. 330-341.
-
(2008)
MICRO '08: Proceedings of the 2008 41st IEEE/ACM International Symposium on Microarchitecture
, pp. 330-341
-
-
Tian, C.1
Feng, M.2
Nagarajan3
Vijay4
Gupta, R.5
-
18
-
-
0031605470
-
Data speculation support for a chip multiprocessor
-
New York, NY, USA: ACM
-
L. Hammond, M. Willey, and K. Olukotun, "Data speculation support for a chip multiprocessor," in ASPLOS-VIII: Proceedings of the eighth international conference on Architectural support for programming languages and operating systems. New York, NY, USA: ACM, 1998, pp. 58-69.
-
(1998)
ASPLOS-VIII: Proceedings of the Eighth International Conference on Architectural Support for Programming Languages and Operating Systems
, pp. 58-69
-
-
Hammond, L.1
Willey, M.2
Olukotun, K.3
-
19
-
-
70649102016
-
-
NVIDIA, 1st ed., NVIDIA Corporation, Santa Clara, California, October
-
NVIDIA, NVIDIA Compute PTX: Parallel Thread Execution, 1st ed., NVIDIA Corporation, Santa Clara, California, October 2008.
-
(2008)
NVIDIA Compute PTX: Parallel Thread Execution
-
-
-
20
-
-
67650692011
-
-
[Online]
-
IMPACT, "The parboil benchmark suite," 2007. [Online]. Available: http://www.crhc.uiuc.edu/IMPACT/parboil.php
-
(2007)
The Parboil Benchmark Suite
-
-
-
21
-
-
70649104826
-
A characterization and analysis of ptx kernels
-
Austin, TX, USA, October
-
A. Kerr, G. Diamos, and S. Yalamanchili, "A characterization and analysis of ptx kernels," in IISWC09: IEEE International Symposium on Workload Characterization, Austin, TX, USA, October 2009.
-
(2009)
IISWC09: IEEE International Symposium on Workload Characterization
-
-
Kerr, A.1
Diamos, G.2
Yalamanchili, S.3
-
22
-
-
0030645118
-
Trading conflict and capacity aliasing in conditional branch predictors
-
New York, NY, USA: ACM
-
M. Pierre, S. Andre, and U. Richard, "Trading conflict and capacity aliasing in conditional branch predictors," in ISCA '97: Proceedings of the 24th annual international symposium on Computer architecture. New York, NY, USA: ACM, 1997, pp. 292-303.
-
(1997)
ISCA '97: Proceedings of the 24th Annual International Symposium on Computer Architecture
, pp. 292-303
-
-
Pierre, M.1
Andre, S.2
Richard, U.3
-
23
-
-
70350771131
-
Benchmarking gpus to tune dense linear algebra
-
Piscataway, NJ, USA: IEEE Press
-
V. Volkov and J. W. Demmel, "Benchmarking gpus to tune dense linear algebra," in SC '08: Proceedings of the 2008 ACM/IEEE conference on Supercomputing. Piscataway, NJ, USA: IEEE Press, 2008, pp. 1-11.
-
(2008)
SC '08: Proceedings of the 2008 ACM/IEEE Conference on Supercomputing
, pp. 1-11
-
-
Volkov, V.1
Demmel, J.W.2
-
24
-
-
68949216895
-
Practical symmetric key cryptography on modern graphics hardware
-
Berkeley, CA, USA: USENIX Association
-
O. Harrison and J. Waldron, "Practical symmetric key cryptography on modern graphics hardware," in SS'08: Proceedings of the 17th conference on Security symposium. Berkeley, CA, USA: USENIX Association, 2008, pp. 195-209.
-
(2008)
SS'08: Proceedings of the 17th Conference on Security Symposium
, pp. 195-209
-
-
Harrison, O.1
Waldron, J.2
-
25
-
-
56449089553
-
Characterizing and improving the performance of the intel threading building blocks runtime system
-
September. [Online]
-
G. Contreras and M. Martonosi, "Characterizing and improving the performance of the intel threading building blocks runtime system," in International Symposium on Workload Characterization (IISWC 2008), September 2008. [Online]. Available: http://www.gigascale.org/pubs/1350.html
-
(2008)
International Symposium on Workload Characterization (IISWC 2008
-
-
Contreras, G.1
Martonosi, M.2
-
26
-
-
0033689702
-
Architectural support for scalable speculative parallelization in shared-memory multiprocessors
-
M. Cintra, J. F. Martínez, and J. Torrellas, "Architectural support for scalable speculative parallelization in shared-memory multiprocessors," SIGARCH Comput. Archit. News, vol.28, no.2, pp. 13-24, 2000.
-
(2000)
SIGARCH Comput. Archit. News
, vol.28
, Issue.2
, pp. 13-24
-
-
Cintra, M.1
Martínez, J.F.2
Torrellas, J.3
-
27
-
-
0036957879
-
A general compiler framework for speculative multithreading
-
New York, NY, USA: ACM
-
B. Anasua and F. Manoj, "A general compiler framework for speculative multithreading," in SPaAA '02: Proceedings of the fourteenth annual ACM symposium on Parallel algorithms and architectures. New York, NY, USA: ACM, 2002, pp. 99-108.
-
(2002)
SPaAA '02: Proceedings of the Fourteenth Annual ACM Symposium on Parallel Algorithms and Architectures
, pp. 99-108
-
-
Anasua, B.1
Manoj, F.2
|