-
1
-
-
33646421297
-
The Fortress language specification version 1.0beta
-
Technical report, Sun Microsystems, 2007
-
E. Allen, D. Chase, J. Hallet, V. Luchangco, J.-W. Maessen, S. Ryu, G. L. Steele, and S. Tobin-Hochstadt. The Fortress language specification version 1.0beta. Technical report, Sun Microsystems, 2007.
-
-
-
Allen, E.1
Chase, D.2
Hallet, J.3
Luchangco, V.4
Maessen, J.-W.5
Ryu, S.6
Steele, G.L.7
Tobin-Hochstadt, S.8
-
2
-
-
27544493676
-
Mitigating Amdahl's law through EPI throttling
-
M. Annavram, E. Grochowski, and J. Shen. Mitigating Amdahl's law through EPI throttling. In Proc. of ISCA, pages 298-309, 2005.
-
(2005)
Proc. of ISCA
, pp. 298-309
-
-
Annavram, M.1
Grochowski, E.2
Shen, J.3
-
4
-
-
10644248153
-
Brook for GPUs: Stream computing on graphics hardware
-
I. Buck, T. Foley, D. Horn, J. Sugerman, K. Fatahalian, M. Houston, and P. Hanrahan. Brook for GPUs: stream computing on graphics hardware. ACM Trans. Graph., 23(3):777-786, 2004.
-
(2004)
ACM Trans. Graph
, vol.23
, Issue.3
, pp. 777-786
-
-
Buck, I.1
Foley, T.2
Horn, D.3
Sugerman, J.4
Fatahalian, K.5
Houston, M.6
Hanrahan, P.7
-
5
-
-
0029452111
-
A metaobject protocol for C++
-
S. Chiba. A metaobject protocol for C++. In Proc. of OOPSLA, pages 285-299, 1995.
-
(1995)
Proc. of OOPSLA
, pp. 285-299
-
-
Chiba, S.1
-
6
-
-
85030321143
-
Mapreduce: Simplified data processing on large clusters
-
J. Dean and S. Ghemawat. Mapreduce: Simplified data processing on large clusters. In Proc. of OSDI, pages 137-149, 2004.
-
(2004)
Proc. of OSDI
, pp. 137-149
-
-
Dean, J.1
Ghemawat, S.2
-
7
-
-
33749064644
-
Recognition, mining and synthesis moves computers to the era of tera
-
P. Dubey. Recognition, mining and synthesis moves computers to the era of tera. Technology@Intel Magazine, 2005.
-
(2005)
Technology@Intel Magazine
-
-
Dubey, P.1
-
9
-
-
34548207355
-
Sequioa: Programming the memory hierarchy
-
K. Fatahalian, T. J. Knight, M. Houston, M. Erez, D. R. Horn, L. Leem, J. Y. Park, M. Ren, A. Aiken, W. J. Dally, and P. Hanrahan. Sequioa: Programming the memory hierarchy. In Proc. of ACM/IEEE Conf. on Supercomputing, page 83, 2006.
-
(2006)
Proc. of ACM/IEEE Conf. on Supercomputing
, pp. 83
-
-
Fatahalian, K.1
Knight, T.J.2
Houston, M.3
Erez, M.4
Horn, D.R.5
Leem, L.6
Park, J.Y.7
Ren, M.8
Aiken, A.9
Dally, W.J.10
Hanrahan, P.11
-
10
-
-
17644370078
-
Best of both latency and throughput
-
E. Grochowski, R. Ronen, J. Shen, and H. Wang. Best of both latency and throughput. In Proc. of ICCD, pages 236-243, 2004.
-
(2004)
Proc. of ICCD
, pp. 236-243
-
-
Grochowski, E.1
Ronen, R.2
Shen, J.3
Wang, H.4
-
12
-
-
33845884291
-
Multiple instruction stream processor
-
R. Hankins, G. Chinya, J. D. Collins, P. Wang, R. Rakvic, H. Wang, and J. Shen. Multiple instruction stream processor. In Proc. of ISCA, pages 114-127, 2006.
-
(2006)
Proc. of ISCA
, pp. 114-127
-
-
Hankins, R.1
Chinya, G.2
Collins, J.D.3
Wang, P.4
Rakvic, R.5
Wang, H.6
Shen, J.7
-
13
-
-
67650022403
-
-
Intel. Intel C++ compiler. http://www3.intel.com/cd/software/products/ asmona/eng/compilers/284132.htm.
-
Intel. Intel C++ compiler
-
-
-
15
-
-
0041562664
-
Programmable stream processors
-
U. Kapasi, S. Rixner, W. J. Dally, B. Khailany, J. H. Ahn, P. Mattson, and J. Owens. Programmable stream processors. IEEE Computer, 36(8):54-62, 2003. 295
-
(2003)
IEEE Computer
, vol.36
, Issue.8
-
-
Kapasi, U.1
Rixner, S.2
Dally, W.J.3
Khailany, B.4
Ahn, J.H.5
Mattson, P.6
Owens, J.7
-
16
-
-
20744444866
-
Telescoping languages: A system for automatic generation of domain languages
-
K. Kennedy, B. Broom, A. Chauhan, R. Fowler, J. Garvin, C. Koelbel, C. McCosh, and J. Mellor-Crummey. Telescoping languages: A system for automatic generation of domain languages. Proc. of the IEEE, 93:378-408, 2005.
-
(2005)
Proc. of the IEEE
, vol.93
, pp. 378-408
-
-
Kennedy, K.1
Broom, B.2
Chauhan, A.3
Fowler, R.4
Garvin, J.5
Koelbel, C.6
McCosh, C.7
Mellor-Crummey, J.8
-
18
-
-
67650073715
-
-
P. Lyman and H. R. Varian. How much information. http://www.sims. berkeley.edu/how-much-info-2003, 2003.
-
P. Lyman and H. R. Varian. How much information. http://www.sims. berkeley.edu/how-much-info-2003, 2003.
-
-
-
-
19
-
-
33646031235
-
Cg: A system for programming graphics hardware in a C-like language
-
896-907
-
W. Mark, R. Glanville, K. Akeley, and M. Kilgard. Cg: A system for programming graphics hardware in a C-like language. ACM Trans. Graph., 22(3):896-907, 2003.
-
(2003)
ACM Trans. Graph
, vol.22
, Issue.3
-
-
Mark, W.1
Glanville, R.2
Akeley, K.3
Kilgard, M.4
-
21
-
-
34548240213
-
Performance evaluation of GPUs using the RapidMind development platform
-
M. McCool, K. Wadleigh, B. Henderson, and H. Y. Lin. Performance evaluation of GPUs using the RapidMind development platform. In Proc. of ACM/IEEE Conf. on Supercomputing, page 81, 2006.
-
(2006)
Proc. of ACM/IEEE Conf. on Supercomputing
, pp. 81
-
-
McCool, M.1
Wadleigh, K.2
Henderson, B.3
Lin, H.Y.4
-
22
-
-
10944222433
-
Practical predicate dispatch
-
T. Millstein. Practical predicate dispatch. In Proc. of OOPSLA, pages 345-264, 2004.
-
(2004)
Proc. of OOPSLA
, pp. 345-264
-
-
Millstein, T.1
-
23
-
-
0003496079
-
A library based compiler to execute MATLAB programs on a heterogeneous platform
-
A. Nayak, M. Haldar, A. Kanhere, P. Joisha, N. Shenoy, A. Choudhary, and P. Banerjee. A library based compiler to execute MATLAB programs on a heterogeneous platform. In Proc. of Conf. on Parallel and Distributed Computing Systems, 2000.
-
(2000)
Proc. of Conf. on Parallel and Distributed Computing Systems
-
-
Nayak, A.1
Haldar, M.2
Kanhere, A.3
Joisha, P.4
Shenoy, N.5
Choudhary, A.6
Banerjee, P.7
-
24
-
-
84870629709
-
-
NVidia. Cuda. http://developer.nvidia.com/object/cuda.html.
-
NVidia. Cuda
-
-
-
25
-
-
42549089403
-
The PeakStream platform: High productivity software development for multi-core processors
-
Technical report, PeakStream Inc
-
Peakstream. The PeakStream platform: High productivity software development for multi-core processors. Technical report, PeakStream Inc., 2006.
-
(2006)
-
-
Peakstream1
-
26
-
-
27344435504
-
The design and implementation of a first-generation CELL processor
-
D. Pham, S. Asano, M. Bolliger, M. N. Day, H. P. Hofstee, C. Johns, J. Kahle, A. Kameyama, J. Keaty, Y. Masubuchi, M. Riley, D. Shippy, D. Stasiak, M. Suzuoki, M. Wang, J. Warnock, S. Weitzel, D. Wendel, T. Yamazaki, and K. Yazawa. The design and implementation of a first-generation CELL processor. In Proc. of ISSCC, pages 184-185, 2005.
-
(2005)
Proc. of ISSCC
, pp. 184-185
-
-
Pham, D.1
Asano, S.2
Bolliger, M.3
Day, M.N.4
Hofstee, H.P.5
Johns, C.6
Kahle, J.7
Kameyama, A.8
Keaty, J.9
Masubuchi, Y.10
Riley, M.11
Shippy, D.12
Stasiak, D.13
Suzuoki, M.14
Wang, M.15
Warnock, J.16
Weitzel, S.17
Wendel, D.18
Yamazaki, T.19
Yazawa, K.20
more..
-
27
-
-
34547679939
-
Evaluating mapreduce for multi-core and multiprocessor systems
-
C. Ranger, R. Raghuraman, A. Penmetsa, G. Bradski, and C. Kozyrakis. Evaluating mapreduce for multi-core and multiprocessor systems. In Proc. of HPCA, pages 13-24, 2007.
-
(2007)
Proc. of HPCA
, pp. 13-24
-
-
Ranger, C.1
Raghuraman, R.2
Penmetsa, A.3
Bradski, G.4
Kozyrakis, C.5
-
28
-
-
41649093013
-
A performance-oriented data parallel virtual machines for GPUs
-
Technical report, ATI Technologies
-
M. Segal and M. Peercy. A performance-oriented data parallel virtual machines for GPUs. Technical report, ATI Technologies, 2006.
-
(2006)
-
-
Segal, M.1
Peercy, M.2
-
29
-
-
33947595619
-
Accelerator: Using data parallelism to program GPUs for general-purpose uses
-
D. Tarditi, S. Puri, and J. Oglesby. Accelerator: Using data parallelism to program GPUs for general-purpose uses. In Proc. of ASPLOS, pages 325-335, 2006.
-
(2006)
Proc. of ASPLOS
, pp. 325-335
-
-
Tarditi, D.1
Puri, S.2
Oglesby, J.3
-
31
-
-
35448978324
-
EXOCHI: Architecture and programming environment for a heterogeneous multi-core multithreaded system
-
P. H. Wang, J. D. Collins, G. N. Chinya, H. Jiang, X. Tian, M. Girkar, N. Y. Yang, G.-Y. Lueh, and H. Wang. EXOCHI: Architecture and programming environment for a heterogeneous multi-core multithreaded system. In Proc. of PLDI, pages 156-166, 2007.
-
(2007)
Proc. of PLDI
, pp. 156-166
-
-
Wang, P.H.1
Collins, J.D.2
Chinya, G.N.3
Jiang, H.4
Tian, X.5
Girkar, M.6
Yang, N.Y.7
Lueh, G.-Y.8
Wang, H.9
-
32
-
-
26444465132
-
Power feasibility of implantable digital spike sorting circuits for neural prosthetic systems
-
Z. S. Zumsteg, C. Kemere, S. O'Driscoll, G. Santhanam, R. E. Ahmed, K. V. Shenoy, and T. H. Meng. Power feasibility of implantable digital spike sorting circuits for neural prosthetic systems. IEEE Trans Neural Syst Rehabil Eng, 13(3):272-279, 2005.
-
(2005)
IEEE Trans Neural Syst Rehabil Eng
, vol.13
, Issue.3
, pp. 272-279
-
-
Zumsteg, Z.S.1
Kemere, C.2
O'Driscoll, S.3
Santhanam, G.4
Ahmed, R.E.5
Shenoy, K.V.6
Meng, T.H.7
|