-
3
-
-
67650530896
-
-
AMD. ATI CTM Guide. http://ati.amd.com/companyinfo/researcher/documents/ ATI-CTM-Guide.pdf.
-
ATI CTM Guide
-
-
-
6
-
-
54249087677
-
A novel asynchronous software cache implementation for the cell/be processor
-
J. Balart, M. Gonzalez, X. Martorell, E. Ayguade, Z. Sura, T. Chen, T. Zhang, K. O'brien, and K. O'Brien. A novel asynchronous software cache implementation for the cell/be processor. In LCPC '07: Proceedings of the 20th International Workshop on Languages and Compilers for Parallel Computing, October 2007.
-
LCPC '07: Proceedings of the 20th International Workshop on Languages and Compilers for Parallel Computing, October 2007
-
-
Balart, J.1
Gonzalez, M.2
Martorell, X.3
Ayguade, E.4
Sura, Z.5
Chen, T.6
Zhang, T.7
O'brien, K.8
O'Brien, K.9
-
7
-
-
0024702539
-
A Technique for Summarizing Data Access and Its Use in Parallelism Enhancing Transformations
-
New York, NY, USA, ACM
-
V. Balasundaram and K. Kennedy. A Technique for Summarizing Data Access and Its Use in Parallelism Enhancing Transformations. In PLDI '89: Proceedings of the ACM SIGPLAN 1989 Conference on Programming Language Design and Implementation, pages 41-53, New York, NY, USA, 1989. ACM.
-
(1989)
PLDI '89: Proceedings of the ACM SIGPLAN 1989 Conference on Programming Language Design and Implementation
, pp. 41-53
-
-
Balasundaram, V.1
Kennedy, K.2
-
8
-
-
63549095070
-
The PARSEC Benchmark Suite: Characterization and Architectural Implications
-
October
-
C. Bienia, S. Kumar, J. P. Singh, and K. Li. The PARSEC Benchmark Suite: Characterization and Architectural Implications. In PACT'08: Proceedings of the 17th International Conference on Parallel Architectures and Compilation Techniques, pages 72-81, October 2008.
-
(2008)
PACT'08: Proceedings of the 17th International Conference on Parallel Architectures and Compilation Techniques
, pp. 72-81
-
-
Bienia, C.1
Kumar, S.2
Singh, J.P.3
Li, K.4
-
9
-
-
70649092154
-
Rodinia: A benchmark suite for heterogeneous
-
Oct
-
S. Che, M. Boyer, J. Meng, D. Tarjan, J. W. Sheaffer, S.-H. Lee, and K. Skadron. Rodinia: A benchmark suite for heterogeneous. In IISWC '09: Proceedings of the IEEE International Symposium on Workload Characterization, pages 44-54, Oct 2009.
-
(2009)
IISWC '09: Proceedings of the IEEE International Symposium on Workload Characterization
, pp. 44-54
-
-
Che, S.1
Boyer, M.2
Meng, J.3
Tarjan, D.4
Sheaffer, J.W.5
Lee, S.-H.6
Skadron, K.7
-
10
-
-
57349153918
-
Orchestrating Data Transfer for the cell/B.E. Processor
-
New York, NY, USA, ACM
-
T. Chen, H. Lin, T. Zhang, K. M. O'Brien, and J. K. O'Brien. Orchestrating Data Transfer for the cell/B.E. Processor. In ICS'08: Proceedings of the 22nd annual International Conference on Supercomputing, pages 289-298, New York, NY, USA, 2008. ACM.
-
(2008)
ICS'08: Proceedings of the 22nd Annual International Conference on Supercomputing
, pp. 289-298
-
-
Chen, T.1
Lin, H.2
Zhang, T.3
O'Brien, K.M.4
O'Brien, J.K.5
-
11
-
-
33745203631
-
Communication Optimizations for Fine-Grained UPC Applications
-
Washington, DC, USA, IEEE Computer Society
-
W.-Y. Chen, C. Iancu, and K. Yelick. Communication Optimizations for Fine-Grained UPC Applications. In PACT'05: Proceedings of the 14th International Conference on Parallel Architectures and Compilation Techniques, pages 267-278, Washington, DC, USA, 2005. IEEE Computer Society.
-
(2005)
PACT'05: Proceedings of the 14th International Conference on Parallel Architectures and Compilation Techniques
, pp. 267-278
-
-
Chen, W.-Y.1
Iancu, C.2
Yelick, K.3
-
12
-
-
84947585636
-
The SPMD Model: Past, Present and Future
-
January
-
F. Darema. The SPMD Model: Past, Present and Future. Lecture Notes in Computer Science, 2131(1):1-1, January 2001.
-
(2001)
Lecture Notes in Computer Science
, vol.2131
, Issue.1
, pp. 1-1
-
-
Darema, F.1
-
13
-
-
33646558229
-
Using advanced compiler technology to exploit the performance of the Cell Broadband EngineTM architecture
-
January
-
A. E. Eichenberger, J. K. O'Brien, K. M. O'Brien, P. Wu, T. Chen, P. H. Oden, D. A. Prener, J. C. Shepherd, B. So, Z. Sura, A. Wang, T. Zhang, P. Zhao, M. K. Gschwind, R. Archambault, Y. Gao, and R. Koo. Using advanced compiler technology to exploit the performance of the Cell Broadband EngineTM architecture. IBM Systems Journal, 45(1):59-84, January 2006.
-
(2006)
IBM Systems Journal
, vol.45
, Issue.1
, pp. 59-84
-
-
Eichenberger, A.E.1
O'Brien, J.K.2
O'Brien, K.M.3
Wu, P.4
Chen, T.5
Oden, P.H.6
Prener, D.A.7
Shepherd, J.C.8
So, B.9
Sura, Z.10
Wang, A.11
Zhang, T.12
Zhao, P.13
Gschwind, M.K.14
Archambault, R.15
Gao, Y.16
Koo, R.17
-
14
-
-
85084161591
-
Portable Multithreading: The Signal Stack Trick for User-Space Thread Creation
-
June
-
R. S. Engelschall. Portable Multithreading: The Signal Stack Trick For User-Space Thread Creation. In Proceedings of 2000 USENIX Annual Technical Conference, pages 155-164, June 2000.
-
(2000)
Proceedings of 2000 USENIX Annual Technical Conference
, pp. 155-164
-
-
Engelschall, R.S.1
-
15
-
-
0025433762
-
Memory consistency and event ordering in scalable shared-memory multiprocessors
-
May
-
K. Gharachorloo, D. Lenoski, J. Laudon, P. Gibbons, A. Gupta, and J. Hennessy. Memory consistency and event ordering in scalable shared-memory multiprocessors. In ISCA '90: Proceedings of the 17th Annual International Symposium on Computer Architecture, pages 15-26, May 1990.
-
(1990)
ISCA '90: Proceedings of the 17th Annual International Symposium on Computer Architecture
, pp. 15-26
-
-
Gharachorloo, K.1
Lenoski, D.2
Laudon, J.3
Gibbons, P.4
Gupta, A.5
Hennessy, J.6
-
16
-
-
63549142252
-
Hybrid Access-specific Software Cache Techniques for the Cell BE Architecture
-
October
-
M. González, N. Vujic, X. Martorell, E. Ayguadé, A. E. Eichenberger, T. Chen, Z. Sura, T. Zhang, K. O'Brien, and K. M. O'Brien. Hybrid Access-specific Software Cache Techniques for the Cell BE Architecture. In PACT'08: Proceedings of the 17th International Conference on Parallel Architectures and Compilation Techniques, pages 292-302, October 2008.
-
(2008)
PACT'08: Proceedings of the 17th International Conference on Parallel Architectures and Compilation Techniques
, pp. 292-302
-
-
González, M.1
Vujic, N.2
Martorell, X.3
Ayguadé, E.4
Eichenberger, A.E.5
Chen, T.6
Sura, Z.7
Zhang, T.8
O'Brien, K.9
O'Brien, K.M.10
-
17
-
-
33646015987
-
Synergistic Processing in Cell's Multicore Architecture
-
March/April
-
M. Gschwind, H. P. Hofstee, B. Flachs, M. Hopkins, Y. Watanabe, and T. Yamazaki. Synergistic Processing in Cell's Multicore Architecture. IEEE Micro, 26(2):10-24, March/April 2006.
-
(2006)
IEEE Micro
, vol.26
, Issue.2
, pp. 10-24
-
-
Gschwind, M.1
Hofstee, H.P.2
Flachs, B.3
Hopkins, M.4
Watanabe, Y.5
Yamazaki, T.6
-
18
-
-
33745195144
-
HUNTing the Overlap
-
Washington, DC, USA, IEEE Computer Society
-
C. Iancu, P. Husbands, and P. Hargrove. HUNTing the Overlap. In PACT '05: Proceedings of the 14th International Conference on Parallel Architectures and Compilation Techniques, pages 279-290, Washington, DC, USA, 2005. IEEE Computer Society.
-
(2005)
PACT '05: Proceedings of the 14th International Conference on Parallel Architectures and Compilation Techniques
, pp. 279-290
-
-
Iancu, C.1
Husbands, P.2
Hargrove, P.3
-
21
-
-
78149260482
-
-
Sony, and Toshiba. IBM
-
IBM, Sony, and Toshiba. Cell Broadband Engine Architecture. IBM, 2009. http://www.ibm.com/developerworks/power/cell/.
-
(2009)
Cell Broadband Engine Architecture
-
-
-
22
-
-
0001841724
-
TreadMarks: Distributed Shared Memory on Standard Workstations and Operating Systems
-
January
-
P. Keleher, A. L. Cox, S. Dwarkadas, and W. Zwaenepoel. TreadMarks: Distributed Shared Memory on Standard Workstations and Operating Systems. In WTEC'94: Proceedings of the USENIX Winter 1994 Technical Conference, pages 115-131, January 1994.
-
(1994)
WTEC'94: Proceedings of the USENIX Winter 1994 Technical Conference
, pp. 115-131
-
-
Keleher, P.1
Cox, A.L.2
Dwarkadas, S.3
Zwaenepoel, W.4
-
23
-
-
77952342828
-
-
Khronos Group
-
Khronos OpenCL Working Group. The OpenCL Specification Version 1.0. Khronos Group, 2009. http://www.khronos.org/opencl.
-
(2009)
The OpenCL Specification Version 1.0
-
-
-
24
-
-
77952566140
-
COMIC++: A Software SVM System for Heterogeneous Multicore Accelerator Clusters
-
IEEE Computer Society, January
-
J. Lee, J. Lee, S. Seo, J. Kim, S. Kim, and Z. Sura. COMIC++: A Software SVM System for Heterogeneous Multicore Accelerator Clusters. In HPCA'10: Proceedings of the 15th International Symposium on High Performance Computer Architecture. IEEE Computer Society, January 2010.
-
(2010)
HPCA'10: Proceedings of the 15th International Symposium on High Performance Computer Architecture
-
-
Lee, J.1
Lee, J.2
Seo, S.3
Kim, J.4
Kim, S.5
Sura, Z.6
-
25
-
-
63549088652
-
COMIC: A Coherent Shared Memory Interface for Cell BE
-
October
-
J. Lee, S. Seo, C. Kim, J. Kim, P. Chun, Z. Sura, J. Kim, and S. Han. COMIC: A Coherent Shared Memory Interface for Cell BE. In PACT'08: Proceedings of the 17th International Conference on Parallel Architectures and Compilation Techniques, pages 303-314, October 2008.
-
(2008)
PACT'08: Proceedings of the 17th International Conference on Parallel Architectures and Compilation Techniques
, pp. 303-314
-
-
Lee, J.1
Seo, S.2
Kim, C.3
Kim, J.4
Chun, P.5
Sura, Z.6
Kim, J.7
Han, S.8
-
26
-
-
70449709562
-
DBDB: Optimizing DMA Transfer for the Cell BE Architecture
-
New York, NY, USA, ACM
-
T. Liu, H. Lin, T. Chen, J. K. O'Brien, and L. Shao. DBDB: Optimizing DMA Transfer for the Cell BE Architecture. In ICS'09: Proceedings of the 23rd International Conference on Supercomputing, pages 36-45, New York, NY, USA, 2009. ACM.
-
(2009)
ICS'09: Proceedings of the 23rd International Conference on Supercomputing
, pp. 36-45
-
-
Liu, T.1
Lin, H.2
Chen, T.3
O'Brien, J.K.4
Shao, L.5
-
29
-
-
2442670256
-
-
NASA Advanced Supercomputing Division. NAS Parallel Benchmarks. http://www.nas.nasa.gov/Resources/Software/npb.html.
-
NAS Parallel Benchmarks
-
-
-
30
-
-
78149247686
-
-
NVIDIA. OpenCL for NVIDIA. http://www.nvidia.com/object/cuda-opencl.html.
-
OpenCL for NVIDIA
-
-
-
33
-
-
67650844223
-
Programming Model for a Heterogeneous x86 Platform
-
New York, NY, USA, ACM
-
B. Saha, X. Zhou, H. Chen, Y. Gao, S. Yan, M. Rajagopalan, J. Fang, P. Zhang, R. Ronen, and A. Mendelson. Programming Model for a Heterogeneous x86 Platform. In PLDI'09: Proceedings of the 2009 ACM SIGPLAN Conference on Programming Language Design and Implementation, pages 431-440, New York, NY, USA, 2009. ACM.
-
(2009)
PLDI'09: Proceedings of the 2009 ACM SIGPLAN Conference on Programming Language Design and Implementation
, pp. 431-440
-
-
Saha, B.1
Zhou, X.2
Chen, H.3
Gao, Y.4
Yan, S.5
Rajagopalan, M.6
Fang, J.7
Zhang, P.8
Ronen, R.9
Mendelson, A.10
-
34
-
-
49249086142
-
Larrabee: A Many-Core x86 Architecture for Visual Computing
-
Article 18, August
-
L. Seiler, D. Carmean, E. Sprangle, T. Forsyth, M. Abrash, P. Dubey, S. Junkins, A. Lake, J. Sugerman, R. Cavin, R. Espasa, E. Grochowski, T. Juan, , and P. Hanrahan. Larrabee: A Many-Core x86 Architecture for Visual Computing. ACM Transactions on Graphics, 27(3):Article 18, August 2008.
-
(2008)
ACM Transactions on Graphics
, vol.27
, Issue.3
-
-
Seiler, L.1
Carmean, D.2
Sprangle, E.3
Forsyth, T.4
Abrash, M.5
Dubey, P.6
Junkins, S.7
Lake, A.8
Sugerman, J.9
Cavin, R.10
Espasa, R.11
Grochowski, E.12
Juan, T.13
Hanrahan, P.14
-
36
-
-
67650692011
-
-
The IMPACT Research Group. Parboil Benchmark Suite. http://impact.crhc. illinois.edu/parboil.php, 2009.
-
(2009)
Parboil Benchmark Suite
-
-
-
37
-
-
70450081438
-
Using many-core hardware to correlate radio astronomy signals
-
New York, NY, USA, ACM
-
R. V. van Nieuwpoort and J. W. Romein. Using many-core hardware to correlate radio astronomy signals. In ICS '09: Proceedings of the 23rd international conference on Supercomputing, pages 440-449, New York, NY, USA, 2009. ACM.
-
(2009)
ICS '09: Proceedings of the 23rd International Conference on Supercomputing
, pp. 440-449
-
-
Van Nieuwpoort, R.V.1
Romein, J.W.2
-
38
-
-
67650085808
-
EXOCHI: Architecture and Programming Environment for a Heterogeneous Multi-core Multithreaded System
-
New York, NY, USA, ACM
-
P. H. Wang, J. D. Collins, G. N. Chinya, H. Jiang, X. Tian, M. Girkar, N. Y. Yang, G.-Y. Lueh, and H. Wang. EXOCHI: Architecture and Programming Environment for a Heterogeneous Multi-core Multithreaded System. In PLDI'07: Proceedings of the 2007 ACM SIGPLAN Conference on Programming Language Design and Implementation, pages 156-166, New York, NY, USA, 2007. ACM.
-
(2007)
PLDI'07: Proceedings of the 2007 ACM SIGPLAN Conference on Programming Language Design and Implementation
, pp. 156-166
-
-
Wang, P.H.1
Collins, J.D.2
Chinya, G.N.3
Jiang, H.4
Tian, X.5
Girkar, M.6
Yang, N.Y.7
Lueh, G.-Y.8
Wang, H.9
|