-
1
-
-
33646558229
-
Using advanced compiler technology to exploit the performance of the cell broadband enginetm architecture
-
A. E. Eichenberger, J. K. O'Brien, K. M. O'Brien, P. Wu, T. Chen, P. H. Oden, D. A. Prener, J. C. Shepherd, B. So, Z. Sura, A. Wang, T. Zhang, P. Zhao, M. K. Gschwind, R. Archambault, Y. Gao, and R. Koo. Using advanced compiler technology to exploit the performance of the cell broadband enginetm architecture. IBM Syst. J., 45(1):59-84, 2006.
-
(2006)
IBM Syst. J.
, vol.45
, Issue.1
, pp. 59-84
-
-
Eichenberger, A.E.1
O'Brien, J.K.2
O'Brien, K.M.3
Wu, P.4
Chen, T.5
Oden, P.H.6
Prener, D.A.7
Shepherd, J.C.8
So, B.9
Sura, Z.10
Wang, A.11
Zhang, T.12
Zhao, P.13
Gschwind, M.K.14
Archambault, R.15
Gao, Y.16
Koo, R.17
-
2
-
-
77952123736
-
A 48-core ia-32 message-passing processor with dvfs in 45nm cmos
-
feb.
-
J. Howard, S. Dighe, Y. Hoskote, S. Vangal, D. Finan, G. Ruhl, D. Jenkins, H. Wilson, N. Borkar, G. Schrom, F. Pailet, S. Jain, T. Jacob, S. Yada, S. Marella, P. Salihundam, V. Erraguntla, M. Konow, M. Riepen, G. Droege, J. Lindemann, M. Gries, T. Apel, K. Henriss, T. Lund-Larsen, S. Steibl, S. Borkar, V. De, R. Van Der Wijngaart, and T. Mattson. A 48-core ia-32 message-passing processor with dvfs in 45nm cmos. In Solid-State Circuits Conference Digest of Technical Papers (ISSCC), 2010 IEEE International, pages 108-109, feb. 2010.
-
(2010)
Solid-State Circuits Conference Digest of Technical Papers (ISSCC), 2010 IEEE International
, pp. 108-109
-
-
Howard, J.1
Dighe, S.2
Hoskote, Y.3
Vangal, S.4
Finan, D.5
Ruhl, G.6
Jenkins, D.7
Wilson, H.8
Borkar, N.9
Schrom, G.10
Pailet, F.11
Jain, S.12
Jacob, T.13
Yada, S.14
Marella, S.15
Salihundam, P.16
Erraguntla, V.17
Konow, M.18
Riepen, M.19
Droege, G.20
Lindemann, J.21
Gries, M.22
Apel, T.23
Henriss, K.24
Lund-Larsen, T.25
Steibl, S.26
Borkar, S.27
De, V.28
Van Der Wijngaart, R.29
Mattson, T.30
more..
-
3
-
-
0033723498
-
A fully associative software-managed cache design
-
Erik G. Hallnor and Steven K. Reinhardt. A fully associative software-managed cache design. SIGARCH Comput. Archit. News, 28(2):107-116, 2000.
-
(2000)
SIGARCH Comput. Archit. News
, vol.28
, Issue.2
, pp. 107-116
-
-
Hallnor, E.G.1
Reinhardt, S.K.2
-
4
-
-
63549142252
-
Hybrid access-specific software cache techniques for the cell be architecture
-
New York, NY, USA, ACM
-
Marc Gonzàlez, Nikola Vujic, Xavier Martorell, Eduard Ayguadé, Alexandre E. Eichenberger, Tong Chen, Zehra Sura, Tao Zhang, Kevin O'Brien, and Kathryn O'Brien. Hybrid access-specific software cache techniques for the cell be architecture. In PACT '08: Proceedings of the 17th international conference on Parallel architectures and compilation techniques, pages 292-302, New York, NY, USA, 2008. ACM.
-
(2008)
PACT '08: Proceedings of the 17th International Conference on Parallel Architectures and Compilation Techniques
, pp. 292-302
-
-
Gonzàlez, M.1
Vujic, N.2
Martorell, X.3
Ayguadé, E.4
Eichenberger, A.E.5
Chen, T.6
Sura, Z.7
Zhang, T.8
O'Brien, K.9
O'Brien, K.10
-
5
-
-
64949198379
-
Design and implementation of software-managed caches for multicores with local memory
-
Feb.
-
Sangmin Seo, Jaejin Lee, and Z. Sura. Design and implementation of software-managed caches for multicores with local memory. In High Performance Computer Architecture, 2009. HPCA 2009. IEEE 15th International Symposium on, pages 55-66, Feb. 2009.
-
(2009)
High Performance Computer Architecture, 2009. HPCA 2009. IEEE 15th International Symposium on
, pp. 55-66
-
-
Seo, S.1
Lee, J.2
Sura, Z.3
-
6
-
-
63549093766
-
A tuning framework for software-managed memory hierarchies
-
New York, NY, USA, ACM
-
Manman Ren, Ji Young Park, Mike Houston, Alex Aiken, and William J. Dally. A tuning framework for software-managed memory hierarchies. In PACT '08: Proceedings of the 17th international conference on Parallel architectures and compilation techniques, pages 280-291, New York, NY, USA, 2008. ACM.
-
(2008)
PACT '08: Proceedings of the 17th International Conference on Parallel Architectures and Compilation Techniques
, pp. 280-291
-
-
Ren, M.1
Park, J.Y.2
Houston, M.3
Aiken, A.4
Dally, W.J.5
-
7
-
-
43449138842
-
Prefetching irregular references for software cache on cell
-
New York, NY, USA, ACM
-
Tong Chen, Tao Zhang, Zehra Sura, and Mar Gonzales Tallada. Prefetching irregular references for software cache on cell. In CGO '08: Proceedings of the sixth annual IEEE/ACM international symposium on Code generation and optimization, pages 155-164, New York, NY, USA, 2008. ACM.
-
(2008)
CGO '08: Proceedings of the Sixth Annual IEEE/ACM International Symposium on Code Generation and Optimization
, pp. 155-164
-
-
Chen, T.1
Zhang, T.2
Sura, Z.3
Tallada, M.G.4
-
8
-
-
54249087677
-
A novel asynchronous software cache implementation for the cell-be processor
-
Revised Selected Papers
-
Jairo Balart, Marc Gonzalez, Xavier Martorell, Eduard Ayguade, Zehra Sura, Tong Chen, Tao Zhang, Kevin O'Brien, and Kathryn O'Brien. A novel asynchronous software cache implementation for the cell-be processor. Languages and Compilers for Parallel Computing: 20th International Workshop, LCPC 2007, Urbana, IL, USA, October 11-13, 2007, Revised Selected Papers, pages 125-140, 2008.
-
(2008)
Languages and Compilers for Parallel Computing: 20th International Workshop, LCPC 2007, Urbana, IL, USA, October 11-13, 2007
, pp. 125-140
-
-
Balart, J.1
Gonzalez, M.2
Martorell, X.3
Ayguade, E.4
Sura, Z.5
Chen, T.6
Zhang, T.7
O'Brien, K.8
O'Brien, K.9
-
9
-
-
63549095070
-
The parsec benchmark suite: Characterization and architectural implications
-
New York, NY, USA, ACM
-
Christian Bienia, Sanjeev Kumar, Jaswinder Pal Singh, and Kai Li. The parsec benchmark suite: characterization and architectural implications. In PACT '08: Proceedings of the 17th international conference on Parallel architectures and compilation techniques, pages 72-81, New York, NY, USA, 2008. ACM.
-
(2008)
PACT '08: Proceedings of the 17th International Conference on Parallel Architectures and Compilation Techniques
, pp. 72-81
-
-
Bienia, C.1
Kumar, S.2
Singh, J.P.3
Li, K.4
-
10
-
-
77950964060
-
A study of an infrastructure for research and development of many-core processors
-
0
-
Koh Uehara, Shimpei Sato, Takefumi Miyoshi, and Kenji Kise. A study of an infrastructure for research and development of many-core processors. Parallel and Distributed Computing Applications and Technologies, International Conference on, 0:414-419, 2009.
-
(2009)
Parallel and Distributed Computing Applications and Technologies, International Conference on
, pp. 414-419
-
-
Uehara, K.1
Sato, S.2
Miyoshi, T.3
Kise, K.4
-
11
-
-
84955506994
-
Runahead execution: An alternative to very large instruction windows for out-of-order processors
-
Washington, DC, USA, IEEE Computer Society
-
Onur Mutlu, Jared Stark, Chris Wilkerson, and Yale N. Patt. Runahead execution: An alternative to very large instruction windows for out-of-order processors. In HPCA '03: Proceedings of the 9th International Symposium on High-Performance Computer Architecture, page 129, Washington, DC, USA, 2003. IEEE Computer Society.
-
(2003)
HPCA '03: Proceedings of the 9th International Symposium on High-Performance Computer Architecture
, pp. 129
-
-
Mutlu, O.1
Stark, J.2
Wilkerson, C.3
Patt, Y.N.4
-
12
-
-
12844269176
-
Continual flow pipelines
-
New York, NY, USA, ACM
-
Srikanth T. Srinivasan, Ravi Rajwar, Haitham Akkary, Amit Gandhi, and Mike Upton. Continual flow pipelines. In ASPLOS-XI: Proceedings of the 11th international conference on Architectural support for programming languages and operating systems, pages 107-119, New York, NY, USA, 2004. ACM.
-
(2004)
ASPLOS-XI: Proceedings of the 11th International Conference on Architectural Support for Programming Languages and Operating Systems
, pp. 107-119
-
-
Srinivasan, S.T.1
Rajwar, R.2
Akkary, H.3
Gandhi, A.4
Upton, M.5
-
13
-
-
64949124579
-
icfp: Tolerating all-level cache misses in in-order processors
-
feb.
-
A. Hilton, S. Nagarakatte, and A. Roth. icfp: Tolerating all-level cache misses in in-order processors. In High Performance Computer Architecture, 2009. HPCA 2009. IEEE 15th International Symposium on, pages 431-442, feb. 2009.
-
(2009)
High Performance Computer Architecture, 2009. HPCA 2009. IEEE 15th International Symposium on
, pp. 431-442
-
-
Hilton, A.1
Nagarakatte, S.2
Roth, A.3
-
14
-
-
70449709562
-
Dbdb: Optimizing dmatransfer for the cell be architecture
-
New York, NY, USA, ACM
-
Tao Liu, Haibo Lin, Tong Chen, John Kevin O'Brien, and Ling Shao. Dbdb: optimizing dmatransfer for the cell be architecture. In ICS '09: Proceedings of the 23rd international conference on Supercomputing, pages 36-45, New York, NY, USA, 2009. ACM.
-
(2009)
ICS '09: Proceedings of the 23rd International Conference on Supercomputing
, pp. 36-45
-
-
Liu, T.1
Lin, H.2
Chen, T.3
O'Brien, J.K.4
Shao, L.5
|