-
1
-
-
70449467862
-
Entering the petaflop era: The architecture and performance of roadrunner
-
IEEE Press, Austin, TX, November
-
K. Barker, K. Davis, A. Hoisie, D. Kerbyson, M. Lang, S. Pakin, and J. C. Sancho, "Entering the Petaflop Era: The Architecture and Performance of Roadrunner," Proceedings of the ACM/IEEE SC2008 Conference, IEEE Press, Austin, TX, November 15-21, 2008; see http://portal.acm.org/citation. cfm?id=1413372.
-
(2008)
Proceedings of the ACM/IEEE SC2008 Conference
, pp. 15-21
-
-
Barker, K.1
Davis, K.2
Hoisie, A.3
Kerbyson, D.4
Lang, M.5
Pakin, S.6
Sancho, J.C.7
-
2
-
-
57649229517
-
GPU acceleration of numerical weather prediction
-
J. Michalakes and M. Vachharajani, "GPU Acceleration of Numerical Weather Prediction," Parallel Processing Lett. 18, No.4, 531-548 (2008).
-
(2008)
Parallel Processing Lett.
, vol.18
, Issue.4
, pp. 531-548
-
-
Michalakes, J.1
Vachharajani, M.2
-
3
-
-
70350754499
-
Adapting a message-driven parallel application to GPU-Accelerated clusters
-
IEEE Press, Austin, TX, November
-
J. C. Phillips, J. E. Stone, and K. Schulten, "Adapting a Message-Driven Parallel Application to GPU-Accelerated Clusters," Proceedings of the ACM/IEEE SC2008 Conference, IEEE Press, Austin, TX, November 15-21, 2008; see http:// portal.acm.org/citation.cfm?id=1413379.
-
(2008)
Proceedings of the ACM/IEEE SC2008 Conference
, pp. 15-21
-
-
Phillips, J.C.1
Stone, J.E.2
Schulten, K.3
-
4
-
-
25844503119
-
Introduction to the cell multiprocessor
-
J. A. Kahle, M. N. Day, H. P. Hofstee, C. R. Johns, T. R. Maeurer, and D. Shippy, "Introduction to the Cell Multiprocessor," IBM J. Res. & Dev. 49, No. 4/5, 589-604 (2005).
-
(2005)
IBM J. Res. & Dev.
, vol.49
, Issue.4-5
, pp. 589-604
-
-
Kahle, J.A.1
Day, M.N.2
Hofstee, H.P.3
Johns, C.R.4
Maeurer, T.R.5
Shippy, D.6
-
5
-
-
73449148916
-
0.374 Pflop/s trillion-particle particle-in-cell modeling of laser plasma interactions on roadrunner
-
IEEE Press, Austin, TX, November
-
K. J. Bowers, B. J. Albright, B. K. Bergen, L. Yin, K. J. Barker, and D. J. Kerbyson, "0.374 Pflop/s Trillion-Particle Particle-in-Cell Modeling of Laser Plasma Interactions on Roadrunner," Proceedings of the ACM/IEEE SC2008 Conference, IEEE Press, Austin, TX, November 15-21, 2008; see http://portal.acm.org/citation.cfm?id=1413435.
-
(2008)
Proceedings of the ACM/IEEE SC2008 Conference
, pp. 15-21
-
-
Bowers, K.J.1
Albright, B.J.2
Bergen, B.K.3
Yin, L.4
Barker, K.J.5
Kerbyson, D.J.6
-
6
-
-
70350780323
-
369 Tflop/s molecular dynamics simulations on the roadrunner general-purpose heterogeneous supercomputer
-
IEEE Press, Austin, TX, November
-
S. Swaminarayan, K. Kadau, T. C. Germann, and G. C. Fossum, "369 Tflop/s Molecular Dynamics Simulations on the Roadrunner General-Purpose Heterogeneous Supercomputer," Proceedings of the ACM/IEEE SC2008 Conference, IEEE Press, Austin, TX, November 15-21, 2008; see http://portal.acm.org/citation.cfm?id=1413436.
-
(2008)
Proceedings of the ACM/IEEE SC2008 Conference
, pp. 15-21
-
-
Swaminarayan, S.1
Kadau, K.2
Germann, T.C.3
Fossum, G.C.4
-
7
-
-
37249009316
-
A buffered-mode MPI implementation for the cell BE processor
-
Y. Shi, G. D. van Albada, J. Dongarra, and P. M. A. Sloot, Eds., Lecture Notes in Computer Science, Beijing, China, Springer, May 27-30
-
A. Kumar, G. Senthilkumar, M. Krishna, N. Jayam, P. K. Baruah, R. Sharma, A. Srinivasan, and S. Kapoor, "A Buffered-Mode MPI Implementation for the Cell BE Processor," Y. Shi, G. D. van Albada, J. Dongarra, and P. M. A. Sloot, Eds., Proceedings of the 7th International Conference on Computational Science (ICCS 2007), Part I, Vol.4487, Lecture Notes in Computer Science, Beijing, China, Springer, May 27-30, 2007, pp. 603-610.
-
(2007)
Proceedings of the 7th International Conference on Computational Science (ICCS 2007)
, vol.4487
, Issue.PART I
, pp. 603-610
-
-
Kumar, A.1
Senthilkumar, G.2
Krishna, M.3
Jayam, N.4
Baruah, P.K.5
Sharma, R.6
Srinivasan, A.7
Kapoor, S.8
-
8
-
-
38149088064
-
A synchronous mode MPI implementation on the cell BE architecture
-
I. Stojmenovic, R. K. Thulasiram, L. T. Yang, W. Jia, M. Guo, and R. Fernandes de Mello, Eds., Lecture Notes in Computer Science, Niagara Falls, Canada, Springer, August 29-31
-
M. Krishna, A. Kumar, N. Jayam, G. Senthilkumar, P. Baruah, R. Sharma, S. Kapoor, and A. Srinivasan, "A Synchronous Mode MPI Implementation on the Cell BE Architecture," I. Stojmenovic, R. K. Thulasiram, L. T. Yang, W. Jia, M. Guo, and R. Fernandes de Mello, Eds., Proceedings of the 5th International Symposium on Parallel and Distributed Processing and Applications (ISPA 2007), Vol.4742, Lecture Notes in Computer Science, Niagara Falls, Canada, Springer, August 29-31, 2007, pp. 982-991.
-
(2007)
Proceedings of the 5th International Symposium on Parallel and Distributed Processing and Applications (ISPA 2007)
, vol.4742
, pp. 982-991
-
-
Krishna, M.1
Kumar, A.2
Jayam, N.3
Senthilkumar, G.4
Baruah, P.5
Sharma, R.6
Kapoor, S.7
Srinivasan, A.8
-
9
-
-
33646596525
-
MPI microtask for programming the cell broadband engine processor
-
M. Ohara, H. Inoue, Y. Sohda, H. Komatsu, and T. Nakatani, "MPI Microtask for Programming the Cell Broadband Engine Processor," IBM Syst. J. 45, No.1, 85-102 (2006).
-
(2006)
IBM Syst. J.
, vol.45
, Issue.1
, pp. 85-102
-
-
Ohara, M.1
Inoue, H.2
Sohda, Y.3
Komatsu, H.4
Nakatani, T.5
-
10
-
-
34548265764
-
CellSs: A programming model for the cell BE architecture
-
Tampa, FL, IEEE Press, November
-
P. Bellens, J. M. Perez, R. M. Badia, and J. Labarta, "CellSs: A Programming Model for the Cell BE Architecture," Proceedings of the ACM/IEEE SC2006 Conference (SC'06), Tampa, FL, IEEE Press, November 11-17, 2006.
-
(2006)
Proceedings of the ACM/IEEE SC2006 Conference (SC'06)
, pp. 11-17
-
-
Bellens, P.1
Perez, J.M.2
Badia, R.M.3
Labarta, J.4
-
11
-
-
77955083410
-
-
IBM Corporation Accelerated Library Framework Programmer's Guide and API Reference, Publication number SC33-8333-8403, product number 5724-S84, version 3, release 1
-
IBM Corporation, Accelerated Library Framework Programmer's Guide and API Reference, 2009. Publication number SC33-8333-8403, product number 5724-S84, version 3, release 1.
-
(2009)
-
-
-
12
-
-
43849085367
-
Supporting openMP on cell
-
Kevin O'Brien, Kathryn O'Brien, Z. Sura, T. Chen, and T. Zhang, "Supporting OpenMP on Cell," Intl. J. Parallel Programming 36, No.3, 289-311 (2008).
-
(2008)
Intl. J. Parallel Programming
, vol.36
, Issue.3
, pp. 289-311
-
-
O'Brien, K.1
O'Brien, K.2
Sura, Z.3
Chen, T.4
Zhang, T.5
-
13
-
-
0002806690
-
OpenMP: An industry-standard API for shared-memory programming
-
L. Dagum and R. Menon, "OpenMP: An Industry-Standard API for Shared-Memory Programming," IEEE Computational Sci. Eng. 5, No.1, 46-55 (1998).
-
(1998)
IEEE Computational Sci. Eng.
, vol.5
, Issue.1
, pp. 46-55
-
-
Dagum, L.1
Menon, R.2
-
14
-
-
77955073916
-
Gedae: A tool for implementing software radio on heterogeneous systems
-
SDR Forum, Phoenix, AZ, November
-
J. Steed, W. Lundgren, and K. Barnes, "Gedae: A Tool for Implementing Software Radio on Heterogeneous Systems," Proceedings of the 2004 Software Defined Radio Technical Conference and Product Exposition, SDR Forum, Phoenix, AZ, November 15-18, 2004; see http://www.gedae.com/ documents/SDR%20-%20GEDAE.pdf.
-
(2004)
Proceedings of the 2004 Software Defined Radio Technical Conference and Product Exposition
, pp. 15-18
-
-
Steed, J.1
Lundgren, W.2
Barnes, K.3
-
16
-
-
0000881430
-
Solution of the first-order form of the 3-D discrete ordinates equation on a massively parallel processor
-
K. R. Koch, R. S. Baker, and R. E. Alcouffe, "Solution of the First-Order Form of the 3-D Discrete Ordinates Equation on a Massively Parallel Processor," Trans. Am. Nuclear Soc. 65, No.108, 198-199 (1992).
-
(1992)
Trans. Am. Nuclear Soc.
, vol.65
, Issue.108
, pp. 198-199
-
-
Koch, K.R.1
Baker, R.S.2
Alcouffe, R.E.3
-
17
-
-
80052028645
-
-
Computer Engineering Series, CRC Press, November 15
-
D. J. Kerbyson and A. Hoise, "A Performance Analysis of Two-Level Heterogeneous Systems on Wavefront Algorithms," E. John and J. Rubio, Eds., Unique Chips and Systems, Vol.4, Computer Engineering Series, CRC Press, November 15, 2007, pp. 259-279.
-
(2007)
A Performance Analysis of Two-Level Heterogeneous Systems on Wavefront Algorithms," E. John and J. Rubio, Eds., Unique Chips and Systems
, vol.4
, pp. 259-279
-
-
Kerbyson, D.J.1
Hoise, A.2
-
18
-
-
34548757858
-
Multicore surprises: lessons learned from optimizing sweep3d on the cell broadband engine
-
Long Beach, CA, March
-
F. Petrini, G. Fossum, J. Fernández, A. L. Varbanescu, M. Kistler, and M. Perrone, "Multicore Surprises: Lessons Learned from Optimizing Sweep3D on the Cell Broadband Engine," Proceedings of the 21st IEEE International Parallel and Distributed Processing Symposium (IPDPS 2007), Long Beach, CA, March 26-30, 2007.
-
(2007)
Proceedings of the 21st IEEE International Parallel and Distributed Processing Symposium (IPDPS 2007)
, pp. 26-30
-
-
Petrini, F.1
Fossum, G.2
Fernández, J.3
Varbanescu, A.L.4
Kistler, M.5
Perrone, M.6
-
19
-
-
49249086142
-
Larrabee: A many-core x86 architecture for visual computing
-
L. Seiler, D. Carmean, E. Sprangle, T. Forsyth, M. Abrash, P. Dubey, S. Junkins, et al., "Larrabee: A Many-Core x86 Architecture for Visual Computing," ACM Trans. Graphics (TOG) 27, No.3, 18:1-18:15 (2008).
-
(2008)
ACM Trans. Graphics (TOG)
, vol.27
, Issue.3
, pp. 1801-1815
-
-
Seiler, L.1
Carmean, D.2
Sprangle, E.3
Forsyth, T.4
Abrash, M.5
Dubey, P.6
Junkins, S.7
-
20
-
-
49549108733
-
TILE64 processor: A 64-core SoC with Mesh Interconnect
-
Digest of Technical Papers, February 3-7
-
S. Bell, B. Edwards, J. Amann, R. Conlin, K. Joyce, V. Leung, J. MacKay, et al., "TILE64 Processor: A 64-core SoC with Mesh Interconnect," 2008 IEEE International Solid-State Circuits Conference (ISSCC), Digest of Technical Papers, February 3-7, 2008, pp. 88-89, 598.
-
(2008)
IEEE International Solid-State Circuits Conference (ISSCC)
, vol.598
, pp. 88-89
-
-
Bell, S.1
Edwards, B.2
Amann, J.3
Conlin, R.4
Joyce, K.5
Leung, V.6
MacKay, J.7
-
21
-
-
77955081478
-
-
Top500 Organization, Top500 List, June and November
-
Top500 Organization, Top500 List, June and November 2008; see http://www.top500.org/.
-
(2008)
-
-
-
22
-
-
0037957323
-
The AMD opteron processor for multiprocessor servers
-
C. N. Keltcher, K. J. McGrath, A. Ahmed, and P. Conway, "The AMD Opteron Processor for Multiprocessor Servers," IEEE Micro 23, No.2, 66-76 (2003).
-
(2003)
IEEE Micro
, vol.23
, Issue.2
, pp. 66-76
-
-
Keltcher, C.N.1
McGrath, K.J.2
Ahmed, A.3
Conway, P.4
-
23
-
-
51349140993
-
Packaging the cell broadband engine microprocessor for supercomputer applications
-
Lake Buena Vista, FL, May 27-30
-
P. Harvey, R. Mandrekar, Y. Zhou, J. Zheng, J. Maloney, S. Cain, K. Kawasaki, et al., "Packaging the Cell Broadband Engine Microprocessor for Supercomputer Applications," Proceedings of the 58th Electronic Components and Technology Conference (ECTC), Lake Buena Vista, FL, May 27-30, 2008, pp. 1368-1371.
-
(2008)
Proceedings of the 58th Electronic Components and Technology Conference (ECTC)
, pp. 1368-1371
-
-
Harvey, P.1
Mandrekar, R.2
Zhou, Y.3
Zheng, J.4
Maloney, J.5
Cain, S.6
Kawasaki, K.7
-
24
-
-
0042674307
-
-
Concurrency and Computation: Practice and Experience
-
J. J. Dongarra, P. Luszczek, and A. Petitet, "The LINPACK Benchmark: Past, Present and Future," Concurrency and Computation: Practice and Experience, 15, No.9, 803-820 (2003).
-
(2003)
The LINPACK Benchmark: Past, Present and Future
, vol.15
, Issue.9
, pp. 803-820
-
-
Dongarra, J.J.1
Luszczek, P.2
Petitet, A.3
-
25
-
-
0022141776
-
Fat-trees: Universal networks for hardware-efficient supercomputing
-
C. E. Leiserson, "Fat-Trees: Universal Networks for Hardware-Efficient Supercomputing," IEEE Trans. Computers C-34, No.10, 892-901 (1985).
-
(1985)
IEEE Trans. Computers C-34
, vol.10
, pp. 892-901
-
-
Leiserson, C.E.1
-
26
-
-
84887601584
-
-
Jin H., Cortes T., Buyya R. , Eds., An introduction to the infiniband architecture Chapter 42, G. F. Pfister, Press and IEEE Press, November 26
-
H. Jin, T. Cortes, and R. Buyya, Eds., An Introduction to the InfiniBand Architecture, Chapter 42, G. F. Pfister, High Performance Mass Storage and Parallel I/O: Technologies andApplications,Wiley Press and IEEE Press, November 26, 2001, pp. 617-632.
-
(2001)
High Performance Mass Storage and Parallel I/O: Technologies and Applications
, pp. 617-632
-
-
-
27
-
-
77952579072
-
-
Technical Report IBM Corporation Research Triangle Park North Carolina August 21, see
-
D. M. Pase and M. A. Eckl, "Performance of the AMD Opteron LS21 for IBM BladeCenter," Technical Report, IBM Corporation, Research Triangle Park, North Carolina, August 21, 2006; see ftp://ftp.software.ibm.com/eserver/ benchmarks/wp-ls21-081506.pdf.
-
(2006)
Performance of the AMD opteron LS21 for IBM Bladecenter
-
-
Pase, D.M.1
Eckl, M.A.2
-
29
-
-
42449128855
-
The playstation 3 for high-performance scientific computing
-
J. Kurzak, A. Buttari, P. Luszczek, and J. Dongarra, "The PlayStation 3 for High-Performance Scientific Computing," Computing Sci. Eng. 10, No.3, 84-87 (2008).
-
(2008)
Computing Sci. Eng.
, vol.10
, Issue.3
, pp. 84-87
-
-
Kurzak, J.1
Buttari, A.2
Luszczek, P.3
Dongarra, J.4
-
30
-
-
34247349114
-
The potential of the cell processor for scientific computing
-
Ischia, Italy, May 3-5
-
S. Williams, J. Shalf, L. Oliker, S. Kamil, P. Husbands, and K. Yelick, "The Potential of the Cell Processor for Scientific Computing," Proceedings of the 3rd Conference on Computing Frontiers, Ischia, Italy, May 3-5, 2006, pp. 9-20.
-
(2006)
Proceedings of the 3rd Conference on Computing Frontiers
, pp. 9-20
-
-
Williams, S.1
Shalf, J.2
Oliker, L.3
Kamil, S.4
Husbands, P.5
Yelick, K.6
-
31
-
-
34250216007
-
Scientific computing kernels on the cell processor
-
S. Williams, J. Shalf, L. Oliker, S. Kamil, P. Husbands, and K. Yelick, "Scientific Computing Kernels on the Cell Processor," Int. J. Parallel Programming 35, No.3, 263-298 (2007).
-
(2007)
Int. J. Parallel Programming
, vol.35
, Issue.3
, pp. 263-298
-
-
Williams, S.1
Shalf, J.2
Oliker, L.3
Kamil, S.4
Husbands, P.5
Yelick, K.6
-
32
-
-
33746923043
-
Cell multiprocessor communication network: Built for speed
-
M. Kistler, M. Perrone, and F. Petrini, "Cell Multiprocessor Communication Network: Built for Speed," IEEE Micro 26, No.3, 10-23 (2006).
-
(2006)
IEEE Micro
, vol.26
, Issue.3
, pp. 10-23
-
-
Kistler, M.1
Perrone, M.2
Petrini, F.3
-
33
-
-
25844490996
-
Clocking and circuit design for a parallel I/O on a first-generation CELL processor
-
Digest of Technical Papers, San Francisco, CA, February 6-10, 615
-
K. Chang, S. Pamarti, K. Kaviani, E. Alon, X. Shi, T. J. Chin, J. Shen, et al., "Clocking and Circuit Design for a Parallel I/O on a First-Generation CELL Processor," 2005 IEEE International Solid-State Circuits Conference (ISSCC), Digest of Technical Papers, San Francisco, CA, February 6-10, 2005, pp. 526-527, 615.
-
(2005)
2005 IEEE International Solid-State Circuits Conference (ISSCC)
, pp. 526-527
-
-
Chang, K.1
Pamarti, S.2
Kaviani, K.3
Alon, E.4
Shi, X.5
Chin, T.J.6
Shen, J.7
-
34
-
-
84944041691
-
PCI express and advanced switching: Evolutionary path to building next generation interconnects
-
Palo Alto, CA, August 20-22
-
D. Mayhew and V. Krishnan, "PCI Express and Advanced Switching: Evolutionary Path to Building Next Generation Interconnects," Proceedings of the 11th Symposium on High Performance Interconnects (HotI), Palo Alto, CA, August 20-22, 2003, pp. 21-29.
-
(2003)
Proceedings of the 11th Symposium on High Performance Interconnects (HotI)
, pp. 21-29
-
-
Mayhew, D.1
Krishnan, V.2
-
35
-
-
51049097698
-
Receiver-initiated message passing over RDMA networks
-
Miami, FL, April
-
S. Pakin, "Receiver-Initiated Message Passing over RDMA Networks," 22nd IEEE International Parallel and Distributed Processing Symposium (IPDPS 2008), Miami, FL, April 14-18, 2008.
-
(2008)
22nd IEEE International Parallel and Distributed Processing Symposium (IPDPS 2008)
, pp. 14-18
-
-
Pakin, S.1
-
36
-
-
35648931511
-
Cell/B.E. blades: Building blocks for scalable, real-time, interactive, and digital media servers
-
A. K. Nanda, J. R. Moulic, R. E. Hanson, G. Goldrian, M. N. Day, B. D. D'Amora, and S. Kesavarapu, "Cell/B.E. Blades: Building Blocks for Scalable, Real-Time, Interactive, and Digital Media Servers," IBM J. Res. & Dev. 51, No.5, 573-582 (2007).
-
(2007)
IBM J. Res. & Dev.
, vol.51
, Issue.5
, pp. 573-582
-
-
Nanda, A.K.1
Moulic, J.R.2
Hanson, R.E.3
Goldrian, G.4
Day, M.N.5
D'Amora, B.D.6
Kesavarapu, S.7
-
37
-
-
0003710740
-
-
The MPI Core, 2nd edition, MIT Press, Cambridge, MA, September
-
M. Snir, S. Otto, S. Huss-Lederman, D. Walker, and J. Dongarra, MPI: The Complete Reference, Vol.1, The MPI Core, 2nd edition, MIT Press, Cambridge, MA, September 1998.
-
(1998)
MPI: The Complete Reference
, vol.1
-
-
Snir, M.1
Otto, S.2
Huss-Lederman, S.3
Walker, D.4
Dongarra, J.5
-
38
-
-
38849100280
-
De novo ultrascale atomistic simulations on high-end parallel supercomputers
-
A. Nakano, R. K. Kalia, K. Nomura, A. Sharma, P. Vashishta, F. Shimojo, A. C. T. van Duin, et al., "De Novo Ultrascale Atomistic Simulations on High-End Parallel Supercomputers," Int. J. High Performance Computing Applic. 22, No.1, 113-128 (2008).
-
(2008)
Int. J. High Performance Computing Applic.
, vol.22
, Issue.1
, pp. 113-128
-
-
Nakano, A.1
Kalia, R.K.2
Nomura, K.3
Sharma, A.4
Vashishta, P.5
Shimojo, F.6
Van Duin, A.C.T.7
-
39
-
-
51849160421
-
Parallel lattice boltzmann flow simulation on emerging multi-core platforms
-
Lecture Notes in Computer Science, Las Palmas de Gran Canaria, Spain, Springer, August 26-29
-
L. Peng, K. Nomura, T. Oyakawa, R. K. Kalia, A. Nakano, and P. Vashishta, "Parallel Lattice Boltzmann Flow Simulation on Emerging Multi-core Platforms," Proceedings of the 14th International Euro-Par Conference, No.5168, Lecture Notes in Computer Science, Las Palmas de Gran Canaria, Spain, Springer, August 26-29, 2008, pp. 763-777.
-
(2008)
Proceedings of the 14th International Euro-Par Conference 5168
, pp. 763-777
-
-
Peng, L.1
Nomura, K.2
Oyakawa, T.3
Kalia, R.K.4
Nakano, A.5
Vashishta, P.6
-
40
-
-
70450091067
-
-
in press
-
H. Dursun, K. J. Barker, D. J. Kerbyson, and S. Pakin, "Application Profiling on Cell-Based Clusters," Workshop on Large-Scale Parallel Processing (LSPP), Proceedings of the 23rd IEEE International Parallel and Distributed Processing Symposium (IPDPS), Rome, Italy, May 29, 2009 (in press).
-
(2009)
Application Profiling on Cell-Based Clusters," Workshop on Large-Scale Parallel Processing (LSPP), Proceedings of the 23rd IEEE International Parallel and Distributed Processing Symposium (IPDPS), Rome, Italy, May 29
-
-
Dursun, H.1
Barker, K.J.2
Kerbyson, D.J.3
Pakin, S.4
-
41
-
-
85021214943
-
-
Annapolis, MD, February 21-25
-
A. Hoisie, O. Lubeck, and H. Wasserman, "Scalability Analysis of Multidimensional Wavefront Algorithms on Large-scale SMP Clusters," Proceedings of The 7th Symposium on the Frontiers of Massively Parallel Computation (Frontiers'99), Annapolis, MD, February 21-25, 1999, pp. 4-15.
-
(1999)
Scalability Analysis of Multidimensional Wavefront Algorithms on Large-scale SMP Clusters," Proceedings of The 7th Symposium on the Frontiers of Massively Parallel Computation (Frontiers'99)
, pp. 4-15
-
-
Hoisie, A.1
Lubeck, O.2
Wasserman, H.3
-
42
-
-
60649094971
-
Implementation and performance modeling of deterministic particle transport (Sweep3D) on the IBM Cell/B.E.
-
O. Lubeck, M. Lang, R. Srinivasan, and G. Johnson, "Implementation and Performance Modeling of Deterministic Particle Transport (Sweep3D) on the IBM Cell/B.E.," Scientific Programming 17, No.2, 199-208 (2008).
-
(2008)
Scientific Programming
, vol.17
, Issue.2
, pp. 199-208
-
-
Lubeck, O.1
Lang, M.2
Srinivasan, R.3
Johnson, G.4
-
43
-
-
77955082601
-
-
IBM Corporation C/C++ Language Extensions for Cell Broadband Engine Architecture February 27, Version 2.5; see
-
IBM Corporation, C/C++ Language Extensions for Cell Broadband Engine Architecture, February 27, 2008, Version 2.5; see http://www.ibm.com/ developerworks/power/cell/ documents.html.
-
(2008)
-
-
-
44
-
-
77955072712
-
-
IBM Corporation, Data Communication and Sychronization for Hybrid-x86 Programmer's Guide and API Reference, October 19, publication number SC33-8408-8500, product number 5724-S84, version 3, release 0
-
IBM Corporation, Data Communication and Sychronization for Hybrid-x86 Programmer's Guide and API Reference, October 19, 2007, publication number SC33-8408-8500, product number 5724-S84, version 3, release 0.
-
(2007)
-
-
-
45
-
-
46049104179
-
-
Open MPI: A High- Performance, Heterogeneous MPI Fifth International Workshop on Algorithms, Barcelona, Spain, September 25-28; see
-
R. L. Graham, G. M. Shipman, B. W. Barrett, R. H. Castain, G. Bosilca, and A. Lumsdaine, "Open MPI: A High- Performance, Heterogeneous MPI," Fifth International Workshop on Algorithms, Models and Tools for Parallel Computing on Heterogeneous Networks (HeteroPar'06), Barcelona, Spain, September 25-28, 2006, pp. 1-9; see http:// www.open-mpi.org/papers/heteropar-2006/ heteropar-2006-paper.pdf.
-
(2006)
Models and Tools for Parallel Computing on Heterogeneous Networks (HeteroPar'06)
, pp. 1-9
-
-
Graham, R.L.1
Shipman, G.M.2
Barrett, B.W.3
Castain, R.H.4
Bosilca, G.5
Lumsdaine, A.6
|