메뉴 건너뛰기




Volumn 53, Issue 5, 2009, Pages

The reverse-acceleration model for programming petascale hybrid systems

Author keywords

[No Author keywords available]

Indexed keywords

CODES (SYMBOLS); EMBEDDED SYSTEMS; GENERAL PURPOSE COMPUTERS; HYBRID SYSTEMS; SUPERCOMPUTERS;

EID: 77955075376     PISSN: 00188646     EISSN: 00188646     Source Type: Journal    
DOI: 10.1147/JRD.2009.5429074     Document Type: Article
Times cited : (17)

References (45)
  • 2
    • 57649229517 scopus 로고    scopus 로고
    • GPU acceleration of numerical weather prediction
    • J. Michalakes and M. Vachharajani, "GPU Acceleration of Numerical Weather Prediction," Parallel Processing Lett. 18, No.4, 531-548 (2008).
    • (2008) Parallel Processing Lett. , vol.18 , Issue.4 , pp. 531-548
    • Michalakes, J.1    Vachharajani, M.2
  • 3
    • 70350754499 scopus 로고    scopus 로고
    • Adapting a message-driven parallel application to GPU-Accelerated clusters
    • IEEE Press, Austin, TX, November
    • J. C. Phillips, J. E. Stone, and K. Schulten, "Adapting a Message-Driven Parallel Application to GPU-Accelerated Clusters," Proceedings of the ACM/IEEE SC2008 Conference, IEEE Press, Austin, TX, November 15-21, 2008; see http:// portal.acm.org/citation.cfm?id=1413379.
    • (2008) Proceedings of the ACM/IEEE SC2008 Conference , pp. 15-21
    • Phillips, J.C.1    Stone, J.E.2    Schulten, K.3
  • 5
    • 73449148916 scopus 로고    scopus 로고
    • 0.374 Pflop/s trillion-particle particle-in-cell modeling of laser plasma interactions on roadrunner
    • IEEE Press, Austin, TX, November
    • K. J. Bowers, B. J. Albright, B. K. Bergen, L. Yin, K. J. Barker, and D. J. Kerbyson, "0.374 Pflop/s Trillion-Particle Particle-in-Cell Modeling of Laser Plasma Interactions on Roadrunner," Proceedings of the ACM/IEEE SC2008 Conference, IEEE Press, Austin, TX, November 15-21, 2008; see http://portal.acm.org/citation.cfm?id=1413435.
    • (2008) Proceedings of the ACM/IEEE SC2008 Conference , pp. 15-21
    • Bowers, K.J.1    Albright, B.J.2    Bergen, B.K.3    Yin, L.4    Barker, K.J.5    Kerbyson, D.J.6
  • 6
    • 70350780323 scopus 로고    scopus 로고
    • 369 Tflop/s molecular dynamics simulations on the roadrunner general-purpose heterogeneous supercomputer
    • IEEE Press, Austin, TX, November
    • S. Swaminarayan, K. Kadau, T. C. Germann, and G. C. Fossum, "369 Tflop/s Molecular Dynamics Simulations on the Roadrunner General-Purpose Heterogeneous Supercomputer," Proceedings of the ACM/IEEE SC2008 Conference, IEEE Press, Austin, TX, November 15-21, 2008; see http://portal.acm.org/citation.cfm?id=1413436.
    • (2008) Proceedings of the ACM/IEEE SC2008 Conference , pp. 15-21
    • Swaminarayan, S.1    Kadau, K.2    Germann, T.C.3    Fossum, G.C.4
  • 9
    • 33646596525 scopus 로고    scopus 로고
    • MPI microtask for programming the cell broadband engine processor
    • M. Ohara, H. Inoue, Y. Sohda, H. Komatsu, and T. Nakatani, "MPI Microtask for Programming the Cell Broadband Engine Processor," IBM Syst. J. 45, No.1, 85-102 (2006).
    • (2006) IBM Syst. J. , vol.45 , Issue.1 , pp. 85-102
    • Ohara, M.1    Inoue, H.2    Sohda, Y.3    Komatsu, H.4    Nakatani, T.5
  • 11
    • 77955083410 scopus 로고    scopus 로고
    • IBM Corporation Accelerated Library Framework Programmer's Guide and API Reference, Publication number SC33-8333-8403, product number 5724-S84, version 3, release 1
    • IBM Corporation, Accelerated Library Framework Programmer's Guide and API Reference, 2009. Publication number SC33-8333-8403, product number 5724-S84, version 3, release 1.
    • (2009)
  • 13
    • 0002806690 scopus 로고    scopus 로고
    • OpenMP: An industry-standard API for shared-memory programming
    • L. Dagum and R. Menon, "OpenMP: An Industry-Standard API for Shared-Memory Programming," IEEE Computational Sci. Eng. 5, No.1, 46-55 (1998).
    • (1998) IEEE Computational Sci. Eng. , vol.5 , Issue.1 , pp. 46-55
    • Dagum, L.1    Menon, R.2
  • 16
    • 0000881430 scopus 로고
    • Solution of the first-order form of the 3-D discrete ordinates equation on a massively parallel processor
    • K. R. Koch, R. S. Baker, and R. E. Alcouffe, "Solution of the First-Order Form of the 3-D Discrete Ordinates Equation on a Massively Parallel Processor," Trans. Am. Nuclear Soc. 65, No.108, 198-199 (1992).
    • (1992) Trans. Am. Nuclear Soc. , vol.65 , Issue.108 , pp. 198-199
    • Koch, K.R.1    Baker, R.S.2    Alcouffe, R.E.3
  • 21
    • 77955081478 scopus 로고    scopus 로고
    • Top500 Organization, Top500 List, June and November
    • Top500 Organization, Top500 List, June and November 2008; see http://www.top500.org/.
    • (2008)
  • 22
    • 0037957323 scopus 로고    scopus 로고
    • The AMD opteron processor for multiprocessor servers
    • C. N. Keltcher, K. J. McGrath, A. Ahmed, and P. Conway, "The AMD Opteron Processor for Multiprocessor Servers," IEEE Micro 23, No.2, 66-76 (2003).
    • (2003) IEEE Micro , vol.23 , Issue.2 , pp. 66-76
    • Keltcher, C.N.1    McGrath, K.J.2    Ahmed, A.3    Conway, P.4
  • 25
    • 0022141776 scopus 로고
    • Fat-trees: Universal networks for hardware-efficient supercomputing
    • C. E. Leiserson, "Fat-Trees: Universal Networks for Hardware-Efficient Supercomputing," IEEE Trans. Computers C-34, No.10, 892-901 (1985).
    • (1985) IEEE Trans. Computers C-34 , vol.10 , pp. 892-901
    • Leiserson, C.E.1
  • 26
    • 84887601584 scopus 로고    scopus 로고
    • Jin H., Cortes T., Buyya R. , Eds., An introduction to the infiniband architecture Chapter 42, G. F. Pfister, Press and IEEE Press, November 26
    • H. Jin, T. Cortes, and R. Buyya, Eds., An Introduction to the InfiniBand Architecture, Chapter 42, G. F. Pfister, High Performance Mass Storage and Parallel I/O: Technologies andApplications,Wiley Press and IEEE Press, November 26, 2001, pp. 617-632.
    • (2001) High Performance Mass Storage and Parallel I/O: Technologies and Applications , pp. 617-632
  • 27
    • 77952579072 scopus 로고    scopus 로고
    • Technical Report IBM Corporation Research Triangle Park North Carolina August 21, see
    • D. M. Pase and M. A. Eckl, "Performance of the AMD Opteron LS21 for IBM BladeCenter," Technical Report, IBM Corporation, Research Triangle Park, North Carolina, August 21, 2006; see ftp://ftp.software.ibm.com/eserver/ benchmarks/wp-ls21-081506.pdf.
    • (2006) Performance of the AMD opteron LS21 for IBM Bladecenter
    • Pase, D.M.1    Eckl, M.A.2
  • 29
    • 42449128855 scopus 로고    scopus 로고
    • The playstation 3 for high-performance scientific computing
    • J. Kurzak, A. Buttari, P. Luszczek, and J. Dongarra, "The PlayStation 3 for High-Performance Scientific Computing," Computing Sci. Eng. 10, No.3, 84-87 (2008).
    • (2008) Computing Sci. Eng. , vol.10 , Issue.3 , pp. 84-87
    • Kurzak, J.1    Buttari, A.2    Luszczek, P.3    Dongarra, J.4
  • 32
    • 33746923043 scopus 로고    scopus 로고
    • Cell multiprocessor communication network: Built for speed
    • M. Kistler, M. Perrone, and F. Petrini, "Cell Multiprocessor Communication Network: Built for Speed," IEEE Micro 26, No.3, 10-23 (2006).
    • (2006) IEEE Micro , vol.26 , Issue.3 , pp. 10-23
    • Kistler, M.1    Perrone, M.2    Petrini, F.3
  • 33
    • 25844490996 scopus 로고    scopus 로고
    • Clocking and circuit design for a parallel I/O on a first-generation CELL processor
    • Digest of Technical Papers, San Francisco, CA, February 6-10, 615
    • K. Chang, S. Pamarti, K. Kaviani, E. Alon, X. Shi, T. J. Chin, J. Shen, et al., "Clocking and Circuit Design for a Parallel I/O on a First-Generation CELL Processor," 2005 IEEE International Solid-State Circuits Conference (ISSCC), Digest of Technical Papers, San Francisco, CA, February 6-10, 2005, pp. 526-527, 615.
    • (2005) 2005 IEEE International Solid-State Circuits Conference (ISSCC) , pp. 526-527
    • Chang, K.1    Pamarti, S.2    Kaviani, K.3    Alon, E.4    Shi, X.5    Chin, T.J.6    Shen, J.7
  • 34
    • 84944041691 scopus 로고    scopus 로고
    • PCI express and advanced switching: Evolutionary path to building next generation interconnects
    • Palo Alto, CA, August 20-22
    • D. Mayhew and V. Krishnan, "PCI Express and Advanced Switching: Evolutionary Path to Building Next Generation Interconnects," Proceedings of the 11th Symposium on High Performance Interconnects (HotI), Palo Alto, CA, August 20-22, 2003, pp. 21-29.
    • (2003) Proceedings of the 11th Symposium on High Performance Interconnects (HotI) , pp. 21-29
    • Mayhew, D.1    Krishnan, V.2
  • 36
  • 39
    • 51849160421 scopus 로고    scopus 로고
    • Parallel lattice boltzmann flow simulation on emerging multi-core platforms
    • Lecture Notes in Computer Science, Las Palmas de Gran Canaria, Spain, Springer, August 26-29
    • L. Peng, K. Nomura, T. Oyakawa, R. K. Kalia, A. Nakano, and P. Vashishta, "Parallel Lattice Boltzmann Flow Simulation on Emerging Multi-core Platforms," Proceedings of the 14th International Euro-Par Conference, No.5168, Lecture Notes in Computer Science, Las Palmas de Gran Canaria, Spain, Springer, August 26-29, 2008, pp. 763-777.
    • (2008) Proceedings of the 14th International Euro-Par Conference 5168 , pp. 763-777
    • Peng, L.1    Nomura, K.2    Oyakawa, T.3    Kalia, R.K.4    Nakano, A.5    Vashishta, P.6
  • 42
    • 60649094971 scopus 로고    scopus 로고
    • Implementation and performance modeling of deterministic particle transport (Sweep3D) on the IBM Cell/B.E.
    • O. Lubeck, M. Lang, R. Srinivasan, and G. Johnson, "Implementation and Performance Modeling of Deterministic Particle Transport (Sweep3D) on the IBM Cell/B.E.," Scientific Programming 17, No.2, 199-208 (2008).
    • (2008) Scientific Programming , vol.17 , Issue.2 , pp. 199-208
    • Lubeck, O.1    Lang, M.2    Srinivasan, R.3    Johnson, G.4
  • 43
    • 77955082601 scopus 로고    scopus 로고
    • IBM Corporation C/C++ Language Extensions for Cell Broadband Engine Architecture February 27, Version 2.5; see
    • IBM Corporation, C/C++ Language Extensions for Cell Broadband Engine Architecture, February 27, 2008, Version 2.5; see http://www.ibm.com/ developerworks/power/cell/ documents.html.
    • (2008)
  • 44
    • 77955072712 scopus 로고    scopus 로고
    • IBM Corporation, Data Communication and Sychronization for Hybrid-x86 Programmer's Guide and API Reference, October 19, publication number SC33-8408-8500, product number 5724-S84, version 3, release 0
    • IBM Corporation, Data Communication and Sychronization for Hybrid-x86 Programmer's Guide and API Reference, October 19, 2007, publication number SC33-8408-8500, product number 5724-S84, version 3, release 0.
    • (2007)


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.