메뉴 건너뛰기




Volumn 33, Issue 10-11, 2007, Pages 700-719

Runtime scheduling of dynamic parallelism on accelerator-based multi-core systems

Author keywords

Accelerator based parallel architectures; Cell broadband engine; Heterogeneous multi core processors; Runtime systems for parallel programming

Indexed keywords

CODES (SYMBOLS); COMPUTATIONAL METHODS; PARALLEL PROCESSING SYSTEMS; PROGRAM PROCESSORS;

EID: 35748984416     PISSN: 01678191     EISSN: None     Source Type: Journal    
DOI: 10.1016/j.parco.2007.09.004     Document Type: Article
Times cited : (18)

References (27)
  • 1
    • 35748979833 scopus 로고    scopus 로고
    • PowerPC Microprocessor Family: Vector/SIMD Multimedia Extension Technology Programming Environments Manual. http://www-306.ibm.com/chips/techlib.
  • 2
    • 34548718683 scopus 로고    scopus 로고
    • D. Bader, V. Agarwal, K. Madduri, On the design and analysis of irregular algorithms on the cell processor: a case study on list ranking, in: Proceedings of the 21st International Parallel and Distributed Processing Symposium, Long Beach, CA, March 2007.
  • 3
    • 0035158540 scopus 로고    scopus 로고
    • D.A. Bader, B.M.E. Moret, L. Vawter, Industrial applications of high-performance computing for phylogeny reconstruction, in: Proceedings of SPIE ITCom, vol. 4528, 2001, pp. 159-168.
  • 4
    • 35748956227 scopus 로고    scopus 로고
    • P. Bellens, J. Perez, R. Badia, J. Labarta, CellSs: a programming model for the Cell BE architecture, in: Proceedings of Supercomputing 2006, Tampa, FL, November 2006.
  • 5
    • 42649120679 scopus 로고    scopus 로고
    • Carsten Benthin, Ingo Wald, Michael Scherbaum, Heiko Friedrich, Ray tracing on the CELL Processor, in: Proceedings of the 2006 IEEE Symposium on Interactive Ray Tracing, 2006.
  • 6
    • 34748917568 scopus 로고    scopus 로고
    • Filip Blagojevic, Dimitrios S. Nikolopoulos, Alexandros Stamatakis, Christos D. Antonopoulos, Dynamic multigrain parallelization on the cell broadband engine, in: Proceedings of the 12th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, San Jose, CA, March 2007, pp. 90-100.
  • 7
    • 34548781271 scopus 로고    scopus 로고
    • Filip Blagojevic, Alexandros Stamatakis, Christos D. Antonopoulos, Dimitrios S. Nikolopoulos, RAxML-Cell: parallel phyolgenetic tree construction on the Cell broadband engine, in: Proceedings of the 21st IEEE/ACM International Parallel and Distributed Processing Symposium, Long Beach, CA, March 2007.
  • 8
    • 35748946168 scopus 로고    scopus 로고
    • T. Chen, Z. Sura, K. O'Brien, K. O'Brien, Optimizing the use of static buffers for DMA on a Cell chip, in: Proceedings of the 19th International Workshop on Languages and Compilers for Parallel Computing, New Orleans, LA, November 2006.
  • 9
    • 34548772303 scopus 로고    scopus 로고
    • Cell Broadband Engine architecture and its first implementation
    • Chen T., Raghavan R., Dale J., and Iwata E. Cell Broadband Engine architecture and its first implementation. IBM developer Works November (2005)
    • (2005) IBM developer Works , Issue.November
    • Chen, T.1    Raghavan, R.2    Dale, J.3    Iwata, E.4
  • 11
    • 35748985389 scopus 로고    scopus 로고
    • B. Flachs et al., The microarchitecture of the streaming processor for a CELL Processor, in: Proceedings of the IEEE International Solid-State Circuits Symposium, February 2005, pp. 184-185.
  • 12
    • 35748961235 scopus 로고    scopus 로고
    • K. Fatahalian et al., Sequoia: programming the memory hierarchy, in: Proceedings of Supercomputing 2006, Tampa, FL, November 2006.
  • 13
    • 0019797407 scopus 로고
    • Evolutionary trees from DNA sequences: a maximum likelihood approach
    • Felsenstein J. Evolutionary trees from DNA sequences: a maximum likelihood approach. Journal of Molecular Evolution 17 (1981) 368-376
    • (1981) Journal of Molecular Evolution , vol.17 , pp. 368-376
    • Felsenstein, J.1
  • 14
    • 35748937470 scopus 로고    scopus 로고
    • X. Feng, K. Cameron, D. Buell, PBPI: a high performance implementation of Bayesian phylogenetic inference, in: Proceedings of Supercomputing 2006, Tampa, FL, November 2006.
  • 15
    • 34548754973 scopus 로고    scopus 로고
    • X. Feng, K. Cameron, B. Smith, C. Sosa, Building the tree of life on terascale systems, in: Proceedings of the 21st International Parallel and Distributed Processing Symposium, Long Beach, CA, March 2007.
  • 17
    • 33846471996 scopus 로고    scopus 로고
    • M. Gordon, W. Thies, S. Amarasinghe, Exploiting coarse-grained task, data and pipelined parallelism in stream programs, in: Proceedings of the 12th International Conference on Architectural Support for Programming Languages and Operating Systems, San Jose, CA, October 2006, pp. 151-162.
  • 18
    • 35748976679 scopus 로고    scopus 로고
    • Nils Hjelte, Smoothed particle hydrodynamics on the Cell Broadband Engine, Master's thesis, Umeå University, Department of Computer Science, June 2006.
  • 19
    • 33746923043 scopus 로고    scopus 로고
    • Mike Kistler, Michael Perrone, Fabrizio Petrini, Cell multiprocessor interconnection network: built for speed, IEEE Micro, 26(3), May-June 2006. Available from http://hpc.pnl.gov/people/fabrizio/papers/ieeemicro-cell.pdf.
  • 20
    • 35748971613 scopus 로고    scopus 로고
    • Julie Langou, Julien Langou, Piotr Luszczek, Jakub Kurzak, Alfredo Buttari, Jack Dongarra, Exploiting the performance of 32 bit floating point arithmetic in obtaining 64 bit accuracy (revisiting iterative refinement for linear systems), in: Proceedings of Supercomputing 2006, Tampa, FL, November 2006.
  • 21
    • 34548757858 scopus 로고    scopus 로고
    • F. Petrini, G. Fossum, A. Varbanescu, M. Perrone, M. Kistler, J. Fernandez Periador, Multi-core surprises: lessons learned from optimized Sweep3D on the Cell Broadband Engine, in: Proceedings of the 21st International Parallel and Distributed Processing Symposium, Long Beach, CA, March 2007.
  • 22
    • 34548704782 scopus 로고    scopus 로고
    • Fabrizio Petrini, Daniel Scarpazza, Oreste Villa, Juan Fernandez, Challenges in mapping graph exploration algorithms on advanced multi-core processors, in: Proceedings of the 21st International Parallel and Distributed Processing Symposium, Long Beach, CA, March 2007.
  • 23
    • 33751067549 scopus 로고    scopus 로고
    • I. Sharapov, R. Kroeger, G. Delamater, R. Cheveresan, M. Ramsay, A case study in top-down performance estimation for a large-scale parallel application, in: Proceedings of the 11th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, New York, NY, March 2006, pp. 81-89.
  • 24
    • 33750403801 scopus 로고    scopus 로고
    • RAxML-VI-HPC: maximum likelihood-based phylogenetic analyses with thousands of taxa and mixed models
    • Stamatakis A. RAxML-VI-HPC: maximum likelihood-based phylogenetic analyses with thousands of taxa and mixed models. Bioinformatics (2006) btl446
    • (2006) Bioinformatics
    • Stamatakis, A.1
  • 25
    • 0009928676 scopus 로고    scopus 로고
    • Optimal use of mixed task and data parallelism for pipelined computations
    • Subhlok J., and Vondran G. Optimal use of mixed task and data parallelism for pipelined computations. Journal of Parallel and Distributed Computing 60 3 (2000) 297-319
    • (2000) Journal of Parallel and Distributed Computing , vol.60 , Issue.3 , pp. 297-319
    • Subhlok, J.1    Vondran, G.2
  • 26
    • 34247349114 scopus 로고    scopus 로고
    • Samuel Williams, John Shalf, Leonid Oliker, Shoaib Kamil, Parry Husbands, Katherine Yelick, The potential of the Cell processor for scientific computing, in: ACM International Conference on Computing Frontiers, May 3-6, 2006.
  • 27
    • 35748965701 scopus 로고    scopus 로고
    • Y. Zhao, K. Kennedy, Dependence-driven code generation for a Cell processor, in: Proceedings of the 19th International Workshop on Languages and Compilers for Parallel Computing, New Orleans, LA, November 2006.


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.