메뉴 건너뛰기




Volumn 33, Issue 10-11, 2007, Pages 720-740

High performance combinatorial algorithm design on the Cell Broadband Engine processor

Author keywords

Cell Broadband Engine processor; Combinatorial algorithms; Graph algorithms; List ranking; Multicore; Novel architectures; Parallel algorithms; Performance; zlib

Indexed keywords

COMPUTATIONAL METHODS; MICROPROCESSOR CHIPS; PARALLEL ALGORITHMS; PARALLEL PROCESSING SYSTEMS; PROBLEM SOLVING; PROGRAM PROCESSORS;

EID: 35748930871     PISSN: 01678191     EISSN: None     Source Type: Journal    
DOI: 10.1016/j.parco.2007.09.005     Document Type: Article
Times cited : (24)

References (26)
  • 1
    • 12444293815 scopus 로고    scopus 로고
    • D.A. Bader, G. Cong. A fast, parallel spanning tree algorithm for symmetric multiprocessors (SMPs), in: Proceedings of the International Parallel and Distributed Processing Symposium (IPDPS 2004), Santa Fe, NM, April 2004.
  • 2
    • 12444298999 scopus 로고    scopus 로고
    • D.A. Bader, G. Cong, Fast shared-memory algorithms for computing the minimum spanning forest of sparse graphs, in: Proceedings of the International Parallel and Distributed Processing Symposium (IPDPS 2004), Santa Fe, NM, April 2004.
  • 3
    • 33745125067 scopus 로고    scopus 로고
    • D.A. Bader, G. Cong, J. Feo, On the architectural requirements for efficient execution of graph algorithms, in: Proceedings of the 34th International Conference on Parallel Processing (ICPP), Oslo, Norway, June 2005.
  • 4
    • 84945304588 scopus 로고    scopus 로고
    • Evaluating arithmetic expressions using tree contraction: a fast and scalable parallel implementation for symmetric multiprocessors (SMPs)
    • Proceedings of the Ninth International Conference on High Performance Computing (HiPC 2002), Bangalore, India. Sahni S., Prasanna V.K., and Shukla U. (Eds), Springer-Verlag
    • Bader D.A., Sreshta S., and Weisse-Bernstein N. Evaluating arithmetic expressions using tree contraction: a fast and scalable parallel implementation for symmetric multiprocessors (SMPs). In: Sahni S., Prasanna V.K., and Shukla U. (Eds). Proceedings of the Ninth International Conference on High Performance Computing (HiPC 2002), Bangalore, India. Lecture Notes in Computer Science vol. 2552 (2002), Springer-Verlag 63-75
    • (2002) Lecture Notes in Computer Science , vol.2552 , pp. 63-75
    • Bader, D.A.1    Sreshta, S.2    Weisse-Bernstein, N.3
  • 5
    • 35748954748 scopus 로고    scopus 로고
    • D.A. Brokenshire, Maximizing the power of the Cell Broadband Engine Processor: 25 tips to optimal application performance, IBM developerWorks technical article, 2006.
  • 6
    • 35748970019 scopus 로고    scopus 로고
    • T. Chen, R. Raghavan, J. Dale, E. Iwata, Cell Broadband Engine Architecture and its first implementation, IBM developerWorks technical article, 2005.
  • 7
    • 0024684158 scopus 로고
    • Faster optimal prefix sums and list ranking
    • Cole R., and Vishkin U. Faster optimal prefix sums and list ranking. Information and Computation 81 3 (1989) 344-352
    • (1989) Information and Computation , vol.81 , Issue.3 , pp. 344-352
    • Cole, R.1    Vishkin, U.2
  • 8
    • 84976790986 scopus 로고    scopus 로고
    • D.E. Culler, R.M. Karp, D.A. Patterson, A. Sahay, K.E. Schauser, E. Santos, R. Subramonian, T. von Eicken, LogP: towards a realistic model of parallel computation, in: Proceedings of the Fourth Symposium Principles and Practice of Parallel Programming, ACM SIGPLAN, May 1993, pp. 1-12.
  • 9
    • 35748962747 scopus 로고    scopus 로고
    • P. Deutsch, J.-L. Gailly, zlib compressed data format specification version 3.3, Internet RFCs, 1996.
  • 10
    • 27644524078 scopus 로고    scopus 로고
    • B. Flachs et al., A streaming processor unit for a Cell processor, in: Proceedings of the International Solid State Circuits Conference, vol. 1, San Fransisco, CA, USA, February 2005, pp. 134-135.
  • 11
    • 84979025439 scopus 로고    scopus 로고
    • Designing practical efficient algorithms for symmetric multiprocessors
    • Algorithm Engineering and Experimentation (ALENEX'99), Baltimore, MD, Springer-Verlag
    • Helman D.R., and JáJá J. Designing practical efficient algorithms for symmetric multiprocessors. Algorithm Engineering and Experimentation (ALENEX'99), Baltimore, MD. Lecture Notes in Computer Science vol. 1619 (1999), Springer-Verlag 37-56
    • (1999) Lecture Notes in Computer Science , vol.1619 , pp. 37-56
    • Helman, D.R.1    JáJá, J.2
  • 13
    • 84938015047 scopus 로고
    • A method for the construction of minimum-redundancy codes
    • Huffman D.A. A method for the construction of minimum-redundancy codes. Proceedings of the IRE 40 9 (1952) 1098-1101
    • (1952) Proceedings of the IRE , vol.40 , Issue.9 , pp. 1098-1101
    • Huffman, D.A.1
  • 14
    • 27944446098 scopus 로고    scopus 로고
    • C. Jacobi, H.-J. Oh, K.D. Tran, S.R. Cottier, B.W. Michael, H. Nishikawa, Y. Totsuka, T. Namatame, N. Yano, The vector floating-point unit in a synergistic processor element of a Cell processor, in: Proceedings of the 17th IEEE Symposium on Computer Arithmetic, Washington, DC, USA, 2005, IEEE (ARITH'05) Computer Society, pp. 59-67.
  • 18
    • 33746923043 scopus 로고    scopus 로고
    • Cell multiprocessor communication network: built for speed
    • Kistler M., Perrone M., and Petrini F. Cell multiprocessor communication network: built for speed. IEEE Micro 26 3 (2006) 10-23
    • (2006) IEEE Micro , vol.26 , Issue.3 , pp. 10-23
    • Kistler, M.1    Perrone, M.2    Petrini, F.3
  • 19
    • 35748952029 scopus 로고    scopus 로고
    • B. Lin, Altivec solutions to sequential problems: calculating CRC with scalable congruent equivalent compression, Freescale application note AN2926, January 2006.
  • 20
    • 35748929507 scopus 로고    scopus 로고
    • K. Margaritis, Vectorization of algorithm Adler32 using altivec. freevec.org whitepaper, 2005.
  • 21
    • 27344435504 scopus 로고    scopus 로고
    • D. Pham et al. The design and implementation of a first-generation Cell processor, in: Proceedings of the International Solid State Circuits Conference, vol. 1, San Fransisco, CA, USA, February 2005, pp. 184-185.
  • 22
    • 0347725261 scopus 로고    scopus 로고
    • A general-purpose shared-memory model for parallel computation
    • Heath M.T., Ranade A., and Schreiber R.S. (Eds), Springer-Verlag, New York
    • Ramachandran V. A general-purpose shared-memory model for parallel computation. In: Heath M.T., Ranade A., and Schreiber R.S. (Eds). Algorithms for Parallel Processing vol. 105 (1999), Springer-Verlag, New York 1-18
    • (1999) Algorithms for Parallel Processing , vol.105 , pp. 1-18
    • Ramachandran, V.1
  • 24
    • 0025467711 scopus 로고
    • A bridging model for parallel computation
    • Valiant L.G. A bridging model for parallel computation. Communications of the ACM 33 8 (1990) 103-111
    • (1990) Communications of the ACM , vol.33 , Issue.8 , pp. 103-111
    • Valiant, L.G.1
  • 25
    • 34247349114 scopus 로고    scopus 로고
    • S. Williams, J. Shalf, L. Oliker, S. Kamil, P. Husbands, K. Yelick, The potential of the Cell processor for scientific computing, in: Proceedings of the Third Conference on Computing Frontiers CF'06, 2006, ACM Press, New York, NY, USA, pp. 9-20.
  • 26
    • 0017493286 scopus 로고
    • A universal algorithm for sequential data compression
    • Ziv J., and Lempel A. A universal algorithm for sequential data compression. IEEE Transactions on Information Theory 23 3 (1977) 337-343
    • (1977) IEEE Transactions on Information Theory , vol.23 , Issue.3 , pp. 337-343
    • Ziv, J.1    Lempel, A.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.