메뉴 건너뛰기




Volumn , Issue , 2009, Pages 140-151

Rigel: An architecture and scalable programming interface for a 1000-core accelerator

Author keywords

Accelerator; Computer architecture; Low level programming interface

Indexed keywords

A-DENSITY; ACCELERATOR ARCHITECTURES; ADDRESS SPACE; BARRIER OPERATIONS; DESIGN ANALYSIS; DOMAIN SPECIFIC; EXECUTION MODEL; EXPERIMENTAL ANALYSIS; HARDWARE SUPPORTS; INITIAL DESIGN; LOAD-BALANCING; PARALLEL COMPUTATION; POWER EFFICIENCY; PROGRAMMING INTERFACE; PROGRAMMING MODELS; SCALABILITY ISSUE; SOFTWARE TECHNIQUES; SPECIALIZED HARDWARE; TASK DISTRIBUTION; WORK DISTRIBUTION;

EID: 70450237431     PISSN: 10636897     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1145/1555754.1555774     Document Type: Conference Paper
Times cited : (94)

References (31)
  • 1
    • 34547471544 scopus 로고    scopus 로고
    • Design tradeoffs for tiled CMP on-chip networks
    • J. Balfour and W. J. Dally. Design tradeoffs for tiled CMP on-chip networks. In ICS'06, 2006.
    • (2006) ICS'06
    • Balfour, J.1    Dally, W.J.2
  • 2
    • 0024770039 scopus 로고
    • Scans as primitive parallel operations
    • G. E. Blelloch. Scans as primitive parallel operations. IEEE Trans. Comput., 38(11), 1989.
    • (1989) IEEE Trans. Comput , vol.38 , Issue.11
    • Blelloch, G.E.1
  • 3
    • 0029666646 scopus 로고    scopus 로고
    • Memory bandwidth limitations of future microprocessors
    • D. Burger, J. R. Goodman, and A. Kägi. Memory bandwidth limitations of future microprocessors. In ISCA'96, 1996.
    • (1996) ISCA'96
    • Burger, D.1    Goodman, J.R.2    Kägi, A.3
  • 4
    • 0029209574 scopus 로고
    • A hierarchical task queue organization for shared-memory multiprocessor systems
    • S. P. Dandamudi and P. S. P. Cheng. A hierarchical task queue organization for shared-memory multiprocessor systems. IEEE Trans. Parallel Distrib. Syst., 6(1), 1995.
    • (1995) IEEE Trans. Parallel Distrib. Syst , vol.6 , Issue.1
    • Dandamudi, S.P.1    Cheng, P.S.P.2
  • 7
    • 56649087761 scopus 로고    scopus 로고
    • GPUs: A closer look
    • K. Fatahalian and M. Houston. GPUs: a closer look. Queue, 6(2):18-28, 2008.
    • (2008) Queue , vol.6 , Issue.2 , pp. 18-28
    • Fatahalian, K.1    Houston, M.2
  • 10
    • 34247376580 scopus 로고    scopus 로고
    • Chip multiprocessing and the Cell broadband engine
    • New York, NY, USA
    • M. Gschwind. Chip multiprocessing and the Cell broadband engine. In CF'06, pages 1-8, New York, NY, USA, 2006.
    • (2006) CF'06 , pp. 1-8
    • Gschwind, M.1
  • 12
    • 70450249565 scopus 로고    scopus 로고
    • Intel. Intel microprocessor export compliance metrics, Februrary 2009.
    • Intel. Intel microprocessor export compliance metrics, Februrary 2009.
  • 14
    • 35348855586 scopus 로고    scopus 로고
    • Carbon: Architectural support for fine-grained parallelism on chip multiprocessors
    • New York, NY, USA
    • S. Kumar, C. J. Hughes, and A. Nguyen. Carbon: architectural support for fine-grained parallelism on chip multiprocessors. In ISCA'07, pages 162-173, New York, NY, USA, 2007.
    • (2007) ISCA'07 , pp. 162-173
    • Kumar, S.1    Hughes, C.J.2    Nguyen, A.3
  • 16
    • 44849137198 scopus 로고    scopus 로고
    • NVIDIA tesla: A unified graphics and computing architecture
    • E. Lindholm, J. Nickolls, S. Oberman, and J. Montrym. NVIDIA tesla: A unified graphics and computing architecture. IEEE Micro, 28(2), 2008.
    • (2008) IEEE Micro , vol.28 , Issue.2
    • Lindholm, E.1    Nickolls, J.2    Oberman, S.3    Montrym, J.4
  • 17
    • 66749170578 scopus 로고    scopus 로고
    • Tradeoffs in designing accelerator architectures for visual computing
    • A. Mahesri, D. Johnson, N. Crago, and S. J. Patel. Tradeoffs in designing accelerator architectures for visual computing. In MICRO'08, 2008.
    • (2008) MICRO'08
    • Mahesri, A.1    Johnson, D.2    Crago, N.3    Patel, S.J.4
  • 18
    • 84976718540 scopus 로고
    • Algorithms for scalable synchronization on shared-memory multiprocessors
    • J. M. Mellor-Crummey and M. L. Scott. Algorithms for scalable synchronization on shared-memory multiprocessors. ACM Trans. Comput. Syst., 9(1):21-65, 1991.
    • (1991) ACM Trans. Comput. Syst , vol.9 , Issue.1 , pp. 21-65
    • Mellor-Crummey, J.M.1    Scott, M.L.2
  • 20
    • 78651550268 scopus 로고    scopus 로고
    • Scalable parallel programming with CUDA
    • J. Nickolls, I. Buck, M. Garland, and K. Skadron. Scalable parallel programming with CUDA. Queue, 6(2), 2008.
    • (2008) Queue , vol.6 , Issue.2
    • Nickolls, J.1    Buck, I.2    Garland, M.3    Skadron, K.4
  • 24
    • 0030259457 scopus 로고    scopus 로고
    • Synchronization and communication in the T3E multiprocessor
    • S. L. Scott. Synchronization and communication in the T3E multiprocessor. In ASPLOS'96, pages 26-36, 1996.
    • (1996) ASPLOS'96 , pp. 26-36
    • Scott, S.L.1
  • 26
    • 0009384049 scopus 로고
    • The architecture of HEP
    • Massachusetts Institute of Technology
    • B. Smith. The architecture of HEP. In On Parallel MIMD computation, pages 41-55. Massachusetts Institute of Technology, 1985.
    • (1985) On Parallel MIMD computation , pp. 41-55
    • Smith, B.1
  • 30
    • 49549084422 scopus 로고    scopus 로고
    • A third-generation 65nm 16-core 32-thread plus 32-scout-thread CMT SPARC processor
    • Feb
    • M. Tremblay and S. Chaudhry. A third-generation 65nm 16-core 32-thread plus 32-scout-thread CMT SPARC processor. In ISSCC'08, Feb. 2008.
    • (2008) ISSCC'08
    • Tremblay, M.1    Chaudhry, S.2
  • 31
    • 0025467711 scopus 로고
    • A bridging model for parallel computation
    • L. G. Valiant. A bridging model for parallel computation. Communications of the ACM, 33(8):103-111, 1990.
    • (1990) Communications of the ACM , vol.33 , Issue.8 , pp. 103-111
    • Valiant, L.G.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.