메뉴 건너뛰기




Volumn , Issue 2008 PROCEEDINGS, 2008, Pages 35-46

Virtual tree coherence: Leveraging regions and in-network multicast trees for scalable cache coherence

Author keywords

[No Author keywords available]

Indexed keywords

CACHE COHERENCE; COARSE-GRAINED; COHERENCE PROTOCOL; COMMERCIAL APPLICATIONS; COMPUTATION POWER; CORE SYSTEMS; EXECUTION TIME; GREEDY PROTOCOLS; HIGH BANDWIDTH COMMUNICATION; IN-NETWORK; MANY-CORE; MANY-CORE ARCHITECTURE; MULTICAST TREE; MULTICASTS; RUNTIME; SCALABLE COMMUNICATION; SERVER CONSOLIDATION; VIRTUAL TREE;

EID: 66749163103     PISSN: 10724451     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/MICRO.2008.4771777     Document Type: Conference Paper
Times cited : (65)

References (37)
  • 1
    • 44149085697 scopus 로고    scopus 로고
    • Reducing the interconnection network cost of chip multiprocessors
    • P. Abad, V. Puente, and J. Gregorio, "Reducing the interconnection network cost of chip multiprocessors," in NOCS, 2008.
    • (2008) NOCS
    • Abad, P.1    Puente, V.2    Gregorio, J.3
  • 2
    • 35348819913 scopus 로고    scopus 로고
    • Rotary router: An efficient architecture for cmp interconnection networks
    • P. Abad, V. Puente, J. Gregorio, and P. Prieto, "Rotary router: an efficient architecture for cmp interconnection networks," in ISCA, 2007.
    • (2007) ISCA
    • Abad, P.1    Puente, V.2    Gregorio, J.3    Prieto, P.4
  • 4
    • 27544481926 scopus 로고    scopus 로고
    • Variability in architectural simulations of multi-threaded workloads
    • A. R. Alameldeen and D. A. Wood, "Variability in architectural simulations of multi-threaded workloads," in Proceedings of HPCA-9, 2003.
    • (2003) Proceedings of HPCA-9
    • Alameldeen, A.R.1    Wood, D.A.2
  • 6
    • 0027309859 scopus 로고
    • The performance of cache-coherent ringbased multiprocessors
    • L. A. Barroso and M. Dubois, "The performance of cache-coherent ringbased multiprocessors," in ISCA-20, 1993.
    • (1993) ISCA-20
    • Barroso, L.A.1    Dubois, M.2
  • 9
    • 27544506862 scopus 로고    scopus 로고
    • Improving multiprocessor performance with coarse-grain coherence tracking
    • J. F. Cantin, M. H. Lipasti, and J. E. Smith, "Improving multiprocessor performance with coarse-grain coherence tracking," in ISCA-32, 2005.
    • (2005) ISCA-32
    • Cantin, J.F.1    Lipasti, M.H.2    Smith, J.E.3
  • 11
    • 0033099692 scopus 로고    scopus 로고
    • An efficient tree cache coherence protocol for distributed shared memory multiprocessors
    • Y. Chang and L. N. Bhuyan, "An efficient tree cache coherence protocol for distributed shared memory multiprocessors," IEEE Transactions on Computers, vol. 48, no. 3, 1998.
    • (1998) IEEE Transactions on Computers , vol.48 , Issue.3
    • Chang, Y.1    Bhuyan, L.N.2
  • 12
    • 0025433355 scopus 로고
    • Virtual-channel flow control
    • W J. Dally, "Virtual-channel flow control," in ISCA, 1990.
    • (1990) ISCA
    • Dally, W.J.1
  • 14
    • 52649171528 scopus 로고    scopus 로고
    • Virtual circuit tree multicasting: A case for on-chip hardware multicast support
    • N. Enright Jerger, L.-S. Peh, and M. H. Lipasti, "Virtual circuit tree multicasting: A case for on-chip hardware multicast support," in Proceedings of ISCA-35, 2008.
    • (2008) Proceedings of ISCA-35
    • Enright Jerger, N.1    Peh, L.-S.2    Lipasti, M.H.3
  • 15
    • 47349085587 scopus 로고    scopus 로고
    • An evaluation of server consolidation workloads for multi-core designs
    • N. Enright Jerger, D. Vanatrease, and M. Lipasti, "An evaluation of server consolidation workloads for multi-core designs," in IISWC, 2007.
    • (2007) IISWC
    • Enright Jerger, N.1    Vanatrease, D.2    Lipasti, M.3
  • 17
    • 0030685588 scopus 로고    scopus 로고
    • The SGI Origin: A ccNUMA highly scalable server
    • J. Laudon and D. Lenoski, "The SGI Origin: a ccNUMA highly scalable server," in ISCA-24, 1997.
    • (1997) ISCA-24
    • Laudon, J.1    Lenoski, D.2
  • 18
    • 84968853465 scopus 로고    scopus 로고
    • Redeeming IPC as a performance metric for multithreaded programs
    • K. M. Lepak, H. W Cain, and M. H. Lipasti, "Redeeming IPC as a performance metric for multithreaded programs," in Proceeding of 12th PACT, 2003, pp. 232-243.
    • (2003) Proceeding of 12th PACT , pp. 232-243
    • Lepak, K.M.1    Cain, H.W.2    Lipasti, M.H.3
  • 19
    • 0038684776 scopus 로고    scopus 로고
    • Using destination-set prediction to improve the latency/bandwidth tradeoff in shared-memory multiprocessors
    • June
    • M. M. K. Martin, P. J. Harper, D. J. Sorin, M. D. Hill, and D. A. Wood, "Using destination-set prediction to improve the latency/bandwidth tradeoff in shared-memory multiprocessors," in Proceedings of the 30th ISCA, June 2003.
    • (2003) Proceedings of the 30th ISCA
    • Martin, M.M.K.1    Harper, P.J.2    Sorin, D.J.3    Hill, M.D.4    Wood, D.A.5
  • 20
    • 0038346234 scopus 로고    scopus 로고
    • Token coherence: Decoupling performance and correctness
    • M. M. K. Martin, M. D. Hill, and D. A. Wood, "Token coherence: Decoupling performance and correctness," in ISCA-30, 2003.
    • (2003) ISCA-30
    • Martin, M.M.K.1    Hill, M.D.2    Wood, D.A.3
  • 24
    • 40349100696 scopus 로고    scopus 로고
    • Coherence ordering for ring-based chip multiprocessors
    • December
    • M. R. Marty and M. D. Hill, "Coherence ordering for ring-based chip multiprocessors," in MICRO-39, December 2006.
    • (2006) MICRO-39
    • Marty, M.R.1    Hill, M.D.2
  • 25
    • 35348900723 scopus 로고    scopus 로고
    • Virtual hierarchies to support server consolidation
    • M. R. Marty, "Virtual hierarchies to support server consolidation," in ISCA-34, 2007.
    • (2007) ISCA-34
    • Marty, M.R.1
  • 26
    • 27544455733 scopus 로고    scopus 로고
    • Regionscout: Exploiting coarse grain sharing in snoop-based coherence
    • A. Moshovos, "Regionscout: Exploiting coarse grain sharing in snoop-based coherence." in ISCA-32, 2005.
    • (2005) ISCA-32
    • Moshovos, A.1
  • 28
    • 0002979865 scopus 로고
    • The scalable tree protocol - a cache coherence approach for large-scale multiprocessors
    • H. Nilsson and P. Stenstrom, "The scalable tree protocol - a cache coherence approach for large-scale multiprocessors," in IPDPS, 1992.
    • (1992) IPDPS
    • Nilsson, H.1    Stenstrom, P.2
  • 30
    • 24644502365 scopus 로고    scopus 로고
    • SPEC
    • SPEC, "SPEC benchmarks," http://www.spec.org.
    • SPEC benchmarks
  • 32
    • 47349125701 scopus 로고    scopus 로고
    • Uncorq: Unconstrained snoop request delivery in embedded-ring multiprocessors
    • K. Strauss, "Uncorq: Unconstrained snoop request delivery in embedded-ring multiprocessors," in MICRO-40, 2007.
    • (2007) MICRO-40
    • Strauss, K.1
  • 33
    • 84871283702 scopus 로고    scopus 로고
    • TPC
    • TPC, "TPC benchmarks," http://www.tpc.org.
    • TPC benchmarks
  • 35
    • 36849030305 scopus 로고    scopus 로고
    • D. Wentzlaff, P. Griffin, H. Hoffman, L. Bao, B. Edwards, C. Ramey, M. Mattina, C.-C. Miao, J. B. III, and A. Agarwal, On-chip interconnection architecture of the tile processor, IEEE Micro, pp. 15-31, 2007.
    • D. Wentzlaff, P. Griffin, H. Hoffman, L. Bao, B. Edwards, C. Ramey, M. Mattina, C.-C. Miao, J. B. III, and A. Agarwal, "On-chip interconnection architecture of the tile processor," IEEE Micro, pp. 15-31, 2007.
  • 36
    • 0029194459 scopus 로고
    • The SPLASH-2 programs: Characterization and methodological considerations
    • June
    • S. Woo, M. Ohara, E. Torrie, J. Singh, and A. Gupta, "The SPLASH-2 programs: Characterization and methodological considerations," in ISCA-22, June 1995.
    • (1995) ISCA-22
    • Woo, S.1    Ohara, M.2    Torrie, E.3    Singh, J.4    Gupta, A.5
  • 37
    • 47349115313 scopus 로고    scopus 로고
    • A framework for coarse-grain optimizations in the on-chip memory hierarchy
    • J. Zebchuk, E. Safi, and A. Moshovos, "A framework for coarse-grain optimizations in the on-chip memory hierarchy," in MICRO-40, 2007.
    • (2007) MICRO-40
    • Zebchuk, J.1    Safi, E.2    Moshovos, A.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.