SCOPUS 정보 검색 플랫폼

Proceedings of the Annual International Symposium on Microarchitecture, MICRO

Volumn , Issue 2008 PROCEEDINGS, 2008, Pages 35-46

Virtual tree coherence: Leveraging regions and in-network multicast trees for scalable cache coherence

(3) Jerger, Natalie D Enright a Peh, Li Shiuan b Lipasti, Mikko H a

a University of Wisconsin Madison (United States)

b Princeton University (United States)

Author keywords

[No Author keywords available]

Indexed keywords

CACHE COHERENCE; COARSE-GRAINED; COHERENCE PROTOCOL; COMMERCIAL APPLICATIONS; COMPUTATION POWER; CORE SYSTEMS; EXECUTION TIME; GREEDY PROTOCOLS; HIGH BANDWIDTH COMMUNICATION; IN-NETWORK; MANY-CORE; MANY-CORE ARCHITECTURE; MULTICAST TREE; MULTICASTS; RUNTIME; SCALABLE COMMUNICATION; SERVER CONSOLIDATION; VIRTUAL TREE;

MULTICASTING; PROGRAM PROCESSORS;

TELECOMMUNICATION SYSTEMS;

EID: 66749163103 PISSN: 10724451 EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/MICRO.2008.4771777 Document Type: Conference Paper

Times cited : (65)

References (37)

1
- 44149085697
- Reducing the interconnection network cost of chip multiprocessors
- P. Abad, V. Puente, and J. Gregorio, "Reducing the interconnection network cost of chip multiprocessors," in NOCS, 2008.
- (2008) NOCS
- Abad, P.¹ Puente, V.² Gregorio, J.³

2
- 35348819913
- Rotary router: An efficient architecture for cmp interconnection networks
- P. Abad, V. Puente, J. Gregorio, and P. Prieto, "Rotary router: an efficient architecture for cmp interconnection networks," in ISCA, 2007.
- (2007) ISCA
- Abad, P.¹ Puente, V.² Gregorio, J.³ Prieto, P.⁴

3
- 66749188683
- Power-Aware DRAM Speculation
- N. Aggarwal, J. Cantin, M. Lipasti, and J. E. Smith, "Power-Aware DRAM Speculation," in HPCA-12, 2008.
- (2008) HPCA-12
- Aggarwal, N.¹ Cantin, J.² Lipasti, M.³ Smith, J.E.⁴

4
- 27544481926
- Variability in architectural simulations of multi-threaded workloads
- A. R. Alameldeen and D. A. Wood, "Variability in architectural simulations of multi-threaded workloads," in Proceedings of HPCA-9, 2003.
- (2003) Proceedings of HPCA-9
- Alameldeen, A.R.¹ Wood, D.A.²

5
- 34547471544
- Design tradeoffs for tiled cmp on-chip networks
- J. Balfour and W. Dally, "Design tradeoffs for tiled cmp on-chip networks," in International Conference on Supercomputing, 2006.
- (2006) International Conference on Supercomputing
- Balfour, J.¹ Dally, W.²

6
- 0027309859
- The performance of cache-coherent ringbased multiprocessors
- L. A. Barroso and M. Dubois, "The performance of cache-coherent ringbased multiprocessors," in ISCA-20, 1993.
- (1993) ISCA-20
- Barroso, L.A.¹ Dubois, M.²

7
- 0032647513
- Multicast snooping: A new coherence method using a multicast address network
- May
- E. Bilir, R. Dickson, Y. Hu, M. Plakal, D. Sorin, M. Hill, and D. Wood, "Multicast snooping: A new coherence method using a multicast address network," in Proc. of ISCA, May 1999.
- (1999) Proc. of ISCA
- Bilir, E.¹ Dickson, R.² Hu, Y.³ Plakal, M.⁴ Sorin, D.⁵ Hill, M.⁶ Wood, D.⁷

8
- 0005369980
- Precise and accurate processor simulation
- H. Cain, K. Lepak, B. Schwarz, and M. H. Lipasti, "Precise and accurate processor simulation," in Workshop On Computer Architecture Evaluation using Commercial Workloads, 2002.
- (2002) Workshop On Computer Architecture Evaluation using Commercial Workloads
- Cain, H.¹ Lepak, K.² Schwarz, B.³ Lipasti, M.H.⁴

9
- 27544506862
- Improving multiprocessor performance with coarse-grain coherence tracking
- J. F. Cantin, M. H. Lipasti, and J. E. Smith, "Improving multiprocessor performance with coarse-grain coherence tracking," in ISCA-32, 2005.
- (2005) ISCA-32
- Cantin, J.F.¹ Lipasti, M.H.² Smith, J.E.³

10
- 34547449252
- Stealth prefetching
- J. F. Cantin, "Stealth prefetching," in International Conference on Architectural Support for Programming Languages and Operating Systems, 2006.
- (2006) International Conference on Architectural Support for Programming Languages and Operating Systems
- Cantin, J.F.¹

11
- 0033099692
- An efficient tree cache coherence protocol for distributed shared memory multiprocessors
- Y. Chang and L. N. Bhuyan, "An efficient tree cache coherence protocol for distributed shared memory multiprocessors," IEEE Transactions on Computers, vol. 48, no. 3, 1998.
- (1998) IEEE Transactions on Computers , vol.48 , Issue.3
- Chang, Y.¹ Bhuyan, L.N.²

12
- 0025433355
- Virtual-channel flow control
- W J. Dally, "Virtual-channel flow control," in ISCA, 1990.
- (1990) ISCA
- Dally, W.J.¹

13
- 40349091835
- In-network cache coherence
- N. Eisley, L.-S. Peh, and L. Shang, "In-network cache coherence," in International Symposium on Microarchitecture, 2006.
- (2006) International Symposium on Microarchitecture
- Eisley, N.¹ Peh, L.-S.² Shang, L.³

14
- 52649171528
- Virtual circuit tree multicasting: A case for on-chip hardware multicast support
- N. Enright Jerger, L.-S. Peh, and M. H. Lipasti, "Virtual circuit tree multicasting: A case for on-chip hardware multicast support," in Proceedings of ISCA-35, 2008.
- (2008) Proceedings of ISCA-35
- Enright Jerger, N.¹ Peh, L.-S.² Lipasti, M.H.³

15
- 47349085587
- An evaluation of server consolidation workloads for multi-core designs
- N. Enright Jerger, D. Vanatrease, and M. Lipasti, "An evaluation of server consolidation workloads for multi-core designs," in IISWC, 2007.
- (2007) IISWC
- Enright Jerger, N.¹ Vanatrease, D.² Lipasti, M.³

16
- 0039335269
- Architecture and design of AlphaServer GS320
- K. Gharachorloo, M. Sharma, S. Steely, and S. V Doren, "Architecture and design of AlphaServer GS320," in Architectural Support for Programming Languages and Operating Systems, 2000.
- (2000) Architectural Support for Programming Languages and Operating Systems
- Gharachorloo, K.¹ Sharma, M.² Steely, S.³ Doren, S.V.⁴

17
- 0030685588
- The SGI Origin: A ccNUMA highly scalable server
- J. Laudon and D. Lenoski, "The SGI Origin: a ccNUMA highly scalable server," in ISCA-24, 1997.
- (1997) ISCA-24
- Laudon, J.¹ Lenoski, D.²

18
- 84968853465
- Redeeming IPC as a performance metric for multithreaded programs
- K. M. Lepak, H. W Cain, and M. H. Lipasti, "Redeeming IPC as a performance metric for multithreaded programs," in Proceeding of 12th PACT, 2003, pp. 232-243.
- (2003) Proceeding of 12th PACT , pp. 232-243
- Lepak, K.M.¹ Cain, H.W.² Lipasti, M.H.³

19
- 0038684776
- Using destination-set prediction to improve the latency/bandwidth tradeoff in shared-memory multiprocessors
- June
- M. M. K. Martin, P. J. Harper, D. J. Sorin, M. D. Hill, and D. A. Wood, "Using destination-set prediction to improve the latency/bandwidth tradeoff in shared-memory multiprocessors," in Proceedings of the 30th ISCA, June 2003.
- (2003) Proceedings of the 30th ISCA
- Martin, M.M.K.¹ Harper, P.J.² Sorin, D.J.³ Hill, M.D.⁴ Wood, D.A.⁵

20
- 0038346234
- Token coherence: Decoupling performance and correctness
- M. M. K. Martin, M. D. Hill, and D. A. Wood, "Token coherence: Decoupling performance and correctness," in ISCA-30, 2003.
- (2003) ISCA-30
- Martin, M.M.K.¹ Hill, M.D.² Wood, D.A.³

21
- 33745130597
- Bandwidth adaptive snooping
- M. M. K. Martin, D. J. Sorin, M. D. Hill, and D. A. Wood, "Bandwidth adaptive snooping," in HPCA-8, 2002.
- (2002) HPCA-8
- Martin, M.M.K.¹ Sorin, D.J.² Hill, M.D.³ Wood, D.A.⁴

22
- 66749153837
- Cache coherence techniques for multicore processors
- M. R. Marty, Cache coherence techniques for multicore processors," in PhD Dissertation, University of Wisconsin - Madison, 2008.
- (2008) PhD Dissertation, University of Wisconsin - Madison
- Marty, M.R.¹

23
- 28444472751
- Improving multiple-cmp systems using token coherenece
- February
- M. R. Marty, J. D. Bingham, M. D. Hill, A. J. Hu, M. M. K. Martin, and D. A. Wood, "Improving multiple-cmp systems using token coherenece," in HPCA, February 2005.
- (2005) HPCA
- Marty, M.R.¹ Bingham, J.D.² Hill, M.D.³ Hu, A.J.⁴ Martin, M.M.K.⁵ Wood, D.A.⁶

24
- 40349100696
- Coherence ordering for ring-based chip multiprocessors
- December
- M. R. Marty and M. D. Hill, "Coherence ordering for ring-based chip multiprocessors," in MICRO-39, December 2006.
- (2006) MICRO-39
- Marty, M.R.¹ Hill, M.D.²

25
- 35348900723
- Virtual hierarchies to support server consolidation
- M. R. Marty, "Virtual hierarchies to support server consolidation," in ISCA-34, 2007.
- (2007) ISCA-34
- Marty, M.R.¹

26
- 27544455733
- Regionscout: Exploiting coarse grain sharing in snoop-based coherence
- A. Moshovos, "Regionscout: Exploiting coarse grain sharing in snoop-based coherence." in ISCA-32, 2005.
- (2005) ISCA-32
- Moshovos, A.¹

27
- 0036167929
- The Alpha 21364 network architecture
- S. S. Mukherjee, P. Bannon, S. Lang, A. Spink, and D. Webb, "The Alpha 21364 network architecture," IEEE Micro, vol. 22, no. 1, pp. 26-35, 2002.
- (2002) IEEE Micro , vol.22 , Issue.1 , pp. 26-35
- Mukherjee, S.S.¹ Bannon, P.² Lang, S.³ Spink, A.⁴ Webb, D.⁵

28
- 0002979865
- The scalable tree protocol - a cache coherence approach for large-scale multiprocessors
- H. Nilsson and P. Stenstrom, "The scalable tree protocol - a cache coherence approach for large-scale multiprocessors," in IPDPS, 1992.
- (1992) IPDPS
- Nilsson, H.¹ Stenstrom, P.²

29
- 25844437046
- Power5 system microarchitecture
- B. Sinharoy, R. Kalla, J. Tendler, R. Eickemeyer, and J. Joyner, "Power5 system microarchitecture," IBM Journal of Research and Development, vol. 49, no. 4, 2005.
- (2005) IBM Journal of Research and Development , vol.49 , Issue.4
- Sinharoy, B.¹ Kalla, R.² Tendler, J.³ Eickemeyer, R.⁴ Joyner, J.⁵

30
- 24644502365
- SPEC
- SPEC, "SPEC benchmarks," http://www.spec.org.
- SPEC benchmarks

31
- 33845886092
- Flexible snooping: Adaptive forwarding and filtering of snoops in embedded ring multiprocessors
- K. Strauss, X. Shen, and J. Torrellas, 'Flexible snooping: Adaptive forwarding and filtering of snoops in embedded ring multiprocessors," in International Symposium on Computer Architecture, 2006.
- (2006) International Symposium on Computer Architecture
- Strauss, K.¹ Shen, X.² Torrellas, J.³

32
- 47349125701
- Uncorq: Unconstrained snoop request delivery in embedded-ring multiprocessors
- K. Strauss, "Uncorq: Unconstrained snoop request delivery in embedded-ring multiprocessors," in MICRO-40, 2007.
- (2007) MICRO-40
- Strauss, K.¹

33
- 84871283702
- TPC
- TPC, "TPC benchmarks," http://www.tpc.org.
- TPC benchmarks

34
- 34548858682
- An 80-tile 1.28 tflops network-on-chip in 65nm cmos
- S. Vangal, J. Howard, G. Ruhl, S. Dighe, H. Wilson, J. Tschanz, D. Finan, P. Iyer, A. Singh, T. Jacob, S. Jain, S. Venkataraman, Y. Hoskote, and N. Borkar, "An 80-tile 1.28 tflops network-on-chip in 65nm cmos," in IEEE International Solid State Circuit Conference, 2007.
- (2007) IEEE International Solid State Circuit Conference
- Vangal, S.¹ Howard, J.² Ruhl, G.³ Dighe, S.⁴ Wilson, H.⁵ Tschanz, J.⁶ Finan, D.⁷ Iyer, P.⁸ Singh, A.⁹ Jacob, T.¹⁰ Jain, S.¹¹ Venkataraman, S.¹² Hoskote, Y.¹³ Borkar, N.¹⁴

35
- 36849030305
- D. Wentzlaff, P. Griffin, H. Hoffman, L. Bao, B. Edwards, C. Ramey, M. Mattina, C.-C. Miao, J. B. III, and A. Agarwal, On-chip interconnection architecture of the tile processor, IEEE Micro, pp. 15-31, 2007.
- D. Wentzlaff, P. Griffin, H. Hoffman, L. Bao, B. Edwards, C. Ramey, M. Mattina, C.-C. Miao, J. B. III, and A. Agarwal, "On-chip interconnection architecture of the tile processor," IEEE Micro, pp. 15-31, 2007.

36
- 0029194459
- The SPLASH-2 programs: Characterization and methodological considerations
- June
- S. Woo, M. Ohara, E. Torrie, J. Singh, and A. Gupta, "The SPLASH-2 programs: Characterization and methodological considerations," in ISCA-22, June 1995.
- (1995) ISCA-22
- Woo, S.¹ Ohara, M.² Torrie, E.³ Singh, J.⁴ Gupta, A.⁵

37
- 47349115313
- A framework for coarse-grain optimizations in the on-chip memory hierarchy
- J. Zebchuk, E. Safi, and A. Moshovos, "A framework for coarse-grain optimizations in the on-chip memory hierarchy," in MICRO-40, 2007.
- (2007) MICRO-40
- Zebchuk, J.¹ Safi, E.² Moshovos, A.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.