메뉴 건너뛰기




Volumn , Issue , 2005, Pages 247-254

System-wide performance monitors and their application to the optimization of coherent memory accesses

Author keywords

Coherence traffic; Performance monitors

Indexed keywords

COHERENCE PROTOCOL; COHERENCE TRAFFIC; PERFORMANCE MONITORS;

EID: 31844432634     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1145/1065944.1065977     Document Type: Conference Paper
Times cited : (2)

References (16)
  • 1
    • 31844449896 scopus 로고    scopus 로고
    • Owner prediction for accelerating cache-to-cache transfer misses in a cc-NUMA architecture
    • M. E. Acacio et al. Owner prediction for accelerating cache-to-cache transfer misses in a cc-NUMA architecture. In Proc. of Supercomputing SC02, pages 1-12, 2002.
    • (2002) Proc. of Supercomputing SC02 , pp. 1-12
    • Acacio, M.E.1
  • 2
    • 31844454881 scopus 로고    scopus 로고
    • Design and experience: Using the Intel Itanium2 processor performance monitoring unit to implement feedback optimizations
    • Y. Choi, A. Knies, G. Vedaraman, and J. Williamson. Design and experience: Using the Intel Itanium2 processor performance monitoring unit to implement feedback optimizations. In EPIC2 Workshop, 2002.
    • (2002) EPIC2 Workshop
    • Choi, Y.1    Knies, A.2    Vedaraman, G.3    Williamson, J.4
  • 3
    • 0027699767 scopus 로고
    • Cooperative shared memory: Software and hardware for scalable multiprocessors
    • Nov.
    • M. D. Hill, J. R. Larus, S. K. Reinhardt, and D. A. Wood. Cooperative shared memory: Software and hardware for scalable multiprocessors. ACM Trans. on Comp. Sys., 11(4):300-318, Nov. 1993.
    • (1993) ACM Trans. on Comp. Sys. , vol.11 , Issue.4 , pp. 300-318
    • Hill, M.D.1    Larus, J.R.2    Reinhardt, S.K.3    Wood, D.A.4
  • 5
    • 3042569221 scopus 로고    scopus 로고
    • Physical experimentation with prefetching helper threads on Intel's hyper-threaded processors
    • Palo Alto, CA, Mar.
    • D. Kim et al. Physical experimentation with prefetching helper threads on Intel's hyper-threaded processors. In Proc. 2nd Symp. on Code Gen. and Optim (CGO), pages 27-38, Palo Alto, CA, Mar. 2004.
    • (2004) Proc. 2nd Symp. on Code Gen. and Optim (CGO) , pp. 27-38
    • Kim, D.1
  • 6
    • 31844447723 scopus 로고    scopus 로고
    • Compiler support for data forwarding in scalable shared-memory multiprocessors
    • D. Koufaty and J. Torrellas. Compiler support for data forwarding in scalable shared-memory multiprocessors. In Intl. Conf. on Parallel Proc., 1999.
    • (1999) Intl. Conf. on Parallel Proc.
    • Koufaty, D.1    Torrellas, J.2
  • 8
    • 3042613777 scopus 로고    scopus 로고
    • Ispike: A post-link optimizer for the Intel Itanium2 architecture
    • Palo Alto, CA, Mar.
    • C. Luk et al. Ispike: A post-link optimizer for the Intel Itanium2 architecture. In Proc. 2nd Intl. Symp. on Code Generation and Optimization (CGO), pages 15-26, Palo Alto, CA, Mar. 2004.
    • (2004) Proc. 2nd Intl. Symp. on Code Generation and Optimization (CGO) , pp. 15-26
    • Luk, C.1
  • 9
    • 0038684776 scopus 로고    scopus 로고
    • Using destination-set prediction to improve the latency/bandwidth tradeoff in shared-memory multiprocessors
    • M. M. K. Martin et al. Using destination-set prediction to improve the latency/bandwidth tradeoff in shared-memory multiprocessors. In Proc. 30th Intl. Symp. on Computer Arch. (ISCA), pages 206-217, 2003.
    • (2003) Proc. 30th Intl. Symp. on Computer Arch. (ISCA) , pp. 206-217
    • Martin, M.M.K.1
  • 11
    • 8344246921 scopus 로고    scopus 로고
    • Detailed cache coherence characterization for OpenMP benchmarks
    • Saint-Malo, France, June
    • A. Nagarajan, J. Marathe, and F. Mueller. Detailed cache coherence characterization for OpenMP benchmarks. In Proc. Intl. Conf. on Supercomputing (ICS), pages 287-297, Saint-Malo, France, June 2004.
    • (2004) Proc. Intl. Conf. on Supercomputing (ICS) , pp. 287-297
    • Nagarajan, A.1    Marathe, J.2    Mueller, F.3
  • 12
    • 85117253089 scopus 로고    scopus 로고
    • SMP system interconnect instrumentation for performance analysis
    • Baltimore, Maryland, Nov.
    • L. Noordergraaf and R. Zak. SMP system interconnect instrumentation for performance analysis. In Proc. of Supercomputing, SC-2002, Baltimore, Maryland, Nov. 2002.
    • (2002) Proc. of Supercomputing, SC-2002
    • Noordergraaf, L.1    Zak, R.2
  • 13
    • 31844447052 scopus 로고    scopus 로고
    • Owl: Next generation system monitoring
    • Ischia, Italy, May
    • M. S. others. Owl: Next generation system monitoring. In Proc. of Computing Frontiers 2005, Ischia, Italy, May 2005.
    • (2005) Proc. of Computing Frontiers 2005
    • Others, M.S.1
  • 15
    • 31844442351 scopus 로고    scopus 로고
    • Evaluating system-wide monitoring capsule design using Xilinx Virtex-II Pro FPGA
    • San Francisco, CA, Feb.
    • T. Suh et al. Evaluating system-wide monitoring capsule design using Xilinx Virtex-II Pro FPGA. In Workshop on Arch. Res. using FPGA Platforms, in conj. with HPCA05, San Francisco, CA, Feb. 2005.
    • (2005) Workshop on Arch. Res. Using FPGA Platforms, in Conj. with HPCA05
    • Suh, T.1
  • 16
    • 0029179077 scopus 로고
    • The SPLASH-2 programs: Characterization and methodological considerations
    • Santa Margherita Ligure, Italy, June
    • S. C. Woo et al. The SPLASH-2 programs: Characterization and methodological considerations. In Proc. 22nd Intl, Symp. on Computer Arch., pages 24-36, Santa Margherita Ligure, Italy, June 1995.
    • (1995) Proc. 22nd Intl, Symp. on Computer Arch. , pp. 24-36
    • Woo, S.C.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.