메뉴 건너뛰기




Volumn , Issue , 2009, Pages 229-239

Effective performance measurement and analysis of multithreaded applications

Author keywords

Call Path Profiling; HPCTOOLKIT; Multithreaded Programming Models; Performance Analysis

Indexed keywords

CALL PATH PROFILING; EFFECTIVE PERFORMANCE; HPCTOOLKIT; LEVEL MODEL; MEASUREMENT COSTS; MULTI-CORE PROCESSOR; MULTI-THREADED APPLICATION; MULTI-THREADED PROGRAMS; MULTITHREADED PROGRAMMING; MULTITHREADED PROGRAMMING MODELS; PARALLELIZATION; PERFORMANCE ANALYSIS; PERFORMANCE METRICS; POSTMORTEM ANALYSIS; PRACTICAL IMPORTANCE; PROGRAMMING MODELS; PTHREADS; RICE UNIVERSITY; RUNTIME; SHARED MEMORIES;

EID: 67650034867     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1145/1504176.1504210     Document Type: Conference Paper
Times cited : (57)

References (26)
  • 2
    • 0030645124 scopus 로고    scopus 로고
    • Exploiting Hardware Performance Counters with Flow and Context Sensitive Profiling
    • G. Ammons, T. Ball, and J. R. Larus. Exploiting hardware performance counters with flow and context sensitive profiling. In SIGPLAN Conference on Programming Language Design and Implementation, pages 85-96, New York, NY, USA, 1997. ACM Press. (Pubitemid 127453689)
    • (1997) SIGPLAN Notices (ACM Special Interest Group on Programming Languages) , vol.32 , Issue.5 , pp. 85-96
    • Ammons, G.1    Ball, T.2    Larus, J.R.3
  • 3
    • 0025567275 scopus 로고
    • Quartz. A tool for tuning parallel program performance
    • Proc 1990 ACM Sigmetrics Conf Meas Model Comput Syst
    • T. E. Anderson and E. D. Lazowska. Quartz: a tool for tuning parallel program performance. SIGMETRICS Perform. Eval. Rev., 18(1):115-125, 1990. (Pubitemid 20728309)
    • (1990) SIGMETRICS Perform. Eval. Rev. , pp. 115-125
    • Anderson Thomas, E.1    Lazowska Edward, D.2
  • 4
    • 84869364257 scopus 로고    scopus 로고
    • Apple Computer. Shark. http://developer.apple.com/tools/ sharkoptimize.html.
    • Shark
  • 5
    • 33646598714 scopus 로고    scopus 로고
    • Portable and accurate sampling profiling for Java
    • W. Binder. Portable and accurate sampling profiling for Java. Softw. Pract. Exper., 36(6):615-650, 2006.
    • (2006) Softw. Pract. Exper. , vol.36 , Issue.6 , pp. 615-650
    • Binder, W.1
  • 7
    • 0004224686 scopus 로고    scopus 로고
    • Addison-Wesley Longman Publishing Co., Inc., Boston, MA, USA
    • D. R. Butenhof. Programming with POSIX threads. Addison-Wesley Longman Publishing Co., Inc., Boston, MA, USA, 1997.
    • (1997) Programming with POSIX Threads
    • Butenhof, D.R.1
  • 9
    • 33846480613 scopus 로고    scopus 로고
    • A performance counter architecture for computing accurate CPI components
    • S. Eyerman, L. Eeckhout, T. Karkhanis, and J. E. Smith. A performance counter architecture for computing accurate CPI components. SIGPLAN Not., 41(11):175-184, 2006. (Pubitemid 46160722)
    • (2006) ACM SIGPLAN Notices , vol.41 , Issue.11 , pp. 175-184
    • Eyerman, S.1    Eeckhout, L.2    Karkhanis, T.3    Smith, J.E.4
  • 10
    • 0031622953 scopus 로고    scopus 로고
    • The Implementation of the Cilk-5 Multithreaded Language
    • M. Frigo, C. E. Leiserson, and K. H. Randall. The implementation of the Cilk-5 multithreaded language. In Proceedings of the ACM SIGPLAN '98 Conference on Programming Language Design and Implementation, pages 212-223, Montreal, Quebec, Canada, June 1998. Proceedings published ACM SIGPLAN Notices, Vol.33, No. 5, May, 1998. (Pubitemid 128454798)
    • (1998) SIGPLAN Notices (ACM Special Interest Group on Programming Languages) , vol.33 , Issue.5 , pp. 212-223
    • Frigo, M.1    Leiserson, C.E.2    Randall, K.H.3
  • 11
    • 32844470371 scopus 로고    scopus 로고
    • Low-overhead call path profiling of unmodified, optimized code
    • DOI 10.1145/1088149.1088161, ICS05 - Proceedings of the 19th ACM International Conference on Supercomputing
    • N. Froyd, J. Mellor-Crummey, and R. Fowler. Low-overhead call path profiling of unmodified, optimized code. In ICS '05: Proceedings of the 19th annual International Conference on Supercomputing, pages 81-90, New York, NY, USA, 2005. ACM Press. (Pubitemid 43251312)
    • (2005) Proceedings of the International Conference on Supercomputing , pp. 81-90
    • Froyd, N.1    Mellor-Crummey, J.2    Fowler, R.3
  • 18
  • 19
    • 42549162260 scopus 로고    scopus 로고
    • Power/performance/thermal design-space exploration for multicore architectures
    • DOI 10.1109/TPDS.2007.70756
    • M. Monchiero, R. Canal, and A. Gonzalez. Power/performance/thermal design-space exploration for multicore architectures. IEEE Transactions on Parallel and Distributed Systems, 19(5):666-681, May 2008. (Pubitemid 351583569)
    • (2008) IEEE Transactions on Parallel and Distributed Systems , vol.19 , Issue.5 , pp. 666-681
    • Monchiero, M.1    Canal, R.2    Gonzalez, A.3
  • 21
    • 35348907981 scopus 로고    scopus 로고
    • Identifying potential parallelism via loop-centric profiling
    • DOI 10.1145/1242531.1242554, 2007 Computing Frontiers, Conference Proceedings
    • T. Moseley, D. A. Connors, D. Grunwald, and R. Peri. Identifying potential parallelism via loop-centric profiling. In CF '07: Proceedings of the 4th international conference on Computing frontiers, pages 143-152, New York, NY, USA, 2007. ACM. (Pubitemid 47582219)
    • (2007) 2007 Computing Frontiers, Conference Proceedings , pp. 143-152
    • Moseley, T.1    Connors, D.A.2    Grunwald, D.3    Peri, R.4
  • 22
    • 84869340092 scopus 로고    scopus 로고
    • OpenMP Architecture Review Board
    • May
    • OpenMP Architecture Review Board. OpenMP application program interface, version 3.0. http://www.openmp.org/mp-documents/spec30.pdf, May 2008.
    • (2008) OpenMP Application Program Interface Version 3.0


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.