메뉴 건너뛰기




Volumn 22, Issue 6, 2010, Pages 685-701

HPCTOOLKIT: Tools for performance analysis of optimized parallel programs

Author keywords

Binary analysis; Call path profiling; Execution monitoring; Performance tools; Tracing

Indexed keywords

PROGRAM COMPILERS; SPACE TIME CODES; USER INTERFACES;

EID: 77950611743     PISSN: 15320626     EISSN: 15320634     Source Type: Journal    
DOI: 10.1002/cpe     Document Type: Article
Times cited : (584)

References (44)
  • 4
    • 32844470371 scopus 로고    scopus 로고
    • Low-overhead call path profiling of unmodified, optimized code
    • DOI 10.1145/1088149.1088161, ICS05 - Proceedings of the 19th ACM International Conference on Supercomputing
    • Froyd N, Mellor-Crummey JM, Fowler R. Low-overhead call path profiling of unmodified, optimized code. Proceedings of the 19th Annual International Conference on Supercomputing. ACM Press: New York, NY, U.S.A., 2005; 81-90. (Pubitemid 43251312)
    • (2005) Proceedings of the International Conference on Supercomputing , pp. 81-90
    • Froyd, N.1    Mellor-Crummey, J.2    Fowler, R.3
  • 5
    • 77950623122 scopus 로고    scopus 로고
    • Intel Corporation. Intel VTune performance analyzer. Available at, 2 December
    • Intel Corporation. Intel VTune performance analyzer. Available at: http://software.intel.com/en-us/intel-vtune [2 December 2009].
    • (2009)
  • 6
    • 77950608876 scopus 로고    scopus 로고
    • Intel Corporation. Intel Performance Tuning Utility. Available at, 2 December
    • Intel Corporation. Intel Performance Tuning Utility. Available at: http://software.intel.com/en-us/articles/intel-performancetuning-utility [2 December 2009].
    • (2009)
  • 11
  • 16
    • 0032544628 scopus 로고    scopus 로고
    • Turbulent transport reduction by zonal flows: Massively parallel simulations
    • DOI 10.1126/science.281.5384.1835
    • Lin Z, Hahm TS, Lee WW, Tang WM, White RB. Turbulent transport reduction by zonal flows: Massively parallel simulations. Science 1998; 281(5384):1835-1837. (Pubitemid 28450499)
    • (1998) Science , vol.281 , Issue.5384 , pp. 1835-1837
    • Lin, Z.1    Hahm, T.S.2    Lee, W.W.3    Tang, W.M.4    White, R.B.5
  • 21
    • 84974695561 scopus 로고    scopus 로고
    • A Dynamic Tracing Mechanism for Performance Analysis of OpenMP Applications
    • OpenMP Shared Memory Parallel Programming International Workshop on OpenMP Applications and Tools, WOMPAT 2001 West Lafayette, IN, USA, July 30-31, 2001 Proceedings
    • Caubet J, Gimenez J, Labarta J, Rose LD, Vetter JS. A dynamic tracing mechanism for performance analysis of OpenMP applications. Proceedings of the International Workshop on OpenMP Applications and Tools. Springer: London, U.K., 2001; 53-67. (Pubitemid 33315607)
    • (2001) LECTURE NOTES IN COMPUTER SCIENCE , Issue.2104 , pp. 53-67
    • Caubet, J.1    Gimenez, J.2    Labarta, J.3    DeRose, L.4    Vetter, J.5
  • 24
    • 0005973264 scopus 로고    scopus 로고
    • Origin 2000 and Onyx2 performance tuning and optimization guide
    • Silicon Graphics, Inc.
    • Cortesi D, Fier J, Wilson J, Boney J. Origin 2000 and Onyx2 performance tuning and optimization guide. Technical Report 007-3430-003, Silicon Graphics, Inc., 2001.
    • (2001) Technical Report 007-3430-003
    • Cortesi, D.1    Fier, J.2    Wilson, J.3    Boney, J.4
  • 27
    • 51849136706 scopus 로고    scopus 로고
    • SpeedShop user's guide
    • Silicon Graphics Inc. (SGI), SGI
    • Silicon Graphics, Inc. (SGI). SpeedShop User's Guide. Technical Report 007-3311-011, SGI, 2003.
    • (2003) Technical Report 007-3311-011
  • 28
    • 84875944868 scopus 로고    scopus 로고
    • Krell Institute, Available at
    • Krell Institute. Open SpeedShop for Linux. Available at: http://www.openspeedshop.org.
    • Open SpeedShop for Linux
  • 32
    • 0036036949 scopus 로고    scopus 로고
    • Dynamic statistical profiling of communication activity in distributed applications
    • Vetter J. Dynamic statistical profiling of communication activity in distributed applications. Proceedings of the ACM SIGMETRICS International Conference on Measurement and Modeling of Computer Systems. ACM Press: New York, NY, U.S.A., 2002; 240-250. (Pubitemid 35009526)
    • (2002) Performance Evaluation Review , vol.30 , Issue.1 , pp. 240-250
    • Vetter, J.1
  • 34
    • 77950605550 scopus 로고    scopus 로고
    • GASP! A standardized performance analysis tool interface for global address space programming models
    • Lawrence Berkeley National Laboratory
    • Su H-H, Bonachea D, Leko A, Sherburne H Billingsley III M. George AD. GASP! A standardized performance analysis tool interface for global address space programming models. Technical Report LBNL-61659, Lawrence Berkeley National Laboratory, 2006.
    • (2006) Technical Report LBNL-61659
    • Su, H.-H.1    Bonachea, D.2    Leko, A.3    Sherburne, H.4    Billingsley III, M.5    George, A.D.6
  • 43
    • 0033691589 scopus 로고    scopus 로고
    • Performance analysis of distributed applications using automatic classification of communication inefficiencies
    • Santa Fe, NM, U.S.A.
    • Vetter J. Performance analysis of distributed applications using automatic classification of communication inefficiencies. International Conference on Supercomputing, Santa Fe, NM, U.S.A., 2000; 245-254.
    • (2000) International Conference on Supercomputing , pp. 245-254
    • Vetter, J.1
  • 44
    • 33646137721 scopus 로고    scopus 로고
    • Efficient Pattern Search in Large Traces Through Successive Refinement
    • Euro-Par 2004 Parallel Processing
    • Wolf F, Mohr B, Dongarra J, Moore S. Efficient pattern search in large traces through successive refinement. Proceedings of the European Conference on Parallel Computing, Pisa, Italy, August 2004. (Pubitemid 39217254)
    • (2004) LECTURE NOTES IN COMPUTER SCIENCE , Issue.3149 , pp. 47-54
    • Wolf, F.1    Mohr, B.2    Dongarra, J.3    Moore, S.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.