-
1
-
-
0036679608
-
HPCView: A tool for top-down analysis of node performance
-
Mellor-Crummey JM, Fowler R, Marin G, Tallent N. HPCView: A tool for top-down analysis of node performance. The Journal of Supercomputing 2002; 23(1):81-104.
-
(2002)
The Journal of Supercomputing
, vol.23
, Issue.1
, pp. 81-104
-
-
Mellor-Crummey, J.M.1
Fowler, R.2
Marin, G.3
Tallent, N.4
-
2
-
-
67650844203
-
Producing wrong data without doing anything obviously wrong!
-
ACM: New York, NY, U.S.A.
-
Mytkowicz T, Diwan A, Hauswirth M, Sweeney PF. Producing wrong data without doing anything obviously wrong! Proceedings of the 14th International Conference on Architectural Support for Programming Languages and Operating Systems. ACM: New York, NY, U.S.A., 2009; 265-276.
-
(2009)
Proceedings of the 14th International Conference on Architectural Support for Programming Languages and Operating Systems
, pp. 265-276
-
-
Mytkowicz, T.1
Diwan, A.2
Hauswirth, M.3
Sweeney, P.F.4
-
3
-
-
84976736522
-
Gprof: A call graph execution profiler
-
ACM Press: New York, NY, U.S.A.
-
Graham SL, Kessler PB, McKusick MK. Gprof: A call graph execution profiler. Proceedings of the 1982 SIGPLAN Symposium on Compiler Construction. ACM Press: New York, NY, U.S.A., 1982; 120-126.
-
(1982)
Proceedings of the 1982 SIGPLAN Symposium on Compiler Construction
, pp. 120-126
-
-
Graham, S.L.1
Kessler, P.B.2
McKusick, M.K.3
-
4
-
-
32844470371
-
Low-overhead call path profiling of unmodified, optimized code
-
DOI 10.1145/1088149.1088161, ICS05 - Proceedings of the 19th ACM International Conference on Supercomputing
-
Froyd N, Mellor-Crummey JM, Fowler R. Low-overhead call path profiling of unmodified, optimized code. Proceedings of the 19th Annual International Conference on Supercomputing. ACM Press: New York, NY, U.S.A., 2005; 81-90. (Pubitemid 43251312)
-
(2005)
Proceedings of the International Conference on Supercomputing
, pp. 81-90
-
-
Froyd, N.1
Mellor-Crummey, J.2
Fowler, R.3
-
5
-
-
77950623122
-
-
Intel Corporation. Intel VTune performance analyzer. Available at, 2 December
-
Intel Corporation. Intel VTune performance analyzer. Available at: http://software.intel.com/en-us/intel-vtune [2 December 2009].
-
(2009)
-
-
-
6
-
-
77950608876
-
-
Intel Corporation. Intel Performance Tuning Utility. Available at, 2 December
-
Intel Corporation. Intel Performance Tuning Utility. Available at: http://software.intel.com/en-us/articles/intel-performancetuning-utility [2 December 2009].
-
(2009)
-
-
-
7
-
-
31944440969
-
Pin: Building customized program analysis tools with dynamic instrumentation
-
Proceedings of the 2005 ACM SIGPLAN Conference on Programming Language Design and Implementation, PLDI 05
-
Luk C-K, Cohn R, Muth R, Patil H, Klauser A, Lowney G, Wallace S, Reddi VJ, Hazelwood K. Pin: Building customized program analysis tools with dynamic instrumentation. Proceedings of the 2005 ACM SIGPLAN Conference on Programming Language Design and Implementation. ACM Press: New York, NY, U.S.A., 2005; 190-200. (Pubitemid 43185951)
-
(2005)
Proceedings of the ACM SIGPLAN Conference on Programming Language Design and Implementation (PLDI)
, pp. 190-200
-
-
Luk, C.-K.1
Cohn, R.2
Muth, R.3
Patil, H.4
Klauser, A.5
Lowney, G.6
Wallace, S.7
Reddi, V.J.8
Hazelwood, K.9
-
8
-
-
70450255123
-
Binary analysis for measurement and attribution of program performance
-
ACM: New York, NY, U.S.A.
-
Tallent NR, Mellor-Crummey JM, Fagan MW. Binary analysis for measurement and attribution of program performance. Proceedings of the 2009 ACM SIGPLAN Conference on Programming Language Design and Implementation. ACM: New York, NY, U.S.A., 2009; 441-452.
-
Proceedings of the 2009 ACM SIGPLAN Conference on Programming Language Design and Implementation
, vol.2009
, pp. 441-452
-
-
Tallent, N.R.1
Mellor-Crummey, J.M.2
Fagan, M.W.3
-
10
-
-
0030645124
-
Exploiting Hardware Performance Counters with Flow and Context Sensitive Profiling
-
Ammons G, Ball T, Larus JR. Exploiting hardware performance counters with flow and context sensitive profiling. SIGPLAN Conference on Programming Language Design and Implementation. ACM: New York, NY, U.S.A., 1997; 85-96. (Pubitemid 127453689)
-
(1997)
SIGPLAN Notices (ACM Special Interest Group on Programming Languages)
, vol.32
, Issue.5
, pp. 85-96
-
-
Ammons, G.1
Ball, T.2
Larus, J.R.3
-
11
-
-
34548010778
-
Scalability analysis of SPMD codes using expectations
-
DOI 10.1145/1274971.1274976, Proceedings of ICS07: 21st ACM International Conference on Supercomputing
-
Coarfa C, Mellor-Crummey JM, Froyd N, Dotsenko Y. Scalability analysis of SPMD codes using expectations. ICS'07: Proceedings of the 21st Annual International Conference on Supercomputing. ACM: New York, NY, U.S.A., 2007; 13-22. (Pubitemid 47281602)
-
(2007)
Proceedings of the International Conference on Supercomputing
, pp. 13-22
-
-
Coarfa, C.1
Mellor-Crummey, J.2
Froyd, N.3
Dotsenko, Y.4
-
12
-
-
74049095154
-
Diagnosing performance bottlenecks in emerging petascale applications
-
ACM: New York, NY, DOI
-
Tallent NR, Mellor-Crummey JM, Adhianto L, Fagan MW, Krentel M. Diagnosing performance bottlenecks in emerging petascale applications. Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis. SC'09. ACM: New York, NY, 2009; 1-11. DOI: http://doi.acm.org/10. 1145/1654059.1654111.
-
(2009)
Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis. SC'09
, pp. 1-11
-
-
Tallent, N.R.1
Mellor-Crummey, J.M.2
Adhianto, L.3
Fagan, M.W.4
Krentel, M.5
-
15
-
-
0031622953
-
The Implementation of the Cilk-5 Multithreaded Language
-
Frigo M, Leiserson CE, Randall KH. The implementation of the Cilk-5 multithreaded language. Proceedings of the 1998 ACM SIGPLAN Conference on Programming Language Design and Implementation, Montreal, Que., Canada, 1998; 212-223. (Pubitemid 128454798)
-
(1998)
SIGPLAN Notices (ACM Special Interest Group on Programming Languages)
, vol.33
, Issue.5
, pp. 212-223
-
-
Frigo, M.1
Leiserson, C.E.2
Randall, K.H.3
-
16
-
-
0032544628
-
Turbulent transport reduction by zonal flows: Massively parallel simulations
-
DOI 10.1126/science.281.5384.1835
-
Lin Z, Hahm TS, Lee WW, Tang WM, White RB. Turbulent transport reduction by zonal flows: Massively parallel simulations. Science 1998; 281(5384):1835-1837. (Pubitemid 28450499)
-
(1998)
Science
, vol.281
, Issue.5384
, pp. 1835-1837
-
-
Lin, Z.1
Hahm, T.S.2
Lee, W.W.3
Tang, W.M.4
White, R.B.5
-
17
-
-
0002438680
-
VAMPIR: Visualization and analysis of MPI resources
-
Nagel WE, Arnold A, Weber M, Hoppe HC, Solchenbach K. VAMPIR: Visualization and analysis of MPI resources. Supercomputer 1996; 12(1):69-80. (Pubitemid 126796012)
-
(1996)
Supercomputer
, vol.12
, Issue.1
, pp. 69-80
-
-
Nagel, W.E.1
Arnold, A.2
Weber, M.3
Hoppe, H.-Ch.4
Solchenbach, K.5
-
18
-
-
0032139230
-
Falcon: On-line monitoring for steering parallel programs
-
Gu W, Eisenhauer G, Schwan K, Vetter J. Falcon: On-line monitoring for steering parallel programs. Concurrency: Practice and Experience 1998; 10(9):699-736. (Pubitemid 128445432)
-
(1998)
Concurrency Practice and Experience
, vol.10
, Issue.9
, pp. 699-736
-
-
Gu, W.1
Eisenhauer, G.2
Schwan, K.3
Vetter, J.4
-
19
-
-
0032593334
-
Toward scalable performance visualization with Jumpshot
-
Zaki O, Lusk E, Gropp W, Swider D. Toward scalable performance visualization with Jumpshot. High Performance Computing Applications 1999; 13(2):277-288.
-
(1999)
High Performance Computing Applications
, vol.13
, Issue.2
, pp. 277-288
-
-
Zaki, O.1
Lusk, E.2
Gropp, W.3
Swider, D.4
-
21
-
-
84974695561
-
A Dynamic Tracing Mechanism for Performance Analysis of OpenMP Applications
-
OpenMP Shared Memory Parallel Programming International Workshop on OpenMP Applications and Tools, WOMPAT 2001 West Lafayette, IN, USA, July 30-31, 2001 Proceedings
-
Caubet J, Gimenez J, Labarta J, Rose LD, Vetter JS. A dynamic tracing mechanism for performance analysis of OpenMP applications. Proceedings of the International Workshop on OpenMP Applications and Tools. Springer: London, U.K., 2001; 53-67. (Pubitemid 33315607)
-
(2001)
LECTURE NOTES IN COMPUTER SCIENCE
, Issue.2104
, pp. 53-67
-
-
Caubet, J.1
Gimenez, J.2
Labarta, J.3
DeRose, L.4
Vetter, J.5
-
24
-
-
0005973264
-
Origin 2000 and Onyx2 performance tuning and optimization guide
-
Silicon Graphics, Inc.
-
Cortesi D, Fier J, Wilson J, Boney J. Origin 2000 and Onyx2 performance tuning and optimization guide. Technical Report 007-3430-003, Silicon Graphics, Inc., 2001.
-
(2001)
Technical Report 007-3430-003
-
-
Cortesi, D.1
Fier, J.2
Wilson, J.3
Boney, J.4
-
25
-
-
38049186498
-
On using incremental profiling for the performance analysis of shared memory parallel applications
-
Rennes, France
-
Fürlinger K, Gerndt M, Dongarra J. On using incremental profiling for the performance analysis of shared memory parallel applications. Proceedings of the 13th International Euro-Par Conference on Parallel Processing, Rennes, France, 2007; 62-71.
-
(2007)
Proceedings of the 13th International Euro-Par Conference on Parallel Processing
, pp. 62-71
-
-
Fürlinger, K.1
Gerndt, M.2
Dongarra, J.3
-
26
-
-
51849091556
-
Observing performance dynamics using parallel profile snapshots
-
Springer: Berlin, Heidelberg
-
Morris A, Spear W, Malony AD, Shende S. Observing performance dynamics using parallel profile snapshots. Proceedings of the 14th International Euro-Par Conference on Parallel Processing. Springer: Berlin, Heidelberg, 2008; 162-171.
-
(2008)
Proceedings of the 14th International Euro-Par Conference on Parallel Processing
, pp. 162-171
-
-
Morris, A.1
Spear, W.2
Malony, A.D.3
Shende, S.4
-
27
-
-
51849136706
-
SpeedShop user's guide
-
Silicon Graphics Inc. (SGI), SGI
-
Silicon Graphics, Inc. (SGI). SpeedShop User's Guide. Technical Report 007-3311-011, SGI, 2003.
-
(2003)
Technical Report 007-3311-011
-
-
-
28
-
-
84875944868
-
-
Krell Institute, Available at
-
Krell Institute. Open SpeedShop for Linux. Available at: http://www.openspeedshop.org.
-
Open SpeedShop for Linux
-
-
-
31
-
-
85052019260
-
From trace generation to visualization: A performance framework for distributed parallel systems
-
IEEE Computer Society: Washington, DC, U.S.A.
-
Wu CE, Bolmarcich A, Snir M, Wootton D, Parpia F, Chan A, Lusk E, Gropp W. From trace generation to visualization: A performance framework for distributed parallel systems. Proceedings of the ACM/IEEE Conference on Supercomputing. IEEE Computer Society: Washington, DC, U.S.A., 2000.
-
(2000)
Proceedings of the ACM/IEEE Conference on Supercomputing
-
-
Wu, C.E.1
Bolmarcich, A.2
Snir, M.3
Wootton, D.4
Parpia, F.5
Chan, A.6
Lusk, E.7
Gropp, W.8
-
32
-
-
0036036949
-
Dynamic statistical profiling of communication activity in distributed applications
-
Vetter J. Dynamic statistical profiling of communication activity in distributed applications. Proceedings of the ACM SIGMETRICS International Conference on Measurement and Modeling of Computer Systems. ACM Press: New York, NY, U.S.A., 2002; 240-250. (Pubitemid 35009526)
-
(2002)
Performance Evaluation Review
, vol.30
, Issue.1
, pp. 240-250
-
-
Vetter, J.1
-
33
-
-
35048825254
-
Design and prototype of a performance tool interface for OpenMP
-
Santa Fe, NM, October
-
Mohr B, Malony AD, Shende S, Wolf F. Design and prototype of a performance tool interface for OpenMP. Proceedings of the Los Alamos Computer Science Institute Second Annual Symposium, Santa Fe, NM, October 2001.
-
(2001)
Proceedings of the Los Alamos Computer Science Institute Second Annual Symposium
-
-
Mohr, B.1
Malony, A.D.2
Shende, S.3
Wolf, F.4
-
34
-
-
77950605550
-
GASP! A standardized performance analysis tool interface for global address space programming models
-
Lawrence Berkeley National Laboratory
-
Su H-H, Bonachea D, Leko A, Sherburne H Billingsley III M. George AD. GASP! A standardized performance analysis tool interface for global address space programming models. Technical Report LBNL-61659, Lawrence Berkeley National Laboratory, 2006.
-
(2006)
Technical Report LBNL-61659
-
-
Su, H.-H.1
Bonachea, D.2
Leko, A.3
Sherburne, H.4
Billingsley III, M.5
George, A.D.6
-
35
-
-
85040770718
-
Scalable performance analysis: The Pablo performance analysis environment
-
IEEE Computer Society: Silver Spring, MD
-
Reed DA, Aydt RA, Noe RJ, Roth PC, Shields KA, Schwartz BW, Tavera LF. Scalable performance analysis: The Pablo performance analysis environment. Proceedings of the Scalable Parallel Libraries Conference. IEEE Computer Society: Silver Spring, MD, 1993; 104-113.
-
(1993)
Proceedings of the Scalable Parallel Libraries Conference
, pp. 104-113
-
-
Reed, D.A.1
Aydt, R.A.2
Noe, R.J.3
Roth, P.C.4
Shields, K.A.5
Schwartz, B.W.6
Tavera, L.F.7
-
36
-
-
33646427877
-
A performance monitoring interface for OpenMP
-
Rome, Italy
-
Mohr B, Malony AD, Hoppe H-C, Schlimbach F, Haab G, Hoeflinger J, Shah S. A performance monitoring interface for OpenMP. Proceedings of the Fourth European Workshop on OpenMP, Rome, Italy, 2002.
-
(2002)
Proceedings of the Fourth European Workshop on OpenMP
-
-
Mohr, B.1
Malony, A.D.2
Hoppe, H.-C.3
Schlimbach, F.4
Haab, G.5
Hoeflinger, J.6
Shah, S.7
-
37
-
-
77952005316
-
OmpP: A profiling tool for OpenMP
-
Eugene, OR, U.S.A.
-
Fürlinger K, Gerndt M. ompP: A profiling tool for OpenMP. Proceedings of the First and Second International Workshops on OpenMP (Lecture Notes in Computer Science, vol. 4315), Eugene, OR, U.S.A., 2005; 12-23.
-
(2005)
Proceedings of the First and Second International Workshops on OpenMP (Lecture Notes in Computer Science, vol. 4315)
, pp. 12-23
-
-
Fürlinger, K.1
Gerndt, M.2
-
40
-
-
84981167256
-
The dynamic probe class library - An infrastructure for developing instrumentation for performance tools
-
San Francisco, CA, U.S.A., April
-
DeRose L, Ted Hoover J, Hollingsworth JK. The dynamic probe class library-An infrastructure for developing instrumentation for performance tools. Proceedings of the International Parallel and Distributed Processing Symposium, San Francisco, CA, U.S.A., April 2001.
-
(2001)
Proceedings of the International Parallel and Distributed Processing Symposium
-
-
DeRose, L.1
Ted Hoover, J.2
Hollingsworth, J.K.3
-
41
-
-
0029408429
-
The Paradyn parallel performance measurement tool
-
Miller BP, Callaghan MD, Cargille JM, Hollingsworth JK, Irvin RB, Karavanic KL, Kunchithapadam K, Newhall T. The Paradyn parallel performance measurement tool. IEEE Computer 1995; 28(11):37-46.
-
(1995)
IEEE Computer
, vol.28
, Issue.11
, pp. 37-46
-
-
Miller, B.P.1
Callaghan, M.D.2
Cargille, J.M.3
Hollingsworth, J.K.4
Irvin, R.B.5
Karavanic, K.L.6
Kunchithapadam, K.7
Newhall, T.8
-
43
-
-
0033691589
-
Performance analysis of distributed applications using automatic classification of communication inefficiencies
-
Santa Fe, NM, U.S.A.
-
Vetter J. Performance analysis of distributed applications using automatic classification of communication inefficiencies. International Conference on Supercomputing, Santa Fe, NM, U.S.A., 2000; 245-254.
-
(2000)
International Conference on Supercomputing
, pp. 245-254
-
-
Vetter, J.1
-
44
-
-
33646137721
-
Efficient Pattern Search in Large Traces Through Successive Refinement
-
Euro-Par 2004 Parallel Processing
-
Wolf F, Mohr B, Dongarra J, Moore S. Efficient pattern search in large traces through successive refinement. Proceedings of the European Conference on Parallel Computing, Pisa, Italy, August 2004. (Pubitemid 39217254)
-
(2004)
LECTURE NOTES IN COMPUTER SCIENCE
, Issue.3149
, pp. 47-54
-
-
Wolf, F.1
Mohr, B.2
Dongarra, J.3
Moore, S.4
|