-
1
-
-
0029428724
-
An integrated compilation and performance analysis environment for data parallel programs
-
New York, NY, USA, ACM Press
-
V. S. Adve, J. Mellor-Crummey, M. Anderson, J.-C. Wang, D. A. Reed, and K. Kennedy. An integrated compilation and performance analysis environment for data parallel programs. In Supercomputing '95: Proceedings of the 1995 ACM/IEEE conference on Supercomputing (CDROM), page 50, New York, NY, USA, 1995. ACM Press.
-
(1995)
Supercomputing '95: Proceedings of the 1995 ACM/IEEE conference on Supercomputing (CDROM)
, pp. 50
-
-
Adve, V.S.1
Mellor-Crummey, J.2
Anderson, M.3
Wang, J.-C.4
Reed, D.A.5
Kennedy, K.6
-
2
-
-
84869534109
-
-
Apple Computer. Shark. http://developer.apple.com/tools/ sharkoptimize.html.
-
Apple Computer. Shark
-
-
-
3
-
-
0027002303
-
A new approach to debugging optimized code
-
New York, NY, USA, ACM Press
-
G. Brooks, G. J. Hansen, and S. Simmons. A new approach to debugging optimized code. In PLDI '92: Proceedings of the ACM SIGPLAN 1992 conference on Programming language design and implementation, pages 1-11, New York, NY, USA, 1992. ACM Press.
-
(1992)
PLDI '92: Proceedings of the ACM SIGPLAN 1992 conference on Programming language design and implementation
, pp. 1-11
-
-
Brooks, G.1
Hansen, G.J.2
Simmons, S.3
-
6
-
-
0030393280
-
Hot cold optimization of large Windows/NT applications
-
Washington, DC, USA, IEEE Computer Society
-
R. Cohn and P. G. Lowney. Hot cold optimization of large Windows/NT applications. In MICRO 29: Proceedings of the 29th Annual ACM/IEEE International Symposium on Microarchitecture, pages 80-89, Washington, DC., USA, 1996. IEEE Computer Society.
-
(1996)
MICRO 29: Proceedings of the 29th Annual ACM/IEEE International Symposium on Microarchitecture
, pp. 80-89
-
-
Cohn, R.1
Lowney, P.G.2
-
7
-
-
67650793277
-
Introduction to FLASH 3.0, with application to supersonic turbulence
-
A. Dubey, L. Reid, and R. Fisher. Introduction to FLASH 3.0, with application to supersonic turbulence. Physica Scripta, 132:014046, 2008.
-
(2008)
Physica Scripta
, vol.132
, pp. 014046
-
-
Dubey, A.1
Reid, L.2
Fisher, R.3
-
8
-
-
84869526531
-
-
Free Standards Group. DWARF debugging information format, version 3, 2005
-
Free Standards Group. DWARF debugging information format, version 3. http://dwarf.freestandards.org. 20 December, 2005.
-
-
-
-
9
-
-
32844470371
-
Low-overhead call path profiling of unmodified, optimized code
-
New York, NY, USA, ACM Press
-
N. Froyd, J. Mellor-Crummey, and R. Fowler. Low-overhead call path profiling of unmodified, optimized code. In ICS '05: Proceedings of the 19th annual International Conference on Supercomputing, pages 81-90, New York, NY, USA, 2005. ACM Press.
-
(2005)
ICS '05: Proceedings of the 19th annual International Conference on Supercomputing
, pp. 81-90
-
-
Froyd, N.1
Mellor-Crummey, J.2
Fowler, R.3
-
10
-
-
67650806214
-
Call path profiling for unmodified, optimized binaries
-
N. Froyd, N. Tallent, J. Mellor-Crummey, and R. Fowler. Call path profiling for unmodified, optimized binaries. In GCC Summit '06: Proceedings of the GCC Developers' Summit, 2006, pages 21-36, 2006.
-
(2006)
GCC Summit '06: Proceedings of the GCC Developers' Summit
, pp. 21-36
-
-
Froyd, N.1
Tallent, N.2
Mellor-Crummey, J.3
Fowler, R.4
-
11
-
-
84976736522
-
Gprof: A call graph execution profiler
-
New York, NY, USA, ACM Press
-
S. L. Graham, P. B. Kessler, and M. K. McKusick. Gprof: A call graph execution profiler. In SIGPLAN '82: Proceedings of the 1982 SIGPLAN Symposium on Compiler Construction, pages 120-126, New York, NY, USA, 1982. ACM Press.
-
(1982)
SIGPLAN '82: Proceedings of the 1982 SIGPLAN Symposium on Compiler Construction
, pp. 120-126
-
-
Graham, S.L.1
Kessler, P.B.2
McKusick, M.K.3
-
12
-
-
0026867697
-
Call path profiling
-
New York, NY, USA, ACM Press
-
R. J. Hall. Call path profiling. In ICSE '92: Proceedings of the 14th international conference on Software engineering, pages 296-306, New York, NY, USA, 1992. ACM Press.
-
(1992)
ICSE '92: Proceedings of the 14th international conference on Software engineering
, pp. 296-306
-
-
Hall, R.J.1
-
13
-
-
0031186224
-
Nesting of reducible and irreducible loops
-
P. Havlak. Nesting of reducible and irreducible loops. ACM Trans. Program. Lang. Syst., 19(4):557-567, 1997.
-
(1997)
ACM Trans. Program. Lang. Syst
, vol.19
, Issue.4
, pp. 557-567
-
-
Havlak, P.1
-
14
-
-
84887449506
-
-
Intel Corporation. Intel performance tuning utility. http:// software.intel.com/en-us/articles/intel-performance-tuning-utility.
-
Intel performance tuning utility
-
-
-
16
-
-
84869510094
-
-
ITAPS working group
-
ITAPS working group. The ITAPS iMesh interface. http: //www.tstt-scidac.org/software/documentation/iMesh-userguide.pdf.
-
The ITAPS iMesh interface. http
-
-
-
17
-
-
84872126872
-
-
J. Levon et al. OProfile. http://oprofile.sourceforge.net.
-
OProfile
-
-
Levon, J.1
-
18
-
-
31944440969
-
Pin: Building customized program analysis tools with dynamic instrumentation
-
New York, NY, USA, ACM Press
-
C.-K. Luk, R. Cohn, R. Muth, H. Patil, A. Klauser, G. Lowney, S. Wallace, V. J. Reddi, and K. Hazelwood. Pin: building customized program analysis tools with dynamic instrumentation. In PLDI '05: Proceedings of the 2005 ACM SIGPLAN conference on programming language design and implementation, pages 190-200, New York, NY, USA, 2005. ACM Press.
-
(2005)
PLDI '05: Proceedings of the 2005 ACM SIGPLAN conference on programming language design and implementation
, pp. 190-200
-
-
Luk, C.-K.1
Cohn, R.2
Muth, R.3
Patil, H.4
Klauser, A.5
Lowney, G.6
Wallace, S.7
Reddi, V.J.8
Hazelwood, K.9
-
19
-
-
0036679608
-
HPCView: A tool for top-down analysis of node performance
-
J. Mellor-Crummey, R. Fowler, G. Marin, and N. Tallent. HPCView: A tool for top-down analysis of node performance. The Journal of Supercomputing, 23(1):81-104, 2002.
-
(2002)
The Journal of Supercomputing
, vol.23
, Issue.1
, pp. 81-104
-
-
Mellor-Crummey, J.1
Fowler, R.2
Marin, G.3
Tallent, N.4
-
22
-
-
35348907981
-
Identifying potential parallelism via loop-centric profiling
-
New York, NY, USA, ACM
-
T. Moseley, D. A. Connors, D. Grunwald, and R. Peri. Identifying potential parallelism via loop-centric profiling. In CF '07: Proceedings of the 4th international conference on Computing frontiers, pages 143-152, New York, NY, USA, 2007. ACM.
-
(2007)
CF '07: Proceedings of the 4th international conference on Computing frontiers
, pp. 143-152
-
-
Moseley, T.1
Connors, D.A.2
Grunwald, D.3
Peri, R.4
-
24
-
-
57749201346
-
Learning to analyze binary computer code
-
N. Rosenblum, X. Zhu, B. Miller, and K. Hunt. Learning to analyze binary computer code. In Proceedings of the Twenty-Third AAAI Conference on Artificial Intelligence (2008), pages 798-804, 2008.
-
(2008)
Proceedings of the Twenty-Third AAAI Conference on Artificial Intelligence (2008)
, pp. 798-804
-
-
Rosenblum, N.1
Zhu, X.2
Miller, B.3
Hunt, K.4
-
25
-
-
84869511425
-
-
21 October 2007
-
S. Sandmann. Sysprof. http://www.daimi.au.dk/∼sandmann/ sysprof. 21 October 2007.
-
Sysprof
-
-
Sandmann, S.1
-
27
-
-
84873458651
-
-
SPEC Corporation, //. 3 November 2007
-
SPEC Corporation. SPEC CPU2006 benchmark suite. http: //www.spec.org/cpu2006. 3 November 2007.
-
SPEC CPU2006 benchmark suite. http
-
-
-
29
-
-
7644236117
-
MOAB-SD: Integrated structured and unstructured mesh representation
-
T. J. Tautges. MOAB-SD: integrated structured and unstructured mesh representation. Eng. Comput. (Lond.), 20(3):286-293, 2004.
-
(2004)
Eng. Comput. (Lond.)
, vol.20
, Issue.3
, pp. 286-293
-
-
Tautges, T.J.1
-
31
-
-
33746100320
-
Accurate, efficient, and adaptive calling context profiling
-
New York, NY, USA, ACM
-
X. Zhuang, M. J. Serrano, H. W. Cain, and J.-D. Choi. Accurate, efficient, and adaptive calling context profiling. In PLDI '06: Proceedings of the 2006 ACM SIGPLAN conference on Programming language design and implementation, pages 263-271, New York, NY, USA, 2006. ACM.
-
(2006)
PLDI '06: Proceedings of the 2006 ACM SIGPLAN conference on Programming language design and implementation
, pp. 263-271
-
-
Zhuang, X.1
Serrano, M.J.2
Cain, H.W.3
Choi, J.-D.4
|