-
1
-
-
67650060166
-
HPCToolkit: Tools for performance analysis of optimized parallel programs
-
Rice University
-
L. Adhianto, S. Banerjee, M. Fagan, M. Krentel, G. Marin, J. Mellor-Crummey, and N. R. Tallent. HPCToolkit: Tools for performance analysis of optimized parallel programs. Technical Report TR08-06, Rice University, 2008.
-
(2008)
Technical Report TR08-06
-
-
Adhianto, L.1
Banerjee, S.2
Fagan, M.3
Krentel, M.4
Marin, G.5
Mellor-Crummey, J.6
Tallent, N.R.7
-
2
-
-
0030645124
-
Exploiting hardware performance counters with flow and context sensitive profiling
-
New York, NY, USA, ACM Press
-
G. Ammons, T. Ball, and J. R. Larus. Exploiting hardware performance counters with flow and context sensitive profiling. In SIGPLAN Conference on Programming Language Design and Implementation, pages 85-96, New York, NY, USA, 1997. ACM Press.
-
(1997)
SIGPLAN Conference on Programming Language Design and Implementation
, pp. 85-96
-
-
Ammons, G.1
Ball, T.2
Larus, J.R.3
-
3
-
-
0025567275
-
Quartz: A tool for tuning parallel program performance
-
T. E. Anderson and E. D. Lazowska. Quartz: a tool for tuning parallel program performance. SIGMETRICS Perform. Eval. Rev., 18(1):115-125, 1990.
-
(1990)
SIGMETRICS Perform. Eval. Rev.
, vol.18
, Issue.1
, pp. 115-125
-
-
Anderson, T.E.1
Lazowska, E.D.2
-
4
-
-
84869662412
-
-
Apple Computer.
-
Apple Computer. Shark. http://developer.apple.com/tools/sharkoptimize. html.
-
Shark
-
-
-
5
-
-
33646598714
-
Portable and accurate sampling profiling for Java
-
W. Binder. Portable and accurate sampling profiling for Java. Softw. Pract. Exper., 36(6):615-650, 2006.
-
(2006)
Softw. Pract. Exper.
, vol.36
, Issue.6
, pp. 615-650
-
-
Binder, W.1
-
7
-
-
0004224686
-
-
Addison-Wesley Longman Publishing Co., Inc., Boston, MA, USA
-
D. R. Butenhof. Programming with POSIX threads. Addison-Wesley Longman Publishing Co., Inc., Boston, MA, USA, 1997.
-
(1997)
Programming with POSIX Threads
-
-
Butenhof, D.R.1
-
8
-
-
0028756159
-
Parallel performance using lost cycles analysis
-
Los Alamitos, CA, USA, 1994. IEEE Computer Society Press
-
M. E. Crovella and T. J. LeBlanc. Parallel performance using lost cycles analysis. In Supercomputing '94: Proceedings of the 1994 conference on Supercomputing, pages 600-609, Los Alamitos, CA, USA, 1994. IEEE Computer Society Press.
-
Supercomputing '94: Proceedings of the 1994 Conference on Supercomputing
, pp. 600-609
-
-
Crovella, M.E.1
LeBlanc, T.J.2
-
9
-
-
33846480613
-
A performance counter architecture for computing accurate CPI components
-
S. Eyerman, L. Eeckhout, T. Karkhanis, and J. E. Smith. A performance counter architecture for computing accurate CPI components. SIGPLAN Not., 41(11):175-184, 2006.
-
(2006)
SIGPLAN Not.
, vol.41
, Issue.11
, pp. 175-184
-
-
Eyerman, S.1
Eeckhout, L.2
Karkhanis, T.3
Smith, J.E.4
-
10
-
-
0031622953
-
The implementation of the Cilk-5 multithreaded language
-
Montreal, Quebec, Canada, June, Proceedings published ACM SIGPLAN Notices, May,1998
-
M. Frigo, C. E. Leiserson, and K. H. Randall. The implementation of the Cilk-5 multithreaded language. In Proceedings of the ACM SIGPLAN '98 Conference on Programming Language Design and Implementation, pages 212-223, Montreal, Quebec, Canada, June 1998. Proceedings published ACM SIGPLAN Notices, Vol. 33, No. 5, May, 1998.
-
(1998)
Proceedings of the ACM SIGPLAN '98 Conference on Programming Language Design and Implementation
, vol.33
, Issue.5
, pp. 212-223
-
-
Frigo, M.1
Leiserson, C.E.2
Randall, K.H.3
-
11
-
-
32844470371
-
Low-overhead call path profiling of unmodified, optimized code
-
New York, NY, USA, ACM Press
-
N. Froyd, J. Mellor-Crummey, and R. Fowler. Low-overhead call path profiling of unmodified, optimized code. In ICS '05: Proceedings of the 19th annual International Conference on Supercomputing, pages 81-90, New York, NY, USA, 2005. ACM Press.
-
(2005)
ICS '05: Proceedings of the 19th annual International Conference on Supercomputing
, pp. 81-90
-
-
Froyd, N.1
Mellor-Crummey, J.2
Fowler, R.3
-
12
-
-
0026867697
-
Call path profiling
-
New York, NY, USA, ACM Press
-
R. J. Hall. Call path profiling. In ICSE '92: Proceedings of the 14th international conference on Software engineering, pages 296-306, New York, NY, USA, 1992. ACM Press.
-
(1992)
ICSE '92: Proceedings of the 14th International Conference on Software Engineering
, pp. 296-306
-
-
Hall, R.J.1
-
13
-
-
84887449506
-
-
Intel Corporation.Linked from
-
Intel Corporation. Intel performance tuning utility. Linked from http://whatif.intel.com/.
-
Intel Performance Tuning Utility
-
-
-
14
-
-
70350590471
-
-
Intel Corporation
-
Intel Corporation. Intel thread profiler. http://www.intel.com/software/ products/tpwin.
-
Intel Thread Profiler
-
-
-
15
-
-
19644399541
-
-
Intel Corporation
-
Intel Corporation. Intel VTune performance analyzers. http: //www.intel.com/software/products/vtune/.
-
Intel VTune Performance Analyzers
-
-
-
18
-
-
84872126872
-
-
J. Levon et al. OProfile. http://oprofile.sourceforge.net/.
-
OProfile
-
-
Levon, J.1
-
19
-
-
42549162260
-
Power/performance/thermal design-space exploration for multicore architectures
-
May
-
M. Monchiero, R. Canal, and A. Gonzalez. Power/performance/thermal design-space exploration for multicore architectures. IEEE Transactions on Parallel and Distributed Systems, 19(5):666-681, May 2008.
-
(2008)
IEEE Transactions on Parallel and Distributed Systems
, vol.19
, Issue.5
, pp. 666-681
-
-
Monchiero, M.1
Canal, R.2
Gonzalez, A.3
-
21
-
-
35348907981
-
Identifying potential parallelism via loop-centric profiling
-
New York, NY, USA, ACM
-
T. Moseley, D. A. Connors, D. Grunwald, and R. Peri. Identifying potential parallelism via loop-centric profiling. In CF '07: Proceedings of the 4th international conference on Computing frontiers, pages 143-152, New York, NY, USA, 2007. ACM.
-
(2007)
CF '07: Proceedings of the 4th international conference on Computing Frontiers
, pp. 143-152
-
-
Moseley, T.1
Connors, D.A.2
Grunwald, D.3
Peri, R.4
-
22
-
-
33745612838
-
-
OpenMP Architecture Review Board, May
-
OpenMP Architecture Review Board. OpenMP application program interface, version 3.0. http://www.openmp.org/mp-documents/ spec30.pdf, May 2008.
-
(2008)
OpenMP Application Program Interface, Version 3.0.
-
-
-
25
-
-
84968724570
-
An efficient online path profiling framework for Java just-in-time compilers
-
Washington, DC, USA,IEEE Computer Society
-
T. Yasue, T. Suganuma, H. Komatsu, and T. Nakatani. An efficient online path profiling framework for Java just-in-time compilers. In PACT '03: Proceedings of the 12th International Conference on Parallel Architectures and Compilation Techniques, page 148, Washington, DC, USA, 2003. IEEE Computer Society.
-
(2003)
PACT '03: Proceedings of the 12th International Conference on Parallel Architectures and Compilation Techniques
, pp. 148
-
-
Yasue, T.1
Suganuma, T.2
Komatsu, H.3
Nakatani, T.4
-
26
-
-
33746100320
-
Accurate efficient, and adaptive calling context profiling
-
New York, NY, USA,. ACM
-
X. Zhuang, M. J. Serrano, H. W. Cain, and J.-D. Choi. Accurate, efficient, and adaptive calling context profiling. In PLDI '06: Proceedings of the 2006 ACM SIGPLAN conference on Programming language design and implementation, pages 263-271, New York, NY, USA, 2006. ACM.
-
(2006)
PLDI '06: Proceedings of the 2006 ACM SIGPLAN Conference on Programming Language Design and Implementation
, pp. 263-271
-
-
Zhuang, X.1
Serrano, M.J.2
Cain, H.W.3
J.-Choi, D.4
|