-
1
-
-
67650060166
-
HPCToolkit: Tools for performance analysis of optimized parallel programs
-
L. Adhianto, S. Banerjee, M. Fagan, M. Krentel, G. Marin, J. Mellor- Crummey, and N. R. Tallent. HPCToolkit: Tools for performance analysis of optimized parallel programs. Technical Report TR08-06, Rice University, 2008.
-
(2008)
Technical Report TR08-06 Rice University
-
-
Adhianto, L.1
Banerjee, S.2
Fagan, M.3
Krentel, M.4
Marin, G.5
Mellor-Crummey, J.6
Tallent, N.R.7
-
2
-
-
0030645124
-
Exploiting Hardware Performance Counters with Flow and Context Sensitive Profiling
-
G. Ammons, T. Ball, and J. R. Larus. Exploiting hardware performance counters with flow and context sensitive profiling. In SIGPLAN Conference on Programming Language Design and Implementation, pages 85-96, New York, NY, USA, 1997. ACM Press. (Pubitemid 127453689)
-
(1997)
SIGPLAN Notices (ACM Special Interest Group on Programming Languages)
, vol.32
, Issue.5
, pp. 85-96
-
-
Ammons, G.1
Ball, T.2
Larus, J.R.3
-
3
-
-
0025567275
-
Quartz. A tool for tuning parallel program performance
-
Proc 1990 ACM Sigmetrics Conf Meas Model Comput Syst
-
T. E. Anderson and E. D. Lazowska. Quartz: a tool for tuning parallel program performance. SIGMETRICS Perform. Eval. Rev., 18(1):115-125, 1990. (Pubitemid 20728309)
-
(1990)
SIGMETRICS Perform. Eval. Rev.
, pp. 115-125
-
-
Anderson Thomas, E.1
Lazowska Edward, D.2
-
4
-
-
84869364257
-
-
Apple Computer. Shark. http://developer.apple.com/tools/ sharkoptimize.html.
-
Shark
-
-
-
5
-
-
33646598714
-
Portable and accurate sampling profiling for Java
-
W. Binder. Portable and accurate sampling profiling for Java. Softw. Pract. Exper., 36(6):615-650, 2006.
-
(2006)
Softw. Pract. Exper.
, vol.36
, Issue.6
, pp. 615-650
-
-
Binder, W.1
-
7
-
-
0004224686
-
-
Addison-Wesley Longman Publishing Co., Inc., Boston, MA, USA
-
D. R. Butenhof. Programming with POSIX threads. Addison-Wesley Longman Publishing Co., Inc., Boston, MA, USA, 1997.
-
(1997)
Programming with POSIX Threads
-
-
Butenhof, D.R.1
-
8
-
-
0028756159
-
Parallel performance using lost cycles analysis
-
Los Alamitos, CA, USA, IEEE Computer Society Press
-
M. E. Crovella and T. J. LeBlanc. Parallel performance using lost cycles analysis. In Supercomputing '94: Proceedings of the 1994 conference on Supercomputing, pages 600-609, Los Alamitos, CA, USA, 1994. IEEE Computer Society Press.
-
(1994)
Supercomputing '94: Proceedings of the 1994 Conference on Supercomputing
, pp. 600-609
-
-
Crovella, M.E.1
Le Blanc, T.J.2
-
9
-
-
33846480613
-
A performance counter architecture for computing accurate CPI components
-
S. Eyerman, L. Eeckhout, T. Karkhanis, and J. E. Smith. A performance counter architecture for computing accurate CPI components. SIGPLAN Not., 41(11):175-184, 2006. (Pubitemid 46160722)
-
(2006)
ACM SIGPLAN Notices
, vol.41
, Issue.11
, pp. 175-184
-
-
Eyerman, S.1
Eeckhout, L.2
Karkhanis, T.3
Smith, J.E.4
-
10
-
-
0031622953
-
The Implementation of the Cilk-5 Multithreaded Language
-
M. Frigo, C. E. Leiserson, and K. H. Randall. The implementation of the Cilk-5 multithreaded language. In Proceedings of the ACM SIGPLAN '98 Conference on Programming Language Design and Implementation, pages 212-223, Montreal, Quebec, Canada, June 1998. Proceedings published ACM SIGPLAN Notices, Vol.33, No. 5, May, 1998. (Pubitemid 128454798)
-
(1998)
SIGPLAN Notices (ACM Special Interest Group on Programming Languages)
, vol.33
, Issue.5
, pp. 212-223
-
-
Frigo, M.1
Leiserson, C.E.2
Randall, K.H.3
-
11
-
-
32844470371
-
Low-overhead call path profiling of unmodified, optimized code
-
DOI 10.1145/1088149.1088161, ICS05 - Proceedings of the 19th ACM International Conference on Supercomputing
-
N. Froyd, J. Mellor-Crummey, and R. Fowler. Low-overhead call path profiling of unmodified, optimized code. In ICS '05: Proceedings of the 19th annual International Conference on Supercomputing, pages 81-90, New York, NY, USA, 2005. ACM Press. (Pubitemid 43251312)
-
(2005)
Proceedings of the International Conference on Supercomputing
, pp. 81-90
-
-
Froyd, N.1
Mellor-Crummey, J.2
Fowler, R.3
-
18
-
-
84872126872
-
-
J. Levon et al. OProfile. http://oprofile.sourceforge.net/.
-
OProfile
-
-
Levon, J.1
-
19
-
-
42549162260
-
Power/performance/thermal design-space exploration for multicore architectures
-
DOI 10.1109/TPDS.2007.70756
-
M. Monchiero, R. Canal, and A. Gonzalez. Power/performance/thermal design-space exploration for multicore architectures. IEEE Transactions on Parallel and Distributed Systems, 19(5):666-681, May 2008. (Pubitemid 351583569)
-
(2008)
IEEE Transactions on Parallel and Distributed Systems
, vol.19
, Issue.5
, pp. 666-681
-
-
Monchiero, M.1
Canal, R.2
Gonzalez, A.3
-
21
-
-
35348907981
-
Identifying potential parallelism via loop-centric profiling
-
DOI 10.1145/1242531.1242554, 2007 Computing Frontiers, Conference Proceedings
-
T. Moseley, D. A. Connors, D. Grunwald, and R. Peri. Identifying potential parallelism via loop-centric profiling. In CF '07: Proceedings of the 4th international conference on Computing frontiers, pages 143-152, New York, NY, USA, 2007. ACM. (Pubitemid 47582219)
-
(2007)
2007 Computing Frontiers, Conference Proceedings
, pp. 143-152
-
-
Moseley, T.1
Connors, D.A.2
Grunwald, D.3
Peri, R.4
-
22
-
-
84869340092
-
OpenMP Architecture Review Board
-
May
-
OpenMP Architecture Review Board. OpenMP application program interface, version 3.0. http://www.openmp.org/mp-documents/spec30.pdf, May 2008.
-
(2008)
OpenMP Application Program Interface Version 3.0
-
-
-
25
-
-
84968724570
-
An efficient online path profiling framework for java just-in-time compilers
-
Washington, DC, USA, IEEE Computer Society
-
T. Yasue, T. Suganuma, H. Komatsu, and T. Nakatani. An efficient online path profiling framework for Java just-in-time compilers. In PACT '03: Proceedings of the 12th International Conference on Parallel Architectures and Compilation Techniques, page 148, Washington, DC, USA, 2003. IEEE Computer Society.
-
(2003)
In PACT '03: Proceedings of the 12th International Conference on Parallel Architectures and Compilation Techniques
, pp. 148
-
-
Yasue, T.1
Suganuma, T.2
Komatsu, H.3
Nakatani, T.4
-
26
-
-
33746100320
-
Accurate, efficient, and adaptive calling context profiling
-
DOI 10.1145/1133255.1134012, PLDI 2006 - Proceedings of the 2006 ACM SIGPLAN Conference on Programming Language Design and Implementation
-
X. Zhuang, M. J. Serrano, H. W. Cain, and J.-D. Choi. Accurate, efficient, and adaptive calling context profiling. In PLDI '06: Proceedings of the 2006 ACM SIGPLAN conference on Programming language design and implementation, pages 263-271, New York, NY, USA, 2006. ACM. (Pubitemid 44074938)
-
(2006)
Proceedings of the ACM SIGPLAN Conference on Programming Language Design and Implementation (PLDI)
, vol.2006
, pp. 263-271
-
-
Zhuang, X.1
Serrano, M.J.2
Cain, H.W.3
Choi, J.-D.4
|