-
1
-
-
34548041538
-
-
US Lattice Quantum Chromodynamics Software
-
US Lattice Quantum Chromodynamics Software. http://www.usqcd.org/usqcd- software, 2006.
-
(2006)
-
-
-
2
-
-
0030645124
-
-
G. Ammons, T. Ball, and J. R.. Larus. Exploiting hardware performance counters with flow and context sensitive profiling. In SIGPLAN Conference on Programming Language Design and Implementation, pages 85-96, New York, NY, USA, 1997. ACM Press.
-
G. Ammons, T. Ball, and J. R.. Larus. Exploiting hardware performance counters with flow and context sensitive profiling. In SIGPLAN Conference on Programming Language Design and Implementation, pages 85-96, New York, NY, USA, 1997. ACM Press.
-
-
-
-
3
-
-
0025567275
-
Quartz: A tool for tuning parallel program performance
-
Boulder, CO, USA
-
T. E. Anderson and E. D. Lazowska. Quartz: a tool for tuning parallel program performance. In Proc. of the ACM SIGMETRICS Conf. on Measurement and Modeling of Computer Systems, pages 115-125, Boulder, CO, USA, 1990.
-
(1990)
Proc. of the ACM SIGMETRICS Conf. on Measurement and Modeling of Computer Systems
, pp. 115-125
-
-
Anderson, T.E.1
Lazowska, E.D.2
-
4
-
-
0003605996
-
The NAS parallel benchmarks 2.0
-
Technical Report NAS-95-020, NASA Ames Research Center, Dec
-
D. Bailey, T. Harris, W. Saphir, R. van der Wijngaart, A. Woo, and M. Yarrow. The NAS parallel benchmarks 2.0. Technical Report NAS-95-020, NASA Ames Research Center, Dec. 1995.
-
(1995)
-
-
Bailey, D.1
Harris, T.2
Saphir, W.3
van der Wijngaart, R.4
Woo, A.5
Yarrow, M.6
-
7
-
-
0003510632
-
Introduction to UPC and language specification
-
Technical Report CCS-TR-99-157, IDA Ctr. for Computing Sciences, May 1999
-
W. W. Carlson et al. Introduction to UPC and language specification. Technical Report CCS-TR-99-157, IDA Ctr. for Computing Sciences, May 1999.
-
-
-
Carlson, W.W.1
-
8
-
-
84974695561
-
A dynamic tracing mechanism for performance analysis of OpenMP applications
-
London, UK, Springer-Verlag
-
J. Caubet et al. A dynamic tracing mechanism for performance analysis of OpenMP applications. In Proc. of the Intl. Workshop on OpenMP Appl. and Tools, pages 53-67, London, UK, 2001. Springer-Verlag.
-
(2001)
Proc. of the Intl. Workshop on OpenMP Appl. and Tools
, pp. 53-67
-
-
Caubet, J.1
-
12
-
-
0005973264
-
Origin 2000 and Onyx2 performance tuning and optimization guide
-
Technical Report 007-3430-003, Silicon Graphics, Inc
-
D. Cortesi, J. Fier, J. Wilson, and J. Boney. Origin 2000 and Onyx2 performance tuning and optimization guide. Technical Report 007-3430-003, Silicon Graphics, Inc., 2001.
-
(2001)
-
-
Cortesi, D.1
Fier, J.2
Wilson, J.3
Boney, J.4
-
14
-
-
10444273118
-
A Multiplatform Co-Array Fortran Compiler
-
Antibes Juan-les-Pins, France, September 29, October 3
-
Y. Dotsenko, C. Coarfa, and J. Mellor-Crummey. A Multiplatform Co-Array Fortran Compiler. In Proceedings of the 13th Intl. Conference of Parallel Architectures and Compilation Techniques, Antibes Juan-les-Pins, France, September 29 - October 3 2004.
-
(2004)
Proceedings of the 13th Intl. Conference of Parallel Architectures and Compilation Techniques
-
-
Dotsenko, Y.1
Coarfa, C.2
Mellor-Crummey, J.3
-
15
-
-
34548038067
-
Efficient call-stack profiling of unmodified, optimized code
-
Cambridge, MA
-
N. Froyd, J. Mellor-Crummey, and R. Fowler. Efficient call-stack profiling of unmodified, optimized code. In Proceedings of the 19th ACM International Conference. on Supercomputing, Cambridge, MA, 2002.
-
(2002)
Proceedings of the 19th ACM International Conference. on Supercomputing
-
-
Froyd, N.1
Mellor-Crummey, J.2
Fowler, R.3
-
16
-
-
67650806214
-
Call path profiling for unmodified, optimized binaries
-
Ottawa, Canada, June
-
N. Froyd, N. Tallent, J. Mellor-Crummey, and R. Fowler. Call path profiling for unmodified, optimized binaries. In Proceedings of GCC Summit, Ottawa, Canada, June 2006.
-
(2006)
Proceedings of GCC Summit
-
-
Froyd, N.1
Tallent, N.2
Mellor-Crummey, J.3
Fowler, R.4
-
17
-
-
0032139230
-
Falcon: On-line monitoring for steering parallel programs
-
W. Gu, G. Eisenhauer, K. Schwan, and J. Vetter. Falcon: On-line monitoring for steering parallel programs. Concurrency: Practice and Experience, 10(9):699-736, 1998.
-
(1998)
Concurrency: Practice and Experience
, vol.10
, Issue.9
, pp. 699-736
-
-
Gu, W.1
Eisenhauer, G.2
Schwan, K.3
Vetter, J.4
-
19
-
-
34548028139
-
-
Krell Institute
-
Krell Institute. Open SpeedShop for Linux, http://www.openspeedshop.org, 2007.
-
(2007)
Open SpeedShop for Linux
-
-
-
21
-
-
0036679608
-
HPCView: A tool for top-down analysis of node performance
-
J. Mellor-Crummey, R. Fowler, G. Marin, and N. Tallent. HPCView: A tool for top-down analysis of node performance. The Journal of Supercomputing, 23:81-101, 2002.
-
(2002)
The Journal of Supercomputing
, vol.23
, pp. 81-101
-
-
Mellor-Crummey, J.1
Fowler, R.2
Marin, G.3
Tallent, N.4
-
22
-
-
34548024299
-
-
Message Passing Interface Forum. MPI-2: Extensions to the Message Passing Interface Standard, 1997.
-
Message Passing Interface Forum. MPI-2: Extensions to the Message Passing Interface Standard, 1997.
-
-
-
-
23
-
-
34548030127
-
-
Message Passing Interface Forum. MPI: A Message Passing Interface Standard, 1999.
-
Message Passing Interface Forum. MPI: A Message Passing Interface Standard, 1999.
-
-
-
-
24
-
-
0029408429
-
The Paradyn parallel performance measurement tool
-
B. P. Miller et al. The Paradyn parallel performance measurement tool. IEEE Computer, 28(11): 37-46, 1995.
-
(1995)
IEEE Computer
, vol.28
, Issue.11
, pp. 37-46
-
-
Miller, B.P.1
-
26
-
-
35048825254
-
Design and prototype of a performance tool interface for OpenMP
-
Santa Fe, NM, Oct, CD-ROM
-
B. Mohr, A. D. Malony, S. Shende, and F. Wolf. Design and prototype of a performance tool interface for OpenMP. In Proceedings of the Los Alamos Computer Science Institute Second Annual Symposium, Santa Fe, NM, Oct. 2001. CD-ROM.
-
(2001)
Proceedings of the Los Alamos Computer Science Institute Second Annual Symposium
-
-
Mohr, B.1
Malony, A.D.2
Shende, S.3
Wolf, F.4
-
27
-
-
33646152753
-
A Scalable Approach to MPI Application Performance Analysis
-
of, Springer-Verlag
-
S. Moore et al. A Scalable Approach to MPI Application Performance Analysis, volume 3666 of Lecture Notes in Computer Science, pages 309-316. Springer-Verlag, 2005.
-
(2005)
Lecture Notes in Computer Science
, vol.3666
, pp. 309-316
-
-
Moore, S.1
-
29
-
-
0002438680
-
VAMPIR: Visualization and analysis of MPI resources
-
W. E. Nagel, A. Arnold, M. Weber, H. C. Hoppe, and K. Solchenbach. VAMPIR: Visualization and analysis of MPI resources. Supercomputer, 12(1):69-80, 1996.
-
(1996)
Supercomputer
, vol.12
, Issue.1
, pp. 69-80
-
-
Nagel, W.E.1
Arnold, A.2
Weber, M.3
Hoppe, H.C.4
Solchenbach, K.5
-
30
-
-
84957632942
-
ARMCI: A Portable Remote Memory Copy Library for Distributed Array Libraries and Compiler Run- Time Systems
-
of, Springer-Verlag
-
J. Nieplocha and B. Carpenter. ARMCI: A Portable Remote Memory Copy Library for Distributed Array Libraries and Compiler Run- Time Systems, volume 1586 of Lecture Notes in Computer Science, pages 533-546. Springer-Verlag, 1999.
-
(1999)
Lecture Notes in Computer Science
, vol.1586
, pp. 533-546
-
-
Nieplocha, J.1
Carpenter, B.2
-
31
-
-
34548051007
-
Benchmarking information referenced in the NSF 05-625 High Performance Computing System Acquisition: Towards a Petascale Computing Environment for Science and Engineering
-
Technical Report NSF0605, Nov
-
Benchmarking information referenced in the NSF 05-625 High Performance Computing System Acquisition: Towards a Petascale Computing Environment for Science and Engineering. Technical Report NSF0605, Nov. 2005.
-
(2005)
-
-
-
32
-
-
0038335680
-
Co-Array Fortran for parallel programming
-
Technical Report RAL-TR-1998-060, Rutherford Appleton Laboratory, August
-
R. W. Numrich and J. K. Reid. Co-Array Fortran for parallel programming. Technical Report RAL-TR-1998-060, Rutherford Appleton Laboratory, August 1998.
-
(1998)
-
-
Numrich, R.W.1
Reid, J.K.2
-
33
-
-
84934325826
-
Scientific computations on modern parallel vector systems
-
Pittsburgh, November
-
L. Oliker, A. Canning, J. Carter, J. Shalf, and S. Ethier. Scientific computations on modern parallel vector systems. In Proceedings of Supercomputing 2004, Pittsburgh, November 2004.
-
(2004)
Proceedings of Supercomputing 2004
-
-
Oliker, L.1
Canning, A.2
Carter, J.3
Shalf, J.4
Ethier, S.5
-
34
-
-
85040770718
-
Scalable performance analysis: The Pablo performance analysis environment
-
IEEE Computer Society
-
D. A. Reed et al. Scalable performance analysis: The Pablo performance analysis environment. In Proc. of the Scalable Parallel Libraries Conference, pages 104-113. IEEE Computer Society, 1993.
-
(1993)
Proc. of the Scalable Parallel Libraries Conference
, pp. 104-113
-
-
Reed, D.A.1
-
36
-
-
0003977887
-
-
Silicon Graphics, Inc, SGI, SpeedShop, Technical Report 007-3311-011, SGI
-
Silicon Graphics, Inc. (SGI). SpeedShop User's Guide. Technical Report 007-3311-011, SGI, 2003.
-
(2003)
User's Guide
-
-
-
37
-
-
0003710740
-
-
MIT Press
-
M. Snir, S. W. Otto, S. Huss-Lederman, D. W. Walker, and J. Dongarra. MPI: The Complete Reference. MIT Press, 1995.
-
(1995)
MPI: The Complete Reference
-
-
Snir, M.1
Otto, S.W.2
Huss-Lederman, S.3
Walker, D.W.4
Dongarra, J.5
-
38
-
-
34548049441
-
-
H.-H. Su et al. GASP! a standardized performance analysis tool interface for global address space programming models. Technical Report LBNL-61659, Lawrence Berkeley National Laboratory, 2006.
-
H.-H. Su et al. GASP! a standardized performance analysis tool interface for global address space programming models. Technical Report LBNL-61659, Lawrence Berkeley National Laboratory, 2006.
-
-
-
-
41
-
-
0033691589
-
Performance analysis of distributed applications using automatic classification of communication inefficiencies
-
J. Vetter. Performance analysis of distributed applications using automatic classification of communication inefficiencies. In International Conference on Supercomputing, pages 245-254, 2000.
-
(2000)
International Conference on Supercomputing
, pp. 245-254
-
-
Vetter, J.1
-
42
-
-
0036036949
-
Dynamic statistical profiling of communication activity in distributed applications
-
NY, NY, USA, ACM Press
-
J. Vetter. Dynamic statistical profiling of communication activity in distributed applications. In Proc. of the ACM SIGMETRICS Intl. Conf. on Measurement and Modeling of Computer Systems, pages 240-250, NY, NY, USA, 2002. ACM Press.
-
(2002)
Proc. of the ACM SIGMETRICS Intl. Conf. on Measurement and Modeling of Computer Systems
, pp. 240-250
-
-
Vetter, J.1
-
44
-
-
33745149889
-
EPILOG binary trace-data format
-
Technical Report FZJ-ZAM-IB-2004-06, Forschungszentrum Julich, May
-
F. Wolf and B. Mohr. EPILOG binary trace-data format. Technical Report FZJ-ZAM-IB-2004-06, Forschungszentrum Julich, May 2004.
-
(2004)
-
-
Wolf, F.1
Mohr, B.2
-
45
-
-
27144442408
-
Efficient pattern search in large traces through successive refinement
-
Pisa, Italy, Aug
-
F. Wolf, B. Mohr, J. Dongarra, and S. Moore. Efficient pattern search in large traces through successive refinement. In Proc. of the European Conference on Parallel Computing, Pisa, Italy, Aug. 2004.
-
(2004)
Proc. of the European Conference on Parallel Computing
-
-
Wolf, F.1
Mohr, B.2
Dongarra, J.3
Moore, S.4
-
47
-
-
33750427372
-
From trace generation to visualization: A performance framework for distributed parallel systems
-
Washington, DC, USA, IEEE Computer Society
-
C. E. Wu et al. From trace generation to visualization: A performance framework for distributed parallel systems. In Proceedings of the A CM/IEEE Conference on Supercomputing, Washington, DC, USA, 2000. IEEE Computer Society.
-
(2000)
Proceedings of the A CM/IEEE Conference on Supercomputing
-
-
Wu, C.E.1
-
48
-
-
0032593334
-
Toward scalable performance visualization with Jumpshot
-
Fall
-
O. Zaki, E. Lusk, W. Gropp, and D. Swider. Toward scalable performance visualization with Jumpshot. High Performance Computing Applications, 13(2):277-288, Fall 1999.
-
(1999)
High Performance Computing Applications
, vol.13
, Issue.2
, pp. 277-288
-
-
Zaki, O.1
Lusk, E.2
Gropp, W.3
Swider, D.4
|