-
1
-
-
85054422128
-
-
Frequently Asked Questions
-
ATLAS Frequently Asked Questions. http://math-atlas.sourceforge.net/faq.html
-
-
-
-
2
-
-
85054451643
-
-
BLAS: Basic linear algebra subprograms. http://www.netlib.org/blas
-
-
-
-
3
-
-
85054431715
-
-
CactusEinstein toolkit home page. http://www.cactuscode.org/Community/NumericalRelativity
-
-
-
-
4
-
-
85054462117
-
-
GEO 600.
-
-
-
-
5
-
-
85054448244
-
-
Gnu standard: Formatting error messages. http://www.gnu.org/prep/standards/html_node/Errors.html
-
-
-
-
6
-
-
85054445397
-
-
Kranc: Automated code generation.
-
-
-
-
7
-
-
85054463898
-
-
LIGO: Laser Interferometer Gravitational wave Observatory.
-
-
-
-
8
-
-
85054461122
-
-
LISA: Laser Interferometer Space Antenna.
-
-
-
-
9
-
-
85054468897
-
-
Mesh refinement with Carpet.
-
-
-
-
10
-
-
85054446144
-
-
Netlib repository. http://www.netlib.org
-
-
-
-
11
-
-
85054449561
-
-
Queen Bee, the core supercomputer of LONI
-
Queen Bee, the core supercomputer of LONI.
-
-
-
-
12
-
-
85054439712
-
-
Sun Constellation Linux Cluster: Ranger.
-
-
-
-
13
-
-
85054451362
-
-
Top500 Supercomputer Sites. http://www.top500.org
-
-
-
-
14
-
-
85054466880
-
-
Optimizing applications on the Cray X1TM system, 2009. http://docs.cray.com/books/S-2315-50/html-S-2315-50/z1055157958smg.html
-
(2009)
-
-
-
15
-
-
85054468206
-
-
ROSE Web Reference, 2010. http://www.rosecompiler.org
-
(2010)
ROSE Web Reference
-
-
-
17
-
-
0010540283
-
An automatic design optimization tool and its application to computational fluid dynamics
-
New York, NY, ACM
-
D. Abramson, A. Lewis, T. Peachey, and C. Fletcher. An automatic design optimization tool and its application to computational fluid dynamics. In Proceedings of the ACM/IEEE Conference on Supercomputing (SC01), pages 25-25, New York, NY, 2001. ACM.
-
(2001)
Proceedings of the ACM/IEEE Conference on Supercomputing (SC01)
, pp. 25
-
-
Abramson, D.1
Lewis, A.2
Peachey, T.3
Fletcher, C.4
-
21
-
-
0037670448
-
Parallel multigrid smoothing: Polynomial versus Gauss-Seidel
-
M.F. Adams, M. Brezina, J. J. Hu, and R.S. Tuminaro. Parallel multigrid smoothing: Polynomial versus Gauss-Seidel. Journal of Computational Physics, 188(2): 593-610, 2003.
-
(2003)
Journal of Computational Physics
, vol.188
, Issue.2
, pp. 593-610
-
-
Adams, M.F.1
Brezina, M.2
Hu, J.J.3
Tuminaro, R.S.4
-
22
-
-
77950611743
-
HPCToolkit: Tools for performance analysis of optimized parallel programs
-
L. Adhianto, S. Banerjee, M. Fagan, M. Krentel, G. Marin, J. Mellor-Crummey, and N.R. Tallent. HPCToolkit: Tools for performance analysis of optimized parallel programs. Concurrency and Computation: Practice and Experience, 2010. http://dx.doi.org/10.1002/cpe.1553
-
(2010)
Concurrency and Computation: Practice and Experience
-
-
Adhianto, L.1
Banerjee, S.2
Fagan, M.3
Krentel, M.4
Marin, G.5
Mellor-Crummey, J.6
Tallent, N.R.7
-
26
-
-
0038102538
-
Gauge conditions for long-term numerical black hole evolutions without excision
-
M. Alcubierre, B. Brügmann, P. Diener, M. Koppitz, D. Pollney, E. Seidel, and R. Takahashi. Gauge conditions for long-term numerical black hole evolutions without excision. Physical Review D, 67: 084023, 2003.
-
(2003)
Physical Review D
, vol.67
, pp. 084023
-
-
Alcubierre, M.1
Brügmann, B.2
Diener, P.3
Koppitz, M.4
Pollney, D.5
Seidel, E.6
Takahashi, R.7
-
27
-
-
17144408376
-
Towards a stable numerical evolution of strongly gravitating systems in general relativity: The conformal treatments
-
M. Alcubierre, B. Brügmann, T. Dramlitsch, J.A. Font, P. Papadopoulos, E. Seidel, N. Stergioulas, and R. Takahashi. Towards a stable numerical evolution of strongly gravitating systems in general relativity: The conformal treatments. Physical Review D, 62: 044034, 2000.
-
(2000)
Physical Review D
, vol.62
, pp. 044034
-
-
Alcubierre, M.1
Brügmann, B.2
Dramlitsch, T.3
Font, J.A.4
Papadopoulos, P.5
Seidel, E.6
Stergioulas, N.7
Takahashi, R.8
-
28
-
-
0008688174
-
-
A.S. Almgren, J.B. Bell, P. Colella, L.H. Howell, and M. Welcome. A conservative adaptive projection method for the variable density incompressible Navier-Stokes equations. 142: 1-46, 1998.
-
(1998)
A conservative adaptive projection method for the variable density incompressible Navier-Stokes equations
, vol.142
, pp. 1-46
-
-
Almgren, A.S.1
Bell, J.B.2
Colella, P.3
Howell, L.H.4
Welcome, M.5
-
29
-
-
85054451287
-
-
Alpaca: Cactus tools for application-level profiling and correctness analysis. http://www.cct.lsu.edu/~eschnett/Alpaca
-
-
-
-
31
-
-
0030645124
-
Exploiting hardware performance counterswith flow and context sensitive profiling
-
New York, NY, USA, ACM
-
G. Ammons, T. Ball, and J.R. Larus. Exploiting hardware performance counterswith flow and context sensitive profiling. In SIGPLAN Conference on Programming Language Design and Implementation, pages 85-96, New York, NY, USA, 1997. ACM.
-
(1997)
SIGPLAN Conference on Programming Language Design and Implementation
, pp. 85-96
-
-
Ammons, G.1
Ball, T.2
Larus, J.R.3
-
32
-
-
0031270220
-
Continuous profiling: Where have all the cycles gone?
-
J.M. Anderson, L.M. Berc, J. Dean, S. Ghemawat, M.R. Henzinger, S-T A. Leung, R.L. Sites, M.T. Vandevoorde, C.A. Waldspurger, and W.E. Weihl. Continuous profiling: Where have all the cycles gone? ACM Transactions on Computer Systems, 15(4): 357-390, 1997.
-
(1997)
ACM Transactions on Computer Systems
, vol.15
, Issue.4
, pp. 357-390
-
-
Anderson, J.M.1
Berc, L.M.2
Dean, J.3
Ghemawat, S.4
Henzinger, M.R.5
Leung, S.-T.A.6
Sites, R.L.7
Vandevoorde, M.T.8
Waldspurger, C.A.9
Weihl, W.E.10
-
33
-
-
85054464091
-
-
home page
-
Astrophysics Simulation Collaboratory (ASC) home page.
-
-
-
-
34
-
-
70350635626
-
An extension of the StarSs programming model for platforms with multiple GPUs
-
Spinger
-
E. Ayguade, R.M. Badia, F.D. Igual, J. Labarta, R. Mayo, and E.S. Quintana-Orti. An extension of the StarSs programming model for platforms with multiple GPUs. In Procs. of the 15th international Euro-Par Conference (Euro-Par 2009), pages 851-862. Spinger, 2009.
-
(2009)
Procs. of the 15th international Euro-Par Conference (Euro-Par 2009)
, pp. 851-862
-
-
Ayguade, E.1
Badia, R.M.2
Igual, F.D.3
Labarta, J.4
Mayo, R.5
Quintana-Orti, E.S.6
-
36
-
-
85054439613
-
-
September
-
L. Bachega, S. Chatterjee, K. Dockser, J. Gunnels, M. Gupta, F. Gustavson, C. Lapkowski, G. Liu, M. Mendell, C. Wait, and T.J.C. Ward. A high-performance SIMD floating point unit design for BlueGene/L: Architecture, compilation, and algorithm design. September 2004.
-
(2004)
A high-performance SIMD floating point unit design for BlueGene/L: Architecture, compilation, and algorithm design
-
-
Bachega, L.1
Chatterjee, S.2
Dockser, K.3
Gunnels, J.4
Gupta, M.5
Gustavson, F.6
Lapkowski, C.7
Liu, G.8
Mendell, M.9
Wait, C.10
Ward, T.J.C.11
-
37
-
-
33646425180
-
Programming grid applications with grid superscalar
-
R. Badia, J. Labarta, R. Sirvent, J.M. Perez, J.M. .Cela, and R. Grima. Programming grid applications with grid superscalar. Journal of Grid Computing, 1(2): 151-170, 2003.
-
(2003)
Journal of Grid Computing
, vol.1
, Issue.2
, pp. 151-170
-
-
Badia, R.1
Labarta, J.2
Sirvent, R.3
Perez, J.M.4
Cela, J.M.5
Grima, R.6
-
38
-
-
0002404913
-
The NAS parallel benchmarks
-
D. Bailey, E. Barszcz, J. Barton, D. Browning, R. Carter, L. Dagum, R. Fatoohi, S. Fineberg, P. Frederickson, T. Lasinski, R. Schreiber, H. Simon, V. Venkatakrishman, and S. Weeratunga. The NAS parallel benchmarks. International Journal of Supercomputer Applications, 5: 66-73, 1991.
-
(1991)
International Journal of Supercomputer Applications
, vol.5
, pp. 66-73
-
-
Bailey, D.1
Barszcz, E.2
Barton, J.3
Browning, D.4
Carter, R.5
Dagum, L.6
Fatoohi, R.7
Fineberg, S.8
Frederickson, P.9
Lasinski, T.10
Schreiber, R.11
Simon, H.12
Venkatakrishman, V.13
Weeratunga, S.14
-
39
-
-
0041638552
-
Twelve ways to fool the masses when giving performance results on parallel computers
-
August
-
D.H. Bailey. Twelve ways to fool the masses when giving performance results on parallel computers. Supercomputing Review, pages 54-55, August 1991.
-
(1991)
Supercomputing Review
, pp. 54-55
-
-
Bailey, D.H.1
-
40
-
-
34147135028
-
Misleading performance reporting in the supercomputing field
-
D.H. Bailey. Misleading performance reporting in the supercomputing field. Scientific Programming, 1: 141-151, 1992.
-
(1992)
Scientific Programming
, vol.1
, pp. 141-151
-
-
Bailey, D.H.1
-
43
-
-
0003660984
-
PETSc users manual
-
Argonne National Laboratory
-
S. Balay, K. Buschelman, V. Eijkhout, W.D. Gropp, D. Kaushik, M.G. Knepley, L.C. McInnes, B.F. Smith, and H. Zhang. PETSc users manual. Technical Report ANL-95/11 -Revision 3.0.0, Argonne National Laboratory, 2008.
-
(2008)
Technical Report ANL-95/11 -Revision 3.0.0
-
-
Balay, S.1
Buschelman, K.2
Eijkhout, V.3
Gropp, W.D.4
Kaushik, D.5
Knepley, M.G.6
McInnes, L.C.7
Smith, B.F.8
Zhang, H.9
-
44
-
-
67650069905
-
Compiler-assisted dynamic scheduling for effective parallelization of loop nests on multicore processors
-
Raleigh, North Carolina, February
-
M.M. Baskaran, N. Vydyanathan, U. Bonkhugula, J. Ramanujam, A. Rountev, and P. Sadayappan. Compiler-assisted dynamic scheduling for effective parallelization of loop nests on multicore processors. In 14th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, Raleigh, North Carolina, February 2009.
-
(2009)
14th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming
-
-
Baskaran, M.M.1
Vydyanathan, N.2
Bonkhugula, U.3
Ramanujam, J.4
Rountev, A.5
Sadayappan, P.6
-
45
-
-
33645441190
-
Micromechanics of the human vertebral body
-
San Francisco
-
H.H. Bayraktar, M.F. Adams, P.F. Hoffmann, D.C. Lee, A. Gupta., P. Papadopoulos, and T.M. Keaveny. Micromechanics of the human vertebral body. In Transactions of the Orthopaedic Research Society, volume 29, page 1129, San Francisco, 2004.
-
(2004)
Transactions of the Orthopaedic Research Society
, vol.29
, pp. 1129
-
-
Bayraktar, H.H.1
Adams, M.F.2
Hoffmann, P.F.3
Lee, D.C.4
Gupta, A.5
Papadopoulos, P.6
Keaveny, T.M.7
-
47
-
-
37549015666
-
Bell’s law for the birth and death of computer classes
-
January
-
G. Bell. Bell’s law for the birth and death of computer classes. Communications of the ACM, 5(1): 86-94, January 2008.
-
(2008)
Communications of the ACM
, vol.5
, Issue.1
, pp. 86-94
-
-
Bell, G.1
-
48
-
-
0000843403
-
-
J. Bell, M. Berger, J. Saltzman, and M. Welcome. A three-dimensional adaptive mesh refinement for hyperbolic conservation laws. 15(1): 127-138, 1994.
-
(1994)
A three-dimensional adaptive mesh refinement for hyperbolic conservation laws
, vol.15
, Issue.1
, pp. 127-138
-
-
Bell, J.1
Berger, M.2
Saltzman, J.3
Welcome, M.4
-
49
-
-
85054429184
-
A portable, extensible, and scalable tool for parallel performance profile analysis
-
R. Bell, A. Malony, and S. Shende. A portable, extensible, and scalable tool for parallel performance profile analysis. In Proceedings of European Conference on Parallel Computing, 2003.
-
(2003)
Proceedings of European Conference on Parallel Computing
-
-
Bell, R.1
Malony, A.2
Shende, S.3
-
51
-
-
11744289966
-
Local adaptive mesh refinement for shock hydrodynamics
-
May
-
M.J. Berger and P. Colella. Local adaptive mesh refinement for shock hydrodynamics. Journal of Computational Physics, 82(1): 64-84, May 1989.
-
(1989)
Journal of Computational Physics
, vol.82
, Issue.1
, pp. 64-84
-
-
Berger, M.J.1
Colella, P.2
-
52
-
-
48749141209
-
Adaptive mesh refinement for hyperbolic partial differential equations
-
M.J. Berger and J. Oliger. Adaptive mesh refinement for hyperbolic partial differential equations. Journal of Computational Physics, 53: 484-512, 1984.
-
(1984)
Journal of Computational Physics
, vol.53
, pp. 484-512
-
-
Berger, M.J.1
Oliger, J.2
-
53
-
-
0029428752
-
Lattice QCD on the IBM scalable POWERParallel systems SP2
-
San Diego, California, November
-
C. Bernard, C. DeTar, S. Gottlieb, U.M. Heller, J. Hetrick, N. Ishizuka, L. Kärkkäinen, S.R. Lantz, K. Rummukainen, R. Sugar, D. Toussaint, and M. Wingate. Lattice QCD on the IBM scalable POWERParallel systems SP2. In ACM/IEEE Proceedings of SC 1995: High Performance Networking and Computing, San Diego, California, November 1995.
-
(1995)
ACM/IEEE Proceedings of SC 1995: High Performance Networking and Computing
-
-
Bernard, C.1
DeTar, C.2
Gottlieb, S.3
Heller, U.M.4
Hetrick, J.5
Ishizuka, N.6
Kärkkäinen, L.7
Lantz, S.R.8
Rummukainen, K.9
Sugar, R.10
Toussaint, D.11
Wingate, M.12
-
54
-
-
23844515651
-
A component architecture for high-performance scientific computing
-
ACTS Collection Special Issue
-
D.E. Bernholdt, B.A. Allan, R. Armstrong, F. Bertrand, K. Chiu, T.L. Dahlgren, K. Damevski, W.R. Elwasif, T.G.W. Epperly, M. Govindaraju, D.S. Katz, J.A. Kohl, M. Krishnan, G. Kumfert, J.W. Larson, S. Lefantzi, M.J. Lewis, A.D. Malony, L.C. McInnes, J. Nieplocha, B. Norris, S.G. Parker, J. Ray, S. Shende, T.L. Windus, and S. Zhou. A component architecture for high-performance scientific computing. Intl. Journal of High-Performance Computing Applications, ACTS Collection Special Issue, 2005.
-
(2005)
Intl. Journal of High-Performance Computing Applications
-
-
Bernholdt, D.E.1
Allan, B.A.2
Armstrong, R.3
Bertrand, F.4
Chiu, K.5
Dahlgren, T.L.6
Damevski, K.7
Elwasif, W.R.8
Epperly, T.G.W.9
Govindaraju, M.10
Katz, D.S.11
Kohl, J.A.12
Krishnan, M.13
Kumfert, G.14
Larson, J.W.15
Lefantzi, S.16
Lewis, M.J.17
Malony, A.D.18
McInnes, L.C.19
Nieplocha, J.20
Norris, B.21
Parker, S.G.22
Ray, J.23
Shende, S.24
Windus, T.L.25
Zhou, S.26
more..
-
55
-
-
0030661485
-
Optimizing matrix multiply using PHiPAC: A portable, high-performance, ANSI C coding methodology
-
Vienna, Austria
-
J. Bilmes, K. Asanovic, C-W Chin, and J. Demmel. Optimizing matrix multiply using PHiPAC: a portable, high-performance, ANSI C coding methodology. In International Conference on Supercomputing, pages 340-347, Vienna, Austria, 1997.
-
(1997)
International Conference on Supercomputing
, pp. 340-347
-
-
Bilmes, J.1
Asanovic, K.2
Chin, C.-W.3
Demmel, J.4
-
57
-
-
0033407555
-
An energy-conserving thermodynamic model of sea ice
-
C.M. Bitz and W.H. Lipscomb. An energy-conserving thermodynamic model of sea ice. Journal of Geophysical Research, 104: 15669-15677, 1999.
-
(1999)
Journal of Geophysical Research
, vol.104
, pp. 15669-15677
-
-
Bitz, C.M.1
Lipscomb, W.H.2
-
58
-
-
0003615167
-
-
SIAM, Philadelphia
-
L.S. Blackford, J. Choi, A. Cleary, E. DAzevedo, J. Demmel, I. Dhillon, J. Dongarra, S. Hammarling, G. Henry, A. Petitet, K. Stanley, D. Walke, and R.C. Whaley. ScaLAPACK Users Guide. SIAM, Philadelphia, 1997.
-
(1997)
ScaLAPACK Users Guide
-
-
Blackford, L.S.1
Choi, J.2
Cleary, A.3
Dazevedo, E.4
Demmel, J.5
Dhillon, I.6
Dongarra, J.7
Hammarling, S.8
Henry, G.9
Petitet, A.10
Stanley, K.11
Walke, D.12
Whaley, R.C.13
-
59
-
-
0030382364
-
Parallel programming with polaris
-
December
-
W. Blume, R. Doallo, R. Eigenmann, J. Grout, J. Hoeflinger, T. Lawrence, J. Lee, D. Padua, Y. Paek, B. Pottenger, L. Rauchwerger, and P. Tu. Parallel programming with polaris. Computer, 29(12), December 1996.
-
(1996)
Computer
, vol.29
, Issue.12
-
-
Blume, W.1
Doallo, R.2
Eigenmann, R.3
Grout, J.4
Hoeflinger, J.5
Lawrence, T.6
Lee, J.7
Padua, D.8
Paek, Y.9
Pottenger, B.10
Rauchwerger, L.11
Tu, P.12
-
61
-
-
0032046628
-
Performance modeling for SPMD messagepassing programs
-
J. Brehm, P.H. Worley, and M. Madhukar. Performance modeling for SPMD messagepassing programs. Concurrency: Practice and Experience, 10(5): 333-357, 1998.
-
(1998)
Concurrency: Practice and Experience
, vol.10
, Issue.5
, pp. 333-357
-
-
Brehm, J.1
Worley, P.H.2
Madhukar, M.3
-
62
-
-
62549150832
-
Turduckening black holes: An analytical and computational study
-
D. Brown, P. Diener, O. Sarbach, E. Schnetter, and M. Tiglio. Turduckening black holes: An analytical and computational study. Physical Review D (submitted), 2008.
-
(2008)
Physical Review D (submitted)
-
-
Brown, D.1
Diener, P.2
Sarbach, O.3
Schnetter, E.4
Tiglio, M.5
-
63
-
-
0033708935
-
Semicoarsening multigrid on distributed memory machines
-
P.N. Brown, R.D. Falgout, and J.E. Jones. Semicoarsening multigrid on distributed memory machines. SIAM Journal on Scientific Computing, 21(5): 1823-1834, 2000.
-
(2000)
SIAM Journal on Scientific Computing
, vol.21
, Issue.5
, pp. 1823-1834
-
-
Brown, P.N.1
Falgout, R.D.2
Jones, J.E.3
-
64
-
-
0034268943
-
A portable programming interface for performance evaluation on modern processors
-
S. Browne, J. Dongarra, N. Garner, G. Ho, and P. Mucci. A portable programming interface for performance evaluation on modern processors. The International Journal of High Performance Computing Applications, 14(4): 189-204, 2000.
-
(2000)
The International Journal of High Performance Computing Applications
, vol.14
, Issue.4
, pp. 189-204
-
-
Browne, S.1
Dongarra, J.2
Garner, N.3
Ho, G.4
Mucci, P.5
-
65
-
-
0034268943
-
A portable programming interface for performance evaluation on modern processors
-
S. Browne, J. Dongarra, N. Garner, G. Ho, and P. Mucci. A portable programming interface for performance evaluation on modern processors. International Journal of High Performance Computing Applications, 14(3): 189-204, 2000.
-
(2000)
International Journal of High Performance Computing Applications
, vol.14
, Issue.3
, pp. 189-204
-
-
Browne, S.1
Dongarra, J.2
Garner, N.3
Ho, G.4
Mucci, P.5
-
66
-
-
0242339524
-
Online remote trace analysis of parallel applications on high-performance clusters
-
Springer
-
H. Brunst, A.D. Malony, S. Shende, and R. Bell. Online remote trace analysis of parallel applications on high-performance clusters. In Proceedings of the ISHPC Conference (LNCS 2858), pages 440-449. Springer, 2003.
-
(2003)
Proceedings of the ISHPC Conference (LNCS 2858)
, pp. 440-449
-
-
Brunst, H.1
Malony, A.D.2
Shende, S.3
Bell, R.4
-
70
-
-
85054454180
-
Perfexpert: An automated HPC performance measurement and analysis tool with optimization recommendations
-
New York, NY, November, ACM
-
M. Burtscher, B.D. Kim, J. Diamond, J. McCalpin, L. Koesterke, and J. Browne. Perfexpert: An automated HPC performance measurement and analysis tool with optimization recommendations. In Proceedings of ACM/IEEE Conference on Supercomputing (SC10), New York, NY, November 2010. ACM.
-
(2010)
Proceedings of ACM/IEEE Conference on Supercomputing (SC10)
-
-
Burtscher, M.1
Kim, B.D.2
Diamond, J.3
McCalpin, J.4
Koesterke, L.5
Browne, J.6
-
71
-
-
58149269099
-
A class of parallel tiled linear algebra algorithms for multicore architectures
-
A. Buttari, J. Langou, J. Kurzak, and J. Dongarra. A class of parallel tiled linear algebra algorithms for multicore architectures. Parallel Computing, 35(1): 38-53, 2009.
-
(2009)
Parallel Computing
, vol.35
, Issue.1
, pp. 38-53
-
-
Buttari, A.1
Langou, J.2
Kurzak, J.3
Dongarra, J.4
-
72
-
-
85054441479
-
-
home page
-
Cactus computational toolkit home page. http://www.cactuscode.org
-
-
-
-
73
-
-
0000493064
-
Estimating interlock and improving balance for pipelined architectures
-
D. Callahan, J. Cocke, and K. Kennedy. Estimating interlock and improving balance for pipelined architectures. Journal of Parallel and Distributed Computing, 5(4): 334-358, 1988.
-
(1988)
Journal of Parallel and Distributed Computing
, vol.5
, Issue.4
, pp. 334-358
-
-
Callahan, D.1
Cocke, J.2
Kennedy, K.3
-
76
-
-
0003510632
-
Introduction to upc and language specification
-
17100 Science Dr., Bowie, MD 20715, May
-
W.W. Carlson, J.M. Draper, D.E. Culler, K. Yelick, E. Brooks, and K. Warren. Introduction to upc and language specification. Technical Report CCS-TR-99-157, Center for Computing Sciences, 17100 Science Dr., Bowie, MD 20715, May 1999.
-
(1999)
Technical Report CCS-TR-99-157, Center for Computing Sciences
-
-
Carlson, W.W.1
Draper, J.M.2
Culler, D.E.3
Yelick, K.4
Brooks, E.5
Warren, K.6
-
77
-
-
0028549474
-
Improving the ratio of memory operations to floating-point operations in loops
-
S. Carr and K. Kennedy. Improving the ratio of memory operations to floating-point operations in loops. ACM Transactions on Programming Languages and Systems, 16(6): 1768-1810, 1994.
-
(1994)
ACM Transactions on Programming Languages and Systems
, vol.16
, Issue.6
, pp. 1768-1810
-
-
Carr, S.1
Kennedy, K.2
-
79
-
-
34250161860
-
Applying an automated framework to produce accurate blind performance predictions of full-scale HPC applications
-
June
-
L. Carrington, N. Wolter, A. Snavely, and C.B. Lee. Applying an automated framework to produce accurate blind performance predictions of full-scale HPC applications. DoD Users Group Conference (UGC2004), June 2004.
-
(2004)
DoD Users Group Conference (UGC2004)
-
-
Carrington, L.1
Wolter, N.2
Snavely, A.3
Lee, C.B.4
-
83
-
-
33646073716
-
Multiple page size modeling and optimization
-
17-21 September
-
C. Cascaval, E. Duesterwald, P.F. Sweeney, and R.W. Wisniewski. Multiple page size modeling and optimization. Parallel Architectures and Compilation Techniques, 2005. PACT 2005. 14th International Conference on, pages 339-349, 17-21 September 2005.
-
(2005)
Parallel Architectures and Compilation Techniques, 2005. PACT 2005. 14th International Conference on
, pp. 339-349
-
-
Cascaval, C.1
Duesterwald, E.2
Sweeney, P.F.3
Wisniewski, R.W.4
-
84
-
-
85054462630
-
-
CCSM Software Engineering Group. http://www.ccsm.ucar.edu/cseg
-
-
-
-
85
-
-
85054427463
-
-
CCSM Software Engineering Working Group. http://www.ccsm.ucar.edu/csm/working_groups/Software
-
-
-
-
86
-
-
85054462150
-
-
National Energy Research Scientific Computing Center. Parallel total energy code, 2009.
-
(2009)
Parallel total energy code
-
-
-
91
-
-
77954007684
-
-
April
-
D. Chen, N. Vachharajani, R. Hundt, S.W. Liao, V. Ramasamy, P. Yuan, W. Chen, and W. Zheng. Taming hardware event samples for FDO compilation. pages 42-53, April 2010.
-
(2010)
Taming hardware event samples for FDO compilation
, pp. 42-53
-
-
Chen, D.1
Vachharajani, N.2
Hundt, R.3
Liao, S.W.4
Ramasamy, V.5
Yuan, P.6
Chen, W.7
Zheng, W.8
-
92
-
-
0029204978
-
Scalable linear algebra software libraries for distributed memory concurrent computers
-
Washington, DC, USA, IEEE Computer Society
-
J. Choi and J.J. Dongarra. Scalable linear algebra software libraries for distributed memory concurrent computers. In FTDCS '95: Proceedings of the 5th IEEE Workshop on Future Trends of Distributed Computing Systems, page 170, Washington, DC, USA, 1995. IEEE Computer Society.
-
(1995)
FTDCS '95: Proceedings of the 5th IEEE Workshop on Future Trends of Distributed Computing Systems
, pp. 170
-
-
Choi, J.1
Dongarra, J.J.2
-
93
-
-
84934324812
-
Using Information from Prior Runs to Improve Automated Tuning Systems
-
Washington, DC, USA, IEEE Computer Society
-
I-H Chung and J.K. Hollingsworth. Using Information from Prior Runs to Improve Automated Tuning Systems. In Proceedings of the 2004 ACM/IEEE conference on Supercomputing (SC04), page 30, Washington, DC, USA, 2004. IEEE Computer Society.
-
(2004)
Proceedings of the 2004 ACM/IEEE conference on Supercomputing (SC04)
, pp. 30
-
-
Chung, I.-H.1
Hollingsworth, J.K.2
-
95
-
-
34548010778
-
Scalability analysis of SPMD codes using expectations
-
New York, NY, ACM
-
C. Coarfa, J. Mellor-Crummey, N. Froyd, and Y. Dotsenko. Scalability analysis of SPMD codes using expectations. In ICS '07: Proceedings of the 21st annual International Conference on Supercomputing, pages 13-22, New York, NY, 2007. ACM.
-
(2007)
ICS '07: Proceedings of the 21st annual International Conference on Supercomputing
, pp. 13-22
-
-
Coarfa, C.1
Mellor-Crummey, J.2
Froyd, N.3
Dotsenko, Y.4
-
96
-
-
26844455510
-
Multidimensional Upwind Methods for Hyperbolic Conservation Laws
-
P. Colella. Multidimensional Upwind Methods for Hyperbolic Conservation Laws. Journal of Computational Physics, 87: 171-200, 1990.
-
(1990)
Journal of Computational Physics
, vol.87
, pp. 171-200
-
-
Colella, P.1
-
98
-
-
33744490657
-
The community climate system model version 3 (CCSM3)
-
W.D. Collins, C.M. Bitz, M.L. Blackmon, G.B. Bonan, C.S. Bretherton, J.A. Carton, P. Chang, S.C. Doney, J.H. Hack, T.B. Henderson, J.T. Kiehl, W.G. Large, D.S. McKenna, B.D. Santer, and R.D. Smith. The community climate system model version 3 (CCSM3). Journal of Climate, 19(11): 2122-2143, 2006.
-
(2006)
Journal of Climate
, vol.19
, Issue.11
, pp. 2122-2143
-
-
Collins, W.D.1
Bitz, C.M.2
Blackmon, M.L.3
Bonan, G.B.4
Bretherton, C.S.5
Carton, J.A.6
Chang, P.7
Doney, S.C.8
Hack, J.H.9
Henderson, T.B.10
Kiehl, J.T.11
Large, W.G.12
McKenna, D.S.13
Santer, B.D.14
Smith, R.D.15
-
99
-
-
1842826742
-
-
NCAR Tech Note NCAR/TN-464+STR, National Center for Atmospheric Research, Boulder, CO 80307
-
W.D. Collins and P.J. Rasch, et al. Description of the NCAR community atmosphere model (CAM 3.0). NCAR Tech Note NCAR/TN-464+STR, National Center for Atmospheric Research, Boulder, CO 80307, 2004.
-
(2004)
Description of the NCAR community atmosphere model (CAM 3.0)
-
-
Collins, W.D.1
Rasch, P.J.2
-
100
-
-
33947636363
-
The formulation and atmospheric simulation of the community atmosphere model: CAM3
-
W.D. Collins, et al. The formulation and atmospheric simulation of the community atmosphere model: CAM3. Journal of Climate, 2005.
-
(2005)
Journal of Climate
-
-
Collins, W.D.1
-
101
-
-
85054469230
-
-
Community Climate System Model. http://www.ccsm.ucar.edu
-
-
-
-
102
-
-
0036679993
-
Adaptive optimizing compilers for the 21st century
-
August
-
K.D. Cooper, D. Subramanian, and L. Torczon. Adaptive optimizing compilers for the 21st century. The Journal of Supercomputing, 23(1): 7-22, August 2002.
-
(2002)
The Journal of Supercomputing
, vol.23
, Issue.1
, pp. 7-22
-
-
Cooper, K.D.1
Subramanian, D.2
Torczon, L.3
-
103
-
-
85117245869
-
Active harmony: Towards automated performance tuning
-
Los Alamitos, CA, USA, IEEE Computer Society Press
-
C. Ţăpuş, I-H Chung, and J.K. Hollingsworth. Active harmony: Towards automated performance tuning. In Proceedings of the ACM/IEEE Conference on Supercomputing (SC02), pages 1-11, Los Alamitos, CA, USA, 2002. IEEE Computer Society Press.
-
(2002)
Proceedings of the ACM/IEEE Conference on Supercomputing (SC02)
, pp. 1-11
-
-
Ţăpuş, C.1
Chung, I.-H.2
Hollingsworth, J.K.3
-
106
-
-
0002806690
-
OpenMP: An industry-standard API for shared-memory programming
-
January/March
-
L. Dagum and R. Menon. OpenMP: an industry-standard API for shared-memory programming. IEEE Computational Science and Engineering, 5(1): 46-55, January/March 1998.
-
(1998)
IEEE Computational Science and Engineering
, vol.5
, Issue.1
, pp. 46-55
-
-
Dagum, L.1
Menon, R.2
-
108
-
-
70350771127
-
Stencil computation optimization and autotuning on stateof-the-art multicore architectures
-
K. Datta, M. Murphy, V. Volkov, S. Williams, J. Carter, L. Oliker, D. Patterson, J. Shalf, and K. Yelick. Stencil computation optimization and autotuning on stateof-the-art multicore architectures. In Proceedings of ACM/IEEE Conference on Supercomputing (SC08), 2008.
-
(2008)
Proceedings of ACM/IEEE Conference on Supercomputing (SC08)
-
-
Datta, K.1
Murphy, M.2
Volkov, V.3
Williams, S.4
Carter, J.5
Oliker, L.6
Patterson, D.7
Shalf, J.8
Yelick, K.9
-
109
-
-
34547470812
-
-
K. Davis, A. Hoisie, G. Johnson, D. Kerbyson, M. Lang, S. Pakin, and F. Petrini. A performance and scalability analysis of the bluegene/l architecture.
-
A performance and scalability analysis of the bluegene/l architecture
-
-
Davis, K.1
Hoisie, A.2
Johnson, G.3
Kerbyson, D.4
Lang, M.5
Pakin, S.6
Petrini, F.7
-
110
-
-
0031340339
-
ProfileMe: Hardware support for instruction-level profiling on out-of-order processors
-
Washington, DC, IEEE Computer Society
-
J. Dean, J.E. Hicks, C.A. Waldspurger, W.E. Weihl, and G. Chrysos. ProfileMe: Hardware support for instruction-level profiling on out-of-order processors. In MICRO 30: Proceedings of the 30th annual ACM/IEEE International Symposium on Microarchitecture, pages 292-302, Washington, DC, 1997. IEEE Computer Society.
-
(1997)
MICRO 30: Proceedings of the 30th annual ACM/IEEE International Symposium on Microarchitecture
, pp. 292-302
-
-
Dean, J.1
Hicks, J.E.2
Waldspurger, C.A.3
Weihl, W.E.4
Chrysos, G.5
-
111
-
-
20744452904
-
Self adapting linear algebra algorithms and software
-
Special issue on Program Generation, Optimization, and Adaptation
-
J. Demmel, J. Dongarra, V. Eijkhout, E. Fuentes, A. Petitet, R. Vuduc, C. Whaley, and K. Yelick. Self adapting linear algebra algorithms and software. Proceedings of the IEEE, 93(2), 2005. Special issue on Program Generation, Optimization, and Adaptation.
-
(2005)
Proceedings of the IEEE
, vol.93
, Issue.2
-
-
Demmel, J.1
Dongarra, J.2
Eijkhout, V.3
Fuentes, E.4
Petitet, A.5
Vuduc, R.6
Whaley, C.7
Yelick, K.8
-
113
-
-
0012612903
-
-
Technical Report TR-01-23, Department of Computer Sciences, The University of Texas at Austin
-
R. Desikan, D. Burger, S. Keckler, and T. Austin. Sim-alpha: a validated, executiondriven Alpha 21264 simulator. Technical Report TR-01-23, Department of Computer Sciences, The University of Texas at Austin, 2001.
-
(2001)
Sim-alpha: A validated, executiondriven Alpha 21264 simulator
-
-
Desikan, R.1
Burger, D.2
Keckler, S.3
Austin, T.4
-
114
-
-
31344460981
-
The Community Land Model and its climate statistics as a component of the Climate System Model
-
R.E. Dickinson, K.W. Oleson, G. Bonan, F. Hoffman, P. Thornton, M. Vertenstein, Z-L Yang, and X. Zeng. The Community Land Model and its climate statistics as a component of the Climate System Model. Journal of Climate, 19(11): 2032-2324, 2006.
-
(2006)
Journal of Climate
, vol.19
, Issue.11
, pp. 2032-2324
-
-
Dickinson, R.E.1
Oleson, K.W.2
Bonan, G.3
Hoffman, F.4
Thornton, P.5
Vertenstein, M.6
Yang, Z.-L.7
Zeng, X.8
-
115
-
-
34250840018
-
Optimized high-order derivative and dissipation operators satisfying summation by parts, and applications in threedimensional multi-block evolutions
-
P. Diener, E.N. Dorband, E. Schnetter, and M. Tiglio. Optimized high-order derivative and dissipation operators satisfying summation by parts, and applications in threedimensional multi-block evolutions. Journal of Scientific Computing, 32: 109-145, 2007.
-
(2007)
Journal of Scientific Computing
, vol.32
, pp. 109-145
-
-
Diener, P.1
Dorband, E.N.2
Schnetter, E.3
Tiglio, M.4
-
117
-
-
34547709622
-
A language for the compact representation of multiple program versions
-
October
-
S. Donadio, J. Brodman, T. Roeder, K. Yotov, D. Barthou, A. Cohen, M.J. Garzarán, D. Padua, and K. Pingali. A language for the compact representation of multiple program versions. In Proceedings of the 18th International Workshop on Languages and Compilers for Parallel Computing, October 2005.
-
(2005)
Proceedings of the 18th International Workshop on Languages and Compilers for Parallel Computing
-
-
Donadio, S.1
Brodman, J.2
Roeder, T.3
Yotov, K.4
Barthou, D.5
Cohen, A.6
Garzarán, M.J.7
Padua, D.8
Pingali, K.9
-
118
-
-
28044453637
-
Performance instrumentation and measurement for terascale systems
-
J. Dongarra, A.D. Malony, S. Moore, P. Mucci, and S. Shende. Performance instrumentation and measurement for terascale systems. In Proceedings of the ICCS 2003 Conference (LNCS 2660), pages 53-62, 2003.
-
(2003)
Proceedings of the ICCS 2003 Conference (LNCS 2660)
, pp. 53-62
-
-
Dongarra, J.1
Malony, A.D.2
Moore, S.3
Mucci, P.4
Shende, S.5
-
119
-
-
0004304389
-
PCCM2: A GCM adapted for scalable parallel computer
-
American Meteorological Society, Boston
-
J.B. Drake, I.T. Foster, J.J. Hack, J.G. Michalakes, B.D. Semeraro, B. Tonen, D.L. Williamson, and P.H. Worley. PCCM2: A GCM adapted for scalable parallel computer. In Fifth Symposium on Global Change Studies, pages 91-98. American Meteorological Society, Boston, 1994.
-
(1994)
Fifth Symposium on Global Change Studies
, pp. 91-98
-
-
Drake, J.B.1
Foster, I.T.2
Hack, J.J.3
Michalakes, J.G.4
Semeraro, B.D.5
Tonen, B.6
Williamson, D.L.7
Worley, P.H.8
-
120
-
-
0029389354
-
Design and performance of a scalable parallel community climate model
-
J.B. Drake, I.T. Foster, J.G. Michalakes, B. Toonen, and P.H. Worley. Design and performance of a scalable parallel community climate model. Parallel Computing, 21(10): 1571-1591, 1995.
-
(1995)
Parallel Computing
, vol.21
, Issue.10
, pp. 1571-1591
-
-
Drake, J.B.1
Foster, I.T.2
Michalakes, J.G.3
Toonen, B.4
Worley, P.H.5
-
121
-
-
69949095393
-
Performance tuning and evaluation of a parallel community climate model
-
New York, NY, USA, ACM
-
J.B. Drake, S. Hammond, R. James, and P.H. Worley. Performance tuning and evaluation of a parallel community climate model. In Proceedings of 1999 ACM/IEEE Conference on Supercomputing (SC99), page 34, New York, NY, USA, 1999. ACM.
-
(1999)
Proceedings of 1999 ACM/IEEE Conference on Supercomputing (SC99)
, pp. 34
-
-
Drake, J.B.1
Hammond, S.2
James, R.3
Worley, P.H.4
-
122
-
-
23844488736
-
Overview of the software design of the Community Climate System Model
-
Fall
-
J.B. Drake, P.W. Jones, and G. Carr. Overview of the software design of the Community Climate System Model. International Journal of High Performance Computing Applications, 19(3): 177-186, Fall 2005.
-
(2005)
International Journal of High Performance Computing Applications
, vol.19
, Issue.3
, pp. 177-186
-
-
Drake, J.B.1
Jones, P.W.2
Carr, G.3
-
123
-
-
23844488736
-
Special issue on climate modeling
-
August
-
J.B. Drake, P.W. Jones, and G.R. Carr, Jr. Special issue on climate modeling. International Journal of High Performance Computing Applications, 19(3), August 2005.
-
(2005)
International Journal of High Performance Computing Applications
, vol.19
, Issue.3
-
-
Drake, J.B.1
Jones, P.W.2
Carr, G.R.3
-
124
-
-
79953225178
-
Software design for petascale climate science
-
D.A. Bader, editor, chapter 7, Chapman & Hall/CRC, New York, NY
-
J.B. Drake, P.W. Jones, M. Vertenstein, J.B. White III, and P.H. Worley. Software design for petascale climate science. In D.A. Bader, editor, Petascale Computing: Algorithms and Applications, chapter 7, pages 125-146. Chapman & Hall/CRC, New York, NY, 2008.
-
(2008)
Petascale Computing: Algorithms and Applications
, pp. 125-146
-
-
Drake, J.B.1
Jones, P.W.2
Vertenstein, M.3
White III, J.B.4
Worley, P.H.5
-
126
-
-
67650793277
-
Introduction to FLASH 3.0, with application to supersonic turbulence
-
A. Dubey, L.B. Reid, and R. Fisher. Introduction to FLASH 3.0, with application to supersonic turbulence. Physica Scripta, 132: 014046, 2008.
-
(2008)
Physica Scripta
, vol.132
, pp. 014046
-
-
Dubey, A.1
Reid, L.B.2
Fisher, R.3
-
128
-
-
85054464329
-
-
Octave home page
-
J.W. Eaton. Octave home page. http://www.octave.org
-
-
-
Eaton, J.W.1
-
130
-
-
85170282443
-
A density-based algorithm for discovering clusters in large spatial databases with noise
-
M. Ester, H.P. Kriegel, J. Sander, and X. Xu. A density-based algorithm for discovering clusters in large spatial databases with noise. In Proceedings of the Second International Conference on Knowledge Discovery and Data Mining, pages 226-231, 1996.
-
(1996)
Proceedings of the Second International Conference on Knowledge Discovery and Data Mining
, pp. 226-231
-
-
Ester, M.1
Kriegel, H.P.2
Sander, J.3
Xu, X.4
-
131
-
-
85054465022
-
-
FEAP.
-
-
-
-
133
-
-
34548715722
-
The importance of being low power in high-performance computing
-
W. Feng. The importance of being low power in high-performance computing. CTWatch Quarterly, 1(3): 12-20, 2005.
-
(2005)
CTWatch Quarterly
, vol.1
, Issue.3
, pp. 12-20
-
-
Feng, W.1
-
134
-
-
85054425042
-
-
March
-
Solaris memory placement optimization and sun fireservers. http://www.sun.com/software/solaris/performance.jsp, March 2003.
-
(2003)
-
-
-
136
-
-
0346575937
-
Performance of parallel computers for spectral atmospheric models
-
I.T. Foster, B. Toonen, and P.H. Worley. Performance of parallel computers for spectral atmospheric models. Journal of Atmospheric and Oceanic Technology, 13(5): 1031-1045, 1996.
-
(1996)
Journal of Atmospheric and Oceanic Technology
, vol.13
, Issue.5
, pp. 1031-1045
-
-
Foster, I.T.1
Toonen, B.2
Worley, P.H.3
-
138
-
-
35048845536
-
Exploring the predictability of MPI messages
-
F. Freitag, J. Caubet, M. Farreras, T. Cortes, and J. Labarta. Exploring the predictability of MPI messages. In Proceedings of the 17th IEEE International Parallel and Distributed Processing Symposium (IPDPS03), pages 46-55, 2003.
-
(2003)
Proceedings of the 17th IEEE International Parallel and Distributed Processing Symposium (IPDPS03)
, pp. 46-55
-
-
Freitag, F.1
Caubet, J.2
Farreras, M.3
Cortes, T.4
Labarta, J.5
-
142
-
-
0031622953
-
The implementation of the Cilk-5 multithreaded language
-
Montreal, Quebec, Canada, June
-
M. Frigo, C.E. Leiserson, and K.H. Randall. The implementation of the Cilk-5 multithreaded language. In Proceedings of the 1998 ACM SIGPLAN Conference on Programming Language Design and Implementation, pages 212-223, Montreal, Quebec, Canada, June 1998.
-
(1998)
Proceedings of the 1998 ACM SIGPLAN Conference on Programming Language Design and Implementation
, pp. 212-223
-
-
Frigo, M.1
Leiserson, C.E.2
Randall, K.H.3
-
143
-
-
32844470371
-
Low-overhead call path profiling of unmodified, optimized code
-
New York, NY, ACM Press
-
N. Froyd, J. Mellor-Crummey, and R. Fowler. Low-overhead call path profiling of unmodified, optimized code. In Proceedings of 19th International Conference on Supercomputing, pages 81-90, New York, NY, 2005. ACM Press.
-
(2005)
Proceedings of 19th International Conference on Supercomputing
, pp. 81-90
-
-
Froyd, N.1
Mellor-Crummey, J.2
Fowler, R.3
-
145
-
-
70350755747
-
Scalable loadbalance measurement for SPMD codes
-
Piscataway, NJ, IEEE Press
-
T. Gamblin, B.R. de Supinski, M. Schulz, R. Fowler, and D.A. Reed. Scalable loadbalance measurement for SPMD codes. In Proceedings of ACM/IEEE Conference on Supercomputing (SC08), pages 1-12, Piscataway, NJ, 2008. IEEE Press.
-
(2008)
Proceedings of ACM/IEEE Conference on Supercomputing (SC08)
, pp. 1-12
-
-
Gamblin, T.1
De Supinski, B.R.2
Schulz, M.3
Fowler, R.4
Reed, D.A.5
-
147
-
-
72149119839
-
Scalable collation and presentation of call-path profile data with cube
-
Julich (Germany)
-
M. Geimer, B. Kuhlmann, F. Pulatova, F. Wolf, and B.J.N. Wylie. Scalable collation and presentation of call-path profile data with cube. In Parallel Computing: Architectures, Algorithms and Applications: Proceedings of Parallel Computing (ParCo07), volume 15, pages 645-652, Julich (Germany), 2007.
-
(2007)
Parallel Computing: Architectures, Algorithms and Applications: Proceedings of Parallel Computing (ParCo07)
, vol.15
, pp. 645-652
-
-
Geimer, M.1
Kuhlmann, B.2
Pulatova, F.3
Wolf, F.4
Wylie, B.J.N.5
-
148
-
-
70149102227
-
A generic and configurable sourcecode instrumentation component
-
G. Allen, J. Nabrzyski, E. Seidel, G. van Albada, J. Dongarra, and P. Sloot, editors, Baton Rouge, LA, May, Springer
-
M. Geimer, S. Shende, A. Malony, and F. Wolf. A generic and configurable sourcecode instrumentation component. In G. Allen, J. Nabrzyski, E. Seidel, G. van Albada, J. Dongarra, and P. Sloot, editors, International Conference on Computational Science (ICCS), volume 5545 of Lecture Notes in Computer Science, pages 696-705, Baton Rouge, LA, May 2009. Springer.
-
(2009)
International Conference on Computational Science (ICCS), volume 5545 of Lecture Notes in Computer Science
, pp. 696-705
-
-
Geimer, M.1
Shende, S.2
Malony, A.3
Wolf, F.4
-
149
-
-
33746593747
-
Semi-automatic composition of loop transformations for deep parallelism and memory hierarchies
-
June
-
S. Girbal, N. Vasilache, C. Bastoul, A. Cohen, D. Parello, M. Sigler, and O. Temam. Semi-automatic composition of loop transformations for deep parallelism and memory hierarchies. International Journal of Parallel Programming, 34(3): 261-317, June 2006.
-
(2006)
International Journal of Parallel Programming
, vol.34
, Issue.3
, pp. 261-317
-
-
Girbal, S.1
Vasilache, N.2
Bastoul, C.3
Cohen, A.4
Parello, D.5
Sigler, M.6
Temam, O.7
-
151
-
-
0345584934
-
The Cactus framework and toolkit: Design and applications
-
Berlin, Springer
-
T. Goodale, G. Allen, G. Lanfermann, J. Massó, T. Radke, E. Seidel, and J. Shalf. The Cactus framework and toolkit: Design and applications. In Vector and Parallel Processing -VECPAR’2002, 5th International Conference, Lecture Notes in Computer Science, Berlin, 2003. Springer.
-
(2003)
Vector and Parallel Processing -VECPAR’2002, 5th International Conference, Lecture Notes in Computer Science
-
-
Goodale, T.1
Allen, G.2
Lanfermann, G.3
Massó, J.4
Radke, T.5
Seidel, E.6
Shalf, J.7
-
154
-
-
85054436902
-
-
D. Gunter, K. Huck, K. Karavanic, J. May, A. Malony, K. Mohror, S. Moore, A. Morris, S. Shende, V. Taylor, X. Wu, and Y. Zhang. Performance database technology for SciDAC applications. 2007.
-
(2007)
Performance database technology for SciDAC applications
-
-
Gunter, D.1
Huck, K.2
Karavanic, K.3
May, J.4
Malony, A.5
Mohror, K.6
Moore, S.7
Morris, A.8
Shende, S.9
Taylor, V.10
Wu, X.11
Zhang, Y.12
-
155
-
-
40749124008
-
Architecture of Qbox: A scalable first-principles molecular dynamics code
-
January/March
-
F. Gygi. Architecture of Qbox: A scalable first-principles molecular dynamics code. IBM Journal of Research and Development, 52, January/March 2008.
-
(2008)
IBM Journal of Research and Development
, pp. 52
-
-
Gygi, F.1
-
156
-
-
33845422522
-
Large-scale first-principles molecular dynamics simulations on the BlueGene/L platform using the Qbox code
-
F. Gygi, E. Draeger, B.R. de Supinski, R.K. Yates, F. Franchetti, S. Kral, J. Lorenz, C.W. Überhuber, J.A. Gunnels, and J.C. Sexton. Large-scale first-principles molecular dynamics simulations on the BlueGene/L platform using the Qbox code. In Proceedings of ACM/IEEE Conference on Supercomputing (SC05), 2005.
-
(2005)
Proceedings of ACM/IEEE Conference on Supercomputing (SC05)
-
-
Gygi, F.1
Draeger, E.2
De Supinski, B.R.3
Yates, R.K.4
Franchetti, F.5
Kral, S.6
Lorenz, J.7
Überhuber, C.W.8
Gunnels, J.A.9
Sexton, J.C.10
-
157
-
-
34548239117
-
Large-scale electronic structure calculations of high-z metals on the BlueGene/L Platform
-
November
-
F. Gygi, E.W. Draeger, M. Schulz, B.R. de Supinski, J.A. Gunnels, V. Austel, J.C. Sexton, F. Franchetti, S. Kral, J. Lorenz, and C.W. Überhuber. Large-scale electronic structure calculations of high-z metals on the BlueGene/L Platform. In Proceedings of ACM/IEEE Conference on Supercomputing (SC06), November 2006.
-
(2006)
Proceedings of ACM/IEEE Conference on Supercomputing (SC06)
-
-
Gygi, F.1
Draeger, E.W.2
Schulz, M.3
De Supinski, B.R.4
Gunnels, J.A.5
Austel, V.6
Sexton, J.C.7
Franchetti, F.8
Kral, S.9
Lorenz, J.10
Überhuber, C.W.11
-
158
-
-
0003501882
-
-
NCAR Tech. Note NCAR/TN-382+STR, National Center for Atmospheric Research, Boulder, CO
-
J.J. Hack, B.A. Boville, B.P. Briegleb, J.T. Kiehland, P.J. Rasch, and D.L. Williamson. Description of the NCAR community climate model (CCM2). NCAR Tech. Note NCAR/TN-382+STR, National Center for Atmospheric Research, Boulder, CO, 1992.
-
(1992)
Description of the NCAR community climate model (CCM2)
-
-
Hack, J.J.1
Boville, B.A.2
Briegleb, B.P.3
Kiehland, J.T.4
Rasch, P.J.5
Williamson, D.L.6
-
159
-
-
84870211068
-
Loop transformation recipes for code generation and auto-tuning
-
October
-
M. Hall, J. Chame, J. Shin, C. Chen, G. Rudy, and M.M. Khan. Loop transformation recipes for code generation and auto-tuning. In LCPC, October, 2009.
-
(2009)
LCPC
-
-
Hall, M.1
Chame, J.2
Shin, J.3
Chen, C.4
Rudy, G.5
Khan, M.M.6
-
161
-
-
0030380793
-
Maximizing multiprocessor performance with the SUIF compiler
-
December
-
M.W. Hall, J.M. Anderson, S.P. Amarasinghe, B.R. Murphy, S. Liao, E. Bugnion, and M.S. Lam. Maximizing multiprocessor performance with the SUIF compiler. IEEE Computer, 29(12): 84-89, December 1996.
-
(1996)
IEEE Computer
, vol.29
, Issue.12
, pp. 84-89
-
-
Hall, M.W.1
Anderson, J.M.2
Amarasinghe, S.P.3
Murphy, B.R.4
Liao, S.5
Bugnion, E.6
Lam, M.S.7
-
165
-
-
0024903997
-
Evaluating Associativity in CPU Caches
-
M.D. Hill and A.J. Smith. Evaluating Associativity in CPU Caches. IEEE Transactions on Computers, 38(12): 1612-1630, 1989.
-
(1989)
IEEE Transactions on Computers
, vol.38
, Issue.12
, pp. 1612-1630
-
-
Hill, M.D.1
Smith, A.J.2
-
166
-
-
10644250257
-
Inhomogeneous electron gas
-
P. Hohenberg and W. Kohn. Inhomogeneous electron gas. Physical Review, 136: B864, 1964.
-
(1964)
Physical Review
, vol.136
, pp. B864
-
-
Hohenberg, P.1
Kohn, W.2
-
167
-
-
0034543848
-
Performance and scalability analysis of teraflop-scale parallel architectures using multidimensional wavefront applications
-
A. Hoisie, O. Lubeck, and H. Wasserman. Performance and scalability analysis of teraflop-scale parallel architectures using multidimensional wavefront applications. International Journal of High Performance Computing Applications, 14: 330-346, 2000.
-
(2000)
International Journal of High Performance Computing Applications
, vol.14
, pp. 330-346
-
-
Hoisie, A.1
Lubeck, O.2
Wasserman, H.3
-
168
-
-
12444335040
-
Prediction and adaptation in Active Harmony
-
J.K. Hollingsworth and P.J. Keleher. Prediction and adaptation in Active Harmony. Cluster Computing, 2(3): 195-205, 1999.
-
(1999)
Cluster Computing
, vol.2
, Issue.3
, pp. 195-205
-
-
Hollingsworth, J.K.1
Keleher, P.J.2
-
169
-
-
0028553216
-
Dynamic program instrumentation for scalable performance tools
-
Knoxville, TN, May
-
J.K. Hollingsworth, B.P. Miller, and J. Cargille. Dynamic program instrumentation for scalable performance tools. In 1994 Scalable High Performance Computing Conference, pages 841-850, Knoxville, TN, May 1994.
-
(1994)
1994 Scalable High Performance Computing Conference
, pp. 841-850
-
-
Hollingsworth, J.K.1
Miller, B.P.2
Cargille, J.3
-
170
-
-
84938447945
-
Direct search solution of numerical and statistical problems
-
R. Hooke and T.A. Jeeves. Direct search solution of numerical and statistical problems. Journal of the ACM, 8(2): 212-229, 1961.
-
(1961)
Journal of the ACM
, vol.8
, Issue.2
, pp. 212-229
-
-
Hooke, R.1
Jeeves, T.A.2
-
171
-
-
85054424049
-
-
HPC challenge benchmark. http://icl.cs.utk.edu/hpcc/index.html
-
-
-
-
173
-
-
48849093309
-
Knowledge Support and Automation for Performance Analysis with PerfExplorer 2.0
-
(special issue on Large-Scale Programming Tools and Environments)
-
K. Huck, A. Malony, S. Shende, and A. Morris. Knowledge Support and Automation for Performance Analysis with PerfExplorer 2.0. The Journal of Scientific Programming, 16(2-3): 123-134, 2008. (special issue on Large-Scale Programming Tools and Environments).
-
(2008)
The Journal of Scientific Programming
, vol.16
, Issue.2-3
, pp. 123-134
-
-
Huck, K.1
Malony, A.2
Shende, S.3
Morris, A.4
-
174
-
-
33745170397
-
Design and implementation of a parallel performance data management framework
-
Washington, DC, USA, IEEE Computer Society
-
K.A. Huck., A.D. Malony, and A. Morris. Design and implementation of a parallel performance data management framework. In Proceedings of the 2005 International Conference on Parallel Processing (ICPP05), pages 473-482, Washington, DC, USA, 2005. IEEE Computer Society.
-
(2005)
Proceedings of the 2005 International Conference on Parallel Processing (ICPP05)
, pp. 473-482
-
-
Huck, K.A.1
Malony, A.D.2
Morris, A.3
-
175
-
-
0001439727
-
An elastic-viscous-plastic model for sea ice dynamics
-
E.C. Hunke and J.K. Dukowicz. An elastic-viscous-plastic model for sea ice dynamics. Journal of Physical Oceanography, 27: 1849-1867, 1997.
-
(1997)
Journal of Physical Oceanography
, vol.27
, pp. 1849-1867
-
-
Hunke, E.C.1
Dukowicz, J.K.2
-
177
-
-
33646765746
-
Kranc: A Mathematica application to generate numerical codes for tensorial evolution equations
-
S. Husa, I. Hinder, and C. Lechner. Kranc: A Mathematica application to generate numerical codes for tensorial evolution equations. Computer Physics Communications, 174: 983-1004, 2006.
-
(2006)
Computer Physics Communications
, vol.174
, pp. 983-1004
-
-
Husa, S.1
Hinder, I.2
Lechner, C.3
-
180
-
-
27144518084
-
An approach to performance prediction for parallel applications
-
E. Ipek, B.R. de Supinski, M. Schulz, and S.A. McKee. An approach to performance prediction for parallel applications. In Euro-Par 2005 Parallel Processing, pages 196-205, 2005.
-
(2005)
Euro-Par 2005 Parallel Processing
, pp. 196-205
-
-
Ipek, E.1
De Supinski, B.R.2
Schulz, M.3
McKee, S.A.4
-
181
-
-
85054431030
-
-
ITER: International thermonuclear experimental reactor.
-
-
-
-
182
-
-
85054461393
-
HPC profiling with the Sun Studio(TM) performance tools
-
Dresden, Germany, September
-
M. Itzkowitz and Y. Maruyama. HPC profiling with the Sun Studio(TM) performance tools. In Third Parallel Tools Workshop, Dresden, Germany, September 2009.
-
(2009)
Third Parallel Tools Workshop
-
-
Itzkowitz, M.1
Maruyama, Y.2
-
183
-
-
0037595554
-
Sheared poloidal flow driven by mode conversion in tokamak plasmas
-
E. Jaeger, L. Berry, and J. Myra, et al. Sheared poloidal flow driven by mode conversion in tokamak plasmas. Physical Review Letters, 90, 2003.
-
(2003)
Physical Review Letters
, pp. 90
-
-
Jaeger, E.1
Berry, L.2
Myra, J.3
-
185
-
-
23244452422
-
Practical performance portability in the Parallel Ocean Program (POP)
-
P.W. Jones, P.H. Worley, Y. Yoshida, J.B. White III, and J. Levesque. Practical performance portability in the Parallel Ocean Program (POP). Concurrency and Computation: Practice and Experience, 17(10): 1317-1327, 2005.
-
(2005)
Concurrency and Computation: Practice and Experience
, vol.17
, Issue.10
, pp. 1317-1327
-
-
Jones, P.W.1
Worley, P.H.2
Yoshida, Y.3
White III, J.B.4
Levesque, J.5
-
187
-
-
78149347218
-
Predictive performance and scalability modeling of a large-scale application
-
New York, NY, USA, ACM
-
D.J. Kerbyson, H.J. Alme, A. Hoisie, F. Petrini, H.J. Wasserman, and M. Gittings. Predictive performance and scalability modeling of a large-scale application. In Proceedings of ACM/IEEE Conference on Supercomputing (SC01), pages 37-37, New York, NY, USA, 2001. ACM.
-
(2001)
Proceedings of ACM/IEEE Conference on Supercomputing (SC01)
, pp. 37
-
-
Kerbyson, D.J.1
Alme, H.J.2
Hoisie, A.3
Petrini, F.4
Wasserman, H.J.5
Gittings, M.6
-
188
-
-
0031718804
-
The National Center for Atmospheric Research Community Climate Model: CCM3
-
J.T. Kiehl, J.J. Hack, G. Bonan, B.A. Boville, D.L. Williamson, and P.J. Rasch. The National Center for Atmospheric Research Community Climate Model: CCM3. Journal of Climate, 11: 1131-1149, 1998.
-
(1998)
Journal of Climate
, vol.11
, pp. 1131-1149
-
-
Kiehl, J.T.1
Hack, J.J.2
Bonan, G.3
Boville, B.A.4
Williamson, D.L.5
Rasch, P.J.6
-
190
-
-
0034512401
-
Combined selection of tile sizes and unroll factors using iterative compilation
-
Washington, DC, USA, IEEE Computer Society
-
T. Kisuki, P.M.W. Knijnenburg, and M.F.P. O’Boyle. Combined selection of tile sizes and unroll factors using iterative compilation. In PACT '00: Proceedings of the 2000 International Conference on Parallel Architectures and Compilation Techniques, Washington, DC, USA, 2000. IEEE Computer Society.
-
(2000)
PACT '00: Proceedings of the 2000 International Conference on Parallel Architectures and Compilation Techniques
-
-
Kisuki, T.1
Knijnenburg, P.M.W.2
O’Boyle, M.F.P.3
-
191
-
-
33746635961
-
Introducing the Open Trace Format (OTF)
-
Reading, UK, May
-
A. Knüpfer, R. Brendel, H. Brunst, H. Mix, and W.E. Nagel. Introducing the Open Trace Format (OTF). In Proceedings of the 6th International Conference on Computational Science, volume 3992 of Springer Lecture Notes in Computer Science, pages 526-533, Reading, UK, May 2006.
-
(2006)
Proceedings of the 6th International Conference on Computational Science, volume 3992 of Springer Lecture Notes in Computer Science
, pp. 526-533
-
-
Knüpfer, A.1
Brendel, R.2
Brunst, H.3
Mix, H.4
Nagel, W.E.5
-
193
-
-
24944580988
-
-
Springer
-
S-H Ko, K.W. Cho, Y.D. Song, Y.G. Kim, J-S Na, and C. Kim. Development of Cactus driver for CFD analyses in the grid computing environment, pages 771-777. Springer, 2005.
-
(2005)
Development of Cactus driver for CFD analyses in the grid computing environment
, pp. 771-777
-
-
Ko, S.-H.1
Cho, K.W.2
Song, Y.D.3
Kim, Y.G.4
Na, J.-S.5
Kim, C.6
-
195
-
-
4043140349
-
Density functional and density matrix method scaling linearly with the number of atoms
-
W. Kohn. Density functional and density matrix method scaling linearly with the number of atoms. Physical Review Letters, 76(17): 3168-3171, 1996.
-
(1996)
Physical Review Letters
, vol.76
, Issue.17
, pp. 3168-3171
-
-
Kohn, W.1
-
196
-
-
0042113153
-
Self-consistent equations including exchange and correlation effects
-
W. Kohn and L.J. Sham. Self-consistent equations including exchange and correlation effects. Physical Review, 140: A1133, 1965.
-
(1965)
Physical Review
, vol.140
, pp. A1133
-
-
Kohn, W.1
Sham, L.J.2
-
197
-
-
0242667172
-
Optimization by direct search: New perspectives on some classical and modern methods
-
T.G. Kolda, R.M. Lewis, and V. Torczon. Optimization by direct search: New perspectives on some classical and modern methods. SIAM Review, 45(3): 385-482, 2004.
-
(2004)
SIAM Review
, vol.45
, Issue.3
, pp. 385-482
-
-
Kolda, T.G.1
Lewis, R.M.2
Torczon, V.3
-
198
-
-
0029359304
-
Comparison of initial value and eigenvalue codes for kinetic toroidal plasma instabilities
-
August
-
M. Kotschenreuther, G. Rewoldt, and W.M. Tang. Comparison of initial value and eigenvalue codes for kinetic toroidal plasma instabilities. Computer Physics Communications, 88: 128-140, August 1995.
-
(1995)
Computer Physics Communications
, vol.88
, pp. 128-140
-
-
Kotschenreuther, M.1
Rewoldt, G.2
Tang, W.M.3
-
199
-
-
85054429973
-
-
Kranc: Automated code generation. http://www.cct.lsu.edu/~eschnett/Kranc
-
-
-
-
200
-
-
65549119644
-
Quantum chromodynamics with advanced computing
-
A.S. Kronfeld. Quantum chromodynamics with advanced computing. Journal of Physics: Conference Series, 125: 012067, 2008.
-
(2008)
Journal of Physics: Conference Series
, vol.125
, pp. 012067
-
-
Kronfeld, A.S.1
-
202
-
-
1442337776
-
Finding effective optimization phase sequences
-
P. Kulkarni, W. Zhao, H. Moon, K. Cho, D. Whalley, J. Davidson, M. Bailey, Y. Paek, and K. Gallivan. Finding effective optimization phase sequences. SIGPLAN Not., 38(7): 12-23, 2003.
-
(2003)
SIGPLAN Not.
, vol.38
, Issue.7
, pp. 12-23
-
-
Kulkarni, P.1
Zhao, W.2
Moon, H.3
Cho, K.4
Whalley, D.5
Davidson, J.6
Bailey, M.7
Paek, Y.8
Gallivan, K.9
-
203
-
-
48849094389
-
Scalability of tracing and visualization tools
-
Malaga
-
J. Labarta, J. Gimenez, E. Martinez, P. Gonzalez, H. Servat, G. Llort, and X. Aguilar. Scalability of tracing and visualization tools. In Parallel Computing 2005, Malaga, 2005.
-
(2005)
Parallel Computing 2005
-
-
Labarta, J.1
Gimenez, J.2
Martinez, E.3
Gonzalez, P.4
Servat, H.5
Llort, G.6
Aguilar, X.7
-
204
-
-
84947944896
-
Dip: A parallel program development environment
-
Lyon (France), August
-
J. Labarta, S. Girona, V. Pillet, T. Cortes, and L. Gregoris. Dip: A parallel program development environment. In Proceedings of 2nd International EuroPar Conference (EuroPar 96), Lyon (France), August 1996.
-
(1996)
Proceedings of 2nd International EuroPar Conference (EuroPar 96)
-
-
Labarta, J.1
Girona, S.2
Pillet, V.3
Cortes, T.4
Gregoris, L.5
-
205
-
-
0032251894
-
Convergence properties of the Nelder-Mead simplex algorithm in low dimensions
-
J.C. Lagarias, J.A. Reeds, M.H. Wright, and P.E. Wright. Convergence properties of the Nelder-Mead simplex algorithm in low dimensions. SIAM Journal on Optimization, 9: 112-147, 1998.
-
(1998)
SIAM Journal on Optimization
, vol.9
, pp. 112-147
-
-
Lagarias, J.C.1
Reeds, J.A.2
Wright, M.H.3
Wright, P.E.4
-
206
-
-
0028380268
-
Rewriting executable files to measure program behavior
-
J.R. Larus and T. Ball. Rewriting executable files to measure program behavior. Software Practice and Experience, 24(2): 197-218, 1994.
-
(1994)
Software Practice and Experience
, vol.24
, Issue.2
, pp. 197-218
-
-
Larus, J.R.1
Ball, T.2
-
207
-
-
0003834102
-
-
Prentice-Hall, Inc., Upper Saddle River, NJ, USA
-
E.D. Lazowska, J. Zahorjan, G.S. Graham, and K.C. Sevcik. Quantitative System Performance: Computer System Analysis Using Queueing Network Models. Prentice-Hall, Inc., Upper Saddle River, NJ, USA, 1984.
-
(1984)
Quantitative System Performance: Computer System Analysis Using Queueing Network Models
-
-
Lazowska, E.D.1
Zahorjan, J.2
Graham, G.S.3
Sevcik, K.C.4
-
208
-
-
70249083648
-
From tensor equations to numerical code -computer algebra tools for numerical relativity
-
C. Lechner, D. Alic, and S. Husa. From tensor equations to numerical code -computer algebra tools for numerical relativity. In SYNASC 2004 -6th International Symposium on Symbolic and Numeric Algorithms for Scientific Computing, Timisoara, Romania, 2004.
-
(2004)
SYNASC 2004 -6th International Symposium on Symbolic and Numeric Algorithms for Scientific Computing, Timisoara, Romania
-
-
Lechner, C.1
Alic, D.2
Husa, S.3
-
209
-
-
34748909426
-
Methods of inference and learning for performance modeling of parallel applications
-
New York, NY, ACM
-
B.C. Lee, D.M. Brooks, B.R. de Supinski, M. Schulz, K. Singh, and S.A. McKee. Methods of inference and learning for performance modeling of parallel applications. In PPoPP '07: Proceedings of the 12th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, pages 249-258, New York, NY, 2007. ACM.
-
(2007)
PPoPP '07: Proceedings of the 12th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming
, pp. 249-258
-
-
Lee, B.C.1
Brooks, D.M.2
De Supinski, B.R.3
Schulz, M.4
Singh, K.5
McKee, S.A.6
-
210
-
-
20844459296
-
Gyrokinetic particle simulation model
-
W.W. Lee. Gyrokinetic particle simulation model. Journal of Computational Physics, 72: 243-269, 1987.
-
(1987)
Journal of Computational Physics
, vol.72
, pp. 243-269
-
-
Lee, W.W.1
-
212
-
-
85054467691
-
Dyninst as a binary rewriter
-
M. Legendre. Dyninst as a binary rewriter. In Paradyn/Dyninst week, 2009. http: //www.dyninst.org/pdWeek09/slides/legendre-binrewriter.pdf
-
(2009)
Paradyn/Dyninst week
-
-
Legendre, M.1
-
214
-
-
77954054624
-
A note on auto-tuning GEMM for GPUs
-
Baton Rouge, LA, May
-
Y. Li, J. Dongarra, and S. Tomov. A note on auto-tuning GEMM for GPUs. In 9th International Conference on Computation Science (ICCS’09), Baton Rouge, LA, May 2009.
-
(2009)
9th International Conference on Computation Science (ICCS’09)
-
-
Li, Y.1
Dongarra, J.2
Tomov, S.3
-
216
-
-
0037071357
-
Size scaling of turbulent transport in magnetically confined plasmas
-
Z. Lin, S. Ethier, T.S. Hahm, and W.M. Tang. Size scaling of turbulent transport in magnetically confined plasmas. Physical Review Letters, 88, 2002.
-
(2002)
Physical Review Letters
, pp. 88
-
-
Lin, Z.1
Ethier, S.2
Hahm, T.S.3
Tang, W.M.4
-
217
-
-
0032544628
-
Turbulent transport reduction by zonal flows: Massively parallel simulations
-
September
-
Z. Lin, T.S. Hahm, W.W. Lee, W.M. Tang, and R.B. White. Turbulent transport reduction by zonal flows: Massively parallel simulations. Science, 281(5384): 1835-1837, September 1998.
-
(1998)
Science
, vol.281
, Issue.5384
, pp. 1835-1837
-
-
Lin, Z.1
Hahm, T.S.2
Lee, W.W.3
Tang, W.M.4
White, R.B.5
-
218
-
-
84862321901
-
A tool framework for static and dynamic analysis of object-oriented software with templates
-
K.A. Lindlan, J. Cuny, A.D. Malony, S. Shende, B. Mohr, R. Rivenburgh, and C. Rasmussen. A tool framework for static and dynamic analysis of object-oriented software with templates. In Proceedings of ACM/IEEE Conference on Supercomputing (SC2000), 2000.
-
(2000)
Proceedings of ACM/IEEE Conference on Supercomputing (SC2000)
-
-
Lindlan, K.A.1
Cuny, J.2
Malony, A.D.3
Shende, S.4
Mohr, B.5
Rivenburgh, R.6
Rasmussen, C.7
-
219
-
-
85054463495
-
High-resolution peripheral quantitative computed tomography can assess microstructural and mechanical properties of human distal tibial bone
-
press
-
X.S. Liu, X.H. Zhang, K.K. Sekhon, M.F. Adam, D.J. McMahon, E. Shane, J.P. Bilezikian, and X.E. Guo. High-resolution peripheral quantitative computed tomography can assess microstructural and mechanical properties of human distal tibial bone. Journal of Bone and Mineral Research, in press.
-
Journal of Bone and Mineral Research
-
-
Liu, X.S.1
Zhang, X.H.2
Sekhon, K.K.3
Adam, M.F.4
McMahon, D.J.5
Shane, E.6
Bilezikian, J.P.7
Guo, X.E.8
-
220
-
-
77954020714
-
On-line detection of large-scale parallel application’s structure
-
April
-
G. Llort, J. Gonzalez, H. Servat, J. Gimenez, and J. Labarta. On-line detection of large-scale parallel application’s structure. In IPDPS 2010, April 2010.
-
(2010)
IPDPS 2010
-
-
Llort, G.1
Gonzalez, J.2
Servat, H.3
Gimenez, J.4
Labarta, J.5
-
221
-
-
31944440969
-
Pin: Building customized program analysis tools with dynamic instrumentation
-
C.K. Luk, R. Cohn, R. Muth, H. Patil, A. Klauser, G. Lowney, S.Wallace, V.J. Reddi, and K. Hazelwood. Pin: Building customized program analysis tools with dynamic instrumentation. In Proceedings of Programming Language Design and Implementation (PLDI), pages 191-200, 2005.
-
(2005)
Proceedings of Programming Language Design and Implementation (PLDI)
, pp. 191-200
-
-
Luk, C.K.1
Cohn, R.2
Muth, R.3
Patil, H.4
Klauser, A.5
Lowney, G.6
Wallace, S.7
Reddi, V.J.8
Hazelwood, K.9
-
222
-
-
0031164889
-
Increasing the efficiency of ideal solar cells by photon induced tansitions at intermediate lavels
-
A. Luque and A. Marti. Increasing the efficiency of ideal solar cells by photon induced tansitions at intermediate lavels. Physical Review Letters, 78: 5014, 1997.
-
(1997)
Physical Review Letters
, vol.78
, pp. 5014
-
-
Luque, A.1
Marti, A.2
-
223
-
-
33645446819
-
Lattice boltzmann model for dissipative MHD
-
Montreux, Switzerland, June 17-21
-
A. Macnab, G. Vahala, L. Vahala, and P. Pavlo. Lattice boltzmann model for dissipative MHD. In 29th EPS Conference on Controlled Fusion and Plasma Physics, volume 26B, Montreux, Switzerland, June 17-21, 2002.
-
(2002)
29th EPS Conference on Controlled Fusion and Plasma Physics
, vol.26B
-
-
Macnab, A.1
Vahala, G.2
Vahala, L.3
Pavlo, P.4
-
224
-
-
33745859061
-
Spatial hypersurfaces in causal set cosmology
-
Jun
-
S. Major, D. Rideout, and S. Surya. Spatial hypersurfaces in causal set cosmology. Classical Quantum Gravity, 23: 4743-4752, Jun 2006.
-
(2006)
Classical Quantum Gravity
, vol.23
, pp. 4743-4752
-
-
Major, S.1
Rideout, D.2
Surya, S.3
-
226
-
-
38049136248
-
Phase-based parallel performance profiling
-
Malaga, Spain, September
-
A. Malony, S. Shende, and A. Morris. Phase-based parallel performance profiling. In ParCo 2005: Parallel Computing 2005, Malaga, Spain, September 2005.
-
(2005)
ParCo 2005: Parallel Computing 2005
-
-
Malony, A.1
Shende, S.2
Morris, A.3
-
229
-
-
70350727348
-
Measuring how fast computers really are
-
September
-
J. Markoff. Measuring how fast computers really are. New York Times, page 14F, September 1991.
-
(1991)
New York Times
, pp. 14
-
-
Markoff, J.1
-
230
-
-
85054455832
-
Performance Measurement of Applications with GPU Acceleration using CUDA
-
to appear
-
S. Mayanglambam, A. Malony, and M. Sottile. Performance Measurement of Applications with GPU Acceleration using CUDA. In Parallel Computing (ParCo), 2009. to appear.
-
(2009)
Parallel Computing (ParCo)
-
-
Mayanglambam, S.1
Malony, A.2
Sottile, M.3
-
232
-
-
0032252855
-
Convergence of the Nelder-Mead simplex method to a nonstationary point
-
K.I.M. McKinnon. Convergence of the Nelder-Mead simplex method to a nonstationary point. SIAM Journal on Optimization, 9(1): 148-158, 1998.
-
(1998)
SIAM Journal on Optimization
, vol.9
, Issue.1
, pp. 148-158
-
-
McKinnon, K.I.M.1
-
233
-
-
85054461446
-
-
a public BSSN code
-
McLachlan, a public BSSN code.
-
-
-
McLachlan1
-
235
-
-
0036679608
-
HPCView: A tool for top-down analysis of node performance
-
J. Mellor-Crummey, R.J. Fowler, G. Marin, and N. Tallent. HPCView: A tool for top-down analysis of node performance. Journal of Supercomputing, 23(1): 81-104, 2002.
-
(2002)
Journal of Supercomputing
, vol.23
, Issue.1
, pp. 81-104
-
-
Mellor-Crummey, J.1
Fowler, R.J.2
Marin, G.3
Tallent, N.4
-
236
-
-
33846529179
-
Performance monitoring on the POWER5 microprocessor
-
L.K. John and L. Eeckhout, CRC PRESS
-
A. Mericas. Performance monitoring on the POWER5 microprocessor. In L.K. John and L. Eeckhout, editors, Performance Evaluation and Benchmarking, pages 247-266. CRC PRESS, 2006.
-
(2006)
Performance Evaluation and Benchmarking
, pp. 247-266
-
-
Mericas, A.1
-
240
-
-
0037146399
-
A Conservative Three-Dimensional Eulerian Method for Coupled Solid-Fluid Shock Capturing
-
G.H. Miller and P. Colella. A Conservative Three-Dimensional Eulerian Method for Coupled Solid-Fluid Shock Capturing. Journal of Computational Physics, 183: 26-82, 2002.
-
(2002)
Journal of Computational Physics
, vol.183
, pp. 26-82
-
-
Miller, G.H.1
Colella, P.2
-
241
-
-
83155177863
-
Coping at the user-level with resource limitations in the Cray message passing poolkit MPI at scale: How not to spend your summer vacation
-
R. Winget and K. Winget, editor, Eagan, MN, Cray User Group, Inc
-
R. Mills, F. Hoffman, P.Worley, K. Perumalla, A. Mirin, G. Hammond, and B. Smith. Coping at the user-level with resource limitations in the Cray message passing poolkit MPI at scale: How not to spend your summer vacation. In R. Winget and K. Winget, editor, Proceedings of the 51st Cray User Group Conference, May 4-7, 2009, Eagan, MN, 2009. Cray User Group, Inc.
-
(2009)
Proceedings of the 51st Cray User Group Conference, May 4-7, 2009
-
-
Mills, R.1
Hoffman, F.2
Worley, P.3
Perumalla, K.4
Mirin, A.5
Hammond, G.6
Smith, B.7
-
242
-
-
35348840289
-
Block structured adaptive mesh and time refinement for hybrid, hyperbolic + n-body systems
-
F. Miniati and P. Colella. Block structured adaptive mesh and time refinement for hybrid, hyperbolic + n-body systems. Journal of Computational Physics, 227: 400-430, 2007.
-
(2007)
Journal of Computational Physics
, vol.227
, pp. 400-430
-
-
Miniati, F.1
Colella, P.2
-
243
-
-
36049045668
-
Extending scalability of the Community Atmosphere Model
-
A. Mirin and P. Worley. Extending scalability of the Community Atmosphere Model. Journal of Physics: Conference Series, 78, 2007. doi: 10.1088/1742-6596/78/1/012082
-
(2007)
Journal of Physics: Conference Series
, pp. 78
-
-
Mirin, A.1
Worley, P.2
-
247
-
-
0000793139
-
Cramming more components onto integrated circuits
-
April
-
G.E. Moore. Cramming more components onto integrated circuits. Electronics, 38(8), April 1965.
-
(1965)
Electronics
, vol.38
, Issue.8
-
-
Moore, G.E.1
-
248
-
-
51849091556
-
Observing performance dynamics using parallel profile snapshots
-
Canary Island, Spain, August, Springer
-
A. Morris, W. Spear, A. Malony, and S. Shende. Observing performance dynamics using parallel profile snapshots. In EuroPar 2008, volume LNCS 5168, pages 162-171, Canary Island, Spain, August 2008. Springer.
-
(2008)
EuroPar 2008, volume LNCS 5168
, pp. 162-171
-
-
Morris, A.1
Spear, W.2
Malony, A.3
Shende, S.4
-
250
-
-
67650844203
-
Producing wrong data without doing anything obviously wrong!
-
New York, NY, USA, ACM
-
T. Mytkowicz, A. Diwan, M. Hauswirth, and P.F. Sweeney. Producing wrong data without doing anything obviously wrong! In Proceedings of the 14th International Conference on Architectural Support for Programming Languages and Operating Systems, pages 265-276, New York, NY, USA, 2009. ACM.
-
(2009)
Proceedings of the 14th International Conference on Architectural Support for Programming Languages and Operating Systems
, pp. 265-276
-
-
Mytkowicz, T.1
Diwan, A.2
Hauswirth, M.3
Sweeney, P.F.4
-
251
-
-
0002438680
-
VAMPIR: Visualization and Analysis of MPI Resources
-
W E. Nagel, A. Arnold, M. Weber, H-C. Hoppe, and K. Solchenbach. VAMPIR: Visualization and Analysis of MPI Resources. Supercomputer, 12(1): 69-80, 1996.
-
(1996)
Supercomputer
, vol.12
, Issue.1
, pp. 69-80
-
-
Nagel, W.E.1
Arnold, A.2
Weber, M.3
Hoppe, H.-C.4
Solchenbach, K.5
-
252
-
-
85054452978
-
VAMPIR: Visualization and analysis of MPI resources
-
W.E. Nagel, A. Arnold, M. Weber, H.C. Hoppe, and K. Solchenbach. VAMPIR: Visualization and analysis of MPI resources. The International Journal of Supercomputer Applications and High Performance Computing, 11(2): 144-159, 1997.
-
(1997)
The International Journal of Supercomputer Applications and High Performance Computing
, vol.11
, Issue.2
, pp. 144-159
-
-
Nagel, W.E.1
Arnold, A.2
Weber, M.3
Hoppe, H.C.4
Solchenbach, K.5
-
253
-
-
77954018807
-
TAUoverMRNet (ToM): A framework for scalable parallel performance monitoring
-
A. Nataraj, A. Malony, A. Morris, D. Arnold, and B. Miller. TAUoverMRNet (ToM): A framework for scalable parallel performance monitoring. In International Workshop on Scalable Tools for High-End Computing (STHEC '08), 2008.
-
(2008)
International Workshop on Scalable Tools for High-End Computing (STHEC '08)
-
-
Nataraj, A.1
Malony, A.2
Morris, A.3
Arnold, D.4
Miller, B.5
-
254
-
-
40449120689
-
Integrated parallel performance views
-
A. Nataraj, A.D. Malony, S. Shende, and A. Morris. Integrated parallel performance views. Cluster Computing, 11(1): 57-73, 2008.
-
(2008)
Cluster Computing
, vol.11
, Issue.1
, pp. 57-73
-
-
Nataraj, A.1
Malony, A.D.2
Shende, S.3
Morris, A.4
-
255
-
-
56749181050
-
The ghost in the machine: Observing the effects of kernel operation on parallel application performance
-
Reno, Nevada, November 10-16
-
A. Nataraj, A. Morris, A.D. Malony, M. Sottile, and P. Beckman. The ghost in the machine: Observing the effects of kernel operation on parallel application performance. In Proceedings of 2007 ACM/IEEE Conference on Supercomputing (SC2007), Reno, Nevada, November 10-16 2007.
-
(2007)
Proceedings of 2007 ACM/IEEE Conference on Supercomputing (SC2007)
-
-
Nataraj, A.1
Morris, A.2
Malony, A.D.3
Sottile, M.4
Beckman, P.5
-
256
-
-
51849107896
-
TAUoverSupermon: Low-overhead online parallel performance monitoring
-
A. Nataraj, M. Sottile, A. Morris, A.D. Malony, and S. Shende. TAUoverSupermon: Low-overhead online parallel performance monitoring. In Europar’07: European Conference on Parallel Processing, 2007.
-
(2007)
Europar’07: European Conference on Parallel Processing
-
-
Nataraj, A.1
Sottile, M.2
Morris, A.3
Malony, A.D.4
Shende, S.5
-
257
-
-
85054443810
-
-
National Center for Supercomputing Applications. Blue Waters hardware. http://www.ncsa.illinois.edu/BlueWaters/hardware.html
-
Blue Waters hardware
-
-
-
258
-
-
0000238336
-
A simplex method for function minimization
-
J.A. Nelder and R. Mead. A simplex method for function minimization. Computer Journal, 7: 308-313, 1965.
-
(1965)
Computer Journal
, vol.7
, pp. 308-313
-
-
Nelder, J.A.1
Mead, R.2
-
259
-
-
51049092126
-
Model-guided performance tuning of parameter values: A case study with molecular dynamics visualization
-
April
-
Y.L. Nelson, B. Bansal, M. Hall, A. Nakano, and K. Lerman. Model-guided performance tuning of parameter values: A case study with molecular dynamics visualization. IEEE International Symposium on Parallel and Distributed Processing (IPDPS 2008), April 2008.
-
(2008)
IEEE International Symposium on Parallel and Distributed Processing (IPDPS 2008)
-
-
Nelson, Y.L.1
Bansal, B.2
Hall, M.3
Nakano, A.4
Lerman, K.5
-
261
-
-
67349187344
-
Scalatrace: Scalable compression and replay of communication traces in high performance computing
-
Aug
-
M. Noeth, P. Ratn, F. Mueller, M. Schulz, and B. de Supinski. Scalatrace: Scalable compression and replay of communication traces in high performance computing. Journal of Parallel and Distributed Computing, 69(8): 969-710, Aug 2009.
-
(2009)
Journal of Parallel and Distributed Computing
, vol.69
, Issue.8
, pp. 710-969
-
-
Noeth, M.1
Ratn, P.2
Mueller, F.3
Schulz, M.4
De Supinski, B.5
-
262
-
-
0002081678
-
Co-Array Fortran for parallel programming
-
R.W. Numrich and J.K. Reid. Co-Array Fortran for parallel programming. ACM Fortran Forum, 17(2): 1-31, 1998.
-
(1998)
ACM Fortran Forum
, vol.17
, Issue.2
, pp. 1-31
-
-
Numrich, R.W.1
Reid, J.K.2
-
264
-
-
84934325826
-
Scientific computations on modern parallel vector systems
-
Washington, DC, USA, IEEE Computer Society
-
L. Oliker, A. Canning, J. Carter, J. Shalf, and S. Ethier. Scientific computations on modern parallel vector systems. In Proceedings of ACM/IEEE Conference on Supercomputing (SC04), page 10, Washington, DC, USA, 2004. IEEE Computer Society.
-
(2004)
Proceedings of ACM/IEEE Conference on Supercomputing (SC04)
, pp. 10
-
-
Oliker, L.1
Canning, A.2
Carter, J.3
Shalf, J.4
Ethier, S.5
-
267
-
-
0031123703
-
From silicon to RNA: The coming of age of first-principle molecular dynamics
-
M. Parrinello. From silicon to RNA: The coming of age of first-principle molecular dynamics. Solid State Communications, 103, 107, 1997.
-
(1997)
Solid State Communications
, vol.103
, pp. 107
-
-
Parrinello, M.1
-
269
-
-
11944256577
-
Iterative minimization techniques for ab initio total-energy calculations: Molecular dynamics and conjugate gradients
-
M.C. Payne, M.P. Teter, D.C. Allan, T.A. Arias, and J.D. Joannopoulos. Iterative minimization techniques for ab initio total-energy calculations: Molecular dynamics and conjugate gradients. Reviews of Modern Physics, 64: 1045, 1992.
-
(1992)
Reviews of Modern Physics
, vol.64
, pp. 1045
-
-
Payne, M.C.1
Teter, M.P.2
Allan, D.C.3
Arias, T.A.4
Joannopoulos, J.D.5
-
270
-
-
27144551353
-
Using simpoint for accurate and efficient simulation
-
E. Perelman, G. Hamerly, M.V. Biesbrouck, T. Sherwood, and B. Calder. Using simpoint for accurate and efficient simulation. ACM SIGMETRICS Performance Evaluation Review, 31: 318-319, 2003.
-
(2003)
ACM SIGMETRICS Performance Evaluation Review
, vol.31
, pp. 318-319
-
-
Perelman, E.1
Hamerly, G.2
Biesbrouck, M.V.3
Sherwood, T.4
Calder, B.5
-
271
-
-
85054451417
-
-
SciDAC Performance Engineering Research Institute (PERI).
-
-
-
-
272
-
-
85054459001
-
-
PETSc: Portable, extensible toolkit for scientific computation.
-
-
-
-
273
-
-
70649090070
-
Victoria Falls: Scaling highly-threaded processor cores
-
S. Phillips. Victoria Falls: Scaling highly-threaded processor cores. In HotChips 19, 2007.
-
(2007)
HotChips 19
-
-
Phillips, S.1
-
274
-
-
0028409163
-
The NX message passing interface
-
April
-
P. Pierce. The NX message passing interface. Parallel Computing, 20(4): 463-480, April 1994.
-
(1994)
Parallel Computing
, vol.20
, Issue.4
, pp. 463-480
-
-
Pierce, P.1
-
275
-
-
33751095034
-
PARAVER: A tool to visualise and analyze parallel code
-
Amsterdam, IOS Press
-
V. Pillet, J. Labarta, T. Cortes, and S. Girona. PARAVER: A tool to visualise and analyze parallel code. In Proceedings of WoTUG-18: Transputer and occam Developments, volume 44, pages 17-31, Amsterdam, 1995. IOS Press.
-
(1995)
Proceedings of WoTUG-18: Transputer and occam Developments
, vol.44
, pp. 17-31
-
-
Pillet, V.1
Labarta, J.2
Cortes, T.3
Girona, S.4
-
276
-
-
33645436979
-
-
Technical Report UPC-CEPBA 95-3, European Center for Parallelism of Barcelona (CEPBA), Universitat Polit`ecnica de Catalunya (UPC)
-
V. Pillet, J. Labarta, T. Cortes, and S. Girona. PARAVER: A tool to visualize and analyze parallel code. Technical Report UPC-CEPBA 95-3, European Center for Parallelism of Barcelona (CEPBA), Universitat Polit`ecnica de Catalunya (UPC), 1995. http://tinyurl.com/paraver95
-
(1995)
PARAVER: A tool to visualize and analyze parallel code
-
-
Pillet, V.1
Labarta, J.2
Cortes, T.3
Girona, S.4
-
278
-
-
85054461952
-
-
PLASMA project. http://icl.cs.utk.edu/plasma
-
-
-
-
279
-
-
84871295761
-
Graphite: Polyhedral analyses and optimizations for gcc
-
S. Pop, A. Cohen, C. Bastoul, S. Girbal, G. Silber, and N. Vasilache. Graphite: Polyhedral analyses and optimizations for gcc. In Proceedings of the 2006 GCC Developers Summit, page 2006, 2006.
-
(2006)
Proceedings of the 2006 GCC Developers Summit
, pp. 2006
-
-
Pop, S.1
Cohen, A.2
Bastoul, C.3
Girbal, S.4
Silber, G.5
Vasilache, N.6
-
282
-
-
85054441534
-
-
Coefficient of determination. mathbits.com/mathbits/tisection/statistics2/correlation.htm
-
-
-
-
283
-
-
57349170105
-
Preserving time in large-scale communication traces
-
June
-
P. Ratn, F. Mueller, M. Schulz, and B. de Supinski. Preserving time in large-scale communication traces. In International Conference on Supercomputing, pages 46-55, June 2008.
-
(2008)
International Conference on Supercomputing
, pp. 46-55
-
-
Ratn, P.1
Mueller, F.2
Schulz, M.3
De Supinski, B.4
-
285
-
-
33846164822
-
Evidence for an entropy bound from fundamentally discrete gravity
-
D. Rideout and S. Zohren. Evidence for an entropy bound from fundamentally discrete gravity. Classical Quantum Gravity, 2006.
-
(2006)
Classical Quantum Gravity
-
-
Rideout, D.1
Zohren, S.2
-
286
-
-
84877034501
-
Mrnet: A software-based multicast/reduction network for scalable tools
-
IEEE Computer Society
-
P.C. Roth, D.C. Arnold, and B.P. Miller. Mrnet: A software-based multicast/reduction network for scalable tools. In International Conference on Supercomputing, pages 21-36. IEEE Computer Society, 2003.
-
(2003)
International Conference on Supercomputing
, pp. 21-36
-
-
Roth, P.C.1
Arnold, D.C.2
Miller, B.P.3
-
288
-
-
84863064747
-
-
Technical Report RC24351 W0709-061, IBM Research Division
-
V. Salapura, K. Ganesan, A. Gara, M. Gschwind, J. Sexton, and R. Walkup. Nextgeneration performance counters: Towards monitoring over a thousand concurrent events. Technical Report RC24351 W0709-061, IBM Research Division, 2007.
-
(2007)
Nextgeneration performance counters: Towards monitoring over a thousand concurrent events
-
-
Salapura, V.1
Ganesan, K.2
Gara, A.3
Gschwind, M.4
Sexton, J.5
Walkup, R.6
-
290
-
-
33746604824
-
A multi-block infrastructure for three-dimensional time-dependent numerical relativity
-
E. Schnetter, P. Diener, E.N. Dorband, and M. Tiglio. A multi-block infrastructure for three-dimensional time-dependent numerical relativity. Classical Quantum Gravity, 23: S553-S578, 2006.
-
(2006)
Classical Quantum Gravity
, vol.23
, pp. S553-S578
-
-
Schnetter, E.1
Diener, P.2
Dorband, E.N.3
Tiglio, M.4
-
291
-
-
1842479966
-
Evolutions in 3D numerical relativity using fixed mesh refinement
-
E. Schnetter, S.H. Hawley, and I. Hawke. Evolutions in 3D numerical relativity using fixed mesh refinement. Classical and Quantum Gravity, 21: 1465-1488, 2004.
-
(2004)
Classical and Quantum Gravity
, vol.21
, pp. 1465-1488
-
-
Schnetter, E.1
Hawley, S.H.2
Hawke, I.3
-
292
-
-
34548192076
-
Optical properties of zno/zns and zno/znte heterostructures for photovoltaic applications
-
J. Schrier, D.O. Demchenko, L.-W. Wang, and A.P. Alivisatos. Optical properties of zno/zns and zno/znte heterostructures for photovoltaic applications. NanoLett., 7: 2377, 2007.
-
(2007)
NanoLett.
, vol.7
, pp. 2377
-
-
Schrier, J.1
Demchenko, D.O.2
Wang, L.-W.3
Alivisatos, A.P.4
-
293
-
-
34547489425
-
A flexible and dynamic infrastructure for MPI tool interoperability
-
M. Schulz and B.R. de Supinski. A flexible and dynamic infrastructure for MPI tool interoperability. In Proceedings of ICPP 2006, pages 193-202, 2006.
-
(2006)
Proceedings of ICPP 2006
, pp. 193-202
-
-
Schulz, M.1
De Supinski, B.R.2
-
294
-
-
56749160395
-
pnMPI tools: A whole lot greater than the sum of their parts
-
M. Schulz and B.R. de Supinski. pnMPI tools: A whole lot greater than the sum of their parts. In Proceedings of SC07, 2007.
-
(2007)
Proceedings of SC07
-
-
Schulz, M.1
De Supinski, B.R.2
-
295
-
-
85054464259
-
-
Report of the High-End Computing Revitalization Task Force (HECRTF)
-
National Science and Technology Council Committee on Technology High-End Computing Revitalization Task Force. Report of the High-End Computing Revitalization Task Force (HECRTF). 2004.
-
(2004)
On Technology High-End Computing Revitalization Task Force
-
-
-
296
-
-
33645982477
-
-
Technical Report ZHR-R-0304, Dresden University of Technology, Center for High-Performance Computing, Nov
-
S. Seidl. VTF3 -A fast Vampir trace file low-level management library. Technical Report ZHR-R-0304, Dresden University of Technology, Center for High-Performance Computing, Nov 2003.
-
(2003)
VTF3 -A fast Vampir trace file low-level management library
-
-
Seidl, S.1
-
299
-
-
38049043035
-
-
Springer
-
S. Shende, A. Malony, and A. Morris. Optimization of Instrumentation in Parallel Performance Evaluation Tools, volume 4699 of LNCS, pages 440-449. Springer, 2008.
-
(2008)
Optimization of Instrumentation in Parallel Performance Evaluation Tools, volume 4699 of LNCS
, pp. 440-449
-
-
Shende, S.1
Malony, A.2
Morris, A.3
-
301
-
-
0031635137
-
Portable Profiling and Tracing for Parallel Scientific Applications using C++
-
S. Shende, A.D. Malony, J. Cuny, K. Lindlan, P. Beckman, and S. Karmesin. Portable Profiling and Tracing for Parallel Scientific Applications using C++. In Proceedings of the SIGMETRICS Symposium onParallel and Distributed Tools, SPDT’98, pages 134-145, 1998.
-
(1998)
Proceedings of the SIGMETRICS Symposium onParallel and Distributed Tools, SPDT’98
, pp. 134-145
-
-
Shende, S.1
Malony, A.D.2
Cuny, J.3
Lindlan, K.4
Beckman, P.5
Karmesin, S.6
-
302
-
-
84947296432
-
A Performance Interface for Component-Based Applications
-
S. Shende, A.D. Malony, C. Rasmussen, and M. Sottile. A Performance Interface for Component-Based Applications. In Proceedings of International Workshop on Performance Modeling, Evaluation and Optimization, International Parallel and Distributed Processing Symposium, 2003.
-
(2003)
Proceedings of International Workshop on Performance Modeling, Evaluation and Optimization, International Parallel and Distributed Processing Symposium
-
-
Shende, S.1
Malony, A.D.2
Rasmussen, C.3
Sottile, M.4
-
303
-
-
85054444397
-
Autotuning and specialization: Speeding up Nek5000 with compiler technology
-
June
-
J. Shin, M.W. Hall, J. Chame, C. Chen, P. Fischer, and P.D. Hovland. Autotuning and specialization: Speeding up Nek5000 with compiler technology. In Proceedings of the International Conference on Supercomputing, June 2010.
-
(2010)
Proceedings of the International Conference on Supercomputing
-
-
Shin, J.1
Hall, M.W.2
Chame, J.3
Chen, C.4
Fischer, P.5
Hovland, P.D.6
-
304
-
-
79958257802
-
Autotuning and specialization: Speeding up matrix multiply for small matrices with compiler technology
-
October
-
J. Shin, M.W. Hall, J. Chame, C. Chen, and P.D. Hovland. Autotuning and specialization: Speeding up matrix multiply for small matrices with compiler technology. In The Fourth International Workshop on Automatic Performance Tuning, October 2009.
-
(2009)
The Fourth International Workshop on Automatic Performance Tuning
-
-
Shin, J.1
Hall, M.W.2
Chame, J.3
Chen, C.4
Hovland, P.D.5
-
306
-
-
35948986416
-
Predicting parallel application performance via machine learning approaches
-
K. Singh, E. Ipek, S.A. McKee, B.R. de Supinski, M. Schulz, and R. Caruana. Predicting parallel application performance via machine learning approaches. Concurrency And Computation: Practice and Experience, 19(17): 2219-2235, 2007.
-
(2007)
Concurrency And Computation: Practice and Experience
, vol.19
, Issue.17
, pp. 2219-2235
-
-
Singh, K.1
Ipek, E.2
McKee, S.A.3
De Supinski, B.R.4
Schulz, M.5
Caruana, R.6
-
308
-
-
22944475131
-
-
Morgan Kaufmann Publishers Inc., San Francisco, CA
-
A. Sloss, D. Symes, and C. Wright. ARM System Developer’s Guide: Designing and Optimizing System Software. Morgan Kaufmann Publishers Inc., San Francisco, CA, 2004.
-
(2004)
ARM System Developer’s Guide: Designing and Optimizing System Software
-
-
Sloss, A.1
Symes, D.2
Wright, C.3
-
309
-
-
0017949328
-
A comparative study of set associative memory mapping algorithms and their use for cache and main memory
-
A.J. Smith. A comparative study of set associative memory mapping algorithms and their use for cache and main memory. IEEE Transactions on Software Engineering, (2): 121-130.
-
IEEE Transactions on Software Engineering
, Issue.2
, pp. 121-130
-
-
Smith, A.J.1
-
310
-
-
44049110107
-
Parallel ocean general circulation modeling
-
R.D. Smith, J.K. Dukowicz, and R.C. Malone. Parallel ocean general circulation modeling. Phys. D, 60(1-4): 38-61, 1992.
-
(1992)
Phys. D
, vol.60
, Issue.1-4
, pp. 38-61
-
-
Smith, R.D.1
Dukowicz, J.K.2
Malone, R.C.3
-
311
-
-
0242505770
-
A framework for application performance modeling and prediction
-
A. Snavely, L. Carrington, N. Wolter, J. Labarta, R. Badia, and A. Purkayastha. A framework for application performance modeling and prediction. In Proceedings of ACM/IEEE Conference on Supercomputing (SC02), 2002.
-
(2002)
Proceedings of ACM/IEEE Conference on Supercomputing (SC02)
-
-
Snavely, A.1
Carrington, L.2
Wolter, N.3
Labarta, J.4
Badia, R.5
Purkayastha, A.6
-
312
-
-
33845456969
-
Performance modeling of HPC applications
-
October
-
A. Snavely, X. Gao, C. Lee, N. Wolter, J. Labarta, J. Gimenez, and P. Jones. Performance modeling of HPC applications. Proceedings of the Parallel Computing Conference 2003, October 2003.
-
(2003)
Proceedings of the Parallel Computing Conference 2003
-
-
Snavely, A.1
Gao, X.2
Lee, C.3
Wolter, N.4
Labarta, J.5
Gimenez, J.6
Jones, P.7
-
313
-
-
10044276950
-
An Algebra for Cross-Experiment Performance Analysis
-
August
-
F. Song, F. Wolf, N. Bhatia, J. Dongarra, and S. Moore. An Algebra for Cross-Experiment Performance Analysis. In Proceedings of International Conference on Parallel Processing (ICPP-04), August 2004.
-
(2004)
Proceedings of International Conference on Parallel Processing (ICPP-04)
-
-
Song, F.1
Wolf, F.2
Bhatia, N.3
Dongarra, J.4
Moore, S.5
-
314
-
-
85054454559
-
-
SPIRAL project. http://www.spiral.net
-
-
-
-
315
-
-
0036652569
-
Pentium 4 performance-monitoring features
-
B. Sprunt. Pentium 4 performance-monitoring features. IEEE Micro, 22(4): 72-82, 2002.
-
(2002)
IEEE Micro
, vol.22
, Issue.4
, pp. 72-82
-
-
Sprunt, B.1
-
317
-
-
85054430208
-
-
STREAM: Sustainable memory bandwidth in high performance computers. http://www.cs.virginia.edu/stream
-
-
-
-
321
-
-
85054452679
-
-
Sun Microsystems. Sun Studio Performance Analyzer. http://developers.sun.com/sunstudio/overview/topics/analyzing.jsp 2009.
-
(2009)
Sun Studio Performance Analyzer
-
-
-
322
-
-
33845443250
-
Parallel Parameter Tuning for Applications with Performance Variability
-
Washington, DC, IEEE Computer Society
-
V. Tabatabaee, A. Tiwari, and J.K. Hollingsworth. Parallel Parameter Tuning for Applications with Performance Variability. In SC '05: Proceedings of the 2005 ACM/IEEE conference on Supercomputing, page 57, Washington, DC, 2005. IEEE Computer Society.
-
(2005)
SC '05: Proceedings of the 2005 ACM/IEEE conference on Supercomputing
, pp. 57
-
-
Tabatabaee, V.1
Tiwari, A.2
Hollingsworth, J.K.3
-
324
-
-
74049095154
-
Diagnosing performance bottlenecks in emerging petascale applications
-
New York, NY, USA, ACM
-
N. Tallent, J. Mellor-Crummey, L. Adhianto, M. Fagan, and M. Krentel. Diagnosing performance bottlenecks in emerging petascale applications. In Proceedings of ACM/IEEE Conference on Supercomputing (SC09), pages 1-11, New York, NY, USA, 2009. ACM.
-
(2009)
Proceedings of ACM/IEEE Conference on Supercomputing (SC09)
, pp. 1-11
-
-
Tallent, N.1
Mellor-Crummey, J.2
Adhianto, L.3
Fagan, M.4
Krentel, M.5
-
325
-
-
78650837195
-
Scalable identification of load imbalance in parallel executions using call path profiles
-
New York, NY, November, ACM
-
N.R. Tallent, L. Adhianto, and J. Mellor-Crummey. Scalable identification of load imbalance in parallel executions using call path profiles. In Proceedings of ACM/IEEE Conference on Supercomputing (SC10), New York, NY, November 2010. ACM.
-
(2010)
Proceedings of ACM/IEEE Conference on Supercomputing (SC10)
-
-
Tallent, N.R.1
Adhianto, L.2
Mellor-Crummey, J.3
-
326
-
-
67650034867
-
Effective performance measurement and analysis of multithreaded applications
-
New York, NY, USA, ACM
-
N.R. Tallent and J. Mellor-Crummey. Effective performance measurement and analysis of multithreaded applications. In Proceedings of the 14th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, pages 229-240, New York, NY, USA, 2009. ACM.
-
(2009)
Proceedings of the 14th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming
, pp. 229-240
-
-
Tallent, N.R.1
Mellor-Crummey, J.2
-
327
-
-
67650837951
-
Binary analysis for measurement and attribution of program performance
-
New York, NY, USA, ACM
-
N.R. Tallent, J. Mellor-Crummey, and M.W. Fagan. Binary analysis for measurement and attribution of program performance. In Proceedings of the 2009 ACM SIGPLAN Conference on Programming Language Design and Implementation, pages 441-452, New York, NY, USA, 2009. ACM.
-
(2009)
Proceedings of the 2009 ACM SIGPLAN Conference on Programming Language Design and Implementation
, pp. 441-452
-
-
Tallent, N.R.1
Mellor-Crummey, J.2
Fagan, M.W.3
-
329
-
-
77957022710
-
-
Technical Report CCT-TR-2008-5, Louisiana State University
-
J. Tao, G. Allen, I. Hinder, E. Schnetter, and Y. Zlochower. XiRel: Standard benchmarks for numerical relativity codes using Cactus and Carpet. Technical Report CCT-TR-2008-5, Louisiana State University, 2008.
-
(2008)
XiRel: Standard benchmarks for numerical relativity codes using Cactus and Carpet
-
-
Tao, J.1
Allen, G.2
Hinder, I.3
Schnetter, E.4
Zlochower, Y.5
-
330
-
-
23944471086
-
Prophesy: An infrastructure for performance analysis and modeling of parallel and grid applications
-
V. Taylor, X. Wu, and R. Stevens. Prophesy: An infrastructure for performance analysis and modeling of parallel and grid applications. SIGMETRICS Perform. Eval. Rev., 30(4): 13-18, 2003.
-
(2003)
SIGMETRICS Perform. Eval. Rev.
, vol.30
, Issue.4
, pp. 13-18
-
-
Taylor, V.1
Wu, X.2
Stevens, R.3
-
331
-
-
85054432382
-
-
The Parallel Ocean Program. http://climate.lanl.gov/Models/POP
-
-
-
-
334
-
-
0002862950
-
Gravitational Radiation -a New Window Onto the Universe. (Karl Schwarzschild Lecture 1996)
-
K.S. Thorne. Gravitational Radiation -a New Window Onto the Universe. (Karl Schwarzschild Lecture 1996). Reviews of Modern Astronomy, 10: 1-28, 1997.
-
(1997)
Reviews of Modern Astronomy
, vol.10
, pp. 1-28
-
-
Thorne, K.S.1
-
336
-
-
0034375534
-
The accuracy, consistency, and speed of an electronpositron equation of state based on table interpolation of the helmholtz free energy
-
F.X. Timmes and F.D. Swesty. The accuracy, consistency, and speed of an electronpositron equation of state based on table interpolation of the helmholtz free energy. Astrophysical Journal, Supplement, 126: 501-516, 2000.
-
(2000)
Astrophysical Journal, Supplement
, vol.126
, pp. 501-516
-
-
Timmes, F.X.1
Swesty, F.D.2
-
337
-
-
70449844310
-
A scalable autotuning framework for compiler optimization
-
April
-
A. Tiwari, C. Chen, J. Chame, M. Hall, and J.K. Hollingsworth. A scalable autotuning framework for compiler optimization. In Proceedings of the 24th International Parallel and Distributed Processing Symposium, April 2009.
-
(2009)
Proceedings of the 24th International Parallel and Distributed Processing Symposium
-
-
Tiwari, A.1
Chen, C.2
Chame, J.3
Hall, M.4
Hollingsworth, J.K.5
-
340
-
-
85054446997
-
-
University of Oregon. TAU Portal. http://tau.nic.uoregon.edu
-
TAU Portal
-
-
-
341
-
-
0036036949
-
Dynamic statistical profiling of communication activity in distributed applications
-
New York, NY, USA, ACM
-
J. Vetter. Dynamic statistical profiling of communication activity in distributed applications. In Proceedings of the 2002 ACM SIGMETRICS International Conference on Measurement and Modeling of Computer Systems, pages 240-250, New York, NY, USA, 2002. ACM.
-
(2002)
Proceedings of the 2002 ACM SIGMETRICS International Conference on Measurement and Modeling of Computer Systems
, pp. 240-250
-
-
Vetter, J.1
-
344
-
-
70350771131
-
Benchmarking GPUs to tune dense linear algebra
-
IEEE, to appear
-
V. Volkov and J. Demmel. Benchmarking GPUs to tune dense linear algebra. In Supercomputing 08. IEEE, 2008. to appear.
-
(2008)
Supercomputing 08
-
-
Volkov, V.1
Demmel, J.2
-
348
-
-
84990706303
-
Parallelizing the spectral transform method. Part II
-
October
-
D.W. Walker, P.H. Worley, and J.B. Drake. Parallelizing the spectral transform method. Part II. Concurrency: Practice and Experience, 4(7): 509-531, October 1992.
-
(1992)
Concurrency: Practice and Experience
, vol.4
, Issue.7
, pp. 509-531
-
-
Walker, D.W.1
Worley, P.H.2
Drake, J.B.3
-
350
-
-
70449505546
-
Linearly scaling 3D fragment method for large-scale electronic structure calculations
-
L.-W. Wang, B. Lee, H. Shan, Z. Zhao, J. Meza, E. Strohmaier, and D. Bailey. Linearly scaling 3D fragment method for large-scale electronic structure calculations. Proceedings of ACM/IEEE Conference on Supercomputing (SC08), 2008.
-
(2008)
Proceedings of ACM/IEEE Conference on Supercomputing (SC08)
-
-
Wang, L.-W.1
Lee, B.2
Shan, H.3
Zhao, Z.4
Meza, J.5
Strohmaier, E.6
Bailey, D.7
-
351
-
-
42749103540
-
First-principles thousand-atoms quantum dot calculations
-
L.-W. Wang and J. Li. First-principles thousand-atoms quantum dot calculations. Physical Review B, 69: 153302, 2004.
-
(2004)
Physical Review B
, vol.69
, pp. 153302
-
-
Wang, L.-W.1
Li, J.2
-
352
-
-
42049097273
-
Linear scaling three-dimensional fragment method for large-scale electronic structure calculations
-
L.-W.Wang, Z. Zhao, and J. Meza. Linear scaling three-dimensional fragment method for large-scale electronic structure calculations. Physical Review B, 77: 165113, 2008.
-
(2008)
Physical Review B
, vol.77
, pp. 165113
-
-
Wang, L.-W.1
Zhao, Z.2
Meza, J.3
-
353
-
-
0001604458
-
Solving Schrodinger’s equation around a desired energy: Application to silicon quantum dots
-
L.-W. Wang and A. Zunger. Solving Schrodinger’s equation around a desired energy: Application to silicon quantum dots. Journal of Chemical Physics, 100: 2394, 1994.
-
(1994)
Journal of Chemical Physics
, vol.100
, pp. 2394
-
-
Wang, L.-W.1
Zunger, A.2
-
357
-
-
33845417137
-
Quantifying locality in the memory access patterns of HPC applications
-
Nov
-
J. Weinberg, M.O. McCracken, E. Strohmaier, and A. Snavely. Quantifying locality in the memory access patterns of HPC applications. Proceedings of ACM/IEEE Conference on Supercomputing (SC05), pages 50-61, Nov. 2005.
-
(2005)
Proceedings of ACM/IEEE Conference on Supercomputing (SC05)
, pp. 50-61
-
-
Weinberg, J.1
McCracken, M.O.2
Strohmaier, E.3
Snavely, A.4
-
361
-
-
0343462141
-
Automated empirical optimization of software and the ATLAS project
-
R.C. Whaley, A. Petitet, and J. Dongarra. Automated empirical optimization of software and the ATLAS project. Parallel Computing, 27(1-2): 3-35, 2001.
-
(2001)
Parallel Computing
, vol.27
, Issue.1-2
, pp. 3-35
-
-
Whaley, R.C.1
Petitet, A.2
Dongarra, J.3
-
363
-
-
51049106193
-
Lattice Boltzmann simulation optimization on leading multicore platforms
-
Miami, FL
-
S. Williams, J. Carter, L. Oliker, J. Shalf, and K. Yelick. Lattice Boltzmann simulation optimization on leading multicore platforms. In Interational Conference on Parallel and Distributed Computing Systems (IPDPS), Miami, FL, 2008.
-
(2008)
Interational Conference on Parallel and Distributed Computing Systems (IPDPS)
-
-
Williams, S.1
Carter, J.2
Oliker, L.3
Shalf, J.4
Yelick, K.5
-
364
-
-
67650998701
-
Lattice Boltzmann simulation optimization on leading multicore platforms
-
S. Williams, J. Carter, L. Oliker, J. Shalf, and K. Yelick. Lattice Boltzmann simulation optimization on leading multicore platforms. Journal of Parallel and Distributed Computing, 69(9): 762-777, 2009.
-
(2009)
Journal of Parallel and Distributed Computing
, vol.69
, Issue.9
, pp. 762-777
-
-
Williams, S.1
Carter, J.2
Oliker, L.3
Shalf, J.4
Yelick, K.5
-
365
-
-
56749158843
-
Optimization of sparse matrix-vector multiplication on emerging multicore platforms
-
S. Williams, L. Oliker, R. Vuduc, J. Shalf, K. Yelick, and J. Demmel. Optimization of sparse matrix-vector multiplication on emerging multicore platforms. In Proceedings of ACM/IEEE Conference on Supercomputing (SC07), 2007.
-
(2007)
Proceedings of ACM/IEEE Conference on Supercomputing (SC07)
-
-
Williams, S.1
Oliker, L.2
Vuduc, R.3
Shalf, J.4
Yelick, K.5
Demmel, J.6
-
366
-
-
60949098907
-
Optimization of sparse matrix-vector multiplication on emerging multicore platforms
-
S. Williams, L. Oliker, R. Vuduc, J. Shalf, K. Yelick, and J. Demmel. Optimization of sparse matrix-vector multiplication on emerging multicore platforms. Parallel Computing -Special Issue on Revolutionary Technologies for Acceleration of Emerging Petascale Applications, 35(3): 178-194, 2008.
-
(2008)
Parallel Computing -Special Issue on Revolutionary Technologies for Acceleration of Emerging Petascale Applications
, vol.35
, Issue.3
, pp. 178-194
-
-
Williams, S.1
Oliker, L.2
Vuduc, R.3
Shalf, J.4
Yelick, K.5
Demmel, J.6
-
367
-
-
68949198052
-
The roofline model: A pedagogical tool for auto-tuning kernels on multicore architectures
-
August
-
S. Williams, D. Patterson, L. Oliker, J. Shalf, and K. Yelick. The roofline model: A pedagogical tool for auto-tuning kernels on multicore architectures. In IEEE HotChips Symposium on High-Performance Chips (HotChips 2008), August 2008.
-
(2008)
IEEE HotChips Symposium on High-Performance Chips (HotChips 2008)
-
-
Williams, S.1
Patterson, D.2
Oliker, L.3
Shalf, J.4
Yelick, K.5
-
368
-
-
67650797544
-
Roofline: An insightful visual performance model for floating-point programs and multicore architectures
-
April
-
S. Williams, A. Watterman, and D. Patterson. Roofline: An insightful visual performance model for floating-point programs and multicore architectures. Communications of the ACM, April 2009.
-
(2009)
Communications of the ACM
-
-
Williams, S.1
Watterman, A.2
Patterson, D.3
-
369
-
-
0004039521
-
-
NCAR Tech. Note NCAR/TN-210+STR, NTIS PB83 231068, National Center for Atmospheric Research, Boulder, Colo
-
D. L. Williamson. Description of NCAR Community Climate Model (CCM0B). NCAR Tech. Note NCAR/TN-210+STR, NTIS PB83 231068, National Center for Atmospheric Research, Boulder, Colo., 1983.
-
(1983)
Description of NCAR Community Climate Model (CCM0B)
-
-
Williamson, D.L.1
-
370
-
-
0004039521
-
-
NCAR Tech. Note NCAR/TN-285+STR, NTIS PB87-203782/AS, June
-
D.L. Williamson, J.T. Kiehl, V. Ramanathan, R.E. Dickinson, and J.J. Hack. Description of NCAR community climate model (CCM1). NCAR Tech. Note NCAR/TN-285+STR, NTIS PB87-203782/AS, June 1987.
-
(1987)
Description of NCAR community climate model (CCM1)
-
-
Williamson, D.L.1
Kiehl, J.T.2
Ramanathan, V.3
Dickinson, R.E.4
Hack, J.J.5
-
372
-
-
33646137721
-
Efficient pattern search in large traces through successive refinement
-
Springer
-
F. Wolf, B. Mohr, J. Dongarra, and S. Moore. Efficient pattern search in large traces through successive refinement. In Proceedings of the European Conference on Parallel Computing (EuroPar 2004, LNCS 3149), pages 47-54. Springer, 2004.
-
(2004)
Proceedings of the European Conference on Parallel Computing (EuroPar 2004, LNCS 3149)
, pp. 47-54
-
-
Wolf, F.1
Mohr, B.2
Dongarra, J.3
Moore, S.4
-
373
-
-
84885411868
-
Usage of the SCALASCA toolset for scalable performance analysis of large-scale parallel applications
-
Stuttgart, Germany, July, Springer. 978-3-540-68561-6
-
F. Wolf, B. Wylie, E. Ábrahám, D. Becker, W. Frings, K. Fürlinger, M. Geimer, M. Hermanns, B. Mohr, S. Moore, M. Pfeifer, and Z. Szebenyi. Usage of the SCALASCA toolset for scalable performance analysis of large-scale parallel applications. In Proceedings of the 2nd HLRS Parallel Tools Workshop, pages 157-167, Stuttgart, Germany, July 2008. Springer. ISBN 978-3-540-68561-6.
-
(2008)
Proceedings of the 2nd HLRS Parallel Tools Workshop
, pp. 157-167
-
-
Wolf, F.1
Wylie, B.2
Ábrahám, E.3
Becker, D.4
Frings, W.5
Fürlinger, K.6
Geimer, M.7
Hermanns, M.8
Mohr, B.9
Moore, S.10
Pfeifer, M.11
Szebenyi, Z.12
-
374
-
-
85054458024
-
Performance of the Community Atmosphere Model on the Cray X1E and XT3
-
R. Winget and K. Winget, editor, Eagan, MN, Cray User Group, Inc
-
P. Worley. Performance of the Community Atmosphere Model on the Cray X1E and XT3. In R. Winget and K. Winget, editor, Proceedings of the 48th Cray User Group Conference, May 8-11, 2006, Eagan, MN, 2006. Cray User Group, Inc.
-
(2006)
Proceedings of the 48th Cray User Group Conference, May 8-11, 2006
-
-
Worley, P.1
-
375
-
-
85054459662
-
-
June, Poster Presentation at the 13th Annual CCSM Workshop, June 17-19, 2008, Breckenridge, CO
-
P. Worley and A. Mirin. Performance Results for the new CAM Benchmark Suite, June 2008. Poster Presentation at the 13th Annual CCSM Workshop, June 17-19, 2008, Breckenridge, CO.
-
(2008)
Performance Results for the new CAM Benchmark Suite
-
-
Worley, P.1
Mirin, A.2
-
376
-
-
33749065293
-
Performance engineering in the community atmosphere model
-
P. Worley, A. Mirin, J. Drake, and W. Sawyer. Performance engineering in the community atmosphere model. Journal of Physics: Conference Series, 46: 356-362, 2006. doi: 10.1088/1742-6596/46/1/050
-
(2006)
Journal of Physics: Conference Series
, vol.46
, pp. 356-362
-
-
Worley, P.1
Mirin, A.2
Drake, J.3
Sawyer, W.4
-
377
-
-
0346941076
-
MPI performance evaluation and characterization using a compact application benchmark code
-
IEEE Computer Society Press, Los Alamitos, CA
-
P.H. Worley. MPI performance evaluation and characterization using a compact application benchmark code. In Proceedings of the Second MPI Developers Conference and Users’ Meeting, pages 170-177. IEEE Computer Society Press, Los Alamitos, CA, 1996.
-
(1996)
Proceedings of the Second MPI Developers Conference and Users’ Meeting
, pp. 170-177
-
-
Worley, P.H.1
-
379
-
-
84883330114
-
Benchmarking using the Community Atmosphere Model
-
Warrenton, VA, The Standard Performance Evaluation Corp
-
P.H. Worley. Benchmarking using the Community Atmosphere Model. In Proceedings of the 2006 SPEC Benchmark Workshop, January 23, 2006, Warrenton, VA, 2006. The Standard Performance Evaluation Corp.
-
(2006)
Proceedings of the 2006 SPEC Benchmark Workshop, January 23, 2006
-
-
Worley, P.H.1
-
381
-
-
23844503894
-
Performance portability in the physical parameterizations of the Community Atmosphere Model
-
August
-
P.H. Worley and J.B. Drake. Performance portability in the physical parameterizations of the Community Atmosphere Model. International Journal of High Performance Computing Applications, 19(3): 1-15, August 2005.
-
(2005)
International Journal of High Performance Computing Applications
, vol.19
, Issue.3
, pp. 1-15
-
-
Worley, P.H.1
Drake, J.B.2
-
382
-
-
0028565417
-
Parallel spectral transform shallow water model: A runtime-tunable parallel benchmark code
-
J. J. Dongarra and D. W. Walker, editors, IEEE Computer Society Press, Los Alamitos, CA
-
P.H. Worley and I.T. Foster. Parallel spectral transform shallow water model: a runtime-tunable parallel benchmark code. In J. J. Dongarra and D. W. Walker, editors, Proceedings of the Scalable High Performance Computing Conference, pages 207-214. IEEE Computer Society Press, Los Alamitos, CA, 1994.
-
(1994)
Proceedings of the Scalable High Performance Computing Conference
, pp. 207-214
-
-
Worley, P.H.1
Foster, I.T.2
-
383
-
-
0002372482
-
Algorithm comparison and benchmarking using a parallel spectral transform shallow water model
-
G.-R. Hoffman and N. Kreitz, editors, World Scientific Publishing Co. Pte. Ltd., Singapore
-
P.H. Worley, I.T. Foster, and B. Toonen. Algorithm comparison and benchmarking using a parallel spectral transform shallow water model. In G.-R. Hoffman and N. Kreitz, editors, Coming of Age: Proceedings of the Sixth ECMWF Workshop on Use of Parallel Processors in Meteorology, pages 277-289. World Scientific Publishing Co. Pte. Ltd., Singapore, 1995.
-
(1995)
Coming of Age: Proceedings of the Sixth ECMWF Workshop on Use of Parallel Processors in Meteorology
, pp. 277-289
-
-
Worley, P.H.1
Foster, I.T.2
Toonen, B.3
-
384
-
-
25144441529
-
The performance evolution of the Parallel Ocean Program on the Cray X1
-
R. Winget and K. Winget, editor, Eagan, MN, ray User Group, Inc
-
P.H. Worley and J. Levesque. The performance evolution of the Parallel Ocean Program on the Cray X1. In R. Winget and K. Winget, editor, Proceedings of the 46th Cray User Group Conference, May 17-21, 2004, Eagan, MN, 2004. Cray User Group, Inc.
-
(2004)
Proceedings of the 46th Cray User Group Conference, May 17-21, 2004
-
-
Worley, P.H.1
Levesque, J.2
-
385
-
-
85052019260
-
From trace generation to visualization: A performance framework for distributed parallel systems
-
November
-
C.E. Wu, A. Bolmarcich, M. Snir, D. Wootton, F. Parpia, A. Chan, E. Lusk, and W. Gropp. From trace generation to visualization: A performance framework for distributed parallel systems. In Proceedings of ACM/IEEE Conference on Supercomputing (SC00), November 2000.
-
(2000)
Proceedings of ACM/IEEE Conference on Supercomputing (SC00)
-
-
Wu, C.E.1
Bolmarcich, A.2
Snir, M.3
Wootton, D.4
Parpia, F.5
Chan, A.6
Lusk, E.7
Gropp, W.8
-
387
-
-
34548765138
-
POET: Parameterized optimizations for empirical tuning
-
March
-
Q. Yi, K. Seymour, H. You, R. Vuduc, and D. Quinlan. POET: parameterized optimizations for empirical tuning. In Proceedings of the 21st International Parallel and Distributed Processing Symposium, March 2007.
-
(2007)
Proceedings of the 21st International Parallel and Distributed Processing Symposium
-
-
Yi, Q.1
Seymour, K.2
You, H.3
Vuduc, R.4
Quinlan, D.5
-
388
-
-
20744459570
-
Is search really necessary to generate high-performance BLAS?
-
K. Yotov, X. Li, G. Ren, M.J. Garzarán, D. Padua, K. Pingali, and P. Stodghill. Is search really necessary to generate high-performance BLAS? Proceedings of the IEEE, 93(2): 358-386, 2005.
-
(2005)
Proceedings of the IEEE
, vol.93
, Issue.2
, pp. 358-386
-
-
Yotov, K.1
Li, X.2
Ren, G.3
Garzarán, M.J.4
Padua, D.5
Pingali, K.6
Stodghill, P.7
-
389
-
-
0942279071
-
Diluted ii-vi oxide semiconductors with multiple band gaps
-
K.M. Yu, W. Walukiewicz, J. Wu, W. Shan, J.W. Beeman, M.A. Scarpulla, O.D. Dubon, and P. Becta. Diluted ii-vi oxide semiconductors with multiple band gaps. Physical Review Letters, 91: 246403, 2003.
-
(2003)
Physical Review Letters
, vol.91
, pp. 246403
-
-
Yu, K.M.1
Walukiewicz, W.2
Wu, J.3
Shan, W.4
Beeman, J.W.5
Scarpulla, M.A.6
Dubon, O.D.7
Becta, P.8
-
390
-
-
47249088157
-
A divide and conquer linear scaling three dimensional fragment method for large scale electronic structure calculations
-
Z. Zhao, J. Meza, and L.-W. Wang. A divide and conquer linear scaling three dimensional fragment method for large scale electronic structure calculations. Journal of Physics: Condensed Matter, 20(294203), 2008.
-
(2008)
Journal of Physics: Condensed Matter
, vol.20
-
-
Zhao, Z.1
Meza, J.2
Wang, L.-W.3
-
392
-
-
44649089660
-
Multipatch methods in general relativistic astrophysics -hydrodynamical flows on fixed backgrounds
-
B. Zink, E. Schnetter, and M. Tiglio. Multipatch methods in general relativistic astrophysics -hydrodynamical flows on fixed backgrounds. Physical Review D, 77: 103015, 2008.
-
(2008)
Physical Review D
, vol.77
, pp. 103015
-
-
Zink, B.1
Schnetter, E.2
Tiglio, M.3
|