-
1
-
-
60449097203
-
The design of OpenMP tasks
-
march
-
E. Ayguade, N. Copty, A. Duran, J. Hoeflinger, Y. Lin, F. Massaioli, X. Teruel, P. Unnikrishnan, and G. Zhang. The design of OpenMP tasks. IEEE Transactions on Parallel and Distributed Systems, 20(3): 404-418, march 2009.
-
(2009)
IEEE Transactions on Parallel and Distributed Systems
, vol.20
, Issue.3
, pp. 404-418
-
-
Ayguade, E.1
Copty, N.2
Duran, A.3
Hoeflinger, J.4
Lin, Y.5
Massaioli, F.6
Teruel, X.7
Unnikrishnan, P.8
Zhang, G.9
-
2
-
-
0003660984
-
-
Argonne National Laboratory
-
S. Balay, K. Buschelman, W. D. Gropp, D. Kaushik, M. Knepley, L. C. McInnes, B. F. Smith, and H. Zhang. PETSc Users Manual. Argonne National Laboratory, 2010.
-
(2010)
PETSc Users Manual
-
-
Balay, S.1
Buschelman, K.2
Gropp, W.D.3
Kaushik, D.4
Knepley, M.5
McInnes, L.C.6
Smith, B.F.7
Zhang, H.8
-
3
-
-
83155160985
-
Petaflop biofluidics simulations on a two million-core system
-
New York, NY, USA, ACM
-
M. Bernaschi, M. Bisson, T. Endo, S. Matsuoka, M. Fatica, and S. Melchionna. Petaflop biofluidics simulations on a two million-core system. In Proceedings of 2011 International Conference for High Performance Computing, Networking, Storage and Analysis (SC '11), pages 4:1-4:12, New York, NY, USA, 2011. ACM.
-
(2011)
Proceedings of 2011 International Conference for High Performance Computing, Networking, Storage and Analysis (SC '11)
-
-
Bernaschi, M.1
Bisson, M.2
Endo, T.3
Matsuoka, S.4
Fatica, M.5
Melchionna, S.6
-
4
-
-
0034268943
-
Portable programming interface for performance evaluation on modern processors
-
DOI 10.1177/109434200001400303
-
S. Browne, J. Dongarra, N. Garner, G. Ho, and P. Mucci. A portable programming interface for performance evaluation on modern processors. Int. J. High Perform. Comput. Appl., 14(3):189-204, Aug. 2000. (Pubitemid 32025040)
-
(2000)
International Journal of High Performance Computing Applications
, vol.14
, Issue.3
, pp. 189-204
-
-
Browne, S.1
Dongarra, J.2
Garner, N.3
Ho, G.4
Mucci, P.5
-
5
-
-
70350771127
-
Stencil computation optimization and auto-tuning on state-of-the-art multicore architectures
-
Piscataway, NJ, USA, IEEE Press
-
K. Datta, M. Murphy, V. Volkov, S. Williams, J. Carter, L. Oliker, D. Patterson, J. Shalf, and K. Yelick. Stencil computation optimization and auto-tuning on state-of-the-art multicore architectures. In Proceedings of the 2008 ACM/IEEE conference on Supercomputing (SC '08), pages 4:1-4:12, Piscataway, NJ, USA, 2008. IEEE Press.
-
(2008)
Proceedings of the 2008 ACM/IEEE Conference on Supercomputing (SC '08)
-
-
Datta, K.1
Murphy, M.2
Volkov, V.3
Williams, S.4
Carter, J.5
Oliker, L.6
Patterson, D.7
Shalf, J.8
Yelick, K.9
-
6
-
-
0035273564
-
Strong stability-preserving high-order time discretization methods
-
DOI 10.1137/S003614450036757X, PII S003614450036757X
-
S. Gottlieb, C.-W. Shu, and E. Tadmore. Strong stability preserving high-order time integration methods. SIAM Review, 43:89-112, 2001. (Pubitemid 32406893)
-
(2001)
SIAM Review
, vol.43
, Issue.1
, pp. 89-112
-
-
Gottlieb, S.1
Shu, C.-W.2
Tadmor, E.3
-
7
-
-
78650835532
-
190 TFlops astrophysical N-body simulation on a cluster of GPUs
-
Washington, DC, USA, IEEE Computer Society
-
T. Hamada and K. Nitadori. 190 TFlops astrophysical N-body simulation on a cluster of GPUs. In Proceedings of the 2010 ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis (SC '10), pages 1-9, Washington, DC, USA, 2010. IEEE Computer Society.
-
(2010)
Proceedings of the 2010 ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis (SC '10)
, pp. 1-9
-
-
Hamada, T.1
Nitadori, K.2
-
8
-
-
74049152899
-
42 TFlops hierarchical N-body simulations on GPUs with applications in both astrophysics and turbulence
-
New York, NY, USA, ACM
-
T. Hamada, T. Narumi, R. Yokota, K. Yasuoka, K. Nitadori, and M. Taiji. 42 TFlops hierarchical N-body simulations on GPUs with applications in both astrophysics and turbulence. In Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis (SC '09), pages 62:1-62:12, New York, NY, USA, 2009. ACM.
-
(2009)
Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis (SC '09)
-
-
Hamada, T.1
Narumi, T.2
Yokota, R.3
Yasuoka, K.4
Nitadori, K.5
Taiji, M.6
-
10
-
-
80055013146
-
Experience applying Fortran GPU compilers to numerical weather prediction
-
T. Henderson, J. Middlecoff, J. Rosinski, M. Govett, and P. Madden. Experience applying Fortran GPU compilers to numerical weather prediction. In Proceedings of 2011 Symposium on Application Accelerators in High Performance Computing (SAAHPC 2011), pages 34-41, 2011.
-
(2011)
Proceedings of 2011 Symposium on Application Accelerators in High Performance Computing (SAAHPC 2011)
, pp. 34-41
-
-
Henderson, T.1
Middlecoff, J.2
Rosinski, J.3
Govett, M.4
Madden, P.5
-
11
-
-
83155160941
-
Scalable fast multipole methods on distributed heterogeneous architectures
-
New York, NY, USA, ACM
-
Q. Hu, N. A. Gumerov, and R. Duraiswami. Scalable fast multipole methods on distributed heterogeneous architectures. In Proceedings of 2011 International Conference for High Performance Computing, Networking, Storage and Analysis (SC '11), pages 36:1-36:12, New York, NY, USA, 2011. ACM.
-
(2011)
Proceedings of 2011 International Conference for High Performance Computing, Networking, Storage and Analysis (SC '11)
-
-
Hu, Q.1
Gumerov, N.A.2
Duraiswami, R.3
-
12
-
-
0001178530
-
Spectral transform solutions to the shallow water test set
-
R. Jakob-Chien, J. J. Hack, and D. L. Williamson. Spectral transform solutions to the shallow water test set. J. Comput. Phys., 119:164-187, 1995.
-
(1995)
J. Comput. Phys.
, vol.119
, pp. 164-187
-
-
Jakob-Chien, R.1
Hack, J.J.2
Williamson, D.L.3
-
13
-
-
12344263563
-
Yin-Yang grid: An overset grid in spherical geometry
-
A. Kageyama and T. Sato. Yin-Yang grid: An overset grid in spherical geometry. Geochem. Geophys. Geosyst., 5, 2004.
-
(2004)
Geochem. Geophys. Geosyst.
, pp. 5
-
-
Kageyama, A.1
Sato, T.2
-
16
-
-
37249063076
-
A Madden-Julian oscillation event realistically simulated by a global cloud-resolving model
-
DOI 10.1126/science.1148443
-
H. Miura, M. Satoh, T. Nasuno, A. T. Noda, and K. Oouchi. A Madden-Julian Oscillation event realistically simulated by a global cloud-resolving model. Science, 318:1763-1765, 2007. (Pubitemid 350274383)
-
(2007)
Science
, vol.318
, Issue.5857
, pp. 1763-1765
-
-
Miura, H.1
Satoh, M.2
Nasuno, T.3
Noda, A.T.4
Oouchi, K.5
-
17
-
-
48749145864
-
Upwind schemes and boundary conditions with applications to Euler equations in general geometries
-
S. Osher and S. Chakravarthy. Upwind schemes and boundary conditions with applications to Euler equations in general geometries. J. Comput. Phys., 50:447-481, 1983.
-
(1983)
J. Comput. Phys.
, vol.50
, pp. 447-481
-
-
Osher, S.1
Chakravarthy, S.2
-
19
-
-
80052235743
-
Cloud-system resolving simulations with the NASA Goddard Earth Observing System global atmospheric model (GEOS-5)
-
W. M. Putman and M. Suarez. Cloud-system resolving simulations with the NASA Goddard Earth Observing System global atmospheric model (GEOS-5). Geophys. Res. Lett., 38, 2011.
-
(2011)
Geophys. Res. Lett.
, pp. 38
-
-
Putman, W.M.1
Suarez, M.2
-
20
-
-
0030096121
-
The ".Cubed sphere": A new method for the solution of partial differential equations in spherical geometry
-
DOI 10.1006/jcph.1996.0047
-
C. Ronchi, R. Iacono, and P. Paolucci. The cubed sphere: A new method for the solution of partial differential equations in spherical geometry. J. Comput. Phys., 124:93-114, 1996. (Pubitemid 126160790)
-
(1996)
Journal of Computational Physics
, vol.124
, Issue.1
, pp. 93-114
-
-
Ronchi, C.1
Iacono, R.2
Paolucci, P.S.3
-
21
-
-
32644450000
-
A wave propagation method for hyperbolic systems on the sphere
-
J. A. Rossmanith. A wave propagation method for hyperbolic systems on the sphere. J. Comput. Phys., 213:629-658, 2006.
-
(2006)
J. Comput. Phys.
, vol.213
, pp. 629-658
-
-
Rossmanith, J.A.1
-
22
-
-
0000644762
-
Conservative finite-difference approximations of the primitive equations on quasi-uniform spherical grids
-
R. Sadourny. Conservative finite-difference approximations of the primitive equations on quasi-uniform spherical grids. Mon. Wea. Rev., 100:211-224, 1972.
-
(1972)
Mon. Wea. Rev.
, vol.100
, pp. 211-224
-
-
Sadourny, R.1
-
23
-
-
0000184292
-
Integration of the nondivergent barotropic vorticity equation with an icosahedralhexagonal grid for the sphere
-
R. Sadourny, A. Arakawa, and Y. Mintz. Integration of the nondivergent barotropic vorticity equation with an icosahedralhexagonal grid for the sphere. Mon. Wea. Rev., 96:351-356, 1968.
-
(1968)
Mon. Wea. Rev.
, vol.96
, pp. 351-356
-
-
Sadourny, R.1
Arakawa, A.2
Mintz, Y.3
-
24
-
-
78650819651
-
An 80-fold speedup, 15.0 TFlops full GPU acceleration of non-hydrostatic weather model ASUCA production code
-
Washington, DC, USA, IEEE Computer Society
-
T. Shimokawabe, T. Aoki, C. Muroi, J. Ishida, K. Kawano, T. Endo, A. Nukada, N. Maruyama, and S. Matsuoka. An 80-fold speedup, 15.0 TFlops full GPU acceleration of non-hydrostatic weather model ASUCA production code. In Proceedings of the 2010 ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis (SC '10), pages 1-11, Washington, DC, USA, 2010. IEEE Computer Society.
-
(2010)
Proceedings of the 2010 ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis (SC '10)
, pp. 1-11
-
-
Shimokawabe, T.1
Aoki, T.2
Muroi, C.3
Ishida, J.4
Kawano, K.5
Endo, T.6
Nukada, A.7
Maruyama, N.8
Matsuoka, S.9
-
25
-
-
79958268442
-
145 TFlops performance on 3990 GPUs of TSUBAME 2.0 supercomputer for an operational weather prediction
-
Proceedings of the International Conference on Computational Science (ICCS 2011)
-
T. Shimokawabe, T. Aoki, J. Ishida, K. Kawano, and C. Muroi. 145 TFlops performance on 3990 GPUs of TSUBAME 2.0 supercomputer for an operational weather prediction. Procedia Computer Science, 4: 1535 - 1544, 2011. Proceedings of the International Conference on Computational Science (ICCS 2011).
-
(2011)
Procedia Computer Science
, vol.4
, pp. 1535-1544
-
-
Shimokawabe, T.1
Aoki, T.2
Ishida, J.3
Kawano, K.4
Muroi, C.5
-
26
-
-
83155190228
-
Peta-scale phasefield simulation for dendritic solidification on the TSUBAME 2.0 supercomputer
-
New York, NY, USA, ACM
-
T. Shimokawabe, T. Aoki, T. Takaki, T. Endo, A. Yamanaka, N. Maruyama, A. Nukada, and S. Matsuoka. Peta-scale phasefield simulation for dendritic solidification on the TSUBAME 2.0 supercomputer. In Proceedings of 2011 International Conference for High Performance Computing, Networking, Storage and Analysis (SC '11), pages 3:1-3:11, New York, NY, USA, 2011. ACM.
-
(2011)
Proceedings of 2011 International Conference for High Performance Computing, Networking, Storage and Analysis (SC '11)
-
-
Shimokawabe, T.1
Aoki, T.2
Takaki, T.3
Endo, T.4
Yamanaka, A.5
Maruyama, N.6
Nukada, A.7
Matsuoka, S.8
-
27
-
-
47149090947
-
A 26.58 Tflops global atmospheric simulation with the spectral transform method on the Earth Simulator
-
Los Alamitos, CA, USA, IEEE Computer Society Press
-
S. Shingu, H. Takahara, H. Fuchigami, M. Yamada, Y. Tsuda, W. Ohfuchi, Y. Sasaki, K. Kobayashi, T. Hagiwara, S.-i. Habata, M. Yokokawa, H. Itoh, and K. Otsuka. A 26.58 Tflops global atmospheric simulation with the spectral transform method on the Earth Simulator. In Proceedings of the 2002 ACM/IEEE conference on Supercomputing (SC '02), pages 1-19, Los Alamitos, CA, USA, 2002. IEEE Computer Society Press.
-
(2002)
Proceedings of the 2002 ACM/IEEE Conference on Supercomputing (SC '02)
, pp. 1-19
-
-
Shingu, S.1
Takahara, H.2
Fuchigami, H.3
Yamada, M.4
Tsuda, Y.5
Ohfuchi, W.6
Sasaki, Y.7
Kobayashi, K.8
Hagiwara, T.9
Habata, S.-I.10
Yokokawa, M.11
Itoh, H.12
Otsuka, K.13
-
28
-
-
0000762234
-
Integration of the barotropic vorticity equation on a spherical geodesic grid
-
D. L. Williamson. Integration of the barotropic vorticity equation on a spherical geodesic grid. Tellus, 20:642-653, 1968.
-
(1968)
Tellus
, vol.20
, pp. 642-653
-
-
Williamson, D.L.1
-
29
-
-
0001440358
-
A standard test set for numerical approximations to the shallow water equations in spherical geometry
-
D. L. Williamson, J. B. Drake, J. J. Hack, R. Jakob, and P. N. Swarztrauber. A standard test set for numerical approximations to the shallow water equations in spherical geometry. J. Comput. Phys., 102:211-224, 1992.
-
(1992)
J. Comput. Phys.
, vol.102
, pp. 211-224
-
-
Williamson, D.L.1
Drake, J.B.2
Hack, J.J.3
Jakob, R.4
Swarztrauber, P.N.5
-
30
-
-
84863117956
-
The Tianhe-1A interconnect and message passing services
-
M. Xie, Y. Lu, K. Wang, L. Liu, H. Cao, and X. Yang. The Tianhe-1A interconnect and message passing services. IEEE Micro, 1, 2012.
-
(2012)
IEEE Micro
, pp. 1
-
-
Xie, M.1
Lu, Y.2
Wang, K.3
Liu, L.4
Cao, H.5
Yang, X.6
-
31
-
-
79951514320
-
Parallel multilevel methods for implicit solution of shallow water equations with nonsmooth topography on the cubed-sphere
-
C. Yang and X.-C. Cai. Parallel multilevel methods for implicit solution of shallow water equations with nonsmooth topography on the cubed-sphere. J. Comput. Phys., 230:2523-2539, 2011.
-
(2011)
J. Comput. Phys.
, vol.230
, pp. 2523-2539
-
-
Yang, C.1
Cai, X.-C.2
-
32
-
-
79959969892
-
The Tianhe-1A supercomputer: Its hardware and software
-
X.-J. Yang, X.-K. Liao, K. Lu, Q.-F. Hu, J.-Q. Song, and J.-S. Su. The Tianhe-1A supercomputer: Its hardware and software. J. Comput. Sci. Tech., 26, 2011.
-
(2011)
J. Comput. Sci. Tech.
, vol.26
-
-
Yang, X.-J.1
Liao, X.-K.2
Lu, K.3
Hu, Q.-F.4
Song, J.-Q.5
Su, J.-S.6
|