-
1
-
-
0025404493
-
Executing a program on the mit tagged-token dataflow architecture
-
March
-
Arvind. Executing a program on the mit tagged-token dataflow architecture. IEEE Transactions on Computers, 39(3):300-318, March 1990.
-
(1990)
IEEE Transactions on Computers
, vol.39
, Issue.3
, pp. 300-318
-
-
Arvind1
-
2
-
-
0021458512
-
Parallel processing with large-grain data flow technique
-
July
-
R. G. Babb, II. Parallel processing with large-grain data flow technique. Computer, 17(7):55-61, July 1984.
-
(1984)
Computer
, vol.17
, Issue.7
, pp. 55-61
-
-
Babb II, R.G.1
-
3
-
-
33847103649
-
Optimizing bandwidth limited problems using one-sided communication and overlap
-
Washington, DC, USA, IEEE Computer Society
-
C. Bell, D. Bonachea, R. Nishtala, and K. Yelick. Optimizing bandwidth limited problems using one-sided communication and overlap. In Proceedings of the 20th international conference on Parallel and distributed processing, IPDPS'06, pages 84-84, Washington, DC, USA, 2006. IEEE Computer Society.
-
(2006)
Proceedings of the 20th International Conference on Parallel and Distributed Processing, IPDPS'06
, pp. 84-84
-
-
Bell, C.1
Bonachea, D.2
Nishtala, R.3
Yelick, K.4
-
4
-
-
0035480276
-
Distributed processing of very large datasets with datacutter
-
Dec
-
M. Beynon, T. M. Kurc, U. V. Catalyurek, C. Chang, A. Sussman, and J. H. Saltz. Distributed processing of very large datasets with datacutter. Par. Comput., pages 1457-1478, Dec 2001.
-
(2001)
Par. Comput.
, pp. 1457-1478
-
-
Beynon, M.1
Kurc, T.M.2
Catalyurek, U.V.3
Chang, C.4
Sussman, A.5
Saltz, J.H.6
-
5
-
-
33847094060
-
-
v1.1. Technical Report CSD-02-1207, University of California, Lawrence Berkeley Laboratory, October
-
D. Bonachea. Gasnet specification, v1.1. Technical Report CSD-02-1207, University of California, Lawrence Berkeley Laboratory, October 2002.
-
(2002)
Gasnet Specification
-
-
Bonachea, D.1
-
6
-
-
67650509253
-
Communication-sensitive static dataflow for parallel message passing applications
-
Washington, DC, USA, IEEE Computer Society
-
G. Bronevetsky. Communication-sensitive static dataflow for parallel message passing applications. In Proceedings of the 7th annual IEEE/ACM International Symposium on Code Generation and Optimization, CGO '09, pages 1-12, Washington, DC, USA, 2009. IEEE Computer Society.
-
(2009)
Proceedings of the 7th Annual IEEE/ACM International Symposium on Code Generation and Optimization, CGO '09
, pp. 1-12
-
-
Bronevetsky, G.1
-
8
-
-
84877715617
-
Large-scale plane-wave-based density functional theory: Formalism, parallelization, and applications
-
J. R. Reimers, editor, John Wiley and Sons, Inc.
-
E. Bylaska, K. Tsemekhman, N. Govind, and M. Valiev. Large-scale plane-wave-based density functional theory: Formalism, parallelization, and applications. In J. R. Reimers, editor, Computational Methods for Large Systems: Electronic Structure Approaches for Biotechnology and Nanotechnology. John Wiley and Sons, Inc., 2011.
-
(2011)
Computational Methods for Large Systems: Electronic Structure Approaches for Biotechnology and Nanotechnology
-
-
Bylaska, E.1
Tsemekhman, K.2
Govind, N.3
Valiev, M.4
-
11
-
-
84860523127
-
Latency hiding and performance tuning with graph-based execution
-
P. Cicotti and S. B. Baden. Latency hiding and performance tuning with graph-based execution. In The Seventh IEEE eScience Conference, Data-Flow Execution Models for Extreme Scale Computing (DFM 2011), Galveston Island, Texas, 2011.
-
The Seventh IEEE eScience Conference, Data-Flow Execution Models for Extreme Scale Computing (DFM 2011), Galveston Island, Texas, 2011
-
-
Cicotti, P.1
Baden, S.B.2
-
13
-
-
0004116989
-
-
MIT Press, Cambridge, MA, USA, 2nd edition
-
T. H. Cormen, C. E. Leiserson, R. L. Rivest, and C. Stein. Introduction to algorithms. MIT Press, Cambridge, MA, USA, 2nd edition, 2001.
-
(2001)
Introduction to Algorithms
-
-
Cormen, T.H.1
Leiserson, C.E.2
Rivest, R.L.3
Stein, C.4
-
14
-
-
33845393854
-
Transformations to parallel codes for communication-computation overlap
-
November
-
A. Danalis, K.-Y. Kim, L. Pollock, and M. Swany. Transformations to parallel codes for communication-computation overlap. In Proceedings of the ACM/IEEE SC 2005 Conference, pages 58-68, November 2005.
-
(2005)
Proceedings of the ACM/IEEE SC 2005 Conference
, pp. 58-68
-
-
Danalis, A.1
Kim, K.-Y.2
Pollock, L.3
Swany, M.4
-
15
-
-
0019079721
-
Data flow supercomputers
-
J. Dennis. Data flow supercomputers. IEEE Computer, 13(11):48-56, 1980.
-
(1980)
IEEE Computer
, vol.13
, Issue.11
, pp. 48-56
-
-
Dennis, J.1
-
16
-
-
82155191689
-
Formal analysis of mpi-based parallel programs
-
Dec.
-
G. Gopalakrishnan, R. M. Kirby, S. Siegel, R. Thakur, W. Gropp, E. Lusk, B. R. De Supinski, M. Schulz, and G. Bronevetsky. Formal analysis of mpi-based parallel programs. Commun. ACM, 54(12):82-91, Dec. 2011.
-
(2011)
Commun. ACM
, vol.54
, Issue.12
, pp. 82-91
-
-
Gopalakrishnan, G.1
Kirby, R.M.2
Siegel, S.3
Thakur, R.4
Gropp, W.5
Lusk, E.6
De Supinski, B.R.7
Schulz, M.8
Bronevetsky, G.9
-
17
-
-
17644426739
-
An annotation language for optimizing software libraries
-
January
-
S. Z. Guyer and C. Lin. An annotation language for optimizing software libraries. ACM SIGPLAN Notices, 35(1):39-52, January 2000.
-
(2000)
ACM SIGPLAN Notices
, vol.35
, Issue.1
, pp. 39-52
-
-
Guyer, S.Z.1
Lin, C.2
-
20
-
-
12444310082
-
The virtualization model of parallel programming: Runtime optimizations and the state of art
-
L. V. Kalé. The virtualization model of parallel programming : Runtime optimizations and the state of art. In LACSI 2002, Albuquerque, October 2002.
-
LACSI 2002, Albuquerque, October 2002
-
-
Kalé, L.V.1
-
21
-
-
84976817516
-
Charm++: A portable concurrent object oriented system based on c++
-
New York, NY, USA, ACM
-
L. V. Kale and S. Krishnan. Charm++: a portable concurrent object oriented system based on c++. In Proceedings of the eighth annual conference on Object-oriented programming systems, languages, and applications, OOPSLA '93, pages 91-108, New York, NY, USA, 1993. ACM.
-
(1993)
Proceedings of the Eighth Annual Conference on Object-oriented Programming Systems, Languages, and Applications, OOPSLA '93
, pp. 91-108
-
-
Kale, L.V.1
Krishnan, S.2
-
22
-
-
20744444866
-
Telescoping languages: A system for automatic generation of domain languages
-
DOI 10.1109/JPROC.2004.840447, Program Generation, Optimization and Platform Adaptation
-
K. Kennedy, B. Broom, A. Chauhan, R. Fowler, J. Garvin, C. Koelbel, C. McCosh, and J. Mellor-Crummey. Telescoping languages: A system for automatic generation of domain languages. Proc. IEEE, 93:387-408, 2005. (Pubitemid 40851231)
-
(2005)
Proceedings of the IEEE
, vol.93
, Issue.2
, pp. 387-408
-
-
Kennedy, K.1
Broom, B.2
Chauhan, A.3
Fowler, R.J.4
Garvin, J.5
Koelbel, C.6
Mccosh, C.7
Mellor-Crummey, J.8
-
24
-
-
77954725202
-
Overlapping communication and computation by using a hybrid mpi/smpss approach
-
V. Marjanović, J. Labarta, E. Ayguadé, and M. Valero. Overlapping communication and computation by using a hybrid mpi/smpss approach. In Proceedings of the 24th ACM International Conference on Supercomputing, ICS '10, pages 5-16, 2010.
-
(2010)
Proceedings of the 24th ACM International Conference on Supercomputing, ICS '10
, pp. 5-16
-
-
Marjanović, V.1
Labarta, J.2
Ayguadé, E.3
Valero, M.4
-
25
-
-
0000323669
-
Ab-initio molecular dynamics: Theory and implementation
-
J. Grotendorst, editor, NIC, chapter 13, i edition, Publicly available at the
-
D. Marx and J. Hutter. Ab-initio molecular dynamics: Theory and implementation. In J. Grotendorst, editor, Modern Methods and Algorithms of Quantum Chemistry, NIC, chapter 13, pages 301-449. Forschungszentrum Jlich, i edition, 2000. Publicly available at the URL: http://www2.fz-juelich.de/nic- series/Volume3/marx.pdf.
-
(2000)
Modern Methods and Algorithms of Quantum Chemistry
, pp. 301-449
-
-
Marx, D.1
Hutter, J.2
-
26
-
-
0001439335
-
MPI:A message-passing interface standard
-
Message Passing Interface Forum
-
Message Passing Interface Forum. MPI:A message-passing interface standard. International Journal of Supercomputing Applications, 8(3/4), 1994.
-
(1994)
International Journal of Supercomputing Applications
, vol.8
, Issue.3-4
-
-
-
27
-
-
57949083229
-
A dependency-aware task-based programming environment for multi-core architectures
-
J. Perez, R. Badia, and J. Labarta. A dependency-aware task-based programming environment for multi-core architectures. In Cluster Computing, 2008 IEEE International Conference on, pages 142-151, 2008.
-
(2008)
Cluster Computing, 2008 IEEE International Conference on
, pp. 142-151
-
-
Perez, J.1
Badia, R.2
Labarta, J.3
-
28
-
-
47749130103
-
Using mpi communication patterns to guide source code transformations
-
Computational Science ICCS 2008, Springer Berlin / Heidelberg
-
R. Preissl, M. Schulz, D. Kranzlmuller, B. de Supinski, and D. Quinlan. Using mpi communication patterns to guide source code transformations. In Computational Science ICCS 2008, volume 5103 of Lecture Notes in Computer Science, pages 253-260. Springer Berlin / Heidelberg, 2008.
-
(2008)
Lecture Notes in Computer Science
, vol.5103
, pp. 253-260
-
-
Preissl, R.1
Schulz, M.2
Kranzlmuller, D.3
De Supinski, B.4
Quinlan, D.5
-
29
-
-
84966549063
-
Treating a user-defined parallel library as a domain-specific language
-
IEEE
-
D. Quinlan, D. Miller, B. Philip, and M. Schordan. Treating a user-defined parallel library as a domain-specific language. In Proceedings of the 16th international Parallel and Distributed Processing Symposium, IPDPS 2002, Los Alamitos, CA, USA, April 2002. IEEE.
-
Proceedings of the 16th International Parallel and Distributed Processing Symposium, IPDPS 2002, Los Alamitos, CA, USA, April 2002
-
-
Quinlan, D.1
Miller, D.2
Philip, B.3
Schordan, M.4
-
30
-
-
33847133764
-
Program flow graph construction for static analysis of mpi programs
-
D. R. Shires, L. L. Pollock, and S. Sprenkle. Program flow graph construction for static analysis of mpi programs. In PDPTA, pages 1847-1853, 1999.
-
(1999)
PDPTA
, pp. 1847-1853
-
-
Shires, D.R.1
Pollock, L.L.2
Sprenkle, S.3
-
31
-
-
80052305141
-
-
Technical Report UCB/EECS-2011-72, EECS Department, University of California, Berkeley, Jun
-
E. Solomonik and J. Demmel. Communication-optimal parallel 2.5d matrix multiplication and lu factorization algorithms. Technical Report UCB/EECS-2011-72, EECS Department, University of California, Berkeley, Jun 2011.
-
(2011)
Communication-optimal Parallel 2.5d Matrix Multiplication and Lu Factorization Algorithms
-
-
Solomonik, E.1
Demmel, J.2
-
32
-
-
68849085832
-
Hiding communication latency with non-spmd, graph-based execution
-
Berlin, Heidelberg, Springer-Verlag
-
J. Sorensen and S. B. Baden. Hiding communication latency with non-spmd, graph-based execution. In Proc. 9th Intl Conf. Computational Sci. (ICCS '09), pages 155-164, Berlin, Heidelberg, 2009. Springer-Verlag.
-
(2009)
Proc. 9th Intl Conf. Computational Sci. (ICCS '09)
, pp. 155-164
-
-
Sorensen, J.1
Baden, S.B.2
-
33
-
-
0033894682
-
A hierarchical partition model for adaptive finite element computation
-
J. D. Teresco, M. W. Beall, J. E. Flaherty, and M. S. Shephard. A hierarchical partition model for adaptive finite element computation. Comput. Methods. Appl. Mech. Engrg., 184:269-285, 2000.
-
(2000)
Comput. Methods. Appl. Mech. Engrg.
, vol.184
, pp. 269-285
-
-
Teresco, J.D.1
Beall, M.W.2
Flaherty, J.E.3
Shephard, M.S.4
-
34
-
-
0025467711
-
A bridging model for parallel computation
-
August
-
L. G. Valiant. A bridging model for parallel computation. Commun. ACM, 33:103-111, August 1990.
-
(1990)
Commun. ACM
, vol.33
, pp. 103-111
-
-
Valiant, L.G.1
|