메뉴 건너뛰기




Volumn , Issue , 2013, Pages 813-824

Cyclops tensor framework: Reducing communication and eliminating load imbalance in massively parallel contractions

Author keywords

communication avoiding algorithms; Coupled Cluster; Cyclops; tensor contractions

Indexed keywords

COUPLED CLUSTERS; CYCLOPS; DOMAIN SPECIFIC LANGUAGES; MASSIVELY PARALLELS; MATRIX MULTIPLICATION; TENSOR CONTRACTION; USER-LEVEL INTERFACE; VIRTUALIZATION LAYERS;

EID: 84884845041     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/IPDPS.2013.112     Document Type: Conference Paper
Times cited : (116)

References (36)
  • 1
    • 36849099976 scopus 로고
    • On the correlation problem in atomic and molecular systems. Calculation of wavefunction components in Ursell-Type expansion using Quantum-Field theoretical methods
    • Dec.
    • J. Čížek, "On the correlation problem in atomic and molecular systems. calculation of wavefunction components in Ursell-Type expansion using Quantum-Field theoretical methods," The Journal of Chemical Physics, vol. 45, no. 11, pp. 4256-4266, Dec. 1966.
    • (1966) The Journal of Chemical Physics , vol.45 , Issue.11 , pp. 4256-4266
    • Čížek, J.1
  • 2
    • 33847389465 scopus 로고    scopus 로고
    • Coupled-cluster theory in quantum chemistry
    • R. J. Bartlett and M. Musiał, "Coupled-cluster theory in quantum chemistry," Reviews of Modern Physics, vol. 79, no. 1, pp. 291-352, 2007.
    • (2007) Reviews of Modern Physics , vol.79 , Issue.1 , pp. 291-352
    • Bartlett, R.J.1    Musiał, M.2
  • 4
    • 84855334530 scopus 로고    scopus 로고
    • An introduction to coupled cluster theory for computational chemists
    • T. D. Crawford and H. F. Schaefer III, "An introduction to coupled cluster theory for computational chemists," Reviews in Computational Chemistry, vol. 14, p. 33, 2000.
    • (2000) Reviews in Computational Chemistry , vol.14 , pp. 33
    • Crawford, T.D.1    Schaefer III, H.F.2
  • 5
    • 0000122016 scopus 로고
    • A full coupledcluster singles and doubles model: The inclusion of disconnected triples
    • Feb.
    • G. D. Purvis and R. J. Bartlett, "A full coupledcluster singles and doubles model: The inclusion of disconnected triples," The Journal of Chemical Physics, vol. 76, no. 4, pp. 1910-1918, Feb. 1982.
    • (1982) The Journal of Chemical Physics , vol.76 , Issue.4 , pp. 1910-1918
    • Purvis, G.D.1    Bartlett, R.J.2
  • 6
    • 36549094083 scopus 로고
    • A coupled cluster approach with triple excitations
    • Y. S. Lee, S. A. Kucharski, and R. J. Bartlett, "A coupled cluster approach with triple excitations," Journal of Chemical Physics, vol. 81, no. 12, p. 5906, 1984.
    • (1984) Journal of Chemical Physics , vol.81 , Issue.12 , pp. 5906
    • Lee, Y.S.1    Kucharski, S.A.2    Bartlett, R.J.3
  • 7
    • 36549092221 scopus 로고
    • The full CCSDT model for molecular electronic structure
    • J. Noga and R. J. Bartlett, "The full CCSDT model for molecular electronic structure," Journal of Chemical Physics, vol. 86, no. 12, p. 7041, 1987.
    • (1987) Journal of Chemical Physics , vol.86 , Issue.12 , pp. 7041
    • Noga, J.1    Bartlett, R.J.2
  • 8
    • 0000639603 scopus 로고
    • Recursive intermediate factorization and complete computational linearization of the coupled-cluster single, double, triple, and quadruple excitation equations
    • S. A. Kucharski and R. J. Bartlett, "Recursive intermediate factorization and complete computational linearization of the coupled-cluster single, double, triple, and quadruple excitation equations," Theoretica Chimica Acta, vol. 80, no. 4-5, pp. 387-405, 1991.
    • (1991) Theoretica Chimica Acta , vol.80 , Issue.4-5 , pp. 387-405
    • Kucharski, S.A.1    Bartlett, R.J.2
  • 9
    • 80052339109 scopus 로고    scopus 로고
    • Communication-optimal 2.5D matrix multiplication and LU factorization algorithms
    • Euro-Par, Bordeaux, France, Aug
    • E. Solomonik and J. Demmel, "Communication-optimal 2.5D matrix multiplication and LU factorization algorithms," in Lecture Notes in Computer Science, Euro-Par, Bordeaux, France, Aug 2011.
    • (2011) Lecture Notes in Computer Science
    • Solomonik, E.1    Demmel, J.2
  • 12
    • 43949127735 scopus 로고    scopus 로고
    • Efficient search-space pruning for integrated fusion and tiling transformations
    • Languages and Compilers for Parallel Computing, ser. Springer Berlin / Heidelberg
    • X. Gao, S. Krishnamoorthy, S. Sahoo, C.-C. Lam, G. Baumgartner, J. Ramanujam, and P. Sadayappan, "Efficient search-space pruning for integrated fusion and tiling transformations," in Languages and Compilers for Parallel Computing, ser. Lecture Notes in Computer Science. Springer Berlin / Heidelberg, 2006, vol. 4339, pp. 215-229.
    • (2006) Lecture Notes in Computer Science , vol.4339 , pp. 215-229
    • Gao, X.1    Krishnamoorthy, S.2    Sahoo, S.3    Lam, C.-C.4    Baumgartner, G.5    Ramanujam, J.6    Sadayappan, P.7
  • 13
    • 0030157365 scopus 로고    scopus 로고
    • Global Arrays: A nonuniform memory access programming model for high-performance computers
    • J. Nieplocha, R. J. Harrison, and R. J. Littlefield, "Global arrays: A nonuniform memory access programming model for high-performance computers," The Journal of Supercomputing, vol. 10, pp. 169-189, 1996, 10.1007/BF00130708. (Pubitemid 126723336)
    • (1996) Journal of Supercomputing , vol.10 , Issue.2 , pp. 169-189
    • Nieplocha, J.1    Harrison, R.J.2    Littlefield, R.J.3
  • 14
    • 70449713889 scopus 로고    scopus 로고
    • An infrastructure for scalable and portable parallel programs for computational chemistry
    • Proceedings of the 23rd international conference on Supercomputing, ser. New York, NY, USA: ACM
    • V. Lotrich, N. Flocke, M. Ponton, B. A. Sanders, E. Deumens, R. J. Bartlett, and A. Perera, "An infrastructure for scalable and portable parallel programs for computational chemistry," in Proceedings of the 23rd international conference on Supercomputing, ser. ICS '09. New York, NY, USA: ACM, 2009, pp. 523-524.
    • (2009) ICS '09 , pp. 523-524
    • Lotrich, V.1    Flocke, N.2    Ponton, M.3    Sanders, B.A.4    Deumens, E.5    Bartlett, R.J.6    Perera, A.7
  • 16
    • 0035880942 scopus 로고    scopus 로고
    • Higher excitations in coupled-cluster theory
    • M. Kállay and P. R. Surján, "Higher excitations in coupled-cluster theory," The Journal of Chemical Physics, vol. 115, no. 7, p. 2945, 2001.
    • (2001) The Journal of Chemical Physics , vol.115 , Issue.7 , pp. 2945
    • Kállay, M.1    Surján, P.R.2
  • 17
    • 84971853043 scopus 로고
    • I/O complexity: The red-blue pebble game
    • Proceedings of the thirteenth annual ACM symposium on Theory of computing, ser. New York, NY, USA: ACM
    • H. Jia-Wei and H. T. Kung, "I/O complexity: The red-blue pebble game," in Proceedings of the thirteenth annual ACM symposium on Theory of computing, ser. STOC '81. New York, NY, USA: ACM, 1981, pp. 326-333.
    • (1981) STOC '81 , pp. 326-333
    • Jia-Wei, H.1    Kung, H.T.2
  • 19
    • 10844258198 scopus 로고    scopus 로고
    • Communication lower bounds for distributed-memory matrix multiplication
    • DOI 10.1016/j.jpdc.2004.03.021
    • D. Irony, S. Toledo, and A. Tiskin, "Communication lower bounds for distributed-memory matrix multiplication," Journal of Parallel and Distributed Computing, vol. 64, no. 9, pp. 1017-1026, 2004. (Pubitemid 40000755)
    • (2004) Journal of Parallel and Distributed Computing , vol.64 , Issue.9 , pp. 1017-1026
    • Irony, D.1    Toledo, S.2    Tiskin, A.3
  • 21
    • 0029370767 scopus 로고
    • A three-dimensional approach to parallel matrix multiplication
    • September
    • R. C. Agarwal, S. M. Balle, F. G. Gustavson, M. Joshi, and P. Palkar, "A three-dimensional approach to parallel matrix multiplication," IBM J. Res. Dev., vol. 39, pp. 575-582, September 1995.
    • (1995) IBM J. Res. Dev. , vol.39 , pp. 575-582
    • Agarwal, R.C.1    Balle, S.M.2    Gustavson, F.G.3    Joshi, M.4    Palkar, P.5
  • 22
    • 0031123769 scopus 로고    scopus 로고
    • SUMMA: Scalable universal matrix multiplication algorithm
    • R. A. Van De Geijn and J. Watts, "SUMMA: scalable universal matrix multiplication algorithm," Concurrency: Practice and Experience, vol. 9, no. 4, pp. 255-274, 1997. (Pubitemid 127679707)
    • (1997) Concurrency Practice and Experience , vol.9 , Issue.4 , pp. 255-274
    • Van De, G.R.A.1    Watts, J.2
  • 24
    • 84864146488 scopus 로고    scopus 로고
    • Brief announcement: Strong scaling of matrix multiplication algorithms and memory-independent communication lower bounds
    • Proceedinbgs of the 24th ACM symposium on Parallelism in algorithms and architectures, ser. New York, NY, USA: ACM, [Online]. Available
    • G. Ballard, J. Demmel, O. Holtz, B. Lipshitz, and O. Schwartz, "Brief announcement: strong scaling of matrix multiplication algorithms and memory-independent communication lower bounds," in Proceedinbgs of the 24th ACM symposium on Parallelism in algorithms and architectures, ser. SPAA '12. New York, NY, USA: ACM, 2012, pp. 77-79. [Online]. Available: http://doi.acm.org/10.1145/2312005.2312021
    • (2012) SPAA '12 , pp. 77-79
    • Ballard, G.1    Demmel, J.2    Holtz, O.3    Lipshitz, B.4    Schwartz, O.5
  • 25
    • 0000456144 scopus 로고
    • Parallel matrix and graph algorithms
    • E. Dekel, D. Nassimi, and S. Sahni, "Parallel matrix and graph algorithms," SIAM Journal on Computing, vol. 10, no. 4, pp. 657-675, 1981.
    • (1981) SIAM Journal on Computing , vol.10 , Issue.4 , pp. 657-675
    • Dekel, E.1    Nassimi, D.2    Sahni, S.3
  • 26
    • 0027702512 scopus 로고
    • Minimizing the communication time for matrix multiplication on multiprocessors
    • November
    • S. L. Johnsson, "Minimizing the communication time for matrix multiplication on multiprocessors," Parallel Comput., vol. 19, pp. 1235-1257, November 1993.
    • (1993) Parallel Comput. , vol.19 , pp. 1235-1257
    • Johnsson, S.L.1
  • 31
    • 84976817516 scopus 로고
    • CHARM++: A portable concurrent object oriented system based on C++
    • Proceedings of the eighth annual conference on Object-oriented programming systems, languages, and applications, ser. New York, NY, USA: ACM
    • L. V. Kale and S. Krishnan, "CHARM++: a portable concurrent object oriented system based on C++," in Proceedings of the eighth annual conference on Object-oriented programming systems, languages, and applications, ser. OOPSLA '93. New York, NY, USA: ACM, 1993, pp. 91-108.
    • (1993) OOPSLA '93 , pp. 91-108
    • Kale, L.V.1    Krishnan, S.2
  • 35
    • 83155160952 scopus 로고    scopus 로고
    • The IBM Blue Gene/Q interconnection network and message unit
    • Proceedings of 2011 International Conference for High Performance Computing, Networking, Storage and Analysis, ser. New York, NY, USA: ACM
    • D. Chen, N. A. Eisley, P. Heidelberger, R. M. Senger, Y. Sugawara, S. Kumar, V. Salapura, D. L. Satterfield, B. Steinmacher-Burow, and J. J. Parker, "The IBM Blue Gene/Q interconnection network and message unit," in Proceedings of 2011 International Conference for High Performance Computing, Networking, Storage and Analysis, ser. SC '11. New York, NY, USA: ACM, 2011, pp. 26:1-26:10.
    • (2011) SC '11
    • Chen, D.1    Eisley, N.A.2    Heidelberger, P.3    Senger, R.M.4    Sugawara, Y.5    Kumar, S.6    Salapura, V.7    Satterfield, D.L.8    Steinmacher-Burow, B.9    Parker, J.J.10


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.