메뉴 건너뛰기




Volumn 15-20-November-2015, Issue , 2015, Pages

An input-adaptive and in-place approach to dense tensor-times-matrix multiply

Author keywords

code generation; multilinear algebra; offline autotuning; tensor operation

Indexed keywords

TENSORS;

EID: 84966570056     PISSN: 21674329     EISSN: 21674337     Source Type: Conference Proceeding    
DOI: 10.1145/2807591.2807671     Document Type: Conference Paper
Times cited : (60)

References (45)
  • 1
    • 19044386208 scopus 로고    scopus 로고
    • An updated set of basic linear algebra subprograms (BLAS)
    • June
    • An updated set of Basic Linear Algebra Subprograms (BLAS). ACM Trans. Math. Softw, 28(2):135-151, June 2002.
    • (2002) ACM Trans. Math. Softw , vol.28 , Issue.2 , pp. 135-151
  • 5
    • 31744435977 scopus 로고    scopus 로고
    • Automatic code generation for many-body electronic structure methods: The tensor contrac
    • A. Auer and etc. Automatic code generation for many-body electronic structure methods: the tensor contrac. Molecular Physics, 104(2):211-228, 2006.
    • (2006) Molecular Physics , vol.104 , Issue.2 , pp. 211-228
    • Auer, A.1
  • 7
    • 84900500118 scopus 로고    scopus 로고
    • Communication lower bounds and optimal algorithms for numerical linear algebra
    • G. Ballard, E. Carson, J. Demmel, M. Hoemmen, N. Knight, and O. Schwartz. Communication lower bounds and optimal algorithms for numerical linear algebra. Acta Numerica, 23:pp. 1-155, 2014.
    • (2014) Acta Numerica , vol.23 , pp. 1-155
    • Ballard, G.1    Carson, E.2    Demmel, J.3    Hoemmen, M.4    Knight, N.5    Schwartz, O.6
  • 9
    • 84937955938 scopus 로고    scopus 로고
    • Dfacto: Distributed factorization of tensors
    • Z. Ghahramani, M. Welling, C. Cortes, N. Lawrence, and K. Weinberger, editors. Curran Associates, Inc
    • J. H. Choi and S. Vishwanathan. Dfacto: Distributed factorization of tensors. In Z. Ghahramani, M. Welling, C. Cortes, N. Lawrence, and K. Weinberger, editors, Advances in Neural Information Processing Systems 27, pages 1296-1304. Curran Associates, Inc., 2014.
    • (2014) Advances in Neural Information Processing Systems , vol.27 , pp. 1296-1304
    • Choi, J.H.1    Vishwanathan, S.2
  • 10
    • 84915742301 scopus 로고    scopus 로고
    • Era of big data processing: A new approach via tensor networks and tensor decompositions
    • abs/1403.2048
    • A. Cichocki. Era of big data processing: A new approach via tensor networks and tensor decompositions. CoRR, abs/1403.2048, 2014.
    • (2014) CoRR
    • Cichocki, A.1
  • 11
    • 44249094647 scopus 로고    scopus 로고
    • Anatomy of high-performance matrix multiplication
    • May
    • K. Goto and R. A. V. D. Geijn. Anatomy of high-performance matrix multiplication. ACM Trans. Math. Softw., 34(3):12:1-12:25, May 2008.
    • (2008) ACM Trans. Math. Softw , vol.34 , Issue.3 , pp. 121-1225
    • Goto, K.1    Geijn, R.A.V.D.2
  • 12
    • 77956032667 scopus 로고    scopus 로고
    • Hierarchical singular value decomposition of tensors
    • May
    • L. Grasedyck. Hierarchical singular value decomposition of tensors. SIAM J. Matrix Anal. Appl., 31(4):2029-2054, May 2010.
    • (2010) SIAM J. Matrix Anal. Appl , vol.31 , Issue.4 , pp. 2029-2054
    • Grasedyck, L.1
  • 13
    • 84892949294 scopus 로고    scopus 로고
    • A literature survey of low-rank tensor approximation techniques
    • L. Grasedyck, D. Kressner, and C. Tobler. A literature survey of low-rank tensor approximation techniques. GAMM-Mitteilungen, 36(1):53-78, 2013.
    • (2013) GAMM-Mitteilungen , vol.36 , Issue.1 , pp. 53-78
    • Grasedyck, L.1    Kressner, D.2    Tobler, C.3
  • 16
    • 0024662699 scopus 로고
    • Fi: A parameter to characterize memory and communication bottlenecks
    • R. W. Hockney and I. J. Curington. fi: A parameter to characterize memory and communication bottlenecks. Parallel Computing, 10:277-286, 1989.
    • (1989) Parallel Computing , vol.10 , pp. 277-286
    • Hockney, R.W.1    Curington, I.J.2
  • 17
    • 84872201157 scopus 로고    scopus 로고
    • Intel
    • Intel. Math kernel library. http://developer.intel.com/software/products/mkl/.
    • Math Kernel Library
  • 21
    • 68649096448 scopus 로고    scopus 로고
    • Tensor decompositions and applications
    • T. Kolda and B. Bader. Tensor decompositions and applications. SIAM Review, 51(3):455-500, 2009.
    • (2009) SIAM Review , vol.51 , Issue.3 , pp. 455-500
    • Kolda, T.1    Bader, B.2
  • 27
    • 0024023333 scopus 로고
    • Topographic components model for event-related potentials and some biophysical considerations
    • June
    • J. Mocks. Topographic components model for event-related potentials and some biophysical considerations. Biomedical Engineering, IEEE Transactions on, 35(6):482-484, June 1988.
    • (1988) Biomedical Engineering IEEE Transactions on , vol.35 , Issue.6 , pp. 482-484
    • Mocks, J.1
  • 28
    • 31044456392 scopus 로고    scopus 로고
    • Parallel factor analysis as an exploratory tool for wavelet transformed event-related {EEG}
    • M. Morup, L. K. Hansen, C. S. Herrmann, J. Par-nas, and S. M. Arnfred. Parallel factor analysis as an exploratory tool for wavelet transformed event-related {EEG}. NeuroImage, 29(3):938-947, 2006.
    • (2006) NeuroImage , vol.29 , Issue.3 , pp. 938-947
    • Morup, M.1    Hansen, L.K.2    Herrmann, C.S.3    Par-Nas, J.4    Arnfred, S.M.5
  • 29
    • 32944458698 scopus 로고    scopus 로고
    • Kronecker product approximation for preconditioning in three-dimensional imaging applications
    • March
    • J. Nagy and M. Kilmer. Kronecker product approximation for preconditioning in three-dimensional imaging applications. Image Processing, IEEE Transactions on, 15(3):604-613, March 2006.
    • (2006) Image Processing, IEEE Transactions on , vol.15 , Issue.3 , pp. 604-613
    • Nagy, J.1    Kilmer, M.2
  • 30
    • 80053896203 scopus 로고    scopus 로고
    • Tensor-train decomposition
    • I. V. Oseledets. Tensor-train decomposition. SIAM J. Scientific Computing, 33(5):2295-2317, 2011.
    • (2011) SIAM J. Scientific Computing , vol.33 , Issue.5 , pp. 2295-2317
    • Oseledets, I.V.1
  • 34
    • 33750527967 scopus 로고    scopus 로고
    • Handwritten digit classification using higher order singular value decomposition
    • B. Savas and L. Elden. Handwritten digit classification using higher order singular value decomposition. Pattern recognition, 40(3):993-1003, 2007.
    • (2007) Pattern Recognition , vol.40 , Issue.3 , pp. 993-1003
    • Savas, B.1    Elden, L.2
  • 42
    • 0013953617 scopus 로고
    • Some mathematical notes on three-mode factor analysis
    • L. R. Tucker. Some mathematical notes on three-mode factor analysis. Psychometrika, 31(3):279-311, 1966.
    • (1966) Psychometrika , vol.31 , Issue.3 , pp. 279-311
    • Tucker, L.R.1
  • 44
    • 84944415516 scopus 로고    scopus 로고
    • Multilinear analysis of image ensembles: Tensorfaces
    • Springer
    • M. A. O. Vasilescu and D. Terzopoulos. Multilinear analysis of image ensembles: Tensorfaces. In Computer Vision-ECCV 2002, pages 447-460. Springer, 2002.
    • (2002) Computer Vision-ECCV 2002 , pp. 447-460
    • Vasilescu, M.A.O.1    Terzopoulos, D.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.