메뉴 건너뛰기




Volumn , Issue , 2016, Pages 1225-1237

Transfer learning for speech and language processing

Author keywords

[No Author keywords available]

Indexed keywords

ABSTRACT DATA TYPES; BAYESIAN NETWORKS; INFORMATION SCIENCE; MODEL STRUCTURES;

EID: 84986198297     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/APSIPA.2015.7415532     Document Type: Conference Paper
Times cited : (220)

References (150)
  • 3
    • 84876672166 scopus 로고    scopus 로고
    • Machine learning paradigms for speech recognition: An overview
    • L. Deng and X. Li, "Machine learning paradigms for speech recognition: An overview," Audio, Speech, and Language Processing, IEEE Transactions on, vol. 21, no. 5, pp. 1060-1089, 2013.
    • (2013) Audio, Speech, and Language Processing, IEEE Transactions on , vol.21 , Issue.5 , pp. 1060-1089
    • Deng, L.1    Li, X.2
  • 5
    • 68949157375 scopus 로고    scopus 로고
    • Transfer learning for reinforcement learning domains: A survey
    • M. E. Taylor and P. Stone, "Transfer learning for reinforcement learning domains: A survey," The Journal of Machine Learning Research, vol. 10, pp. 1633-1685, 2009.
    • (2009) The Journal of Machine Learning Research , vol.10 , pp. 1633-1685
    • Taylor, M.E.1    Stone, P.2
  • 6
    • 84904548965 scopus 로고    scopus 로고
    • Deep learning of representations for unsupervised and transfer learning
    • Y. Bengio, "Deep learning of representations for unsupervised and transfer learning," in ICML Unsupervised and Transfer Learning, 2012.
    • (2012) ICML Unsupervised and Transfer Learning
    • Bengio, Y.1
  • 7
  • 11
    • 0031189914 scopus 로고    scopus 로고
    • Multitask learning
    • R. Caruana, "Multitask learning," Machine learning, vol. 28, no. 1, pp. 41-75, 1997.
    • (1997) Machine Learning , vol.28 , Issue.1 , pp. 41-75
    • Caruana, R.1
  • 12
    • 0028419019 scopus 로고
    • Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains
    • J.-L. Gauvain and C.-H. Lee, "Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains," IEEE Transactions on Speech and audio processing, vol. 2, no. 2, pp. 291-298, 1994.
    • (1994) IEEE Transactions on Speech and Audio Processing , vol.2 , Issue.2 , pp. 291-298
    • Gauvain, J.-L.1    Lee, C.-H.2
  • 13
    • 0029288633 scopus 로고
    • Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models
    • C. J. Leggetter and P. Woodland, "Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models," Computer Speech & Language, vol. 9, no. 2, pp. 171-185, 1995.
    • (1995) Computer Speech & Language , vol.9 , Issue.2 , pp. 171-185
    • Leggetter, C.J.1    Woodland, P.2
  • 14
    • 77952642202 scopus 로고
    • Incremental induction of decision trees
    • P. E. Utgoff, "Incremental induction of decision trees," Machine learning, vol. 4, no. 2, pp. 161-186, 1989.
    • (1989) Machine Learning , vol.4 , Issue.2 , pp. 161-186
    • Utgoff, P.E.1
  • 18
    • 85072851756 scopus 로고    scopus 로고
    • Transfer learning by structural analogy
    • H.-Y. Wang and Q. Yang, "Transfer learning by structural analogy," in AAAI. Citeseer, 2011.
    • (2011) AAAI. Citeseer
    • Wang, H.-Y.1    Yang, Q.2
  • 20
    • 57349144443 scopus 로고    scopus 로고
    • Online learning of Gaussian mixture models-a two-level approach
    • A. Declercq and J. H. Piater, "Online learning of Gaussian mixture models-a two-level approach." in VISAPP (1), 2008, pp. 605-611.
    • (2008) VISAPP , Issue.1 , pp. 605-611
    • Declercq, A.1    Piater, J.H.2
  • 21
    • 33745456231 scopus 로고    scopus 로고
    • Semi-supervised learning literature survey
    • University of Wisconsin C Madison
    • X. Zhu, "Semi-supervised learning literature survey," Computer Sciences TRP 1530, University of Wisconsin C Madison, 2005.
    • (2005) Computer Sciences TRP 1530
    • Zhu, X.1
  • 27
    • 84986192609 scopus 로고    scopus 로고
    • Heterogeneous transfer learning with RBMs
    • B. Wei and C. J. Pal, "Heterogeneous transfer learning with RBMs." in AAAI, 2011.
    • (2011) AAAI
    • Wei, B.1    Pal, C.J.2
  • 31
    • 48749148075 scopus 로고
    • Structure-mapping: A theoretical framework for analogy
    • D. Gentner, "Structure-mapping: A theoretical framework for analogy," Cognitive science, vol. 7, no. 2, pp. 155-170, 1983.
    • (1983) Cognitive Science , vol.7 , Issue.2 , pp. 155-170
    • Gentner, D.1
  • 32
    • 0030641933 scopus 로고    scopus 로고
    • Reasoning and learning by analogy: Introduction
    • D. Gentner and K. J. Holyoak, "Reasoning and learning by analogy: Introduction." American Psychologist, vol. 52, no. 1, p. 32, 1997.
    • (1997) American Psychologist , vol.52 , Issue.1 , pp. 32
    • Gentner, D.1    Holyoak, K.J.2
  • 37
    • 14344277592 scopus 로고    scopus 로고
    • A model of inductive bias learning
    • J. Baxter, "A model of inductive bias learning," J. Artif. Intell. Res.(JAIR), vol. 12, pp. 149-198, 2000.
    • (2000) J. Artif. Intell. Res.(JAIR) , vol.12 , pp. 149-198
    • Baxter, J.1
  • 38
    • 79958787804 scopus 로고    scopus 로고
    • Estimating variable structure and dependence in multitask learning via gradients
    • J. Guinney, Q. Wu, and S. Mukherjee, "Estimating variable structure and dependence in multitask learning via gradients," Machine Learning, vol. 83, no. 3, pp. 265-287, 2011.
    • (2011) Machine Learning , vol.83 , Issue.3 , pp. 265-287
    • Guinney, J.1    Wu, Q.2    Mukherjee, S.3
  • 41
    • 84903724014 scopus 로고    scopus 로고
    • Deep learning: Methods and applications
    • L. Deng and D. Yu, "Deep learning: Methods and applications," Foundations and Trends in Signal Processing, vol. 7, no. 3-4, pp. 197-387, 2013. [Online]. Available: http://dx.doi.org/10.1561/2000000039
    • (2013) Foundations and Trends in Signal Processing , vol.7 , Issue.3-4 , pp. 197-387
    • Deng, L.1    Yu, D.2
  • 43
    • 84937806794 scopus 로고    scopus 로고
    • Advances in natural language processing
    • J. Hirschberg and C. D. Manning, "Advances in natural language processing," Science, vol. 349, no. 6245, pp. 261-266, 2015.
    • (2015) Science , vol.349 , Issue.6245 , pp. 261-266
    • Hirschberg, J.1    Manning, C.D.2
  • 44
    • 33745805403 scopus 로고    scopus 로고
    • A fast learning algorithm for deep belief nets
    • G. E. Hinton, S. Osindero, and Y.-W. Teh, "A fast learning algorithm for deep belief nets," Neural computation, vol. 18, no. 7, pp. 1527-1554, 2006.
    • (2006) Neural Computation , vol.18 , Issue.7 , pp. 1527-1554
    • Hinton, G.E.1    Osindero, S.2    Teh, Y.-W.3
  • 47
    • 79551480483 scopus 로고    scopus 로고
    • Stacked denoising autoencoders: Learning useful representations in a deep network with a local denoising criterion
    • P. Vincent, H. Larochelle, I. Lajoie, Y. Bengio, and P.-A. Manzagol, "Stacked denoising autoencoders: Learning useful representations in a deep network with a local denoising criterion," The Journal of Machine Learning Research, vol. 11, pp. 3371-3408, 2010.
    • (2010) The Journal of Machine Learning Research , vol.11 , pp. 3371-3408
    • Vincent, P.1    Larochelle, H.2    Lajoie, I.3    Bengio, Y.4    Manzagol, P.-A.5
  • 48
  • 50
    • 80054108245 scopus 로고    scopus 로고
    • On the expressive power of deep architectures
    • Springer
    • Y. Bengio and O. Delalleau, "On the expressive power of deep architectures," in Algorithmic Learning Theory. Springer, 2011, pp. 18-36.
    • (2011) Algorithmic Learning Theory , pp. 18-36
    • Bengio, Y.1    Delalleau, O.2
  • 51
    • 56449095373 scopus 로고    scopus 로고
    • A unified architecture for natural language processing: Deep neural networks with multitask learning
    • ACM
    • R. Collobert and J. Weston, "A unified architecture for natural language processing: Deep neural networks with multitask learning," in Proceedings of the 25th international conference on Machine learning. ACM, 2008, pp. 160-167.
    • (2008) Proceedings of the 25th International Conference on Machine Learning , pp. 160-167
    • Collobert, R.1    Weston, J.2
  • 56
    • 33746600649 scopus 로고    scopus 로고
    • Reducing the dimensionality of data with neural networks
    • G. E. Hinton and R. R. Salakhutdinov, "Reducing the dimensionality of data with neural networks," Science, vol. 313, no. 5786, pp. 504-507, 2006.
    • (2006) Science , vol.313 , Issue.5786 , pp. 504-507
    • Hinton, G.E.1    Salakhutdinov, R.R.2
  • 61
    • 74549123074 scopus 로고    scopus 로고
    • Zero-data learning of new tasks
    • H. Larochelle, D. Erhan, and Y. Bengio, "Zero-data learning of new tasks." in AAAI, vol. 1, no. 2, 2008, p. 3.
    • (2008) AAAI , vol.1 , Issue.2 , pp. 3
    • Larochelle, H.1    Erhan, D.2    Bengio, Y.3
  • 63
    • 0035426931 scopus 로고    scopus 로고
    • Language-independent and languageadaptive acoustic modeling for speech recognition
    • T. Schultz and A. Waibel, "Language-independent and languageadaptive acoustic modeling for speech recognition," Speech Communication, vol. 35, no. 1, pp. 31-51, 2001.
    • (2001) Speech Communication , vol.35 , Issue.1 , pp. 31-51
    • Schultz, T.1    Waibel, A.2
  • 72
    • 84910067354 scopus 로고    scopus 로고
    • Language independent and unsupervised acoustic models for speech recognition and keyword spotting
    • K. M. Knill, M. J. Gales, A. Ragni, and S. P. Rath, "Language independent and unsupervised acoustic models for speech recognition and keyword spotting," in Proc. Interspeech14, 2014.
    • (2014) Proc. Interspeech14
    • Knill, K.M.1    Gales, M.J.2    Ragni, A.3    Rath, S.P.4
  • 76
    • 84986195842 scopus 로고    scopus 로고
    • Speech recognition with pronunciation vecotrs
    • Tsinghua University
    • X. Z. Zhiyuan Tang, "Speech recognition with pronunciation vecotrs," CSLT, Tsinghua University, 2015. [Online]. Available: http://cslt.riit.tsinghua.edu.cn/publications.php?Publication-trp
    • (2015) CSLT
    • Zhiyuan Tang, X.Z.1
  • 80
    • 84986223001 scopus 로고    scopus 로고
    • Music removal by convolutional denoising autoencoder in speech recognition
    • M. Zhao, D. Wang, Z. Zhang, and X. Zhang, "Music removal by convolutional denoising autoencoder in speech recognition," in Interspeech 2015, 2015.
    • (2015) Interspeech 2015
    • Zhao, M.1    Wang, D.2    Zhang, Z.3    Zhang, X.4
  • 81
    • 84865783757 scopus 로고    scopus 로고
    • Separating speaker and environmental variability using factored transforms
    • M. L. Seltzer and A. Acero, "Separating speaker and environmental variability using factored transforms." in INTERSPEECH, 2011, pp. 1097-1100.
    • (2011) INTERSPEECH , pp. 1097-1100
    • Seltzer, M.L.1    Acero, A.2
  • 82
    • 79957856980 scopus 로고    scopus 로고
    • A basis representation of constrained MLLR transforms for robust adaptation
    • D. Povey and K. Yao, "A basis representation of constrained MLLR transforms for robust adaptation," Computer Speech & Language, vol. 26, no. 1, pp. 35-51, 2012.
    • (2012) Computer Speech & Language , vol.26 , Issue.1 , pp. 35-51
    • Povey, D.1    Yao, K.2
  • 84
    • 84866861206 scopus 로고    scopus 로고
    • Speaker adaptation techniques for automatic speech recognition
    • K. Shinoda, "Speaker adaptation techniques for automatic speech recognition," Proc. APSIPA ASC 2011, 2011.
    • (2011) Proc. APSIPA ASC 2011
    • Shinoda, K.1
  • 86
    • 84906225505 scopus 로고    scopus 로고
    • Rapid and effective speaker adaptation of convolutional neural network based models for speech recognition
    • -, "Rapid and effective speaker adaptation of convolutional neural network based models for speech recognition." in INTERSPEECH, 2013, pp. 1248-1252.
    • (2013) INTERSPEECH , pp. 1248-1252
  • 89
    • 84910068089 scopus 로고    scopus 로고
    • Adaptation of deep neural network acoustic models using factorised i-vectors
    • P. Karanasou, Y. Wang, M. J. Gales, and P. C. Woodland, "Adaptation of deep neural network acoustic models using factorised i-vectors," in Proc Interspeech, 2014.
    • (2014) Proc Interspeech
    • Karanasou, P.1    Wang, Y.2    Gales, M.J.3    Woodland, P.C.4
  • 90
    • 84905259138 scopus 로고    scopus 로고
    • Improving DNN speaker independence with i-vector inputs
    • A. Senior and I. Lopez-Moreno, "Improving DNN speaker independence with i-vector inputs," in Proc. ICASSP, 2014.
    • (2014) Proc. ICASSP
    • Senior, A.1    Lopez-Moreno, I.2
  • 95
    • 34548012893 scopus 로고    scopus 로고
    • Linear hidden transformations for adaptation of hybrid ANN/HMM models
    • R. Gemello, F. Mana, S. Scanzio, P. Laface, and R. De Mori, "Linear hidden transformations for adaptation of hybrid ANN/HMM models," Speech Communication, vol. 49, no. 10, pp. 827-835, 2007.
    • (2007) Speech Communication , vol.49 , Issue.10 , pp. 827-835
    • Gemello, R.1    Mana, F.2    Scanzio, S.3    Laface, P.4    De Mori, R.5
  • 96
    • 84881054791 scopus 로고    scopus 로고
    • Hermitian polynomial for speaker adaptation of connectionist speech recognition systems
    • S. M. Siniscalchi, J. Li, and C.-H. Lee, "Hermitian polynomial for speaker adaptation of connectionist speech recognition systems," Audio, Speech, and Language Processing, IEEE Transactions on, vol. 21, no. 10, pp. 2152-2161, 2013.
    • (2013) Audio, Speech, and Language Processing, IEEE Transactions on , vol.21 , Issue.10 , pp. 2152-2161
    • Siniscalchi, S.M.1    Li, J.2    Lee, C.-H.3
  • 97
    • 84983119674 scopus 로고    scopus 로고
    • Learning hidden unit contributions for unsupervised speaker adaptation of neural network acoustic models
    • IEEE
    • P. Swietojanski and S. Renals, "Learning hidden unit contributions for unsupervised speaker adaptation of neural network acoustic models," in Spoken Language Technology Workshop (SLT), 2014 IEEE. IEEE, 2014, pp. 171-176.
    • (2014) Spoken Language Technology Workshop (SLT), 2014 IEEE , pp. 171-176
    • Swietojanski, P.1    Renals, S.2
  • 98
    • 79959849500 scopus 로고    scopus 로고
    • Comparison of discriminative input and output transformations for speaker adaptation in the hybrid NN/HMM systems
    • B. Li and K. C. Sim, "Comparison of discriminative input and output transformations for speaker adaptation in the hybrid NN/HMM systems," in Interspeech'10, 2010.
    • (2010) Interspeech'10
    • Li, B.1    Sim, K.C.2
  • 104
    • 84910031119 scopus 로고    scopus 로고
    • Towards speaker adaptive training of deep neural network acoustic models
    • Y. Miao, H. Zhang, and F. Metze, "Towards speaker adaptive training of deep neural network acoustic models," in Interspeech'14, 2014.
    • (2014) Interspeech'14
    • Miao, Y.1    Zhang, H.2    Metze, F.3
  • 107
    • 70450192740 scopus 로고    scopus 로고
    • State mapping based method for cross-lingual speaker adaptation in HMM-based speech synthesis
    • Y.-J. Wu, Y. Nankaku, and K. Tokuda, "State mapping based method for cross-lingual speaker adaptation in HMM-based speech synthesis." in Interspeech, 2009, pp. 528-531.
    • (2009) Interspeech , pp. 528-531
    • Wu, Y.-J.1    Nankaku, Y.2    Tokuda, K.3
  • 108
    • 33847129573 scopus 로고    scopus 로고
    • Average-voice-based speech synthesis using HSMM-based speaker adaptation and adaptive training
    • J. Yamagishi and T. Kobayashi, "Average-voice-based speech synthesis using HSMM-based speaker adaptation and adaptive training," IEICE TRANSACTIONS on Information and Systems, vol. 90, no. 2, pp. 533-543, 2007.
    • (2007) IEICE TRANSACTIONS on Information and Systems , vol.90 , Issue.2 , pp. 533-543
    • Yamagishi, J.1    Kobayashi, T.2
  • 111
    • 79953289255 scopus 로고    scopus 로고
    • Unsupervised intralingual and cross-lingual speaker adaptation for HMM-based speech synthesis using two-pass decision tree construction
    • May
    • M. Gibson and W. Byrne, "Unsupervised intralingual and cross-lingual speaker adaptation for HMM-based speech synthesis using two-pass decision tree construction," Audio, Speech, and Language Processing, IEEE Transactions on, vol. 19, no. 4, pp. 895-904, May 2011.
    • (2011) Audio, Speech, and Language Processing, IEEE Transactions on , vol.19 , Issue.4 , pp. 895-904
    • Gibson, M.1    Byrne, W.2
  • 112
    • 84901237776 scopus 로고    scopus 로고
    • Modeling spectral envelopes using restricted boltzmann machines and deep belief networks for statistical parametric speech synthesis
    • Z.-H. Ling, L. Deng, and D. Yu, "Modeling spectral envelopes using restricted boltzmann machines and deep belief networks for statistical parametric speech synthesis," Audio, Speech, and Language Processing, IEEE Transactions on, vol. 21, no. 10, pp. 2129-2139, 2013.
    • (2013) Audio, Speech, and Language Processing, IEEE Transactions on , vol.21 , Issue.10 , pp. 2129-2139
    • Ling, Z.-H.1    Deng, L.2    Yu, D.3
  • 115
    • 84959166808 scopus 로고    scopus 로고
    • Preliminary work on speaker adaptation for DNN-based speech synthesis
    • B. Potard, P. Motlicek, and D. Imseng, "Preliminary work on speaker adaptation for DNN-based speech synthesis," Idiap, Tech. Rep., 2015.
    • (2015) Idiap, Tech. Rep.
    • Potard, B.1    Motlicek, P.2    Imseng, D.3
  • 125
    • 84959104706 scopus 로고    scopus 로고
    • Recognize foreign low-frequency words with similar pairs
    • X. Ma, X. Wang, and D. Wang, "Recognize foreign low-frequency words with similar pairs," in Interspeech 2015, 2015.
    • (2015) Interspeech 2015
    • Ma, X.1    Wang, X.2    Wang, D.3
  • 131
    • 84858779990 scopus 로고    scopus 로고
    • A scalable hierarchical distributed language model
    • A. Mnih and G. E. Hinton, "A scalable hierarchical distributed language model," in NIPS, 2008, pp. 1081-1088.
    • (2008) NIPS , pp. 1081-1088
    • Mnih, A.1    Hinton, G.E.2
  • 136
    • 84951746837 scopus 로고    scopus 로고
    • Normalized word embedding and orthogonal transform for bilingual word translation
    • C. Xing, D. Wang, C. Liu, and Y. Lin, "Normalized word embedding and orthogonal transform for bilingual word translation," in NAACL'15, 2015.
    • (2015) NAACL'15
    • Xing, C.1    Wang, D.2    Liu, C.3    Lin, Y.4
  • 138
    • 84876812227 scopus 로고    scopus 로고
    • Inducing crosslingual distributed representations of words
    • Citeseer
    • A. Klementiev, I. Titov, and B. Bhattarai, "Inducing crosslingual distributed representations of words," in COLING'12. Citeseer, 2012.
    • (2012) COLING'12
    • Klementiev, A.1    Titov, I.2    Bhattarai, B.3
  • 142
    • 84953837109 scopus 로고    scopus 로고
    • Going beyond text: A hybrid imagetext approach for measuring word relatedness
    • C. W. Leong and R. Mihalcea, "Going beyond text: A hybrid imagetext approach for measuring word relatedness." in IJCNLP, 2011, pp. 1403-1407.
    • (2011) IJCNLP , pp. 1403-1407
    • Leong, C.W.1    Mihalcea, R.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.