메뉴 건너뛰기




Volumn , Issue , 2013, Pages 356-361

Porting concepts from DNNs back to GMMs

Author keywords

deep neural networks; deep structures; DNN; Gaussian mixture models; GMM; speech recognition

Indexed keywords

DEEP NEURAL NETWORKS; DEEP STRUCTURE; DNN; GAUSSIAN MIXTURE MODEL; GMM;

EID: 84893695632     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ASRU.2013.6707756     Document Type: Conference Paper
Times cited : (9)

References (41)
  • 2
    • 84055211743 scopus 로고    scopus 로고
    • Acoustic modeling using deep belief networks
    • A.-R. Mohamed, G.E. Dahl, and G. Hinton, "Acoustic modeling using deep belief networks, " IEEE Trans. on ASLP, vol. 20, no. 1, pp. 14-22, 2012.
    • (2012) IEEE Trans. on ASLP , vol.20 , Issue.1 , pp. 14-22
    • Mohamed, A.-R.1    Dahl, G.E.2    Hinton, G.3
  • 3
    • 84890466217 scopus 로고    scopus 로고
    • Improving neural networks by preventing co-adaptation of feature detectors
    • p. abs/1207.0580
    • G.E. Hinton, N. Srivastava, A. Krizhevsky, I. Sutskever, and R. Salakhutdinov, "Improving neural networks by preventing co-adaptation of feature detectors, " CoRR, p. abs/1207.0580, 2012.
    • (2012) CoRR
    • Hinton, G.E.1    Srivastava, N.2    Krizhevsky, A.3    Sutskever, I.4    Salakhutdinov, R.5
  • 5
    • 0032050110 scopus 로고    scopus 로고
    • Maximum likelihood linear transformations for HMM-based speech recognition
    • M.J.F. Gales, "Maximum likelihood linear transformations for HMM-based speech recognition, " Comp. Speech and Lang., vol. 12, pp. 75-98, 1998. (Pubitemid 128383747)
    • (1998) Computer Speech and Language , vol.12 , Issue.2 , pp. 75-98
    • Gales, M.J.F.1
  • 7
    • 85009113852 scopus 로고    scopus 로고
    • Hmm adaptation using vector taylor series for noisy speech recognition
    • A. Acero, L. Deng, T. Kristjansson, and J. Zhang, "HMM adaptation using vector Taylor series for noisy speech recognition, " in Proc. ICSLP, 2000, pp. 229-232.
    • (2000) Proc. ICSLP , pp. 229-232
    • Acero, A.1    Deng, L.2    Kristjansson, T.3    Zhang, J.4
  • 8
    • 85006734596 scopus 로고    scopus 로고
    • Evaluation of the SPLICE algorithm on the aurora2 database
    • J. Droppo, L. Deng, and A. Acero, "Evaluation of the SPLICE algorithm on the Aurora2 database, " in Proc. EUROSPEECH, 2001, pp. 217-220.
    • (2001) Proc. EUROSPEECH , pp. 217-220
    • Droppo, J.1    Deng, L.2    Acero, A.3
  • 9
    • 34547528168 scopus 로고    scopus 로고
    • Adaptive training with joint uncertainty decoding for robust recognition of noisy data
    • H. Liao and M.J.F Gales, "Adaptive training with joint uncertainty decoding for robust recognition of noisy data, " in Proc. ICASSP, 2007, pp. 389-392.
    • (2007) Proc. ICASSP , pp. 389-392
    • Liao, H.1    Gales, M.J.F.2
  • 10
    • 0030364003 scopus 로고    scopus 로고
    • Reduced semi-continuous models for large vocabulary continuous speech recognition in dutch
    • K. Demuynck, J. Duchateau, and D. Van Compernolle, "Reduced semi-continuous models for large vocabulary continuous speech recognition in Dutch, " in Proc. ICSLP, 1996, vol. IV, pp. 2289-2292.
    • (1996) Proc. ICSLP , vol.4 , pp. 2289-2292
    • Demuynck, K.1    Duchateau, J.2    Van Compernolle, D.3
  • 11
    • 0013344078 scopus 로고    scopus 로고
    • Training products of experts by minimizing contrastive divergence
    • G.E. Hinton, "Training products of experts by minimizing contrastive divergence, " Neural Computation, vol. 14, pp. 1771- 1800, 2002.
    • (2002) Neural Computation , vol.14 , pp. 1771-1800
    • Hinton, G.E.1
  • 13
    • 0029228458 scopus 로고
    • Optimal linear feature transformations for semi-continuous hidden markov models
    • G. Schukat-Talamazzini, J. Hornegger, and H. Niemann, "Optimal linear feature transformations for semi-continuous hidden Markov models, " in Proc. ICASSP, 1995, vol. I, pp. 369- 372.
    • (1995) Proc. ICASSP , vol.1 , pp. 369-372
    • Schukat-Talamazzini, G.1    Hornegger, J.2    Niemann, H.3
  • 15
    • 84871612455 scopus 로고    scopus 로고
    • Optimal feature sub-space selection based on discriminant analysis
    • K. Demuynck, J. Duchateau, and D. Van Compernolle, "Optimal feature sub-space selection based on discriminant analysis, " in Proc. EUROSPEECH, 1999, vol. III, pp. 1311-1314.
    • (1999) Proc. EUROSPEECH , vol.3 , pp. 1311-1314
    • Demuynck, K.1    Duchateau, J.2    Van Compernolle, D.3
  • 17
    • 0002235014 scopus 로고    scopus 로고
    • Improved feature decorrelation for hmm-based speech recognition
    • K. Demuynck, J. Duchateau, D. Van Compernolle, and P.Wambacq, "Improved feature decorrelation for HMM-based speech recognition, " in Proc. ICSLP, 1998, vol. VII, pp. 2907- 2910.
    • (1998) Proc. ICSLP , vol.7 , pp. 2907-2910
    • Demuynck, K.1    Duchateau, J.2    Van Compernolle, D.3    Wambacq, P.4
  • 18
    • 4243460174 scopus 로고    scopus 로고
    • Semi-tied covariance matrices
    • M.J.F. Gales, "Semi-tied covariance matrices, " in Proc. ICASSP, 1998, vol. II, pp. 657-660.
    • (1998) Proc. ICASSP , vol.2 , pp. 657-660
    • Gales, M.J.F.1
  • 19
    • 70450191324 scopus 로고    scopus 로고
    • A study of bootstrapping with multiple acoustic features for improved automatic speech recognition
    • X. Cui, J. Xue, B. Xiang, and B. Zhou, "A study of bootstrapping with multiple acoustic features for improved automatic speech recognition, " in Proc. INTERSPEECH, 2009, pp. 240- 243.
    • (2009) Proc. INTERSPEECH , pp. 240-243
    • Cui, X.1    Xue, J.2    Xiang, B.3    Zhou, B.4
  • 20
    • 85083953021 scopus 로고    scopus 로고
    • Feature learning in deep neural networks - studies on speech recognition tasks
    • vol. abs/1301.3605
    • D. Yu, M.L. Seltzer, J. Li, J.-T. Huang, and F. Seide, "Feature learning in deep neural networks - studies on speech recognition tasks, " CoRR, vol. abs/1301.3605, 2013, http://arxiv.org/abs/1301.3605.
    • (2013) CoRR
    • Yu, D.1    Seltzer, M.L.2    Li, J.3    Huang, J.-T.4    Seide, F.5
  • 21
    • 77949426518 scopus 로고    scopus 로고
    • Hidden conditional random fields for phone recognition
    • Y-h. Sung and D. Jurafsky, "Hidden conditional random fields for phone recognition, " in Proc. ASRU, 2009, pp. 107-112.
    • (2009) Proc. ASRU , pp. 107-112
    • Sung, Y.-H.1    Jurafsky, D.2
  • 22
    • 70349208656 scopus 로고    scopus 로고
    • A flat direct model for speech recognition
    • G. Heigold, G. Zweig, X. Li, and P. Nguyen, "A flat direct model for speech recognition, " in Proc. ICASSP, 2009, pp. 3861-3864.
    • (2009) Proc. ICASSP , pp. 3861-3864
    • Heigold, G.1    Zweig, G.2    Li, X.3    Nguyen, P.4
  • 23
    • 0004158153 scopus 로고    scopus 로고
    • Speech recognition with dynamic bayesian networks
    • G. Zweig and S. Russell, "Speech recognition with dynamic Bayesian networks, " Tech. Rep., Microsoft, 1998.
    • (1998) Tech. Rep., Microsoft
    • Zweig, G.1    Russell, S.2
  • 24
    • 78649308591 scopus 로고    scopus 로고
    • Sequential labeling using deepstructured conditional random fields
    • D. Yu, S.Wang, and L. Deng, "Sequential labeling using deepstructured conditional random fields, " IEEE Journal of Selected Topics in Signal Processing, vol. 4, no. 6, pp. 965-973, 2010.
    • (2010) IEEE Journal of Selected Topics in Signal Processing , vol.4 , Issue.6 , pp. 965-973
    • Yu, D.1    Wang, S.2    Deng, L.3
  • 28
    • 84867215798 scopus 로고    scopus 로고
    • Spraak: An open source speech recognition and automatic annotation kit
    • Sept.
    • K. Demuynck, J. Roelens, D. Van Compernolle, and P. Wambacq, "SPRAAK: An open source speech recognition and automatic annotation kit, " in Proc. INTERSPEECH, Sept. 2008, p. 495.
    • (2008) Proc. INTERSPEECH , pp. 495
    • Demuynck, K.1    Roelens, J.2    Van Compernolle, D.3    Wambacq, P.4
  • 30
    • 0031211090 scopus 로고    scopus 로고
    • A Decision-Theoretic Generalization of On-Line Learning and an Application to Boosting
    • Y. Freund and R.E. Schapire, "A decision-theoretic generalization of on-line learning and an application to boosting, " Journal of computer and system sciences, vol. 55, no. 1, pp. 119- 139, 1997. (Pubitemid 127433398)
    • (1997) Journal of Computer and System Sciences , vol.55 , Issue.1 , pp. 119-139
    • Freund, Y.1    Schapire, R.E.2
  • 31
    • 0344666609 scopus 로고    scopus 로고
    • Boosting linear discriminant analysis for face recognition
    • J. Lu, K.N. Plataniotis, and A.N. Venetsanopoulos, "Boosting linear discriminant analysis for face recognition, " in Proc. ICIP, 2003, vol. 1, pp. 657-660.
    • (2003) Proc. ICIP , vol.1 , pp. 657-660
    • Lu, J.1    Plataniotis, K.N.2    Venetsanopoulos, A.N.3
  • 32
    • 0030677475 scopus 로고    scopus 로고
    • Speaker adaptive training: A maximum likelihood approach to speaker normalization
    • T. Anastasakos, J. McDonough, and J. Makhoul, "Speaker adaptive training: A maximum likelihood approach to speaker normalization, " in Proc. ICASSP, 1997, vol. 2, pp. 1043-1046.
    • (1997) Proc. ICASSP , vol.2 , pp. 1043-1046
    • Anastasakos, T.1    McDonough, J.2    Makhoul, J.3
  • 33
    • 84867600898 scopus 로고    scopus 로고
    • Latent variable speaker adaptation of gaussian mixture weights and means
    • X. Zhang, K. Demuynck, and H. Van hamme, "latent variable speaker adaptation of gaussian mixture weights and means, " in Proc. ICASSP, 2012, pp. 4349-4352.
    • (2012) Proc. ICASSP , pp. 4349-4352
    • Zhang, X.1    Demuynck, K.2    Van Hamme, H.3
  • 36
    • 0036296863 scopus 로고    scopus 로고
    • Minimum phone error and i-smoothing for improved discriminative training
    • D. Povey and P.C. Woodland, "Minimum phone error and I-smoothing for improved discriminative training, " in Proc. ICASSP, 2002, vol. 1, pp. 105-108.
    • (2002) Proc. ICASSP , vol.1 , pp. 105-108
    • Povey, D.1    Woodland, P.C.2
  • 37
    • 70349213445 scopus 로고    scopus 로고
    • Lattice-based optimization of sequence classification criteria for neural-network acoustic modeling
    • B. Kingsbury, "lattice-based optimization of sequence classification criteria for neural-network acoustic modeling, " in Proc. ICASSP, 2009, pp. 3761-3764.
    • (2009) Proc. ICASSP , pp. 3761-3764
    • Kingsbury, B.1
  • 38
    • 80051622448 scopus 로고    scopus 로고
    • A-functions: A generalization of extended baum-welch transformations to convex optimization
    • D. Kanevsky, D. Nahamoo, T.N. Sainath, B. Ramabhadran, and P.A. Olsen, "A-functions: A generalization of extended Baum-Welch transformations to convex optimization, " in Proc. ICASSP, 2011, pp. 5164-5167.
    • (2011) Proc. ICASSP , pp. 5164-5167
    • Kanevsky, D.1    Nahamoo, D.2    Sainath, T.N.3    Ramabhadran, B.4    Olsen, P.A.5
  • 39
    • 84865801985 scopus 로고    scopus 로고
    • Conversational speech transcription using context-dependent deep neural networks
    • F. Seide, G. Li, and D. Yu, "Conversational speech transcription using context-dependent deep neural networks, " in Proc. INTERSPEECH, 2011, pp. 437-440.
    • (2011) Proc. INTERSPEECH , pp. 437-440
    • Seide, F.1    Li, G.2    Yu, D.3
  • 40
    • 84976736715 scopus 로고
    • The hearsay-ii speech understanding system: Integrating knowledge to resolve uncertainty
    • L.D. Erman, F. Hayes-Roth, V.R. Lesser, and D.R. Reddy, "The HEARSAY-II speech understanding system: Integrating knowledge to resolve uncertainty, " Computing Surveys, vol. 12, pp. 213-253, 1980.
    • (1980) Computing Surveys , vol.12 , pp. 213-253
    • Erman, L.D.1    Hayes-Roth, F.2    Lesser, V.R.3    Reddy, D.R.4
  • 41
    • 84893686887 scopus 로고    scopus 로고
    • Attention shift decoding for conversational speech recognition
    • R. Kumaran, J. Bilmes, and K. Kirchhoff, "Attention shift decoding for conversational speech recognition, " in Proc. INTERSPEECH, 2007, pp. 1493-1496.
    • (2007) Proc. INTERSPEECH , pp. 1493-1496
    • Kumaran, R.1    Bilmes, J.2    Kirchhoff, K.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.