SCOPUS 정보 검색 플랫폼

2013 IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2013 - Proceedings

Volumn , Issue , 2013, Pages 356-361

Porting concepts from DNNs back to GMMs

(2) Demuynck, Kris a Triefenbach, Fabian a

a GHENT UNIVERSITY (Belgium)

Author keywords

deep neural networks; deep structures; DNN; Gaussian mixture models; GMM; speech recognition

Indexed keywords

DEEP NEURAL NETWORKS; DEEP STRUCTURE; DNN; GAUSSIAN MIXTURE MODEL; GMM;

NEURAL NETWORKS;

SPEECH RECOGNITION;

EID: 84893695632 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/ASRU.2013.6707756 Document Type: Conference Paper

Times cited : (9)

References (41)

1
- 85032751458
- Deep neural networks for acoustic modeling in speech recognition: The shared views of four research groups
- G. Hinton, L. Deng, D. Yu, G.E. Dahl, A-R. Mohamed, N. Jaitly, A. Senior, V. Vanhoucke, P. Nguyen, T.N. Sainath, and B. Kingsbury, "Deep neural networks for acoustic modeling in speech recognition: The shared views of four research groups, " Signal Processing Magazine, vol. 29, no. 6, pp. 82- 97, 2012.
- (2012) Signal Processing Magazine , vol.29 , Issue.6 , pp. 82-97
- Hinton, G.¹ Deng, L.² Yu, D.³ Dahl, G.E.⁴ Mohamed, A.-R.⁵ Jaitly, N.⁶ Senior, A.⁷ Vanhoucke, V.⁸ Nguyen, P.⁹ Sainath, T.N.¹⁰ Kingsbury, B.¹¹

2
- 84055211743
- Acoustic modeling using deep belief networks
- A.-R. Mohamed, G.E. Dahl, and G. Hinton, "Acoustic modeling using deep belief networks, " IEEE Trans. on ASLP, vol. 20, no. 1, pp. 14-22, 2012.
- (2012) IEEE Trans. on ASLP , vol.20 , Issue.1 , pp. 14-22
- Mohamed, A.-R.¹ Dahl, G.E.² Hinton, G.³

3
- 84890466217
- Improving neural networks by preventing co-adaptation of feature detectors
- p. abs/1207.0580
- G.E. Hinton, N. Srivastava, A. Krizhevsky, I. Sutskever, and R. Salakhutdinov, "Improving neural networks by preventing co-adaptation of feature detectors, " CoRR, p. abs/1207.0580, 2012.
- (2012) CoRR
- Hinton, G.E.¹ Srivastava, N.² Krizhevsky, A.³ Sutskever, I.⁴ Salakhutdinov, R.⁵

4
- 84886714036
- Acoustic modeling with hierarchical reservoirs
- (accepted
- F. Triefenbach, A. Jalalvand, K. Demuynck, and J-P. Martens, "Acoustic modeling with hierarchical reservoirs, " IEEE Trans. on ASLP, 2013, (accepted).
- (2013) IEEE Trans. on ASLP
- Triefenbach, F.¹ Jalalvand, A.² Demuynck, K.³ Martens, J.-P.⁴

5
- 0032050110
- Maximum likelihood linear transformations for HMM-based speech recognition
- M.J.F. Gales, "Maximum likelihood linear transformations for HMM-based speech recognition, " Comp. Speech and Lang., vol. 12, pp. 75-98, 1998. (Pubitemid 128383747)
- (1998) Computer Speech and Language , vol.12 , Issue.2 , pp. 75-98
- Gales, M.J.F.¹

6
- 0034320005
- Rapid speaker adaptation in eigenvoice space
- DOI 10.1109/89.876308
- R. Kuhn, J.-C. Junqua, P. Nguyen, and N. Niedzielski, "Rapid speaker adaptation in eigenvoice space, " IEEE Trans. on SAP, vol. 8, no. 6, pp. 695-707, 2000. (Pubitemid 32025317)
- (2000) IEEE Transactions on Speech and Audio Processing , vol.8 , Issue.6 , pp. 695-707
- Kuhn, R.¹ Junqua, J.-C.² Nguyen, P.³ Niedzielski, N.⁴

7
- 85009113852
- Hmm adaptation using vector taylor series for noisy speech recognition
- A. Acero, L. Deng, T. Kristjansson, and J. Zhang, "HMM adaptation using vector Taylor series for noisy speech recognition, " in Proc. ICSLP, 2000, pp. 229-232.
- (2000) Proc. ICSLP , pp. 229-232
- Acero, A.¹ Deng, L.² Kristjansson, T.³ Zhang, J.⁴

8
- 85006734596
- Evaluation of the SPLICE algorithm on the aurora2 database
- J. Droppo, L. Deng, and A. Acero, "Evaluation of the SPLICE algorithm on the Aurora2 database, " in Proc. EUROSPEECH, 2001, pp. 217-220.
- (2001) Proc. EUROSPEECH , pp. 217-220
- Droppo, J.¹ Deng, L.² Acero, A.³

9
- 34547528168
- Adaptive training with joint uncertainty decoding for robust recognition of noisy data
- H. Liao and M.J.F Gales, "Adaptive training with joint uncertainty decoding for robust recognition of noisy data, " in Proc. ICASSP, 2007, pp. 389-392.
- (2007) Proc. ICASSP , pp. 389-392
- Liao, H.¹ Gales, M.J.F.²

10
- 0030364003
- Reduced semi-continuous models for large vocabulary continuous speech recognition in dutch
- K. Demuynck, J. Duchateau, and D. Van Compernolle, "Reduced semi-continuous models for large vocabulary continuous speech recognition in Dutch, " in Proc. ICSLP, 1996, vol. IV, pp. 2289-2292.
- (1996) Proc. ICSLP , vol.4 , pp. 2289-2292
- Demuynck, K.¹ Duchateau, J.² Van Compernolle, D.³

11
- 0013344078
- Training products of experts by minimizing contrastive divergence
- G.E. Hinton, "Training products of experts by minimizing contrastive divergence, " Neural Computation, vol. 14, pp. 1771- 1800, 2002.
- (2002) Neural Computation , vol.14 , pp. 1771-1800
- Hinton, G.E.¹

12
- 0003472470
- John Wiley & Sons
- R. O. Duda and P. E. Hart, Pattern Classification and Scene Analysis, pp. 114-121, John Wiley & Sons, 1973.
- (1973) Pattern Classification and Scene Analysis , pp. 114-121
- Duda, R.O.¹ Hart, P.E.²

13
- 0029228458
- Optimal linear feature transformations for semi-continuous hidden markov models
- G. Schukat-Talamazzini, J. Hornegger, and H. Niemann, "Optimal linear feature transformations for semi-continuous hidden Markov models, " in Proc. ICASSP, 1995, vol. I, pp. 369- 372.
- (1995) Proc. ICASSP , vol.1 , pp. 369-372
- Schukat-Talamazzini, G.¹ Hornegger, J.² Niemann, H.³

14
- 0003871508
- Ph.D. thesis, John Hopkins Univ
- N. Kumar, Investigation of Silicon-Auditory Models and Generalization of LDA for Improved Speech Recognition, Ph.D. thesis, John Hopkins Univ., 1997.
- (1997) Investigation of Silicon-Auditory Models and Generalization of LDA for Improved Speech Recognition
- Kumar, N.¹

15
- 84871612455
- Optimal feature sub-space selection based on discriminant analysis
- K. Demuynck, J. Duchateau, and D. Van Compernolle, "Optimal feature sub-space selection based on discriminant analysis, " in Proc. EUROSPEECH, 1999, vol. III, pp. 1311-1314.
- (1999) Proc. EUROSPEECH , vol.3 , pp. 1311-1314
- Demuynck, K.¹ Duchateau, J.² Van Compernolle, D.³

16
- 84893656625
- Class definition in discriminant feature analysis
- J. Duchateau, K. Demuynck, D. Van Compernolle, and P. Wambacq, "Class definition in discriminant feature analysis, " in Proc. EUROSPEECH, 2001, vol. III, pp. 1621-1624.
- (2001) Proc. EUROSPEECH , vol.3 , pp. 1621-1624
- Duchateau, J.¹ Demuynck, K.² Van Compernolle, D.³ Wambacq, P.⁴

17
- 0002235014
- Improved feature decorrelation for hmm-based speech recognition
- K. Demuynck, J. Duchateau, D. Van Compernolle, and P.Wambacq, "Improved feature decorrelation for HMM-based speech recognition, " in Proc. ICSLP, 1998, vol. VII, pp. 2907- 2910.
- (1998) Proc. ICSLP , vol.7 , pp. 2907-2910
- Demuynck, K.¹ Duchateau, J.² Van Compernolle, D.³ Wambacq, P.⁴

18
- 4243460174
- Semi-tied covariance matrices
- M.J.F. Gales, "Semi-tied covariance matrices, " in Proc. ICASSP, 1998, vol. II, pp. 657-660.
- (1998) Proc. ICASSP , vol.2 , pp. 657-660
- Gales, M.J.F.¹

19
- 70450191324
- A study of bootstrapping with multiple acoustic features for improved automatic speech recognition
- X. Cui, J. Xue, B. Xiang, and B. Zhou, "A study of bootstrapping with multiple acoustic features for improved automatic speech recognition, " in Proc. INTERSPEECH, 2009, pp. 240- 243.
- (2009) Proc. INTERSPEECH , pp. 240-243
- Cui, X.¹ Xue, J.² Xiang, B.³ Zhou, B.⁴

20
- 85083953021
- Feature learning in deep neural networks - studies on speech recognition tasks
- vol. abs/1301.3605
- D. Yu, M.L. Seltzer, J. Li, J.-T. Huang, and F. Seide, "Feature learning in deep neural networks - studies on speech recognition tasks, " CoRR, vol. abs/1301.3605, 2013, http://arxiv.org/abs/1301.3605.
- (2013) CoRR
- Yu, D.¹ Seltzer, M.L.² Li, J.³ Huang, J.-T.⁴ Seide, F.⁵

21
- 77949426518
- Hidden conditional random fields for phone recognition
- Y-h. Sung and D. Jurafsky, "Hidden conditional random fields for phone recognition, " in Proc. ASRU, 2009, pp. 107-112.
- (2009) Proc. ASRU , pp. 107-112
- Sung, Y.-H.¹ Jurafsky, D.²

22
- 70349208656
- A flat direct model for speech recognition
- G. Heigold, G. Zweig, X. Li, and P. Nguyen, "A flat direct model for speech recognition, " in Proc. ICASSP, 2009, pp. 3861-3864.
- (2009) Proc. ICASSP , pp. 3861-3864
- Heigold, G.¹ Zweig, G.² Li, X.³ Nguyen, P.⁴

23
- 0004158153
- Speech recognition with dynamic bayesian networks
- G. Zweig and S. Russell, "Speech recognition with dynamic Bayesian networks, " Tech. Rep., Microsoft, 1998.
- (1998) Tech. Rep., Microsoft
- Zweig, G.¹ Russell, S.²

24
- 78649308591
- Sequential labeling using deepstructured conditional random fields
- D. Yu, S.Wang, and L. Deng, "Sequential labeling using deepstructured conditional random fields, " IEEE Journal of Selected Topics in Signal Processing, vol. 4, no. 6, pp. 965-973, 2010.
- (2010) IEEE Journal of Selected Topics in Signal Processing , vol.4 , Issue.6 , pp. 965-973
- Yu, D.¹ Wang, S.² Deng, L.³

25
- 0003548585
- The darpa timit acoustic-phonetic continuous speech corpus cd-rom
- J. Garofolo, L. Lamel, W. Fisher, J. Fiscus, D. Pallett, and N. Dahlgren, "The DARPA TIMIT acoustic-phonetic continuous speech corpus CD-rom, " Tech. report, NIST, 1993.
- (1993) Tech. Report, NIST
- Garofolo, J.¹ Lamel, L.² Fisher, W.³ Fiscus, J.⁴ Pallett, D.⁵ Dahlgren, N.⁶

26
- 0024768209
- Speaker-independent phone recognition using hidden Markov models
- DOI 10.1109/29.46546
- K. Lee and H.-W. Hon, "Speaker-independent phone recognition using hidden Markov models, " IEEE Trans. on ASSP, vol. 37, pp. 1641-1648, 1989. (Pubitemid 20652953)
- (1989) IEEE Transactions on Acoustics, Speech, and Signal Processing , vol.37 , Issue.11 , pp. 1641-1648
- Lee, K.-F.¹ Hon, H.-W.²

27
- 80051654263
- Deep belief networks using discriminative features for phone recognition
- A.R. Mohamed, T.N. Sainath, G. Dahl, B. Ramabhadran, G.E. Hinton, and M.A. Picheny, "Deep belief networks using discriminative features for phone recognition, " in Proc. ICASSP, 2011, pp. 5060-5063.
- (2011) Proc. ICASSP , pp. 5060-5063
- Mohamed, A.R.¹ Sainath, T.N.² Dahl, G.³ Ramabhadran, B.⁴ Hinton, G.E.⁵ Picheny, M.A.⁶

28
- 84867215798
- Spraak: An open source speech recognition and automatic annotation kit
- Sept.
- K. Demuynck, J. Roelens, D. Van Compernolle, and P. Wambacq, "SPRAAK: An open source speech recognition and automatic annotation kit, " in Proc. INTERSPEECH, Sept. 2008, p. 495.
- (2008) Proc. INTERSPEECH , pp. 495
- Demuynck, K.¹ Roelens, J.² Van Compernolle, D.³ Wambacq, P.⁴

29
- 44949142593
- A flexible recogniser architecture in a reading tutor for children
- J. Duchateau, M. Wigham, K. Demuynck, and H. Van hamme, "A flexible recogniser architecture in a reading tutor for children, " in Proc. ITRWon Speech Recognition and Intrinsic Variation, 2006, pp. 59-64.
- (2006) Proc. ITRWon Speech Recognition and Intrinsic Variation , pp. 59-64
- Duchateau, J.¹ Wigham, M.² Demuynck, K.³ Van Hamme, H.⁴

30
- 0031211090
- A Decision-Theoretic Generalization of On-Line Learning and an Application to Boosting
- Y. Freund and R.E. Schapire, "A decision-theoretic generalization of on-line learning and an application to boosting, " Journal of computer and system sciences, vol. 55, no. 1, pp. 119- 139, 1997. (Pubitemid 127433398)
- (1997) Journal of Computer and System Sciences , vol.55 , Issue.1 , pp. 119-139
- Freund, Y.¹ Schapire, R.E.²

31
- 0344666609
- Boosting linear discriminant analysis for face recognition
- J. Lu, K.N. Plataniotis, and A.N. Venetsanopoulos, "Boosting linear discriminant analysis for face recognition, " in Proc. ICIP, 2003, vol. 1, pp. 657-660.
- (2003) Proc. ICIP , vol.1 , pp. 657-660
- Lu, J.¹ Plataniotis, K.N.² Venetsanopoulos, A.N.³

32
- 0030677475
- Speaker adaptive training: A maximum likelihood approach to speaker normalization
- T. Anastasakos, J. McDonough, and J. Makhoul, "Speaker adaptive training: A maximum likelihood approach to speaker normalization, " in Proc. ICASSP, 1997, vol. 2, pp. 1043-1046.
- (1997) Proc. ICASSP , vol.2 , pp. 1043-1046
- Anastasakos, T.¹ McDonough, J.² Makhoul, J.³

33
- 84867600898
- Latent variable speaker adaptation of gaussian mixture weights and means
- X. Zhang, K. Demuynck, and H. Van hamme, "latent variable speaker adaptation of gaussian mixture weights and means, " in Proc. ICASSP, 2012, pp. 4349-4352.
- (2012) Proc. ICASSP , pp. 4349-4352
- Zhang, X.¹ Demuynck, K.² Van Hamme, H.³

34
- 0023857030
- Phonetic recognition using hidden markov models and maximum mutual information training
- B. Merialdo, "Phonetic recognition using hidden Markov models and maximum mutual information training, " in Proc. ICASSP, 1988, vol. 1, pp. 111-114. (Pubitemid 18665963)
- (1988) ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings , pp. 111-114
- Merialdo, B.¹

35
- 0034849080
- Improved discriminative training techniques for large vocabulary continuous speech recognition
- D. Povey and P.C. Woodland, "Improved discriminative training techniques for large vocabulary continuous speech recognition, " in Proc. ICASSP, 2001, vol. 1, pp. 45-48. (Pubitemid 32839184)
- (2001) ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings , vol.1 , pp. 45-48
- Povey, D.¹ Woodland, P.C.²

36
- 0036296863
- Minimum phone error and i-smoothing for improved discriminative training
- D. Povey and P.C. Woodland, "Minimum phone error and I-smoothing for improved discriminative training, " in Proc. ICASSP, 2002, vol. 1, pp. 105-108.
- (2002) Proc. ICASSP , vol.1 , pp. 105-108
- Povey, D.¹ Woodland, P.C.²

37
- 70349213445
- Lattice-based optimization of sequence classification criteria for neural-network acoustic modeling
- B. Kingsbury, "lattice-based optimization of sequence classification criteria for neural-network acoustic modeling, " in Proc. ICASSP, 2009, pp. 3761-3764.
- (2009) Proc. ICASSP , pp. 3761-3764
- Kingsbury, B.¹

38
- 80051622448
- A-functions: A generalization of extended baum-welch transformations to convex optimization
- D. Kanevsky, D. Nahamoo, T.N. Sainath, B. Ramabhadran, and P.A. Olsen, "A-functions: A generalization of extended Baum-Welch transformations to convex optimization, " in Proc. ICASSP, 2011, pp. 5164-5167.
- (2011) Proc. ICASSP , pp. 5164-5167
- Kanevsky, D.¹ Nahamoo, D.² Sainath, T.N.³ Ramabhadran, B.⁴ Olsen, P.A.⁵

39
- 84865801985
- Conversational speech transcription using context-dependent deep neural networks
- F. Seide, G. Li, and D. Yu, "Conversational speech transcription using context-dependent deep neural networks, " in Proc. INTERSPEECH, 2011, pp. 437-440.
- (2011) Proc. INTERSPEECH , pp. 437-440
- Seide, F.¹ Li, G.² Yu, D.³

40
- 84976736715
- The hearsay-ii speech understanding system: Integrating knowledge to resolve uncertainty
- L.D. Erman, F. Hayes-Roth, V.R. Lesser, and D.R. Reddy, "The HEARSAY-II speech understanding system: Integrating knowledge to resolve uncertainty, " Computing Surveys, vol. 12, pp. 213-253, 1980.
- (1980) Computing Surveys , vol.12 , pp. 213-253
- Erman, L.D.¹ Hayes-Roth, F.² Lesser, V.R.³ Reddy, D.R.⁴

41
- 84893686887
- Attention shift decoding for conversational speech recognition
- R. Kumaran, J. Bilmes, and K. Kirchhoff, "Attention shift decoding for conversational speech recognition, " in Proc. INTERSPEECH, 2007, pp. 1493-1496.
- (2007) Proc. INTERSPEECH , pp. 1493-1496
- Kumaran, R.¹ Bilmes, J.² Kirchhoff, K.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.