SCOPUS 정보 검색 플랫폼

IEEE Transactions on Audio, Speech and Language Processing

Volumn 21, Issue 12, 2013, Pages 2616-2626

Investigations on an EM-Style optimization algorithm for discriminative training of HMMs

(3) Heigold, Georg a,b Ney, Hermann a Schluter, Ralf a

a RWTH AACHEN UNIVERSITY (Germany)

b GOOGLE INC (United States)

Author keywords

discriminative training; Expectation maximization; generalized iterative scaling; hidden Markov model

Indexed keywords

DISCRIMINATIVE TRAINING; EXPECTATION MAXIMIZATION; GENERALIZED ITERATIVE SCALING; GRAPHEME-TO-PHONEME CONVERSION; HIDDEN MARKOV MODELS (HMMS); MAXIMUM MUTUAL INFORMATION; OPTIMIZATION ALGORITHMS; SPEECH RECOGNITION SYSTEMS;

HIDDEN MARKOV MODELS; ITERATIVE METHODS; MAXIMUM PRINCIPLE; OPTIMIZATION; PARAMETER ESTIMATION; SPEECH RECOGNITION; STEREOPHONIC BROADCASTING;

ALGORITHMS;

EID: 84887376734 PISSN: 15587916 EISSN: None Source Type: Journal
DOI: 10.1109/TASL.2013.2280234 Document Type: Article

Times cited : (7)

References (44)

1
- 0024610919
- A tutorial on hiddenMarkovmodels and selected applications in speech recognition
- Feb.
- L. R. Rabiner, "A tutorial on hiddenMarkovmodels and selected applications in speech recognition," Proc. IEEE, vol. 77, no. 2, pp. 257-286, Feb. 1989.
- (1989) Proc. IEEE , vol.77 , Issue.2 , pp. 257-286
- Rabiner, L.R.¹

2
- 0003459132
- Ph.D. dissertation McGill Univ., Montreal, QC, Canada
- Y. Normandin, "Hidden Markov models, maximum mutual information, and the speech recognition problem," Ph.D. dissertation, McGill Univ., Montreal, QC, Canada, 1991.
- (1991) Hidden Markov Models, Maximum Mutual Information, and the Speech Recognition Problem
- Normandin, Y.¹

3
- 0036461035
- Large scale discriminative training of hidden Markov models for speech recognition
- P. C. Woodland and D. Povey, "Large scale discriminative training of hidden Markov models for speech recognition," Comput. Speech Lang., vol. 16, no. 1, pp. 25-48, 2002.
- (2002) Comput. Speech Lang. , vol.16 , Issue.1 , pp. 25-48
- Woodl, P.C.¹ Povey, D.²

4
- 51449120120
- Boosted MMI for model and feature-space discriminative training
- Las Vegas, NV, USA, Apr.
- D. Povey, D. Kanevsky, B. Kingsbury, B. Ramabhadran, G. Saon, and K. Visweswariah, "Boosted MMI for model and feature-space discriminative training," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP), Las Vegas, NV, USA, Apr. 2008, pp. 4057-4060.
- (2008) Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP) , pp. 4057-4060
- Povey, D.¹ Kanevsky, D.² Kingsbury, B.³ Ramabhadran, B.⁴ Saon, G.⁵ Visweswariah, K.⁶

5
- 34547522070
- Discriminative training for large vocabulary speech recognition using minimum classification error
- Jan.
- E. McDermott, T. Hazen, J. L. Roux, A. Nakamura, and S. Katagiri, "Discriminative training for large vocabulary speech recognition using minimum classification error," IEEE Trans. Audio, Speech, Lang. Process., vol. 15, no. 1, pp. 203-223, Jan. 2007.
- (2007) IEEE Trans. Audio, Speech, Lang. Process. , vol.15 , Issue.1 , pp. 203-223
- McDermott, E.¹ Hazen, T.² Roux, J.L.³ Nakamura, A.⁴ Katagiri, S.⁵

6
- 33745208000
- Investigations on error minimizing training criteria for discriminative training in automatic speech recognition
- 9th European Conference on Speech Communication and Technology, Eurospeech Interspeech
- W. Macherey, L. Haferkamp, R. Schlüter, and H. Ney, "Investigations on errorminimizing training criteria for discriminative training in automatic speech recognition," in Proc. Interspeech, Lisbon, Portugal, Sep. 2005, pp. 2133-2136. (Pubitemid 43908515)
- (2005) 9th European Conference on Speech Communication and Technology , pp. 2133-2136
- Macherey, W.¹ Haferkamp, L.² Schluter, R.³ Ney, H.⁴

7
- 4544265717
- Ph.D. dissertation Cambridge, U.K.
- D. Povey, "Discriminative training for large vocabulary speech recognition," Ph.D. dissertation, Cambridge, U.K., 2004.
- (2004) Discriminative training for large vocabulary speech recognition
- Povey, D.¹

8
- 0026372945
- An improved MMIE training algorithm for speaker-independent, small vocabulary, continuous speech recognition
- Toronto, ON, Canada May
- Y. Normandin and S. Morgera, "An improved MMIE training algorithm for speaker-independent, small vocabulary, continuous speech recognition," in IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP), Toronto, ON, Canada, May 1991, pp. 537-540.
- (1991) IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP) , pp. 537-540
- Normandin, Y.¹ Morgera, S.²

9
- 0009643121
- Ph.D. dissertation RWTH Aachen Univ., Aachen, Germany Sep.
- R. Schlüter, "Investigations on discriminative training criteria," Ph.D. dissertation, RWTH Aachen Univ., Aachen, Germany, Sep. 2000.
- (2000) Investigations on Discriminative Training Criteria
- Schlüter, R.¹

10
- 33745200532
- Discriminative training with tied covariancematrices
- Jeju Island,Korea, Oct.
- W. Macherey, R. Schlüter, and H. Ney, "Discriminative training with tied covariancematrices," in Proc. Interspeech, Jeju Island,Korea,Oct. 2004, pp. 681-684.
- (2004) Proc. Interspeech , pp. 681-684
- MacHerey, W.¹ Schlüter, R.² Ney, H.³

11
- 70349197696
- Generalized Baum-Welch algorithm for discriminative training on large vocabulary continuous speech recognition systems
- Taipei, Taiwan, Apr.
- R. Hsiao, Y.-C. Tam, and T. Schultz, "Generalized Baum-Welch algorithm for discriminative training on large vocabulary continuous speech recognition systems," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP), Taipei, Taiwan, Apr. 2009, pp. 3769-3772.
- (2009) Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP) , pp. 3769-3772
- Hsiao, R.¹ Tam, Y.-C.² Schultz, T.³

12
- 0025952278
- An inequality for rational functions with applications to some statistical estimation problems
- Jan.
- P. Gopalakrishnan, D. Kanevsky, A. Nadas, and D. Nahamoo, "An inequality for rational functions with applications to some statistical estimation problems," IEEE Trans. Inf. Theory, vol. 37, no. 1, pp. 107-113, Jan. 1991.
- (1991) IEEE Trans. Inf. Theory , vol.37 , Issue.1 , pp. 107-113
- Gopalakrishnan, P.¹ Kanevsky, D.² Nadas, A.³ Nahamoo, D.⁴

13
- 4544302567
- Extended baum welch transformations for general functions
- Montreal, QC, Canada, May
- D. Kanevsky, "Extended Baum Welch transformations for general functions," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP), Montreal, QC, Canada, May 2004, pp. 821-824.
- (2004) Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP) , pp. 821-824
- Kanevsky, D.¹

14
- 34249656385
- Discriminative estimation of subspace constrained Gaussian mixture models for speech recognition
- Jan.
- S. Axelrod, V. Goel, R. Gopinath, P. Olsen, and K. Visweswariah, "Discriminative estimation of subspace constrained Gaussian mixture models for speech recognition," IEEE Trans. Speech Audio Process., vol. 15, no. 1, pp. 172-189, Jan. 2007.
- (2007) IEEE Trans. Speech Audio Process. , vol.15 , Issue.1 , pp. 172-189
- Axelrod, S.¹ Goel, V.² Gopinath, R.³ Olsen, P.⁴ Visweswariah, K.⁵

15
- 84943274699
- A direct adaptivemethod for faster backpropagation learning: The Rprop algorithm
- San Francisco, CA, USA, Mar.-Apr.
- M. Riedmiller and H. Braun, "A direct adaptivemethod for faster backpropagation learning: The Rprop algorithm," in Proc. IEEE Int. Conf. Neural Netw. (ICNN), San Francisco, CA, USA, Mar.-Apr. 1993, pp. 586-591.
- (1993) Proc. IEEE Int. Conf. Neural Netw. (ICNN) , pp. 586-591
- Riedmiller, M.¹ Braun, H.²

16
- 80051640064
- Ph.D. dissertation RWTH Aachen Univ., Aachen, Germany, Jun.
- G. Heigold, "A log-linear discriminative modeling framework for speech recognition," Ph.D. dissertation, RWTH Aachen Univ., Aachen, Germany, Jun. 2010.
- (2010) A Log-linear Discriminative Modeling Framework for Speech Recognition
- Heigold, G.¹

17
- 0003982971
- NewYorkNY USA: Springer
- J. Nocedal and S. Wright, Numerical Optimization. NewYork,NY, USA: Springer, 1999.
- (1999) Numerical Optimization
- Nocedal, J.¹ Wright, S.²

18
- 15844401040
- New globally convergent training scheme based on the resilient propagation algorithm
- DOI 10.1016/j.neucom.2004.11.016, PII S0925231204005168
- A. D. Anastasiadis, G. D. Magoulas, and M. N. Vrahatis, "New globally convergent training scheme based on the resilient propagation algorithm," Neurocomputing, vol. 64, pp. 253-270, 2005. (Pubitemid 40425322)
- (2005) Neurocomputing , vol.64 , Issue.1-4 SPEC. ISS. , pp. 253-270
- Anastasiadis, A.D.¹ Magoulas, G.D.² Vrahatis, M.N.³

19
- 0002629270
- Maximum likelihood from incomplete data via the em algorithm
- A. Dempster, N. Laird, and D. Rubin, "Maximum likelihood from incomplete data via the EM algorithm," J. R. Statist. Soc., vol. 39, no. B, pp. 1-38, 1977.
- (1977) J. R. Statist. Soc. , vol.39 , Issue.B , pp. 1-38
- Dempster, A.¹ Laird, N.² Rubin, D.³

20
- 0001573124
- Generalized iterative scaling for log-linear models
- J. Darroch and D. Ratcliff, "Generalized iterative scaling for log-linear models," Ann. Math. Statist., vol. 43, pp. 1470-1480, 1972.
- (1972) Ann. Math. Statist. , vol.43 , pp. 1470-1480
- Darroch, J.¹ Ratcliff, D.²

21
- 0031120321
- Inducing features of random fields
- S. Della Pietra, V. Della Pietra, and J. Lafferty, "Inducing features of random fields," IEEE Trans. Pattern Anal. Mach. Intell., vol. 19, no. 4, pp. 380-393, Apr. 1997. (Pubitemid 127762893)
- (1997) IEEE Transactions on Pattern Analysis and Machine Intelligence , vol.19 , Issue.4 , pp. 380-393
- Pietra, S.D.¹ Pietra, V.D.² Lafferty, J.³

22
- 83755194417
- Maximum mutual information estimation of acoustic HMM emission densities
- Johns Hopkins Univ., Baltimore, MD, CLSP Research Note
- A. Gunawardana, "Maximum mutual information estimation of acoustic HMM emission densities, Center for Language and Speech Processing," Johns Hopkins Univ., Baltimore, MD, CLSP Research Note No. 40, 2001.
- (2001) Center for Language and Speech Processing , vol.40
- Gunawardana, A.¹

23
- 9444260139
- Ph.D. dissertation Mass. Inst. of Technol., Cambridge, MA, USA
- T. Jebara, "Discriminative, generative, and imitative learning," Ph.D. dissertation, Mass. Inst. of Technol., Cambridge, MA, USA, 2002.
- (2002) Discriminative, Generative, and Imitative Learning
- Jebara, T.¹

24
- 33745198221
- Extended baum-welch reestimation of Gaussian mixture models based on reverse jensen inequality
- 9th European Conference on Speech Communication and Technology, Eurospeech Interspeech
- M. Afify, "Extended Baum-Welch reestimation of Gaussian mixture models based on reverse Jensen inequality," in Interspeech, Lisbon, Portugal, Sep. 2005, pp. 1113-1116. (Pubitemid 43908261)
- (2005) 9th European Conference on Speech Communication and Technology , pp. 1113-1116
- Afify, M.¹

25
- 1942514230
- Ph.D. dissertation Univ. Tübingen, Tübingen, Germany
- S. Riezler, "Probabilistic constraint logic programming," Ph.D. dissertation, Univ. Tübingen, Tübingen, Germany, 1998.
- (1998) Probabilistic Constraint Logic Programming
- Riezler, S.¹

26
- 80051653230
- Lexicalized stochastic modeling of constraint-based grammars using log-linear measures and em training
- Hong Kong Oct.
- S. Riezler, J. Kuhn, D. Prescher, and M. Johnson, "Lexicalized stochastic modeling of constraint-based grammars using log-linear measures and EM training," in Proc. Annu. Meeting Assoc. Comput. Linguist. (ACL), Hong Kong, Oct. 2000, pp. 480-487.
- (2000) Proc. Annu. Meeting Assoc. Comput. Linguist. (ACL) , pp. 480-487
- Riezler, S.¹ Kuhn, J.² Prescher, D.³ Johnson, M.⁴

27
- 51449099268
- GIS-like estimation of log-linear models with hidden variables
- Speech, Signal Process. (ICASSP), LasVegas,NV, USA,Apr.
- G. Heigold, T. Deselaers, R. Schlüter, andH. Ney, "GIS-like estimation of log-linear models with hidden variables," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP), LasVegas,NV, USA,Apr. 2008, pp. 4045-4048.
- (2008) Proc. IEEE Int. Conf. Acoust. , pp. 4045-4048
- Heigold, G.¹ Deselaers, T.² Schlüter, R.³ Ney, H.⁴

28
- 80051618443
- EM-style optimization of hidden conditional random fields for grapheme-to-phoneme conversion
- Speech, Signal Process. (ICASSP), Prague, Czech Republic, May
- G. Heigold, S. Hahn, P. Lehnen, and H. Ney, "EM-style optimization of hidden conditional random fields for grapheme-to-phoneme conversion," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP), Prague, Czech Republic, May 2011, pp. 4920-4923.
- (2011) Proc. IEEE Int. Conf. Acoust. , pp. 4920-4923
- Heigold, G.¹ Hahn, S.² Lehnen, P.³ Ney, H.⁴

29
- 85008035419
- Equivalence of generative and log-linear models
- Jul.
- G. Heigold, H. Ney, P. Lehnen, T. Gass, and R. Schlüter, "Equivalence of generative and log-linear models," IEEE Trans. Speech Audio Process., vol. 19, no. 5, pp. 1138-1148, Jul. 2011.
- (2011) IEEE Trans. Speech Audio Process. , vol.19 , Issue.5 , pp. 1138-1148
- Heigold, G.¹ Ney, H.² Lehnen, P.³ Gass, T.⁴ Schlüter, R.⁵

30
- 33846516584
- NewYork NY USA: Springer
- C. Bishop, Pattern Recognition and Machine Learning. NewYork, NY, USA: Springer, 2006.
- (2006) Pattern Recognition and Machine Learning
- Bishop, C.¹

31
- 0002210265
- On the convergence properties of the em algorithm
- C. Wu, "On the convergence properties of the EM algorithm," Ann. Statist., vol. 11, no. 1, pp. 95-103, 1983.
- (1983) Ann. Statist. , vol.11 , Issue.1 , pp. 95-103
- Wu, C.¹

32
- 0036352806
- The latent maximum entropy principle
- S. Wang, D. Schuurmans, and Y. Zhao, "The latent maximum entropy principle," in Proc. IEEE Int. Symp. Inf. Theory (ISIT), Lausanne, Switzerland, Jun.-Jul. 2002, p. 131. (Pubitemid 34964752)
- (2002) IEEE International Symposium on Information Theory - Proceedings , pp. 131
- Wang, S.¹ Rosenfeld, R.² Zhao, Y.³ Schuurmans, D.⁴

33
- 33947618431
- Hidden conditional random fields for phone classification
- Sep.
- A. Gunawardana, M. Mahajan, A. Acero, and J. Platt, "Hidden conditional random fields for phone classification," in Proc. Interspeech, Lisbon, Portugal, Sep. 2005, pp. 117-120.
- (2005) Proc. Interspeech, Lisbon, Portugal , pp. 117-120
- Gunawardana, A.¹ Mahajan, M.² Acero, A.³ Platt, J.⁴

34
- 78649262962
- Margin-based discriminative training for string recognition
- Dec.
- G. Heigold, P. Dreuw, S. Hahn, R. Schlüter, and H. Ney, "Margin-based discriminative training for string recognition," IEEE J. Sel. Topics Signal Process., vol. 4, no. 6, pp. 917-925, Dec. 2010.
- (2010) IEEE J. Sel. Topics Signal Process. , vol.4 , Issue.6 , pp. 917-925
- Heigold, G.¹ Dreuw, P.² Hahn, S.³ Schlüter, R.⁴ Ney, H.⁵

35
- 84966275544
- Minimization of functions having Lipschitz continuous first derivatives
- L. Armijo, "Minimization of functions having Lipschitz continuous first derivatives," Pacific J. Math., vol. 16, pp. 1-3, 1966.
- (1966) Pacific J. Math. , vol.16 , pp. 1-3
- Armijo, L.¹

36
- 33947716431
- Beitrag zur Theorie des Ferromagnetismus
- E. Ising, "Beitrag zur Theorie des Ferromagnetismus," Z. Phys., vol. 31, pp. 253-258, 1925.
- (1925) Z. Phys. , vol.31 , pp. 253-258
- Ising, E.¹

37
- 56449091292
- Modified MMI/MPE: A direct evaluation of the margin in speech recognition
- Helsinki, Finland Jul.
- G. Heigold, T. Deselaers, R. Schlüter, and H. Ney, "Modified MMI/MPE: A direct evaluation of the margin in speech recognition," in Proc. Int. Conf. Mach. Learn. (ICML), Helsinki, Finland, Jul. 2008, pp. 384-391.
- (2008) Proc. Int. Conf. Mach. Learn. (ICML) , pp. 384-391
- Heigold, G.¹ Deselaers, T.² Schlüter, R.³ Ney, H.⁴

38
- 70349226871
- Modified MPE/MMI in a transducer-based framework
- Speech, Signal Process. (ICASSP), Taipei, Taiwan, Apr.
- G. Heigold, R. Schlüter, and H. Ney, "Modified MPE/MMI in a transducer-based framework," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP), Taipei, Taiwan, Apr. 2009, pp. 3749-3752.
- (2009) Proc. IEEE Int. Conf. Acoust. , pp. 3749-3752
- Heigold, G.¹ Schlüter, R.² Ney, H.³

39
- 34447280375
- Deformation models for image recognition
- DOI 10.1109/TPAMI.2007.1153
- D. Keysers, T. Deselaers, C. Gollan, and H. Ney, "Deformation models for image recognition," IEEE Trans. Pattern Anal. Mach. Intell., vol. 29, no. 8, pp. 1422-1435, Aug. 2007. (Pubitemid 47040440)
- (2007) IEEE Transactions on Pattern Analysis and Machine Intelligence , vol.29 , Issue.8 , pp. 1422-1435
- Keysers, D.¹ Deselaers, T.² Gollan, C.³ Ney, H.⁴

40
- 0002711083
- Text chunking using transformationbased learning
- Cambridge, MA, USA Jun.
- L. Ramshaw and M. Marcus, "Text chunking using transformationbased learning," in Proc. 3rd Workshop Very Large Corpora, Cambridge, MA, USA, Jun. 1995, pp. 84-94.
- (1995) Proc. 3rd Workshop Very Large Corpora , pp. 84-94
- Ramshaw, L.¹ Marcus, M.²

41
- 0030362755
- A comparative study of linear feature transformation techniques for automatic speech recognition
- Philadelphia, PA, USA Oct.
- T. Eisele, R. Haeb-Umbach, and D. Langmann, "A comparative study of linear feature transformation techniques for automatic speech recognition," in Proc. Int. Conf. Spoken Lang. Process. (ICSLP), Philadelphia, PA, USA, Oct. 1996, pp. 252-255.
- (1996) Proc. Int. Conf. Spoken Lang. Process. (ICSLP) , pp. 252-255
- Eisele, T.¹ Haeb-Umbach, R.² Langmann, D.³

42
- 51449100910
- Ph.D. dissertation Faculty of Eng., Univ. of Sheffield, Sheffield, U.K.
- Y. H. Abdel-Haleem, "Conditional random fields for continuous speech recognition," Ph.D. dissertation, Faculty of Eng., Univ. of Sheffield, Sheffield, U.K., 2006.
- (2006) Conditional Random Fields for Continuous Speech Recognition
- Abdel-Haleem, Y.H.¹

43
- 85162533997
- A convergence analysis of log-linear training
- Cambridge, MA, USA: MIT Press, Dec.
- S.Wiesler and H. Ney, "A convergence analysis of log-linear training," in Advances in Neural Information Processing Systems (NIPS). Cambridge, MA, USA: MIT Press, Dec. 2011, pp. 657-665.
- (2011) Advances in Neural Information Processing Systems (NIPS) , pp. 657-665
- Wiesler, S.¹ Ney, H.²

44
- 84887388950
- An empirical study of learning rates in deep neural networks for speech recognition
- Vancouver, BC, Canada, Apr.
- A. Senior, G. Heigold, M. Ranzato, and K. Yang, "An empirical study of learning rates in deep neural networks for speech recognition," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP), Vancouver, BC, Canada, Apr. 2013, vol. 1, pp. 6724-6728.
- (2013) Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP) , vol.1 , pp. 6724-6728
- Senior, A.¹ Heigold, G.² Ranzato, M.³ Yang, K.⁴

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.