메뉴 건너뛰기




Volumn 21, Issue 12, 2013, Pages 2616-2626

Investigations on an EM-Style optimization algorithm for discriminative training of HMMs

Author keywords

discriminative training; Expectation maximization; generalized iterative scaling; hidden Markov model

Indexed keywords

DISCRIMINATIVE TRAINING; EXPECTATION MAXIMIZATION; GENERALIZED ITERATIVE SCALING; GRAPHEME-TO-PHONEME CONVERSION; HIDDEN MARKOV MODELS (HMMS); MAXIMUM MUTUAL INFORMATION; OPTIMIZATION ALGORITHMS; SPEECH RECOGNITION SYSTEMS;

EID: 84887376734     PISSN: 15587916     EISSN: None     Source Type: Journal    
DOI: 10.1109/TASL.2013.2280234     Document Type: Article
Times cited : (7)

References (44)
  • 1
    • 0024610919 scopus 로고
    • A tutorial on hiddenMarkovmodels and selected applications in speech recognition
    • Feb.
    • L. R. Rabiner, "A tutorial on hiddenMarkovmodels and selected applications in speech recognition," Proc. IEEE, vol. 77, no. 2, pp. 257-286, Feb. 1989.
    • (1989) Proc. IEEE , vol.77 , Issue.2 , pp. 257-286
    • Rabiner, L.R.1
  • 3
    • 0036461035 scopus 로고    scopus 로고
    • Large scale discriminative training of hidden Markov models for speech recognition
    • P. C. Woodland and D. Povey, "Large scale discriminative training of hidden Markov models for speech recognition," Comput. Speech Lang., vol. 16, no. 1, pp. 25-48, 2002.
    • (2002) Comput. Speech Lang. , vol.16 , Issue.1 , pp. 25-48
    • Woodl, P.C.1    Povey, D.2
  • 5
    • 34547522070 scopus 로고    scopus 로고
    • Discriminative training for large vocabulary speech recognition using minimum classification error
    • Jan.
    • E. McDermott, T. Hazen, J. L. Roux, A. Nakamura, and S. Katagiri, "Discriminative training for large vocabulary speech recognition using minimum classification error," IEEE Trans. Audio, Speech, Lang. Process., vol. 15, no. 1, pp. 203-223, Jan. 2007.
    • (2007) IEEE Trans. Audio, Speech, Lang. Process. , vol.15 , Issue.1 , pp. 203-223
    • McDermott, E.1    Hazen, T.2    Roux, J.L.3    Nakamura, A.4    Katagiri, S.5
  • 6
    • 33745208000 scopus 로고    scopus 로고
    • Investigations on error minimizing training criteria for discriminative training in automatic speech recognition
    • 9th European Conference on Speech Communication and Technology, Eurospeech Interspeech
    • W. Macherey, L. Haferkamp, R. Schlüter, and H. Ney, "Investigations on errorminimizing training criteria for discriminative training in automatic speech recognition," in Proc. Interspeech, Lisbon, Portugal, Sep. 2005, pp. 2133-2136. (Pubitemid 43908515)
    • (2005) 9th European Conference on Speech Communication and Technology , pp. 2133-2136
    • Macherey, W.1    Haferkamp, L.2    Schluter, R.3    Ney, H.4
  • 8
    • 0026372945 scopus 로고
    • An improved MMIE training algorithm for speaker-independent, small vocabulary, continuous speech recognition
    • Toronto, ON, Canada May
    • Y. Normandin and S. Morgera, "An improved MMIE training algorithm for speaker-independent, small vocabulary, continuous speech recognition," in IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP), Toronto, ON, Canada, May 1991, pp. 537-540.
    • (1991) IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP) , pp. 537-540
    • Normandin, Y.1    Morgera, S.2
  • 10
    • 33745200532 scopus 로고    scopus 로고
    • Discriminative training with tied covariancematrices
    • Jeju Island,Korea, Oct.
    • W. Macherey, R. Schlüter, and H. Ney, "Discriminative training with tied covariancematrices," in Proc. Interspeech, Jeju Island,Korea,Oct. 2004, pp. 681-684.
    • (2004) Proc. Interspeech , pp. 681-684
    • MacHerey, W.1    Schlüter, R.2    Ney, H.3
  • 11
    • 70349197696 scopus 로고    scopus 로고
    • Generalized Baum-Welch algorithm for discriminative training on large vocabulary continuous speech recognition systems
    • Taipei, Taiwan, Apr.
    • R. Hsiao, Y.-C. Tam, and T. Schultz, "Generalized Baum-Welch algorithm for discriminative training on large vocabulary continuous speech recognition systems," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP), Taipei, Taiwan, Apr. 2009, pp. 3769-3772.
    • (2009) Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP) , pp. 3769-3772
    • Hsiao, R.1    Tam, Y.-C.2    Schultz, T.3
  • 12
    • 0025952278 scopus 로고
    • An inequality for rational functions with applications to some statistical estimation problems
    • Jan.
    • P. Gopalakrishnan, D. Kanevsky, A. Nadas, and D. Nahamoo, "An inequality for rational functions with applications to some statistical estimation problems," IEEE Trans. Inf. Theory, vol. 37, no. 1, pp. 107-113, Jan. 1991.
    • (1991) IEEE Trans. Inf. Theory , vol.37 , Issue.1 , pp. 107-113
    • Gopalakrishnan, P.1    Kanevsky, D.2    Nadas, A.3    Nahamoo, D.4
  • 13
  • 14
    • 34249656385 scopus 로고    scopus 로고
    • Discriminative estimation of subspace constrained Gaussian mixture models for speech recognition
    • Jan.
    • S. Axelrod, V. Goel, R. Gopinath, P. Olsen, and K. Visweswariah, "Discriminative estimation of subspace constrained Gaussian mixture models for speech recognition," IEEE Trans. Speech Audio Process., vol. 15, no. 1, pp. 172-189, Jan. 2007.
    • (2007) IEEE Trans. Speech Audio Process. , vol.15 , Issue.1 , pp. 172-189
    • Axelrod, S.1    Goel, V.2    Gopinath, R.3    Olsen, P.4    Visweswariah, K.5
  • 15
    • 84943274699 scopus 로고
    • A direct adaptivemethod for faster backpropagation learning: The Rprop algorithm
    • San Francisco, CA, USA, Mar.-Apr.
    • M. Riedmiller and H. Braun, "A direct adaptivemethod for faster backpropagation learning: The Rprop algorithm," in Proc. IEEE Int. Conf. Neural Netw. (ICNN), San Francisco, CA, USA, Mar.-Apr. 1993, pp. 586-591.
    • (1993) Proc. IEEE Int. Conf. Neural Netw. (ICNN) , pp. 586-591
    • Riedmiller, M.1    Braun, H.2
  • 18
    • 15844401040 scopus 로고    scopus 로고
    • New globally convergent training scheme based on the resilient propagation algorithm
    • DOI 10.1016/j.neucom.2004.11.016, PII S0925231204005168
    • A. D. Anastasiadis, G. D. Magoulas, and M. N. Vrahatis, "New globally convergent training scheme based on the resilient propagation algorithm," Neurocomputing, vol. 64, pp. 253-270, 2005. (Pubitemid 40425322)
    • (2005) Neurocomputing , vol.64 , Issue.1-4 SPEC. ISS. , pp. 253-270
    • Anastasiadis, A.D.1    Magoulas, G.D.2    Vrahatis, M.N.3
  • 19
    • 0002629270 scopus 로고
    • Maximum likelihood from incomplete data via the em algorithm
    • A. Dempster, N. Laird, and D. Rubin, "Maximum likelihood from incomplete data via the EM algorithm," J. R. Statist. Soc., vol. 39, no. B, pp. 1-38, 1977.
    • (1977) J. R. Statist. Soc. , vol.39 , Issue.B , pp. 1-38
    • Dempster, A.1    Laird, N.2    Rubin, D.3
  • 20
    • 0001573124 scopus 로고
    • Generalized iterative scaling for log-linear models
    • J. Darroch and D. Ratcliff, "Generalized iterative scaling for log-linear models," Ann. Math. Statist., vol. 43, pp. 1470-1480, 1972.
    • (1972) Ann. Math. Statist. , vol.43 , pp. 1470-1480
    • Darroch, J.1    Ratcliff, D.2
  • 22
    • 83755194417 scopus 로고    scopus 로고
    • Maximum mutual information estimation of acoustic HMM emission densities
    • Johns Hopkins Univ., Baltimore, MD, CLSP Research Note
    • A. Gunawardana, "Maximum mutual information estimation of acoustic HMM emission densities, Center for Language and Speech Processing," Johns Hopkins Univ., Baltimore, MD, CLSP Research Note No. 40, 2001.
    • (2001) Center for Language and Speech Processing , vol.40
    • Gunawardana, A.1
  • 24
    • 33745198221 scopus 로고    scopus 로고
    • Extended baum-welch reestimation of Gaussian mixture models based on reverse jensen inequality
    • 9th European Conference on Speech Communication and Technology, Eurospeech Interspeech
    • M. Afify, "Extended Baum-Welch reestimation of Gaussian mixture models based on reverse Jensen inequality," in Interspeech, Lisbon, Portugal, Sep. 2005, pp. 1113-1116. (Pubitemid 43908261)
    • (2005) 9th European Conference on Speech Communication and Technology , pp. 1113-1116
    • Afify, M.1
  • 26
    • 80051653230 scopus 로고    scopus 로고
    • Lexicalized stochastic modeling of constraint-based grammars using log-linear measures and em training
    • Hong Kong Oct.
    • S. Riezler, J. Kuhn, D. Prescher, and M. Johnson, "Lexicalized stochastic modeling of constraint-based grammars using log-linear measures and EM training," in Proc. Annu. Meeting Assoc. Comput. Linguist. (ACL), Hong Kong, Oct. 2000, pp. 480-487.
    • (2000) Proc. Annu. Meeting Assoc. Comput. Linguist. (ACL) , pp. 480-487
    • Riezler, S.1    Kuhn, J.2    Prescher, D.3    Johnson, M.4
  • 27
    • 51449099268 scopus 로고    scopus 로고
    • GIS-like estimation of log-linear models with hidden variables
    • Speech, Signal Process. (ICASSP), LasVegas,NV, USA,Apr.
    • G. Heigold, T. Deselaers, R. Schlüter, andH. Ney, "GIS-like estimation of log-linear models with hidden variables," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP), LasVegas,NV, USA,Apr. 2008, pp. 4045-4048.
    • (2008) Proc. IEEE Int. Conf. Acoust. , pp. 4045-4048
    • Heigold, G.1    Deselaers, T.2    Schlüter, R.3    Ney, H.4
  • 28
    • 80051618443 scopus 로고    scopus 로고
    • EM-style optimization of hidden conditional random fields for grapheme-to-phoneme conversion
    • Speech, Signal Process. (ICASSP), Prague, Czech Republic, May
    • G. Heigold, S. Hahn, P. Lehnen, and H. Ney, "EM-style optimization of hidden conditional random fields for grapheme-to-phoneme conversion," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP), Prague, Czech Republic, May 2011, pp. 4920-4923.
    • (2011) Proc. IEEE Int. Conf. Acoust. , pp. 4920-4923
    • Heigold, G.1    Hahn, S.2    Lehnen, P.3    Ney, H.4
  • 31
    • 0002210265 scopus 로고
    • On the convergence properties of the em algorithm
    • C. Wu, "On the convergence properties of the EM algorithm," Ann. Statist., vol. 11, no. 1, pp. 95-103, 1983.
    • (1983) Ann. Statist. , vol.11 , Issue.1 , pp. 95-103
    • Wu, C.1
  • 35
    • 84966275544 scopus 로고
    • Minimization of functions having Lipschitz continuous first derivatives
    • L. Armijo, "Minimization of functions having Lipschitz continuous first derivatives," Pacific J. Math., vol. 16, pp. 1-3, 1966.
    • (1966) Pacific J. Math. , vol.16 , pp. 1-3
    • Armijo, L.1
  • 36
    • 33947716431 scopus 로고
    • Beitrag zur Theorie des Ferromagnetismus
    • E. Ising, "Beitrag zur Theorie des Ferromagnetismus," Z. Phys., vol. 31, pp. 253-258, 1925.
    • (1925) Z. Phys. , vol.31 , pp. 253-258
    • Ising, E.1
  • 37
    • 56449091292 scopus 로고    scopus 로고
    • Modified MMI/MPE: A direct evaluation of the margin in speech recognition
    • Helsinki, Finland Jul.
    • G. Heigold, T. Deselaers, R. Schlüter, and H. Ney, "Modified MMI/MPE: A direct evaluation of the margin in speech recognition," in Proc. Int. Conf. Mach. Learn. (ICML), Helsinki, Finland, Jul. 2008, pp. 384-391.
    • (2008) Proc. Int. Conf. Mach. Learn. (ICML) , pp. 384-391
    • Heigold, G.1    Deselaers, T.2    Schlüter, R.3    Ney, H.4
  • 38
    • 70349226871 scopus 로고    scopus 로고
    • Modified MPE/MMI in a transducer-based framework
    • Speech, Signal Process. (ICASSP), Taipei, Taiwan, Apr.
    • G. Heigold, R. Schlüter, and H. Ney, "Modified MPE/MMI in a transducer-based framework," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP), Taipei, Taiwan, Apr. 2009, pp. 3749-3752.
    • (2009) Proc. IEEE Int. Conf. Acoust. , pp. 3749-3752
    • Heigold, G.1    Schlüter, R.2    Ney, H.3
  • 40
    • 0002711083 scopus 로고
    • Text chunking using transformationbased learning
    • Cambridge, MA, USA Jun.
    • L. Ramshaw and M. Marcus, "Text chunking using transformationbased learning," in Proc. 3rd Workshop Very Large Corpora, Cambridge, MA, USA, Jun. 1995, pp. 84-94.
    • (1995) Proc. 3rd Workshop Very Large Corpora , pp. 84-94
    • Ramshaw, L.1    Marcus, M.2
  • 41
    • 0030362755 scopus 로고    scopus 로고
    • A comparative study of linear feature transformation techniques for automatic speech recognition
    • Philadelphia, PA, USA Oct.
    • T. Eisele, R. Haeb-Umbach, and D. Langmann, "A comparative study of linear feature transformation techniques for automatic speech recognition," in Proc. Int. Conf. Spoken Lang. Process. (ICSLP), Philadelphia, PA, USA, Oct. 1996, pp. 252-255.
    • (1996) Proc. Int. Conf. Spoken Lang. Process. (ICSLP) , pp. 252-255
    • Eisele, T.1    Haeb-Umbach, R.2    Langmann, D.3
  • 43
    • 85162533997 scopus 로고    scopus 로고
    • A convergence analysis of log-linear training
    • Cambridge, MA, USA: MIT Press, Dec.
    • S.Wiesler and H. Ney, "A convergence analysis of log-linear training," in Advances in Neural Information Processing Systems (NIPS). Cambridge, MA, USA: MIT Press, Dec. 2011, pp. 657-665.
    • (2011) Advances in Neural Information Processing Systems (NIPS) , pp. 657-665
    • Wiesler, S.1    Ney, H.2
  • 44
    • 84887388950 scopus 로고    scopus 로고
    • An empirical study of learning rates in deep neural networks for speech recognition
    • Vancouver, BC, Canada, Apr.
    • A. Senior, G. Heigold, M. Ranzato, and K. Yang, "An empirical study of learning rates in deep neural networks for speech recognition," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP), Vancouver, BC, Canada, Apr. 2013, vol. 1, pp. 6724-6728.
    • (2013) Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP) , vol.1 , pp. 6724-6728
    • Senior, A.1    Heigold, G.2    Ranzato, M.3    Yang, K.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.