메뉴 건너뛰기




Volumn 21, Issue 11, 2013, Pages 2231-2243

Optimization algorithms and applications for speech and language processing

Author keywords

Natural language processing; Optimization methods; Speech processing

Indexed keywords

AUTOMATIC SPEECH RECOGNITION; COMPUTATIONAL PROBLEM; NATURAL LANGUAGE PROCESSING; OPTIMIZATION ALGORITHMS; OPTIMIZATION FORMULATIONS; OPTIMIZATION METHOD; OPTIMIZATION PROCEDURES; OPTIMIZATION TECHNIQUES;

EID: 84887037596     PISSN: 15587916     EISSN: None     Source Type: Journal    
DOI: 10.1109/TASL.2013.2283777     Document Type: Article
Times cited : (24)

References (97)
  • 1
    • 0000342467 scopus 로고
    • Statistical inference for probabilistic functions of finite state Markov chains
    • L. E. Baum and T. Petrie, "Statistical inference for probabilistic functions of finite state Markov chains," Ann. Math. Statist., vol. 37, no. 6, pp. 1554-1563, 1966.
    • (1966) Ann. Math. Statist. , vol.37 , Issue.6 , pp. 1554-1563
    • Baum, L.E.1    Petrie, T.2
  • 2
    • 0029288633 scopus 로고
    • Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models
    • C. Leggetter and P.Woodland, "Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models," Comput., Speech, Lang., vol. 9, pp. 171-185, 1995.
    • (1995) Comput., Speech, Lang. , vol.9 , pp. 171-185
    • Leggetter, C.1    Woodland, P.2
  • 4
    • 0036461035 scopus 로고    scopus 로고
    • Large scale discriminative training of hidden markov models for speech recognition
    • P. C. Woodland and D. Povey, "Large scale discriminative training of hidden markov models for speech recognition," Comput. Speech, Lang., pp. 25-47, 2002.
    • (2002) Comput. Speech, Lang. , pp. 25-47
    • Woodland, P.C.1    Povey, D.2
  • 7
    • 85009216465 scopus 로고    scopus 로고
    • A comparative study on maximum entropy and discriminative training for acoustic modeling in automatic speech recognition
    • W.Macherey and H. Ney, "A comparative study on maximum entropy and discriminative training for acoustic modeling in automatic speech recognition," in Proc. Eurospeech, 2003, pp. 493-496.
    • (2003) Proc. Eurospeech , pp. 493-496
    • Macherey, W.1    Ney, H.2
  • 8
  • 9
    • 34249656385 scopus 로고    scopus 로고
    • Discriminative estimation of subspace constrained gaussian mixture models for speech recognition
    • Jan.
    • S.Axelrod, V. Goel, R.A.Gopinath, P. A. Olsen, andK.Visweswariah, "Discriminative estimation of subspace constrained gaussian mixture models for speech recognition," IEEE Trans. Audio, Speech, Lang. Process., vol. 15, no. 1, pp. 172-189, Jan. 2007.
    • (2007) IEEE Trans. Audio, Speech, Lang. Process. , vol.15 , Issue.1 , pp. 172-189
    • Axelrod, S.1    Goel, V.2    Gopinath, R.A.3    Olsen, P.A.4    Visweswariah, K.5
  • 10
    • 85032750905 scopus 로고    scopus 로고
    • Discriminative learning in sequential pattern recognition-A unifying review for optimization-oriented speech recognition
    • Sep.
    • X. He, L. Deng, and W. Chou, "Discriminative learning in sequential pattern recognition-a unifying review for optimization-oriented speech recognition," IEEE Signal Process. Mag., vol. 25, no. 5, pp. 14-36, Sep. 2008.
    • (2008) IEEE Signal Process. Mag. , vol.25 , Issue.5 , pp. 14-36
    • He, X.1    Deng, L.2    Chou, W.3
  • 13
    • 80051618443 scopus 로고    scopus 로고
    • EM-style optimization of hidden conditional random fields for grapheme-to-phoneme conversion
    • G. Heigold, S. Hahn, P. Lehnen, and H. Ney, "EM-style optimization of hidden conditional random fields for grapheme-to-phoneme conversion," in Proc. ICASSP, 2011, pp. 4920-4923.
    • (2011) Proc. ICASSP , pp. 4920-4923
    • Heigold, G.1    Hahn, S.2    Lehnen, P.3    Ney, H.4
  • 14
    • 84877743396 scopus 로고    scopus 로고
    • Optimizing the performance of spoken language recognition with discriminative training
    • Aug.
    • V. Hautamäki, K. A. Lee, T. Kinnunen, B.Ma, and H. Li, "Optimizing the performance of spoken language recognition with discriminative training," IEEE Trans. Audio, Speech, Lang. Process., vol. 21, no. 8, pp. 1622-1631, Aug. 2013.
    • (2013) IEEE Trans. Audio, Speech, Lang. Process. , vol.21 , Issue.8 , pp. 1622-1631
    • Hautamäki, V.1    Lee, K.A.2    Kinnunen, T.3    Ma, B.4    Li, H.5
  • 16
    • 29044444825 scopus 로고    scopus 로고
    • Support vector machines for speaker and language recognition
    • DOI 10.1016/j.csl.2005.06.003, PII S0885230805000318, Odyssey 2004: The Speaker and Language Recognition Workshop Odyssey-04
    • W. Campbell, J. Campbell, D. Reynolds, E. Singer, and P. Torres-Carrasquillo, "Support vector machines for speaker and language recognition," Computer Speech Lang., pp. 210-229, Apr. 2006. (Pubitemid 41787537)
    • (2006) Computer Speech and Language , vol.20 , Issue.2-3 SPEC. ISSUE , pp. 210-229
    • Campbell, W.M.1    Campbell, J.P.2    Reynolds, D.A.3    Singer, E.4    Torres-Carrasquillo, P.A.5
  • 18
    • 0000159105 scopus 로고    scopus 로고
    • On adaptive decision rules and decision parameter adaptation for automatic speech recognition
    • Aug.
    • C. H. Lee and Q. Huo, "On adaptive decision rules and decision parameter adaptation for automatic speech recognition," Proc. IEEE, vol. 88, no. 8, pp. 1241-1269, Aug. 2000.
    • (2000) Proc. IEEE , vol.88 , Issue.8 , pp. 1241-1269
    • Lee, C.H.1    Huo, Q.2
  • 20
    • 85032751865 scopus 로고    scopus 로고
    • A geometric perspective of large-margin training of Gaussian models
    • Nov.
    • L. Xiao and L.Deng, "A geometric perspective of large-margin training of Gaussian models," IEEE Signal Process. Mag., vol. 27, no. 6, pp. 118-123, Nov. 2010.
    • (2010) IEEE Signal Process. Mag. , vol.27 , Issue.6 , pp. 118-123
    • Xiao, L.1    Deng, L.2
  • 22
    • 84876669905 scopus 로고    scopus 로고
    • Speech-centric information processing: An optimization-oriented approach
    • May
    • X. He and L. Deng, "Speech-centric information processing: An optimization-oriented approach," Proc. IEEE, vol. 101, no. 5, pp. 1116-1135, May 2013.
    • (2013) Proc. IEEE , vol.101 , Issue.5 , pp. 1116-1135
    • He, X.1    Deng, L.2
  • 26
    • 80053182852 scopus 로고    scopus 로고
    • Trust region-based optimization for maximum mutual information estimation of hmms in speech recognition
    • Nov.
    • C. Liu, Y. Hu, L.-R. Dai, and H. Jiang, "Trust region-based optimization for maximum mutual information estimation of hmms in speech recognition," IEEE Trans. Audio, Speech, Lang. Process., vol. 19, no. 8, pp. 2474-2485, Nov. 2011.
    • (2011) IEEE Trans. Audio, Speech, Lang. Process. , vol.19 , Issue.8 , pp. 2474-2485
    • Liu, C.1    Hu, Y.2    Dai, L.-R.3    Jiang, H.4
  • 27
    • 77955783938 scopus 로고    scopus 로고
    • Error approximation and minimum phone error acoustic model estimation
    • Aug.
    • M. Gibson and T. Hain, "Error approximation and minimum phone error acoustic model estimation," IEEE Trans. Audio, Speech, Lang. Process., vol. 18, no. 6, pp. 1269-1279, Aug. 2010.
    • (2010) IEEE Trans. Audio, Speech, Lang. Process. , vol.18 , Issue.6 , pp. 1269-1279
    • Gibson, M.1    Hain, T.2
  • 31
    • 0003845417 scopus 로고
    • The present status of automatic translation of languages
    • Y. Bar-Hillel, "The present status of automatic translation of languages," Adv. Comput., pp. 158-163, 1960.
    • (1960) Adv. Comput. , pp. 158-163
    • Bar-Hillel, Y.1
  • 32
    • 85044611587 scopus 로고
    • The mathematics of statistical machine translation: Parameter estimation
    • P. Brown, S. Pietra, V. Pietra, and R.Mercer, "The mathematics of statistical machine translation: Parameter estimation," Comput. Linguist., vol. 19, no. 2, pp. 263-311, 1993.
    • (1993) Comput. Linguist. , vol.19 , Issue.2 , pp. 263-311
    • Brown, P.1    Pietra, S.2    Pietra, V.3    Mercer, R.4
  • 34
    • 84944098666 scopus 로고    scopus 로고
    • Minimum error rate training in statistical machine translation
    • F. Och, "Minimum error rate training in statistical machine translation," in Proc. ACL, 2003.
    • (2003) Proc. ACL
    • Och, F.1
  • 35
    • 85032751114 scopus 로고    scopus 로고
    • Speech recognition, machine translation, and speech translation-A unified discriminative learning paradigm
    • Sep.
    • X. He, L. Deng, and W. Chou, "Speech recognition, machine translation, and speech translation-a unified discriminative learning paradigm," IEEE Signal Process. Mag., vol. 28, no. 5, pp. 126-133, Sep. 2011.
    • (2011) IEEE Signal Process. Mag. , vol.28 , Issue.5 , pp. 126-133
    • He, X.1    Deng, L.2    Chou, W.3
  • 36
    • 80053214216 scopus 로고    scopus 로고
    • A maximum-entropy segmentation model for statistical machine translation
    • Nov.
    • D. Xiong, M. Zhang, and H. Li, "A maximum-entropy segmentation model for statistical machine translation," IEEE Trans. Audio, Speech, Lang. Process., vol. 19, no. 8, pp. 2494-2505, Nov. 2011.
    • (2011) IEEE Trans. Audio, Speech, Lang. Process. , vol.19 , Issue.8 , pp. 2494-2505
    • Xiong, D.1    Zhang, M.2    Li, H.3
  • 37
    • 84988221520 scopus 로고    scopus 로고
    • Exploiting morphology and local word reordering in english-to-turkish phrase-based statistical machine translation
    • Aug.
    • I. D. El-Kahlout and K. Oflazer, "Exploiting morphology and local word reordering in english-to-turkish phrase-based statistical machine translation," IEEE Trans. Audio, Speech, Lang. Process., vol. 18, no. 6, pp. 1313-1322, Aug. 2010.
    • (2010) IEEE Trans. Audio, Speech, Lang. Process. , vol.18 , Issue.6 , pp. 1313-1322
    • El-Kahlout, I.D.1    Oflazer, K.2
  • 38
    • 84876693434 scopus 로고    scopus 로고
    • Maximum expected bleu training of phrase and lexicon translation models
    • X. He and L. Deng, "Maximum expected bleu training of phrase and lexicon translation models," in Proc. ACL, Assoc. Comput. Linguist., 2012.
    • (2012) Proc. ACL, Assoc. Comput. Linguist.
    • He, X.1    Deng, L.2
  • 40
    • 70350125882 scopus 로고    scopus 로고
    • An overview of text-independent speaker recognition: From features to supervectors
    • T. Kinnunen and H. Li, "An overview of text-independent speaker recognition: from features to supervectors," Speech Commun., vol. 52, no. 1, pp. 12-40, 2010.
    • (2010) Speech Commun. , vol.52 , Issue.1 , pp. 12-40
    • Kinnunen, T.1    Li, H.2
  • 43
    • 79953277529 scopus 로고    scopus 로고
    • Using discrete probabilities with bhattacharyya measure for svm-based speaker verification
    • May
    • K. A. Lee, C. H. You, H. Li, T. Kinnunen, and K. C. Sim, "Using discrete probabilities with bhattacharyya measure for svm-based speaker verification," IEEE Trans. Audio, Speech, Lang. Process., vol. 19, no. 4, pp. 861-870, May 2011.
    • (2011) IEEE Trans. Audio, Speech, Lang. Process. , vol.19 , Issue.4 , pp. 861-870
    • Lee, K.A.1    You, C.H.2    Li, H.3    Kinnunen, T.4    Sim, K.C.5
  • 45
    • 84876676725 scopus 로고    scopus 로고
    • Spoken language recognition: From fundamentals to practice
    • May
    • H. Li, B. Ma, and K. A. Lee, "Spoken language recognition: From fundamentals to practice," Proc. IEEE, vol. 101, no. 5, pp. 1136-1159, May 2013.
    • (2013) Proc. IEEE , vol.101 , Issue.5 , pp. 1136-1159
    • Li, H.1    Ma, B.2    Lee, K.A.3
  • 46
    • 84887109920 scopus 로고    scopus 로고
    • Vector-based spoken language classification
    • J. Benesty, M. Sondhi, and A. Huang, Eds. New York, NY, USA: Springer
    • H. Li, B. Ma, and C.-H. Lee, "Vector-based spoken language classification," in Springer Handbook of Speech Processing, J. Benesty, M. Sondhi, and A. Huang, Eds. New York, NY, USA: Springer, 2007.
    • (2007) Springer Handbook of Speech Processing
    • Li, H.1    Ma, B.2    Lee, C.-H.3
  • 47
    • 34547502608 scopus 로고    scopus 로고
    • A vector space modeling approach to spoken language identification
    • Jan.
    • H. Li, B. Ma, and C.-H. Lee, "A vector space modeling approach to spoken language identification," IEEE Trans. Audio, Speech, Lang. Process., vol. 15, no. 1, pp. 271-284, Jan. 2007.
    • (2007) IEEE Trans. Audio, Speech, Lang. Process. , vol.15 , Issue.1 , pp. 271-284
    • Li, H.1    Ma, B.2    Lee, C.-H.3
  • 48
    • 29044433376 scopus 로고    scopus 로고
    • Application-independent evaluation of speaker detection
    • DOI 10.1016/j.csl.2005.08.001, PII S0885230805000483, Odyssey 2004: The Speaker and Language Recognition Workshop Odyssey-04
    • N. Brümmer and J. Preez, "Application-independent evaluation of speaker detection," Comput. Speech Lang., vol. 20, no. 2, pp. 230-275, 2006. (Pubitemid 41787538)
    • (2006) Computer Speech and Language , vol.20 , Issue.2-3 SPEC. ISSUE , pp. 230-275
    • Brummer, N.1    Du Preez, J.2
  • 50
    • 85032751399 scopus 로고    scopus 로고
    • TechWare: Speaker and spoken language recognition resources
    • Nov.
    • H. Li and B.Ma, "TechWare: Speaker and spoken language recognition resources," IEEE Signal Process. Mag., vol. 27, no. 6, pp. 139-142, Nov. 2010.
    • (2010) IEEE Signal Process. Mag. , vol.27 , Issue.6 , pp. 139-142
    • Li, H.1    Ma, B.2
  • 52
    • 37649031157 scopus 로고    scopus 로고
    • The current state of language recognition: NIST 2005 evaluation results
    • A. F. Martin and A. N. Le, "The current state of language recognition: NIST 2005 evaluation results," in Proc. Odyssey: Speaker Lang. Recogn. Workshop, 2006, pp. 1-6.
    • (2006) Proc. Odyssey: Speaker Lang. Recogn. Workshop , pp. 1-6
    • Martin, A.F.1    Le, A.N.2
  • 55
    • 70350444555 scopus 로고    scopus 로고
    • Optimizing the performance of spoken language recognition with discriminative training
    • Nov.
    • D. Zhu, H. Li, B. Ma, and C. H. Lee, "Optimizing the performance of spoken language recognition with discriminative training," IEEE Trans. Audio, Speech, Lang. Process., vol. 16, no. 8, pp. 1642-1653, Nov. 2008.
    • (2008) IEEE Trans. Audio, Speech, Lang. Process. , vol.16 , Issue.8 , pp. 1642-1653
    • Zhu, D.1    Li, H.2    Ma, B.3    Lee, C.H.4
  • 56
    • 0031139839 scopus 로고    scopus 로고
    • Minimum classification error rate methods for speech recognition
    • PII S1063667697035937
    • B.-H. Juang, W. Chou, and C.-H. Lee, "Minimum classification error rate methods for speech recognition," IEEE Trans. Speech Audio Process., vol. 5, no. 3, pp. 257-265, May 1997. (Pubitemid 127745998)
    • (1997) IEEE Transactions on Speech and Audio Processing , vol.5 , Issue.3 , pp. 257-265
    • Juang, B.-H.1    Chou, W.2    Lee, C.-H.3
  • 60
    • 0025952278 scopus 로고
    • An inequality for rational functions with applications to some statistical estimation problems
    • Jan.
    • P. S. Gopalakrishnan, D. Kanevsky, D. Nahamoo, and A. Nadas, "An inequality for rational functions with applications to some statistical estimation problems," IEEE Trans. Inf. Theory, vol. 37, no. 1, pp. 107-113, Jan. 1991.
    • (1991) IEEE Trans. Inf. Theory , vol.37 , Issue.1 , pp. 107-113
    • Gopalakrishnan, P.S.1    Kanevsky, D.2    Nahamoo, D.3    Nadas, A.4
  • 61
    • 0026372945 scopus 로고
    • An improvedMMIE training algorithmfor speaker-independent, small vocabulary, continuous speech recognition
    • Y. Normandin, "An improvedMMIE training algorithmfor speaker-independent, small vocabulary, continuous speech recognition," in Proc. ICASSP, 1991, pp. 537-540.
    • (1991) Proc. ICASSP , pp. 537-540
    • Normandin, Y.1
  • 64
    • 2142684272 scopus 로고    scopus 로고
    • On reversing Jensen's inequality
    • T. Jebara, "On reversing Jensen's inequality," in Proc. NIPS, 2002.
    • (2002) Proc. NIPS
    • Jebara, T.1
  • 65
    • 34249656385 scopus 로고    scopus 로고
    • Discriminative estimation of subspace constrained Gaussian mixture models for speech recognition
    • Jan.
    • S. Axelrod, V. Goel, P. Gopinath, R. Olsen, and K. Visweswariah, "Discriminative estimation of subspace constrained Gaussian mixture models for speech recognition," IEEE Trans. Audio, Speech, Lang. Process., vol. 15, no. 1, pp. 172-189, Jan. 2007.
    • (2007) IEEE Trans. Audio, Speech, Lang. Process. , vol.15 , Issue.1 , pp. 172-189
    • Axelrod, S.1    Goel, V.2    Gopinath, P.3    Olsen, R.4    Visweswariah, K.5
  • 66
    • 70349995255 scopus 로고    scopus 로고
    • Generalization of extended Baum-Welch parameter estimation for discriminative training and decoding
    • D. Kanevsky, T. Sainath, B. Ramabhadran, and D. Nahamoo, "Generalization of extended Baum-Welch parameter estimation for discriminative training and decoding," in Proc. Interspeech, 2008.
    • (2008) Proc. Interspeech
    • Kanevsky, D.1    Sainath, T.2    Ramabhadran, B.3    Nahamoo, D.4
  • 67
    • 80051622448 scopus 로고    scopus 로고
    • A-Functions: A generalization of extended Baum-Welch transformations to convex optimization
    • D. Kanevsky, D. Nahamoo, T. N. Sainath, B. Ramabhadran, and P. A. Olsen, "A-Functions: A generalization of extended Baum-Welch transformations to convex optimization," in Proc. ICASSP, 2011, pp. 5164-5167.
    • (2011) Proc. ICASSP , pp. 5164-5167
    • Kanevsky, D.1    Nahamoo, D.2    Sainath, T.N.3    Ramabhadran, B.4    Olsen, P.A.5
  • 68
    • 0035342391 scopus 로고    scopus 로고
    • Comparison of discriminative training criteria and optimization methods for speech recognition
    • DOI 10.1016/S0167-6393(00)00035-2, PII S0167639300000352
    • R. Schlüter, W. Macherey, B. Müller, and H. Ney, "Comparison of discriminative training criteria and optimization methods for speech recognition," Speech Commun., pp. 287-310, 2001. (Pubitemid 32284868)
    • (2001) Speech Communication , vol.34 , Issue.3 , pp. 287-310
    • Schluter, R.1    Macherey, W.2    Muller, B.3    Ney, H.4
  • 69
    • 34547530690 scopus 로고    scopus 로고
    • Constrained line search optimization for discriminative training in speech recognition
    • C. Liu, P. Liu, H. Jiang, F. Soong, and R. Wang, "Constrained Line Search Optimization for Discriminative Training in Speech Recognition," in Proc. ICASSP, 2007, pp. 329-332.
    • (2007) Proc. ICASSP , pp. 329-332
    • Liu, C.1    Liu, P.2    Jiang, H.3    Soong, F.4    Wang, R.5
  • 70
    • 84865747510 scopus 로고    scopus 로고
    • Generalized Baum-Welch algorithm and its application to new extended Baum-Welch algorithm
    • DR. Hsiao and T. Schultz, "Generalized Baum-Welch algorithm and its application to new extended Baum-Welch algorithm," in Proc. Interspeech, 2011.
    • (2011) Proc. Interspeech
    • Hsiao, D.R.1    Schultz, T.2
  • 71
    • 48849083725 scopus 로고    scopus 로고
    • Extended Baum-Welch reestimation of Gaussian mixture models based on reverse Jensen inequality
    • Lisbon, Portugal, Sep.
    • M. Afify, "Extended Baum-Welch reestimation of Gaussian mixture models based on reverse Jensen inequality," in Proc. Interspeech, Lisbon, Portugal, Sep. 2005.
    • (2005) Proc. Interspeech
    • Afify, M.1
  • 74
    • 0002210265 scopus 로고
    • On the convergence properties of the em algorithm
    • C. F. J. Wu, "On the convergence properties of the EM algorithm," Ann. Statist., vol. 11, no. 1, pp. 95-103, 1983.
    • (1983) Ann. Statist. , vol.11 , Issue.1 , pp. 95-103
    • Wu, C.F.J.1
  • 75
    • 0024610919 scopus 로고
    • Tutorial on hidden Markov models and selected applications in speech recognition
    • Feb.
    • L. Rabiner, "Tutorial on hidden Markov models and selected applications in speech recognition," Proc. IEEE, vol. 77, no. 2, pp. 257-286, Feb. 1989.
    • (1989) Proc. IEEE , vol.77 , Issue.2 , pp. 257-286
    • Rabiner, L.1
  • 76
    • 0028412908 scopus 로고
    • High-performance connected digit recognition using maximum mutual information estimation
    • Apr.
    • Y. Normandin, R. Cardin, and R. Demori, "High-performance connected digit recognition using maximum mutual information estimation," IEEE Trans. Speech Audio Process., vol. 2, no. 2, pp. 299-311, Apr. 1994.
    • (1994) IEEE Trans. Speech Audio Process. , vol.2 , Issue.2 , pp. 299-311
    • Normandin, Y.1    Cardin, R.2    Demori, R.3
  • 78
    • 34547522070 scopus 로고    scopus 로고
    • Discriminative training for large vocabulary speech recognition usingminimumclassification error
    • Jan.
    • E. McDermott, T. J. Hazen, J. Le Roux, A. Nakamura, and S. Katagiri, "Discriminative training for large vocabulary speech recognition usingminimumclassification error," IEEE Trans. Audio, Speech, Lang. Process., vol. 15, no. 1, pp. 203-223, Jan. 2007.
    • (2007) IEEE Trans. Audio, Speech, Lang. Process. , vol.15 , Issue.1 , pp. 203-223
    • McDermott, E.1    Hazen, T.J.2    Le Roux, J.3    Nakamura, A.4    Katagiri, S.5
  • 81
    • 0001573124 scopus 로고
    • Generalized iterative scaling for log-linear models
    • J. Darroch and D. Ratcliff, "Generalized iterative scaling for log-linear models," Ann. Math. Statist., vol. 43, pp. 1470-1480, 1972.
    • (1972) Ann. Math. Statist. , vol.43 , pp. 1470-1480
    • Darroch, J.1    Ratcliff, D.2
  • 83
    • 51449099268 scopus 로고    scopus 로고
    • GIS-like estimation of log-linear models with hidden variables
    • G. Heigold, T. Deselaers, R. Schlüter, andH. Ney, "GIS-like estimation of log-linear models with hidden variables," in Proc. ICASSP, 2008, pp. 4045-4048.
    • (2008) Proc. ICASSP , pp. 4045-4048
    • Heigold, G.1    Deselaers, T.2    Schlüter, R.3    Ney, H.4
  • 85
    • 84876672166 scopus 로고    scopus 로고
    • Machine learning paradigms for speech recognition: An overview
    • May
    • L. Deng and X. Li, "Machine learning paradigms for speech recognition: An overview," IEEE Trans. Audio, Speech, Lang. Process., vol. 21, no. 5, pp. 1060-1089, May 2013.
    • (2013) IEEE Trans. Audio, Speech, Lang. Process. , vol.21 , Issue.5 , pp. 1060-1089
    • Deng, L.1    Li, X.2
  • 86
    • 84878379108 scopus 로고    scopus 로고
    • Scalable minimum Bayes risk training of deep neural network acoustic models using distributed hessian-free optimization
    • B. Kingsbury, T. Sainath, and H. Soltau, "Scalable minimum Bayes risk training of deep neural network acoustic models using distributed hessian-free optimization," in Proc. Interspeech, 2012.
    • (2012) Proc. Interspeech
    • Kingsbury, B.1    Sainath, T.2    Soltau, H.3
  • 89
    • 84055222005 scopus 로고    scopus 로고
    • Context-dependent pre-trained deep neural networks for large-vocabulary speech recognition
    • Jan.
    • G. Dahl,D.Yu, L.Deng, and A. Acero, "Context-dependent pre-trained deep neural networks for large-vocabulary speech recognition," IEEE Trans. Audio, Speech, Lang. Process., vol. 20, no. 1, pp. 30-42, Jan. 2012.
    • (2012) IEEE Trans. Audio, Speech, Lang. Process. , vol.20 , Issue.1 , pp. 30-42
    • Dahl, G.1    Yu, D.2    Deng, L.3    Acero, A.4
  • 91
    • 84255177123 scopus 로고    scopus 로고
    • Deep and wide: Multiple layers in automatic speech recognition
    • Jan.
    • N. Morgan, "Deep and wide: Multiple layers in automatic speech recognition," IEEE Trans. Audio, Speech, Lang. Process., vol. 20, no. 1, pp. 7-13, Jan. 2012.
    • (2012) IEEE Trans. Audio, Speech, Lang. Process. , vol.20 , Issue.1 , pp. 7-13
    • Morgan, N.1
  • 92
    • 84875405186 scopus 로고    scopus 로고
    • Exploiting deep neural networks for detection-based speech recognition
    • M. Siniscalchi, L. Deng, D. Yu, and C.-H. Lee, "Exploiting deep neural networks for detection-based speech recognition," Neurocomputing, pp. 148-157, 2013.
    • (2013) Neurocomputing , pp. 148-157
    • Siniscalchi, M.1    Deng, L.2    Yu, D.3    Lee, C.-H.4
  • 93
    • 84865768819 scopus 로고    scopus 로고
    • Deep convex network: A scalable architecture for speech pattern classification
    • L. Deng and D. Yu, "Deep convex network: A scalable architecture for speech pattern classification," in Proc. Interspeech, 2011.
    • (2011) Proc. Interspeech
    • Deng, L.1    Yu, D.2
  • 97
    • 84890526837 scopus 로고    scopus 로고
    • New types of deep neural network learning for speech recognition and related applications: An overview
    • L. Deng, G. E. Hinton, and B. Kingsbury, "New types of deep neural network learning for speech recognition and related applications: An overview," in Proc. ICASSP, 2013.
    • (2013) Proc. ICASSP
    • Deng, L.1    Hinton, G.E.2    Kingsbury, B.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.