메뉴 건너뛰기




Volumn 21, Issue 3, 2013, Pages 544-555

Structured SVMs for automatic speech recognition

Author keywords

large margin; log linear models; Structured support vector machines

Indexed keywords

AUTOMATIC SPEECH RECOGNITION; CONTEXT DEPENDENT; DISCRIMINATIVE MODELS; FEATURE SPACE; GAUSSIANS; GENERATIVE MODEL; LARGE MARGIN; LARGE VOCABULARY SPEECH RECOGNITION; LOGLINEAR MODEL; PARALLELIZATION STRATEGIES; SEQUENCE CLASSIFICATION; STRUCTURED SUPPORTS; TRAINING ALGORITHMS; TRAINING PROCESS;

EID: 84872193462     PISSN: 15587916     EISSN: None     Source Type: Journal    
DOI: 10.1109/TASL.2012.2227734     Document Type: Article
Times cited : (36)

References (46)
  • 1
    • 84858977944 scopus 로고    scopus 로고
    • Extending noise robust structured support vector machines to larger vocabulary tasks
    • Waikoloa, Hawaii
    • S.-X. Zhang and M. J. F. Gales, "Extending noise robust structured support vector machines to larger vocabulary tasks," in Proc. ASRU, Waikoloa, Hawaii, 2011.
    • (2011) Proc. ASRU
    • Zhang, S.-X.1    Gales, M.J.F.2
  • 5
    • 70349227947 scopus 로고    scopus 로고
    • The application of hidden Markov models in speech recognition
    • M. Gales and S. Young, "The application of hidden Markov models in speech recognition," Foundat. Trends Signal Process., p. 2007.
    • (2007) Foundat. Trends Signal Process
    • Gales, M.1    Young, S.2
  • 6
    • 0036461035 scopus 로고    scopus 로고
    • Large scale discriminative training of hidden Markov models in speech recognition
    • Jan
    • P. Woodland and D. Povey, "Large scale discriminative training of hidden Markov models in speech recognition," Comput. Speech Lang., vol. 16, no. 1, pp. 25-48, Jan. 2002.
    • (2002) Comput. Speech Lang , vol.16 , Issue.1 , pp. 25-48
    • Woodland, P.1    Povey, D.2
  • 7
    • 0026982122 scopus 로고
    • Discriminative learning for minimum error classification
    • B.-H. Juang and S. Katagiri, "Discriminative learning for minimum error classification," IEEE Trans. Signal Process., vol. 40, no. 12, p. 3043, 1992.
    • (1992) IEEE Trans. Signal Process , vol.40 , Issue.12 , pp. 3043
    • Juang, B.-H.1    Katagiri, S.2
  • 8
    • 33645766076 scopus 로고    scopus 로고
    • Minimum Bayes risk estimation and decoding in large vocabulary continuous speech recognition
    • W. Byrne, "Minimum Bayes risk estimation and decoding in large vocabulary continuous speech recognition," IEICE Trans., vol. 89-D, no. 3, pp. 900-907, 2006.
    • (2006) IEICE Trans. , vol.89 D , Issue.3 , pp. 900-907
    • Byrne, W.1
  • 9
    • 84864038630 scopus 로고    scopus 로고
    • Large margin hidden Markov models for automatic speech recognition
    • F. Sha and L. K. Saul, "Large margin hidden Markov models for automatic speech recognition," Neural Inf. Process. Syst., pp. 1249-1256, 2007.
    • (2007) Neural Inf. Process. Syst , pp. 1249-1256
    • Sha, F.1    Saul, L.K.2
  • 11
    • 70349208656 scopus 로고    scopus 로고
    • A flat direct model for speech recognition
    • G. Heigold, G. Zweig, X. Li, and P. Nguyen, "A flat direct model for speech recognition," in Proc. ICASSP, 2009, pp. 3861-3864.
    • (2009) Proc. ICASSP , pp. 3861-3864
    • Heigold, G.1    Zweig, G.2    Li, X.3    Nguyen, P.4
  • 12
    • 77949370075 scopus 로고    scopus 로고
    • A segmental CRF approach to large vocabulary continuous speech recognition
    • G. Zweig and P. Nguyen, "A segmental CRF approach to large vocabulary continuous speech recognition," in Proc. ASRU, 2009.
    • (2009) Proc. ASRU
    • Zweig, G.1    Nguyen, P.2
  • 13
    • 33947702666 scopus 로고    scopus 로고
    • Augmented statistical models for speech recognition
    • Toulouse, France
    • M. Layton and M. Gales, "Augmented statistical models for speech recognition," in Proc. ICASSP, Toulouse, France, 2006, pp. 129-132.
    • (2006) Proc. ICASSP , pp. 129-132
    • Layton, M.1    Gales, M.2
  • 14
    • 77957744761 scopus 로고    scopus 로고
    • Structured log linear models for noise robust speech recognition
    • Nov.
    • S.-X. Zhang, A. Ragni, and M. J. F. Gales, "Structured log linear models for noise robust speech recognition," IEEE Signal Process. Lett., vol. 17, no. 11, pp. 945-948, Nov. 2010.
    • (2010) IEEE Signal Process. Lett , vol.17 , Issue.11 , pp. 945-948
    • Zhang, S.-X.1    Ragni, A.2    Gales, M.J.F.3
  • 16
    • 77950857527 scopus 로고    scopus 로고
    • Discriminative classifiers with adaptive kernels for noise robust speech recognition
    • M. J. F. Gales and F. Flego, "Discriminative classifiers with adaptive kernels for noise robust speech recognition," Comput. Speech Lang., vol. 24, no. 4, pp. 648-662, 2010.
    • (2010) Comput. Speech Lang , vol.24 , Issue.4 , pp. 648-662
    • Gales, M.J.F.1    Flego, F.2
  • 20
    • 69549111057 scopus 로고    scopus 로고
    • Cutting-plane training of structural SVMs
    • T. Joachims, T. Finley, and C.-N. J. Yu, "Cutting-plane training of structural SVMs," Mach. Learn., vol. 77, no. 1, pp. 27-59, 2009.
    • (2009) Mach. Learn , vol.77 , Issue.1 , pp. 27-59
    • Joachims, T.1    Finley, T.2    Yu, C.-N.J.3
  • 21
    • 33947666144 scopus 로고    scopus 로고
    • Isolated-word recognition with penalized logistic regression machines
    • O. Birkenes, T. Matsui, and K. Tanabe, "Isolated-word recognition with penalized logistic regression machines," in Proc. ICASSP, 2006, vol. 1, pp. 405-408.
    • (2006) Proc. ICASSP , vol.1 , pp. 405-408
    • Birkenes, O.1    Matsui, T.2    Tanabe, K.3
  • 22
    • 0142192295 scopus 로고    scopus 로고
    • Conditional random fields: Probabilistic models for segmenting and labeling sequence data
    • J. Lafferty, A. McCallum, and F. Pereira, "Conditional random fields: Probabilistic models for segmenting and labeling sequence data," in Proc. Int. Conf. Mach. Learn., 2001.
    • (2001) Proc. Int. Conf. Mach. Learn
    • Lafferty, J.1    McCallum, A.2    Pereira, F.3
  • 23
    • 78049375705 scopus 로고    scopus 로고
    • From flat direct models to segmental CRF models
    • G. Zweig and P. Nguyen, "From flat direct models to segmental CRF models," in Proc. ICASSP, 2010, pp. 5530-5533.
    • (2010) Proc. ICASSP , pp. 5530-5533
    • Zweig, G.1    Nguyen, P.2
  • 24
    • 0010442827 scopus 로고    scopus 로고
    • On the algorithmic implementation of multiclass kernel-based vector machines
    • K. Crammer and Y. Singer, "On the algorithmic implementation of multiclass kernel-based vector machines," J. Mach. Learn. Res., vol. 2, pp. 265-292, 2002.
    • (2002) J. Mach. Learn. Res , vol.2 , pp. 265-292
    • Crammer, K.1    Singer, Y.2
  • 25
    • 33645775754 scopus 로고    scopus 로고
    • Support vector machines for segmental minimum Bayes risk decoding of continuous speech
    • V. Venkataramani, S. Chakrabartty, and W. Byrne, "Support vector machines for segmental minimum Bayes risk decoding of continuous speech," in Proc. ASRU, 2003.
    • (2003) Proc. ASRU
    • Venkataramani, V.1    Chakrabartty, S.2    Byrne, W.3
  • 26
    • 84865756340 scopus 로고    scopus 로고
    • Structured support vector machines for noise robust continuous speech recognition
    • Florence, Italy
    • S.-X. Zhang and M. J. F. Gales, "Structured support vector machines for noise robust continuous speech recognition," in Proc. Interspeech, Florence, Italy, 2011, pp. 989-992.
    • (2011) Proc. Interspeech , pp. 989-992
    • Zhang, S.-X.1    Gales, M.J.F.2
  • 27
    • 84898982939 scopus 로고    scopus 로고
    • Exploiting generative models in discriminative classifiers
    • Cambridge, MA MIT Press
    • T. S. Jaakkola and D. Haussler, "Exploiting generative models in discriminative classifiers," in Proc. 1998 Conf. Adv. Neural Inf. Process. Syst. II, Cambridge, MA, 1999, pp. 487-493, MIT Press.
    • (1999) Proc. 1998 Conf. Adv. Neural Inf. Process. Syst. II , pp. 487-493
    • Jaakkola, T.S.1    Haussler, D.2
  • 28
    • 84858988048 scopus 로고    scopus 로고
    • Derivative kernels for noise robustASR
    • Waikoloa, Hawaii
    • A. Ragni and M. J. F. Gales, "Derivative kernels for noise robustASR," in Proc. ASRU, Waikoloa, Hawaii, 2011.
    • (2011) Proc. ASRU
    • Ragni, A.1    Gales, M.J.F.2
  • 29
    • 80051634426 scopus 로고    scopus 로고
    • Structured discriminative models for noise robust continuous speech recognition
    • Prague, Czech Repubic
    • A. Ragni and M. J. F. Gales, "Structured discriminative models for noise robust continuous speech recognition," in Proc. ICASSP, Prague, Czech Repubic, 2011, pp. 4788-4791.
    • (2011) Proc. ICASSP , pp. 4788-4791
    • Ragni, A.1    Gales, M.J.F.2
  • 30
    • 0031268341 scopus 로고    scopus 로고
    • Factorial hidden markov models
    • Z. Ghahramani and M. I. Jordan, "Factorial Hidden Markov models," Mach. Learn., vol. 29, pp. 245-273, 1997. (Pubitemid 127510040)
    • (1997) Machine Learning , vol.29 , Issue.2-3 , pp. 245-273
    • Ghahramani, Z.1    Jordan, M.I.2
  • 31
    • 84898962087 scopus 로고    scopus 로고
    • Semi-Markov conditional random fields for information extraction
    • S. Sarawagi and W. W. Cohen, "Semi-Markov conditional random fields for information extraction," in Proc. NIPS, 2005.
    • (2005) Proc. NIPS
    • Sarawagi, S.1    Cohen, W.W.2
  • 33
    • 71149086466 scopus 로고    scopus 로고
    • Learning structural SVMs with latent variables
    • C.-N. Yu and T. Joachims, "Learning structural SVMs with latent variables," in Proc. ICML, 2009.
    • (2009) Proc ICML
    • Yu, C.-N.1    Joachims, T.2
  • 35
    • 34547964973 scopus 로고    scopus 로고
    • Pegasos: Primal estimated sub-gradient solver for SVM
    • Y. Singer and N. Srebro, "Pegasos: Primal estimated sub-gradient solver for SVM," in Proc. ICML, 2007, pp. 807-814.
    • (2007) Proc ICML , pp. 807-814
    • Singer, Y.1    Srebro, N.2
  • 36
    • 34547969126 scopus 로고    scopus 로고
    • Exponentiated gradient algorithms for log-linear structured prediction
    • A. Globerson, T. Y. Koo, X. Carreras, and M. Collins, "Exponentiated gradient algorithms for log-linear structured prediction," in Proc. ICML, 2007, pp. 305-312.
    • (2007) Proc ICML , pp. 305-312
    • Globerson, A.1    Koo, T.Y.2    Carreras, X.3    Collins, M.4
  • 37
    • 24944537843 scopus 로고    scopus 로고
    • Large margin methods for structured and interdependent output variables
    • I. Tsochantaridis, T. Joachims, T. Hofmann, and Y. Altun, "Large margin methods for structured and interdependent output variables," J. Mach. Learn. Res., vol. 6, pp. 1453-1484, 2005.
    • (2005) J. Mach. Learn. Res , vol.6 , pp. 1453-1484
    • Tsochantaridis, I.1    Joachims, T.2    Hofmann, T.3    Altun, Y.4
  • 39
    • 64149098818 scopus 로고    scopus 로고
    • Approximate test risk bound minimization through soft margin estimation
    • Nov
    • J. Li, M. Yuan, and C.-H. Lee, "Approximate test risk bound minimization through soft margin estimation," IEEE Trans. Audio, Speech, Lang. Process., vol. 15, no. 8, pp. 2393-2404, Nov. 2007.
    • (2007) IEEE Trans. Audio, Speech, Lang. Process , vol.15 , Issue.8 , pp. 2393-2404
    • Li, J.1    Yuan, M.2    Lee, C.-H.3
  • 42
    • 0030359637 scopus 로고    scopus 로고
    • Variance compensation within theMLLR framework for robust speech recognition and speaker adaptation
    • M. J. F. Gales, D. Pye, and P. Woodland, "Variance compensation within theMLLR framework for robust speech recognition and speaker adaptation," in Proc. ICSLP, 1996, vol. 3, pp. 1832-1835.
    • (1996) Proc. ICSLP , vol.3 , pp. 1832-1835
    • Gales, M.J.F.1    Pye, D.2    Woodland, P.3
  • 43
    • 85009113852 scopus 로고    scopus 로고
    • HMM Adaptation using vector Taylor series for noisy speech recognition
    • Beijing, China
    • A. Acero, L. Deng, T. Kristjansson, and J. Zhang, "HMM Adaptation using vector Taylor series for noisy speech recognition," in Proc. ICSLP, Beijing, China, 2000.
    • (2000) Proc. ICSLP
    • Acero, A.1    Deng, L.2    Kristjansson, T.3    Zhang, J.4
  • 45
    • 70349194599 scopus 로고    scopus 로고
    • Noise adaptive training using a vector Taylor series approach for noise robust automatic speech recognition
    • O. Kalinli, M. L. Seltzer, and A. Acero, "Noise adaptive training using a vector Taylor series approach for noise robust automatic speech recognition," in Proc. ICASSP, 2009, pp. 3825-3828.
    • (2009) Proc. ICASSP , pp. 3825-3828
    • Kalinli, O.1    Seltzer, M.L.2    Acero, A.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.