SCOPUS 정보 검색 플랫폼

IEEE Transactions on Audio, Speech and Language Processing

Volumn 21, Issue 3, 2013, Pages 544-555

Structured SVMs for automatic speech recognition

(2) Zhang, Shi Xiong a Gales, Mark J F a

a UNIVERSITY OF CAMBRIDGE (United Kingdom)

Author keywords

large margin; log linear models; Structured support vector machines

Indexed keywords

AUTOMATIC SPEECH RECOGNITION; CONTEXT DEPENDENT; DISCRIMINATIVE MODELS; FEATURE SPACE; GAUSSIANS; GENERATIVE MODEL; LARGE MARGIN; LARGE VOCABULARY SPEECH RECOGNITION; LOGLINEAR MODEL; PARALLELIZATION STRATEGIES; SEQUENCE CLASSIFICATION; STRUCTURED SUPPORTS; TRAINING ALGORITHMS; TRAINING PROCESS;

HIDDEN MARKOV MODELS; SPEECH RECOGNITION;

SUPPORT VECTOR MACHINES;

EID: 84872193462 PISSN: 15587916 EISSN: None Source Type: Journal
DOI: 10.1109/TASL.2012.2227734 Document Type: Article

Times cited : (36)

References (46)

1
- 84858977944
- Extending noise robust structured support vector machines to larger vocabulary tasks
- Waikoloa, Hawaii
- S.-X. Zhang and M. J. F. Gales, "Extending noise robust structured support vector machines to larger vocabulary tasks," in Proc. ASRU, Waikoloa, Hawaii, 2011.
- (2011) Proc. ASRU
- Zhang, S.-X.¹ Gales, M.J.F.²

2
- 34047266379
- Progress in the CU-HTK broadcast news transcription system
- DOI 10.1109/TASL.2006.878264
- M. J. F. Gales, D. Kim, P. Woodland, H. Chan, D. Mrva, R. Sinha, and S. Tranter, "Progress in the CU-HTK broadcast news transcription system," IEEE Trans. Audio, Speech, Lang. Process., vol. 14, no. 5, pp. 1513-1525, Sep. 2006. (Pubitemid 46547578)
- (2006) IEEE Transactions on Audio, Speech and Language Processing , vol.14 , Issue.5 , pp. 1513-1525
- Gales, M.J.F.¹ Kim, D.Y.² Woodland, P.C.³ Chan, H.Y.⁴ Mrva, D.⁵ Sinha, R.⁶ Tranter, S.E.⁷

3
- 31844442382
- Learning structured prediction models: A large margin approach
- B. Taskar, V. Chatalbashev, D. Koller, and C. Guestrin, "Learning structured prediction models: A large margin approach," in Proc. Int. Conf. Mach. Learn., 2005, pp. 896-903.
- (2005) Proc. Int. Conf. Mach. Learn , pp. 896-903
- Taskar, B.¹ Chatalbashev, V.² Koller, D.³ Guestrin, C.⁴

4
- 36849072723
- Cambridge MA MIT Press
- G. H. Bakir, T. Hofmann, B. Schölkopf,A. J. Smola, B. Taskar, and S. V. N. Vishwanathan, Predicting Structured Data (Neural Information Processing). Cambridge, MA: MIT Press, 2007.
- (2007) Predicting Structured Data (Neural Information Processing)
- Bakir, G.H.¹ Hofmann, T.² Schölkopf, B.³ Smola, A.J.⁴ Taskar, B.⁵ Vishwanathan, S.V.N.⁶

5
- 70349227947
- The application of hidden Markov models in speech recognition
- M. Gales and S. Young, "The application of hidden Markov models in speech recognition," Foundat. Trends Signal Process., p. 2007.
- (2007) Foundat. Trends Signal Process
- Gales, M.¹ Young, S.²

6
- 0036461035
- Large scale discriminative training of hidden Markov models in speech recognition
- Jan
- P. Woodland and D. Povey, "Large scale discriminative training of hidden Markov models in speech recognition," Comput. Speech Lang., vol. 16, no. 1, pp. 25-48, Jan. 2002.
- (2002) Comput. Speech Lang , vol.16 , Issue.1 , pp. 25-48
- Woodland, P.¹ Povey, D.²

7
- 0026982122
- Discriminative learning for minimum error classification
- B.-H. Juang and S. Katagiri, "Discriminative learning for minimum error classification," IEEE Trans. Signal Process., vol. 40, no. 12, p. 3043, 1992.
- (1992) IEEE Trans. Signal Process , vol.40 , Issue.12 , pp. 3043
- Juang, B.-H.¹ Katagiri, S.²

8
- 33645766076
- Minimum Bayes risk estimation and decoding in large vocabulary continuous speech recognition
- W. Byrne, "Minimum Bayes risk estimation and decoding in large vocabulary continuous speech recognition," IEICE Trans., vol. 89-D, no. 3, pp. 900-907, 2006.
- (2006) IEICE Trans. , vol.89 D , Issue.3 , pp. 900-907
- Byrne, W.¹

9
- 84864038630
- Large margin hidden Markov models for automatic speech recognition
- F. Sha and L. K. Saul, "Large margin hidden Markov models for automatic speech recognition," Neural Inf. Process. Syst., pp. 1249-1256, 2007.
- (2007) Neural Inf. Process. Syst , pp. 1249-1256
- Sha, F.¹ Saul, L.K.²

10
- 84865788127
- Direct error rate minimization of hidden Markov models
- J. Keshet, C.-C. Cheng, M. Stoehr, and D. A. McAllester, "Direct error rate minimization of hidden Markov models," in Proc. Interspeech, 2011, pp. 449-452.
- (2011) Proc. Interspeech , pp. 449-452
- Keshet, J.¹ Cheng, C.-C.² Stoehr, M.³ McAllester, D.A.⁴

11
- 70349208656
- A flat direct model for speech recognition
- G. Heigold, G. Zweig, X. Li, and P. Nguyen, "A flat direct model for speech recognition," in Proc. ICASSP, 2009, pp. 3861-3864.
- (2009) Proc. ICASSP , pp. 3861-3864
- Heigold, G.¹ Zweig, G.² Li, X.³ Nguyen, P.⁴

12
- 77949370075
- A segmental CRF approach to large vocabulary continuous speech recognition
- G. Zweig and P. Nguyen, "A segmental CRF approach to large vocabulary continuous speech recognition," in Proc. ASRU, 2009.
- (2009) Proc. ASRU
- Zweig, G.¹ Nguyen, P.²

13
- 33947702666
- Augmented statistical models for speech recognition
- Toulouse, France
- M. Layton and M. Gales, "Augmented statistical models for speech recognition," in Proc. ICASSP, Toulouse, France, 2006, pp. 129-132.
- (2006) Proc. ICASSP , pp. 129-132
- Layton, M.¹ Gales, M.²

14
- 77957744761
- Structured log linear models for noise robust speech recognition
- Nov.
- S.-X. Zhang, A. Ragni, and M. J. F. Gales, "Structured log linear models for noise robust speech recognition," IEEE Signal Process. Lett., vol. 17, no. 11, pp. 945-948, Nov. 2010.
- (2010) IEEE Signal Process. Lett , vol.17 , Issue.11 , pp. 945-948
- Zhang, S.-X.¹ Ragni, A.² Gales, M.J.F.³

15
- 80051640064
- Ph.D. dissertation RWTH Aachen Univ., Aachen, Germany
- G. Heigold, "A log-linear discriminative modeling framework for speech recognition," Ph.D. dissertation, RWTH Aachen Univ., Aachen, Germany, 2010.
- (2010) A Log-linear Discriminative Modeling Framework for Speech Recognition
- Heigold, G.¹

16
- 77950857527
- Discriminative classifiers with adaptive kernels for noise robust speech recognition
- M. J. F. Gales and F. Flego, "Discriminative classifiers with adaptive kernels for noise robust speech recognition," Comput. Speech Lang., vol. 24, no. 4, pp. 648-662, 2010.
- (2010) Comput. Speech Lang , vol.24 , Issue.4 , pp. 648-662
- Gales, M.J.F.¹ Flego, F.²

17
- 31844438834
- Ph.D. dissertation Stanford Univ., Stanford, CA
- B. Taskar, "Learning structured prediction models: A large margin approach," Ph.D. dissertation, Stanford Univ., Stanford, CA, 2005.
- (2005) Learning Structured Prediction Models: A Large Margin Approach
- Taskar, B.¹

18
- 84898948585
- Max-margin markov networks
- B. Taskar, C. Guestrin, and D. Koller, "Max-margin Markov networks," in Proc. NIPS, 2004.
- (2004) Proc. NIPS
- Taskar, B.¹ Guestrin, C.² Koller, D.³

19
- 4544265717
- Ph.D. dissertation Cambridge Univ
- D. Povey, "Discriminative Training for Large Vocabulary Speech Recognition," Ph.D. dissertation, Cambridge Univ., 2004.
- (2004) Discriminative Training for Large Vocabulary Speech Recognition
- Povey, D.¹

20
- 69549111057
- Cutting-plane training of structural SVMs
- T. Joachims, T. Finley, and C.-N. J. Yu, "Cutting-plane training of structural SVMs," Mach. Learn., vol. 77, no. 1, pp. 27-59, 2009.
- (2009) Mach. Learn , vol.77 , Issue.1 , pp. 27-59
- Joachims, T.¹ Finley, T.² Yu, C.-N.J.³

21
- 33947666144
- Isolated-word recognition with penalized logistic regression machines
- O. Birkenes, T. Matsui, and K. Tanabe, "Isolated-word recognition with penalized logistic regression machines," in Proc. ICASSP, 2006, vol. 1, pp. 405-408.
- (2006) Proc. ICASSP , vol.1 , pp. 405-408
- Birkenes, O.¹ Matsui, T.² Tanabe, K.³

22
- 0142192295
- Conditional random fields: Probabilistic models for segmenting and labeling sequence data
- J. Lafferty, A. McCallum, and F. Pereira, "Conditional random fields: Probabilistic models for segmenting and labeling sequence data," in Proc. Int. Conf. Mach. Learn., 2001.
- (2001) Proc. Int. Conf. Mach. Learn
- Lafferty, J.¹ McCallum, A.² Pereira, F.³

23
- 78049375705
- From flat direct models to segmental CRF models
- G. Zweig and P. Nguyen, "From flat direct models to segmental CRF models," in Proc. ICASSP, 2010, pp. 5530-5533.
- (2010) Proc. ICASSP , pp. 5530-5533
- Zweig, G.¹ Nguyen, P.²

24
- 0010442827
- On the algorithmic implementation of multiclass kernel-based vector machines
- K. Crammer and Y. Singer, "On the algorithmic implementation of multiclass kernel-based vector machines," J. Mach. Learn. Res., vol. 2, pp. 265-292, 2002.
- (2002) J. Mach. Learn. Res , vol.2 , pp. 265-292
- Crammer, K.¹ Singer, Y.²

25
- 33645775754
- Support vector machines for segmental minimum Bayes risk decoding of continuous speech
- V. Venkataramani, S. Chakrabartty, and W. Byrne, "Support vector machines for segmental minimum Bayes risk decoding of continuous speech," in Proc. ASRU, 2003.
- (2003) Proc. ASRU
- Venkataramani, V.¹ Chakrabartty, S.² Byrne, W.³

26
- 84865756340
- Structured support vector machines for noise robust continuous speech recognition
- Florence, Italy
- S.-X. Zhang and M. J. F. Gales, "Structured support vector machines for noise robust continuous speech recognition," in Proc. Interspeech, Florence, Italy, 2011, pp. 989-992.
- (2011) Proc. Interspeech , pp. 989-992
- Zhang, S.-X.¹ Gales, M.J.F.²

27
- 84898982939
- Exploiting generative models in discriminative classifiers
- Cambridge, MA MIT Press
- T. S. Jaakkola and D. Haussler, "Exploiting generative models in discriminative classifiers," in Proc. 1998 Conf. Adv. Neural Inf. Process. Syst. II, Cambridge, MA, 1999, pp. 487-493, MIT Press.
- (1999) Proc. 1998 Conf. Adv. Neural Inf. Process. Syst. II , pp. 487-493
- Jaakkola, T.S.¹ Haussler, D.²

28
- 84858988048
- Derivative kernels for noise robustASR
- Waikoloa, Hawaii
- A. Ragni and M. J. F. Gales, "Derivative kernels for noise robustASR," in Proc. ASRU, Waikoloa, Hawaii, 2011.
- (2011) Proc. ASRU
- Ragni, A.¹ Gales, M.J.F.²

29
- 80051634426
- Structured discriminative models for noise robust continuous speech recognition
- Prague, Czech Repubic
- A. Ragni and M. J. F. Gales, "Structured discriminative models for noise robust continuous speech recognition," in Proc. ICASSP, Prague, Czech Repubic, 2011, pp. 4788-4791.
- (2011) Proc. ICASSP , pp. 4788-4791
- Ragni, A.¹ Gales, M.J.F.²

30
- 0031268341
- Factorial hidden markov models
- Z. Ghahramani and M. I. Jordan, "Factorial Hidden Markov models," Mach. Learn., vol. 29, pp. 245-273, 1997. (Pubitemid 127510040)
- (1997) Machine Learning , vol.29 , Issue.2-3 , pp. 245-273
- Ghahramani, Z.¹ Jordan, M.I.²

31
- 84898962087
- Semi-Markov conditional random fields for information extraction
- S. Sarawagi and W. W. Cohen, "Semi-Markov conditional random fields for information extraction," in Proc. NIPS, 2005.
- (2005) Proc. NIPS
- Sarawagi, S.¹ Cohen, W.W.²

32
- 77955422240
- Object detection with discriminatively trained part-based models
- Sep.
- P. F. Felzenszwalb, R. B. Girshick, D. McAllester, and D. Ramanan, "Object detection with discriminatively trained part-based models," IEEE Trans, Pattern Anal.Mach. Intell., vol. 32, no. 9, pp. 1627-1645, Sep. 2010.
- (2010) IEEE Trans, Pattern Anal.Mach. Intell , vol.32 , Issue.9 , pp. 1627-1645
- Felzenszwalb, P.F.¹ Girshick, R.B.² McAllester, D.³ Ramanan, D.⁴

33
- 71149086466
- Learning structural SVMs with latent variables
- C.-N. Yu and T. Joachims, "Learning structural SVMs with latent variables," in Proc. ICML, 2009.
- (2009) Proc ICML
- Yu, C.-N.¹ Joachims, T.²

34
- 77951160349
- The concave-convex procedure (CCCP)
- Cambridge, MA MIT Press
- A. Yuille, A. Rangarajan, and A. L. Yuille, "The concave-convex procedure (CCCP)," in Advances in Neural Information Processing Systems. Cambridge, MA: MIT Press, 2002.
- (2002) Advances in Neural Information Processing Systems
- Yuille, A.¹ Rangarajan, A.² Yuille, A.L.³

35
- 34547964973
- Pegasos: Primal estimated sub-gradient solver for SVM
- Y. Singer and N. Srebro, "Pegasos: Primal estimated sub-gradient solver for SVM," in Proc. ICML, 2007, pp. 807-814.
- (2007) Proc ICML , pp. 807-814
- Singer, Y.¹ Srebro, N.²

36
- 34547969126
- Exponentiated gradient algorithms for log-linear structured prediction
- A. Globerson, T. Y. Koo, X. Carreras, and M. Collins, "Exponentiated gradient algorithms for log-linear structured prediction," in Proc. ICML, 2007, pp. 305-312.
- (2007) Proc ICML , pp. 305-312
- Globerson, A.¹ Koo, T.Y.² Carreras, X.³ Collins, M.⁴

37
- 24944537843
- Large margin methods for structured and interdependent output variables
- I. Tsochantaridis, T. Joachims, T. Hofmann, and Y. Altun, "Large margin methods for structured and interdependent output variables," J. Mach. Learn. Res., vol. 6, pp. 1453-1484, 2005.
- (2005) J. Mach. Learn. Res , vol.6 , pp. 1453-1484
- Tsochantaridis, I.¹ Joachims, T.² Hofmann, T.³ Altun, Y.⁴

38
- 9444260139
- Ph.D. dissertation Media Lab, Mass. Inst. of Technol., Cambridge, MA
- T. Jebara, "Discriminative, generative and imitative learning," Ph.D. dissertation, Media Lab, Mass. Inst. of Technol., Cambridge, MA, 2001.
- (2001) Discriminative, Generative and Imitative Learning
- Jebara, T.¹

39
- 64149098818
- Approximate test risk bound minimization through soft margin estimation
- Nov
- J. Li, M. Yuan, and C.-H. Lee, "Approximate test risk bound minimization through soft margin estimation," IEEE Trans. Audio, Speech, Lang. Process., vol. 15, no. 8, pp. 2393-2404, Nov. 2007.
- (2007) IEEE Trans. Audio, Speech, Lang. Process , vol.15 , Issue.8 , pp. 2393-2404
- Li, J.¹ Yuan, M.² Lee, C.-H.³

40
- 51449120120
- Boosted MMI for model and feature-space discriminative training
- D. Povey, D. Kanevsky, B. Kingsbury, B. Ramabhadran, G. Saon, and K. Visweswariah, "BoostedMMI for model and feature-space discriminative training," in Proc. ICASSP, 2008, pp. 4057-4060.
- (2008) Proc. ICASSP , pp. 4057-4060
- Povey, D.¹ Kanevsky, D.² Kingsbury, B.³ Ramabhadran, B.⁴ Saon, G.⁵ Visweswariah, K.⁶

41
- 34547374592
- Ph.D. dissertation Cambridge Univ., Cambridge, U.K
- M. Layton, "Augmented statistical models for classifying sequence data," Ph.D. dissertation, Cambridge Univ., Cambridge, U.K., 2006.
- (2006) Augmented Statistical Models for Classifying Sequence Data
- Layton, M.¹

42
- 0030359637
- Variance compensation within theMLLR framework for robust speech recognition and speaker adaptation
- M. J. F. Gales, D. Pye, and P. Woodland, "Variance compensation within theMLLR framework for robust speech recognition and speaker adaptation," in Proc. ICSLP, 1996, vol. 3, pp. 1832-1835.
- (1996) Proc. ICSLP , vol.3 , pp. 1832-1835
- Gales, M.J.F.¹ Pye, D.² Woodland, P.³

43
- 85009113852
- HMM Adaptation using vector Taylor series for noisy speech recognition
- Beijing, China
- A. Acero, L. Deng, T. Kristjansson, and J. Zhang, "HMM Adaptation using vector Taylor series for noisy speech recognition," in Proc. ICSLP, Beijing, China, 2000.
- (2000) Proc. ICSLP
- Acero, A.¹ Deng, L.² Kristjansson, T.³ Zhang, J.⁴

44
- 34547537573
- Cambridge Univ., Cambridge, U.K., Tech. Rep. CUED/F-INFENG/TR552 Nov
- H. Liao and M. Gales, "Joint uncertainty decoding for robust large vocabulary speech recognition," Cambridge Univ., Cambridge, U.K., Tech. Rep. CUED/F-INFENG/TR552, Nov. 2006.
- (2006) Joint Uncertainty Decoding for Robust Large Vocabulary Speech Recognition
- Liao, H.¹ Gales, M.²

45
- 70349194599
- Noise adaptive training using a vector Taylor series approach for noise robust automatic speech recognition
- O. Kalinli, M. L. Seltzer, and A. Acero, "Noise adaptive training using a vector Taylor series approach for noise robust automatic speech recognition," in Proc. ICASSP, 2009, pp. 3825-3828.
- (2009) Proc. ICASSP , pp. 3825-3828
- Kalinli, O.¹ Seltzer, M.L.² Acero, A.³

46
- 84858961235
- Cambridge Univ., Cambridge, U.K., Tech. Rep. CUED/F-INFENG/TR653 [Online]
- F. Flego and M. Gales, "Factor analysis based VTS and JUD noise estimation and compensation," Cambridge Univ., Cambridge, U.K., Tech. Rep. CUED/F-INFENG/TR653, 2011 [Online]. Available: http://mi. eng.cam.ac.uk/~mjfg
- (2011) Factor Analysis Based VTS and JUD Noise Estimation and Compensation
- Flego, F.¹ Gales, M.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.