SCOPUS 정보 검색 플랫폼

IEEE Signal Processing Magazine

Volumn 29, Issue 6, 2012, Pages 18-33

Large-vocabulary continuous speech recognition systems: A look at some recent advances

(2) Saon, George a Chien, Jen Tzung b

a IBM T J WATSON RESEARCH CENTER (United States)

b NATIONAL CHENG KUNG UNIVERSITY (Taiwan)

Author keywords

[No Author keywords available]

Indexed keywords

AUTOMATIC TELEPHONE SYSTEMS; AUTOMATION; CONTINUOUS SPEECH RECOGNITION; DEEP NEURAL NETWORKS; SPEECH; SPEECH PROCESSING; SPEECH TRANSMISSION; TRANSCRIPTION; VOCABULARY CONTROL;

BROADCAST NEWS TRANSCRIPTIONS; CHANNEL DISTORTIONS; CONVERSATIONAL TELEPHONE SPEECH; LARGE VOCABULARY CONTINUOUS SPEECH RECOGNITION; LARGE VOCABULARY SPEECH RECOGNITION; RECOGNITION ERROR; SPEAKER DEPENDENTS; SPEAKER INDEPENDENTS;

SPEECH RECOGNITION;

EID: 85032751472 PISSN: 10535888 EISSN: None Source Type: Journal
DOI: 10.1109/MSP.2012.2197156 Document Type: Article

Times cited : (95)

References (118)

1
- 34047266376
- Advances in speech transcription at ibm under the darpa ears program
- S. Chen, B. Kingsbury, L. Mangu, D. Povey, G. Saon, H. Soltau, and G. Zweig, "Advances in speech transcription at IBM under the DARPA EARS program," IEEE Trans. Speech Audio Processing, vol. 14, no. 5, pp. 1596-1608, 2006.
- (2006) IEEE Trans. Speech Audio Processing , vol.14 , Issue.5 , pp. 1596-1608
- Chen, S.¹ Kingsbury, B.² Mangu, L.³ Povey, D.⁴ Saon, G.⁵ Soltau, H.⁶ Zweig, G.⁷

2
- 4544236272
- Development of the 2003 cu-htk conversational telephone speech transcription system
- G. Evermann, H. Y. Chan, M. J. F. Gales, T. Hain, X. Liu, D. Mrva, L. Wang, and P. C. Woodland, "Development of the 2003 CU-HTK conversational telephone speech transcription system," in Proc. Int. Conf. Acoustics, Speech, and Signal Processing (ICASSP), 2004, pp. 249-252.
- (2004) Proc. Int. Conf. Acoustics, Speech, and Signal Processing (ICASSP) , pp. 249-252
- Evermann, G.¹ Chan, H.Y.² Gales, M.J.F.³ Hain, T.⁴ Liu, X.⁵ Mrva, D.⁶ Wang, L.⁷ Woodland, P.C.⁸

3
- 34147119672
- Advances in transcription of broadcast news and conversational telephone speech within the combined ears bbn/limsi system
- S. Matsoukas, J.-L. Gauvain, G. Adda, T. Colthurst, C. Kao, O. Kimball, L. Lamel, F. Lefevre, J. Ma, J. Makhoul, L. Nguyen, R. Prasad, R. Schwartz, H. Schwenk, and B. Xiang, "Advances in transcription of broadcast news and conversational telephone speech within the combined EARS BBN/LIMSI system," IEEE Trans. Speech Audio Processing, vol. 14, no. 5, pp. 1541-1556, 2006.
- (2006) IEEE Trans. Speech Audio Processing , vol.14 , Issue.5 , pp. 1541-1556
- Matsoukas, S.¹ Gauvain, J.-L.² Adda, G.³ Colthurst, T.⁴ Kao, C.⁵ Kimball, O.⁶ Lamel, L.⁷ Lefevre, F.⁸ Ma, J.⁹ Makhoul, J.¹⁰ Nguyen, L.¹¹ Prasad, R.¹² Schwartz, R.¹³ Schwenk, H.¹⁴ Xiang, B.¹⁵

4
- 34047270914
- Recent innovations in speech-to-text transcription at sri/icsi/uw
- A. Stolcke, B. Chen, H. Franco, V. Gadde, M. Graciarena, M. Hwang, A. Mandal, N. Morgan, X. Lei, T. Ng, M. Ostendorf, K. Sonmez, A. Venkataraman, D. Vergyri, W. Wang, J. Zheng, and Q. Zhu, "Recent innovations in speech-to-text transcription at SRI/ICSI/UW," IEEE Trans. Audio Speech Lang. Processing, vol. 14, no. 4, pp. 1729-1744, 2006.
- (2006) IEEE Trans. Audio Speech Lang. Processing , vol.14 , Issue.4 , pp. 1729-1744
- Stolcke, A.¹ Chen, B.² Franco, H.³ Gadde, V.⁴ Graciarena, M.⁵ Hwang, M.⁶ Mandal, A.⁷ Morgan, N.⁸ Lei, X.⁹ Ng, T.¹⁰ Ostendorf, M.¹¹ Sonmez, K.¹² Venkataraman, A.¹³ Vergyri, D.¹⁴ Wang, W.¹⁵ Zheng, J.¹⁶ Zhu, Q.¹⁷

5
- 34047266379
- Progress in the CU-HTK broadcast news transcription system
- DOI 10.1109/TASL.2006.878264
- M. J. F. Gales, D. Y. Kim, P. C. Woodland, H. Y. Chan, D. Mrva, R. Sinha, and S. E. Tranter, "Progress in the CU-HTK broadcast news transcription system," IEEE Trans. Speech Audio Processing, vol. 14, no. 5, pp. 1513-1525, 2006. (Pubitemid 46547578)
- (2006) IEEE Transactions on Audio, Speech and Language Processing , vol.14 , Issue.5 , pp. 1513-1525
- Gales, M.J.F.¹ Kim, D.Y.² Woodland, P.C.³ Chan, H.Y.⁴ Mrva, D.⁵ Sinha, R.⁶ Tranter, S.E.⁷

6
- 56149087174
- Improved acoustic modeling for transcribing arabic broadcast data
- L. Lamel, A. Messaoudi, and J.-L. Gauvain, "Improved acoustic modeling for transcribing Arabic broadcast data," in Proc. Annu. Conf. Int. Speech Communication Association (INTERSPEECH), 2007, pp. 2077-2080.
- (2007) Proc. Annu. Conf. Int. Speech Communication Association (INTERSPEECH) , pp. 2077-2080
- Lamel, L.¹ Messaoudi, A.² Gauvain, J.-L.³

7
- 70349225980
- Improved morphological decomposition for Arabic broadcast news transcription
- T. Ng, K. Nguyen, R. Zbib, and L. Nguyen, "Improved morphological decomposition for Arabic broadcast news transcription," in Proc. Int. Conf. Acoustic, Speech, and Signal Processing (ICASSP), 2009, pp. 4309-4312.
- (2009) Proc. Int. Conf. Acoustic, Speech, and Signal Processing (ICASSP) , pp. 4309-4312
- Ng, T.¹ Nguyen, K.² Zbib, R.³ Nguyen, L.⁴

8
- 78649306132
- Advances in the cmu/interact arabic gale transcription system
- M. Noamany, T. Schaaf, and T. Schultz, "Advances in the CMU/InterACT Arabic GALE transcription system," in Proc. North American Chapter of the Association for Computational Linguistics-Human Language Technologies (NAACL-HLT), 2007, pp. 129-132.
- (2007) Proc. North American Chapter of the Association for Computational Linguistics-Human Language Technologies (NAACL-HLT) , pp. 129-132
- Noamany, M.¹ Schaaf, T.² Schultz, T.³

9
- 85008006725
- Advances in arabic speech transcription at ibm under the darpa gale program
- H. Soltau, G. Saon, B. Kingsbury, H.-K. Kuo, L. Mangu, D. Povey, and A. Emami, "Advances in arabic speech transcription at IBM under the DARPA GALE program," IEEE Trans. Audio Speech Lang. Processing, vol. 17, no. 5, pp. 884-894, 2009.
- (2009) IEEE Trans. Audio Speech Lang. Processing , vol.17 , Issue.5 , pp. 884-894
- Soltau, H.¹ Saon, G.² Kingsbury, B.³ Kuo, H.-K.⁴ Mangu, L.⁵ Povey, D.⁶ Emami, A.⁷

10
- 84867206086
- Development of the sri/nightingale arabic asr system
- D. Vergyri, A. Mandal, W. Wang, A. Stolcke, J. Zheng, M. Graciarena, D. Rybach, C. Gollan, R. Schlueter, K. Kirchhoff, A. Faria, and N. Morgan, "Development of the SRI/Nightingale Arabic ASR system," in Proc. Annu. Conf. Int. Speech Communication Association (INTERSPEECH), 2008, pp. 1437-1440.
- (2008) Proc. Annu. Conf. Int. Speech Communication Association (INTERSPEECH) , pp. 1437-1440
- Vergyri, D.¹ Mandal, A.² Wang, W.³ Stolcke, A.⁴ Zheng, J.⁵ Graciarena, M.⁶ Rybach, D.⁷ Gollan, C.⁸ Schlueter, R.⁹ Kirchhoff, K.¹⁰ Faria, A.¹¹ Morgan, N.¹²

11
- 78049384511
- The 2009 ibm gale mandarin broadcast transcription system
- S. M. Chu, D. Povey, H.-K. Kuo, L. Mangu, S. Zhang, Q. Shi, and Y. Qin, "The 2009 IBM GALE Mandarin broadcast transcription system," in Proc. Int. Conf. Acoustics, Speech, and Signal Processing (ICASSP), 2010, pp. 4374-4377.
- (2010) Proc. Int. Conf. Acoustics, Speech, and Signal Processing (ICASSP) , pp. 4374-4377
- Chu, S.M.¹ Povey, D.² Kuo, H.-K.³ Mangu, L.⁴ Zhang, S.⁵ Shi, Q.⁶ Qin, Y.⁷

12
- 70450187943
- Development of the gale 2008 mandarin lvcsr system
- C. Plahl, B. Hoffmeister, G. Heigold, J. Loof, R. Schlueter, and H. Ney, "Development of the GALE 2008 Mandarin LVCSR system," in Proc. Annu. Conf. Int. Speech Communication Association (INTERSPEECH), 2009, pp. 2307-2311.
- (2009) Proc. Annu. Conf. Int. Speech Communication Association (INTERSPEECH) , pp. 2307-2311
- Plahl, C.¹ Hoffmeister, B.² Heigold, G.³ Loof, J.⁴ Schlueter, R.⁵ Ney, H.⁶

13
- 33947703664
- The cu-htk mandarin broadcast news transcription system
- R. Sinha, M. J. F. Gales, D. Y. Kim, X. Liu, K. C. Sim, and P. C. Woodland, "The CU-HTK Mandarin broadcast news transcription system," in Proc. Int. Conf. Acoustics, Speech, and Signal Processing (ICASSP), 2006, pp. 14-19.
- (2006) Proc. Int. Conf. Acoustics, Speech, and Signal Processing (ICASSP) , pp. 14-19
- Sinha, R.¹ Gales, M.J.F.² Kim, D.Y.³ Liu, X.⁴ Sim, K.C.⁵ Woodland, P.C.⁶

14
- 0030244826
- A review of large-vocabulary continuous-speech recognition
- S. Young, "A review of large-vocabulary continuous-speech recognition," IEEE Signal Processing Mag., vol. 13, no. 5, pp. 45-57, 1996.
- (1996) IEEE Signal Processing Mag. , vol.13 , Issue.5 , pp. 45-57
- Young, S.¹

15
- 77956781007
- Advances in large vocabulary continuous speech recognition
- G. Zweig and M. Picheny, "Advances in large vocabulary continuous speech recognition," Adv. Comput., vol. 60, pp. 249-291, 2004.
- (2004) Adv. Comput. , vol.60 , pp. 249-291
- Zweig, G.¹ Picheny, M.²

16
- 85032751593
- Developments and directions in speech recognition and understanding-part 1
- J. M. Baker, L. Deng, J. Glass, S. Khudanpur, C.-H. Lee, and D. O'Shaughnessy, "Developments and directions in speech recognition and understanding-Part 1," IEEE Signal Processing Mag., vol. 26, no. 3, pp. 75-80, 2009.
- (2009) IEEE Signal Processing Mag. , vol.26 , Issue.3 , pp. 75-80
- Baker, J.M.¹ Deng, L.² Glass, J.³ Khudanpur, S.⁴ Lee, C.-H.⁵ O'Shaughnessy, D.⁶

17
- 85032759066
- Updated minds report on speech recognition and understanding-part 2
- J. M. Baker, L. Deng, S. Khudanpur, C.-H. Lee, J. R. Glass, N. Morgan, and D. O'Shaughnessy, "Updated minds report on speech recognition and understanding-Part 2," IEEE Signal Processing Mag., vol. 26, no. 4, pp. 78-85, 2009.
- (2009) IEEE Signal Processing Mag. , vol.26 , Issue.4 , pp. 78-85
- Baker, J.M.¹ Deng, L.² Khudanpur, S.³ Lee, C.-H.⁴ Glass, J.R.⁵ Morgan, N.⁶ O'Shaughnessy, D.⁷

18
- 0025041264
- Perceptual linear predictive (PLP) analysis of speech
- DOI 10.1121/1.399423
- H. Hermansky, "Perceptual linear predictive (PLP) analysis of speech," J. Acoust. Soc. Am., vol. 87, no. 4, pp. 1738-1752, 1990. (Pubitemid 20256470)
- (1990) Journal of the Acoustical Society of America , vol.87 , Issue.4 , pp. 1738-1752
- Hermansky, H.¹

19
- 0022667694
- Speaker-independent isolated word recognition using dynamic features of speech spectrum
- S. Furui, "Speaker independent isolated word recognition using dynamic features of speech spectrum," IEEE Trans. Acoust., Speech, Signal Processing, vol. 34, no. 1, pp. 52-59, 1986. (Pubitemid 16575387)
- (1986) IEEE Transactions on Acoustics, Speech, and Signal Processing , vol.ASSP-34 , Issue.1 , pp. 52-59
- Furui Sadaoki¹

20
- 0033677121
- Maximum likelihood discriminant feature spaces
- G. Saon, M. Padmanabhan, R. Gopinath, and S. Chen, "Maximum likelihood discriminant feature spaces," in Proc. Int. Conf. Acoustic, Speech, and Signal Processing (ICASSP), 2000, pp. 1129-1132.
- (2000) Proc. Int. Conf. Acoustic, Speech, and Signal Processing (ICASSP) , pp. 1129-1132
- Saon, G.¹ Padmanabhan, M.² Gopinath, R.³ Chen, S.⁴

21
- 0036475982
- Maximum likelihood multiple subspace projections for hidden Markov models
- DOI 10.1109/89.985541, PII S1063667602015213
- M. J. F. Gales, "Maximum likelihood multiple subspace projections for hidden Markov models," IEEE Trans. Speech Audio Processing, vol. 10, no. 2, pp. 37-47, 2002. (Pubitemid 34295263)
- (2002) IEEE Transactions on Speech and Audio Processing , vol.10 , Issue.2 , pp. 37-47
- Gales, M.J.F.¹

22
- 0032289099
- Heteroscedastic discriminant analysis and reduced rank HMMs for improved speech recognition
- PII S0167639398000612
- N. Kumar and A. G. Andreou, "Heteroscedastic discriminant analysis and reduced rank HMMs for improved speech recognition," Speech Commun., vol. 26, no. 4, pp. 283-297, 1998. (Pubitemid 128425471)
- (1998) Speech Communication , vol.26 , Issue.4 , pp. 283-297
- Kumar, N.¹ Andreou, A.G.²

23
- 0032050110
- Maximum likelihood linear transformations for HMM-based speech recognition
- M. J. F. Gales, "Maximum likelihood linear transformations for HMM-based speech recognition," Comput. Speech Lang., vol. 12, no. 2, pp. 75-98, 1998. (Pubitemid 128383747)
- (1998) Computer Speech and Language , vol.12 , Issue.2 , pp. 75-98
- Gales, M.J.F.¹

24
- 0002960982
- Recent advances in robust speech recognition
- S. Furui, "Recent advances in robust speech recognition," in Proc. Workshop Robust Speech Recognition for Unknown Communication Channels, 1997, pp. 11-20.
- (1997) Proc. Workshop Robust Speech Recognition for Unknown Communication Channels , pp. 11-20
- Furui, S.¹

25
- 85009070292
- Large-vocabulary speech recognition under adverse acoustic environments
- L. Deng, A. Acero, M. Plumpe, and X. Huang, "Large-vocabulary speech recognition under adverse acoustic environments," in Proc. Int. Conf. Spoken Language Processing (ICSLP), 2000, pp. 806-809.
- (2000) Proc. Int. Conf. Spoken Language Processing (ICSLP) , pp. 806-809
- Deng, L.¹ Acero, A.² Plumpe, M.³ Huang, X.⁴

26
- 40249103761
- Issues with uncertainty decoding for noise robust automatic speech recognition
- H. Liao and M. J. F. Gales, "Issues with uncertainty decoding for noise robust automatic speech recognition," Speech Commun., vol. 50, no. 4, pp. 265-277, 2008.
- (2008) Speech Commun. , vol.50 , Issue.4 , pp. 265-277
- Liao, H.¹ Gales, M.J.F.²

27
- 34047249084
- Quantile based histogram equalization for noise robust large vocabulary speech recognition
- DOI 10.1109/TSA.2005.857792
- F. Hilger and H. Ney, "Quantile based histogram equalization for noise robust large vocabulary speech recognition," IEEE Trans. Audio Speech Lang. Processing, vol. 14, no. 3, pp. 845-854, 2006. (Pubitemid 46547647)
- (2006) IEEE Transactions on Audio, Speech and Language Processing , vol.14 , Issue.3 , pp. 845-854
- Hilger, F.¹ Ney, H.²

28
- 0031647824
- A frequency warping approach to speaker normalization
- PII S1063667698000960
- L. Lee and R. Rose, "A frequency warping approach to speaker normalization," IEEE Trans. Speech Audio Processing, vol. 6, no. 1, pp. 49-60, 1998. (Pubitemid 128720631)
- (1998) IEEE Transactions on Speech and Audio Processing , vol.6 , Issue.1 , pp. 49-60
- Lee, L.¹ Rose, R.²

29
- 0029764708
- Speaker normalization on conversational telephone speech
- S. Wegmann, D. McAllaster, J. Orloff, and B. Peskin, "Speaker normalization on conversational telephone speech," in Proc. Int. Conf. Acoustic, Speech, and Signal Processing (ICASSP), 1996, pp. 339-341.
- (1996) Proc. Int. Conf. Acoustic, Speech, and Signal Processing (ICASSP) , pp. 339-341
- Wegmann, S.¹ McAllaster, D.² Orloff, J.³ Peskin, B.⁴

30
- 4544324811
- Feature space Gaussianization
- G. Saon, S. Dharanipragada, and D. Povey, "Feature space Gaussianization," in Proc. Int. Conf. Acoustic, Speech, and Signal Processing (ICASSP), 2004, pp. 329-332.
- (2004) Proc. Int. Conf. Acoustic, Speech, and Signal Processing (ICASSP) , pp. 329-332
- Saon, G.¹ Dharanipragada, S.² Povey, D.³

31
- 84858990289
- The IBM 2011 GALE Arabic speech transcription system
- L. Mangu, H.-K. Kuo, S. Chu, B. Kingsbury, G. Saon, H. Soltau, and F. Biadsy, "The IBM 2011 GALE Arabic speech transcription system," in Proc. IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 2011, pp. 272-277.
- (2011) Proc. IEEE Automatic Speech Recognition and Understanding Workshop (ASRU) , pp. 272-277
- Mangu, L.¹ Kuo, H.-K.² Chu, S.³ Kingsbury, B.⁴ Saon, G.⁵ Soltau, H.⁶ Biadsy, F.⁷

32
- 33646788786
- FMPE: Discriminatively trained features for speech recognition
- D. Povey, B. Kingsbury, L. Mangu, G. Saon, H. Soltau, and G. Zweig, "fMPE: Discriminatively trained features for speech recognition," in Proc. Int. Conf. Acoustic, Speech, and Signal Processing (ICASSP), 2005, pp. 961-964.
- (2005) Proc. Int. Conf. Acoustic, Speech, and Signal Processing (ICASSP) , pp. 961-964
- Povey, D.¹ Kingsbury, B.² Mangu, L.³ Saon, G.⁴ Soltau, H.⁵ Zweig, G.⁶

33
- 51449120120
- Boosted MMI for model and feature-space discriminative training
- D. Povey, D. Kanevsky, B. Kingsbury, B. Ramabhadran, G. Saon, and K. Visweswariah, "Boosted MMI for model and feature-space discriminative training," in Proc. Int. Conf. Acoustics, Speech, and Signal Processing (ICASSP), 2008, pp. 4057-4060.
- (2008) Proc. Int. Conf. Acoustics, Speech, and Signal Processing (ICASSP) , pp. 4057-4060
- Povey, D.¹ Kanevsky, D.² Kingsbury, B.³ Ramabhadran, B.⁴ Saon, G.⁵ Visweswariah, K.⁶

34
- 0033709098
- Tandem connectionist feature extraction for conventional HMM systems
- H. Hermansky, D. P. W. Ellis, and S. Sharma, "Tandem connectionist feature extraction for conventional HMM systems," in Proc. Int. Conf. Acoustic, Speech, and Signal Processing (ICASSP), 2000, pp. 1635-1638.
- (2000) Proc. Int. Conf. Acoustic, Speech, and Signal Processing (ICASSP) , pp. 1635-1638
- Hermansky, H.¹ Ellis, D.P.W.² Sharma, S.³

35
- 34547548235
- Probabilistic and bottleneck features for LVCSR of meetings
- F. Grezl, M. Karafiat, S. Kontar, and J. Cernocky, "Probabilistic and bottleneck features for LVCSR of meetings," in Proc. Int. Conf. Acoustic, Speech, and Signal Processing (ICASSP), 2007, pp. 757-760.
- (2007) Proc. Int. Conf. Acoustic, Speech, and Signal Processing (ICASSP) , pp. 757-760
- Grezl, F.¹ Karafiat, M.² Kontar, S.³ Cernocky, J.⁴

36
- 80051608179
- The IBM 2009 GALE Arabic speech transcription system
- B. Kingsbury, H. Soltau, G. Saon, S. Chu, H. K. Kuo, L. Mangu, S. Ravuri, N. Morgan, and A. Janin, "The IBM 2009 GALE Arabic speech transcription system," in Proc. Int. Conf. Acoustics, Speech, and Signal Processing (ICASSP), 2011, pp. 4672-4675.
- (2011) Proc. Int. Conf. Acoustics, Speech, and Signal Processing (ICASSP) , pp. 4672-4675
- Kingsbury, B.¹ Soltau, H.² Saon, G.³ Chu, S.⁴ Kuo, H.K.⁵ Mangu, L.⁶ Ravuri, S.⁷ Morgan, N.⁸ Janin, A.⁹

37
- 0024610919
- A tutorial on hidden Markov models and selected applications in speech recognition
- L. R. Rabiner, "A tutorial on hidden Markov models and selected applications in speech recognition," Proc. IEEE, vol. 77, no. 2, pp. 257-286, 1989.
- (1989) Proc. IEEE , vol.77 , Issue.2 , pp. 257-286
- Rabiner, L.R.¹

38
- 85009289957
- Modeling with a subspace constraint on inverse covariance matrices
- S. Axelrod, R. Gopinath, and P. Olsen, "Modeling with a subspace constraint on inverse covariance matrices," in Proc. Int. Conf. Spoken Language Processing (ICSLP), 2002, pp. 2177-2180.
- (2002) Proc. Int. Conf. Spoken Language Processing (ICSLP) , pp. 2177-2180
- Axelrod, S.¹ Gopinath, R.² Olsen, P.³

39
- 0002629270
- Maximum likelihood from incomplete data via the em algorithm
- A. P. Dempster, N. M. Laird, and D. B. Rubin, "Maximum likelihood from incomplete data via the EM algorithm," J. R. Stat. Soc. B, vol. 39, no. 1, pp. 1-38, 1977.
- (1977) J. R. Stat. Soc. B , vol.39 , Issue.1 , pp. 1-38
- Dempster, A.P.¹ Laird, N.M.² Rubin, D.B.³

40
- 0031139839
- Minimum classification error rate methods for speech recognition
- PII S1063667697035937
- B.-H. Juang, W. Chou, and C.-H. Lee, "Minimum classification error methods for speech recognition," IEEE Trans. Speech Audio Processing, vol. 5, no. 3, pp. 257-265, 1997. (Pubitemid 127745998)
- (1997) IEEE Transactions on Speech and Audio Processing , vol.5 , Issue.3 , pp. 257-265
- Juang, B.-H.¹ Chou, W.² Lee, C.-H.³

41
- 0022890536
- Maximum mutual information estimation of hidden Markov model parameters for speech recognition
- L. R. Bahl, P. F. Brown, P. V. de Souza, and R. L. Mercer, "Maximum mutual information estimation of hidden Markov model parameters for speech recognition," in Proc. Int. Conf. Acoustic, Speech, and Signal Processing (ICASSP), 1986, pp. 49-52.
- (1986) Proc. Int. Conf. Acoustic, Speech, and Signal Processing (ICASSP) , pp. 49-52
- Bahl, L.R.¹ Brown, P.F.² De Souza, P.V.³ Mercer, R.L.⁴

42
- 0036460908
- Lightly supervised and unsupervised acoustic model training
- L. Lamel, J.-L. Gauvain, and G. Adda, "Lightly supervised and unsupervised acoustic model training," Comput. Speech Lang., vol. 16, no. 1, pp. 115-129, 2002.
- (2002) Comput. Speech Lang. , vol.16 , Issue.1 , pp. 115-129
- Lamel, L.¹ Gauvain, J.-L.² Adda, G.³

43
- 85135149386
- Discriminative training for continuous speech recognition
- W. Reichl and G. Ruske, "Discriminative training for continuous speech recognition," in Proc. European Conf. Speech Communication and Technology (EUROSPEECH), 1995, pp. 537-540.
- (1995) Proc. European Conf. Speech Communication and Technology (EUROSPEECH) , pp. 537-540
- Reichl, W.¹ Ruske, G.²

44
- 0036296863
- Minimum phone error and I-smoothing for improved discriminative training
- D. Povey and P. C. Woodland, "Minimum phone error and I-smoothing for improved discriminative training," in Proc. Int. Conf. Acoustic, Speech, and Signal Processing (ICASSP), 2002, pp. 105-108.
- (2002) Proc. Int. Conf. Acoustic, Speech, and Signal Processing (ICASSP) , pp. 105-108
- Povey, D.¹ Woodland, P.C.²

45
- 33745186926
- Anatomy of an extremely fast LVCSR decoder
- 9th European Conference on Speech Communication and Technology, Eurospeech Interspeech
- G. Saon, D. Povey, and G. Zweig, "Anatomy of an extremely fast LVCSR decoder," in Proc. Annu. Conf. Int. Speech Communication Association (INTERSPEECH), 2005, pp. 549-552. (Pubitemid 43908121)
- (2005) 9th European Conference on Speech Communication and Technology , pp. 549-552
- Saon, G.¹ Povey, D.² Zweig, G.³

46
- 33745203493
- Improved discriminative training using phone lattices
- 9th European Conference on Speech Communication and Technology, Eurospeech Interspeech
- J. Zheng and A. Stolcke, "Improved discriminative training using phone lattices," in Proc. Annu. Conf. Int. Speech Communication Association (INTERSPEECH), 2005, pp. 2125-2128. (Pubitemid 43908513)
- (2005) 9th European Conference on Speech Communication and Technology , pp. 2125-2128
- Zheng, J.¹ Stolcke, A.²

47
- 33947615252
- Discriminatively trained region dependent feature transforms for speech recognition
- B. Zhang and S. Matsoukas and R. Schwartz, "Discriminatively trained region dependent feature transforms for speech recognition," in Proc. Int. Conf. Acoustics, Speech, and Signal Processing (ICASSP), 2006, pp. 313-316.
- (2006) Proc. Int. Conf. Acoustics, Speech, and Signal Processing (ICASSP) , pp. 313-316
- Zhang, B.¹ Matsoukas, S.² Schwartz, R.³

48
- 84867211272
- Penalty function maximization for large margin HMM training
- G. Saon and D. Povey, "Penalty function maximization for large margin HMM training," in Proc. Annu. Conf. Int. Speech Communication Association (INTERSPEECH), 2008, pp. 920-923.
- (2008) Proc. Annu. Conf. Int. Speech Communication Association (INTERSPEECH) , pp. 920-923
- Saon, G.¹ Povey, D.²

49
- 70349220993
- Large margin semi-tied covariance transforms for discriminative training
- G. Saon, D. Povey, and H. Soltau, "Large margin semi-tied covariance transforms for discriminative training," in Proc. Int. Conf. Acoustic, Speech, and Signal Processing (ICASSP), 2009, pp. 3753-3756.
- (2009) Proc. Int. Conf. Acoustic, Speech, and Signal Processing (ICASSP) , pp. 3753-3756
- Saon, G.¹ Povey, D.² Soltau, H.³

50
- 34547522370
- Comparison of large margin training to other discriminative methods for phonetic recognition by hidden Markov models
- F. Sha and L. K. Saul, "Comparison of large margin training to other discriminative methods for phonetic recognition by hidden Markov models," in Proc. Int. Conf. Acoustic, Speech, and Signal Processing (ICASSP), 2007, pp. 313-316.
- (2007) Proc. Int. Conf. Acoustic, Speech, and Signal Processing (ICASSP) , pp. 313-316
- Sha, F.¹ Saul, L.K.²

51
- 33846516584
- New York: Springer-Verlag
- C. M. Bishop, Pattern Recognition and Machine Learning. New York: Springer-Verlag, 2006.
- (2006) Pattern Recognition and Machine Learning
- Bishop, C.M.¹

52
- 64149098818
- Approximate test risk bound minimization through soft margin estimation
- J. Li, M. Yuan, and C.-H. Lee, "Approximate test risk bound minimization through soft margin estimation," IEEE Trans. Audio Speech Lang. Processing, vol. 15, no. 8, pp. 2393-2404, 2007.
- (2007) IEEE Trans. Audio Speech Lang. Processing , vol.15 , Issue.8 , pp. 2393-2404
- Li, J.¹ Yuan, M.² Lee, C.-H.³

53
- 34047115134
- Large margin hidden Markov models for speech recognition
- DOI 10.1109/TASL.2006.879805
- H. Jiang, X. Li, and C. Liu, "Large margin hidden Markov models for speech recognition," IEEE Trans. Audio Speech Lang. Processing, vol. 14, no. 5, pp. 1584-1595, 2006. (Pubitemid 46552926)
- (2006) IEEE Transactions on Audio, Speech and Language Processing , vol.14 , Issue.5 , pp. 1584-1595
- Jiang, H.¹ Li, X.² Liu, C.³

54
- 70349226870
- Bayesian large margin hidden Markov models for speech recognition
- .J.-C. Chen and J.-T. Chien, "Bayesian large margin hidden Markov models for speech recognition," in Proc. Int. Conf. Acoustics, Speech, and Signal Processing (ICASSP), 2009, pp. 3765-3768.
- (2009) Proc. Int. Conf. Acoustics, Speech, and Signal Processing (ICASSP) , pp. 3765-3768
- Chen, J.-C.¹ Chien, J.-T.²

55
- 85032750905
- Discriminative learning in sequential pattern recognition-A unifying review for optimization-oriented speech recognition
- X. He, L. Deng, and W. Chou, "Discriminative learning in sequential pattern recognition-A unifying review for optimization-oriented speech recognition," IEEE Signal Processing Mag., vol. 25, no. 5, pp. 14-36, 2008.
- (2008) IEEE Signal Processing Mag. , vol.25 , Issue.5 , pp. 14-36
- He, X.¹ Deng, L.² Chou, W.³

56
- 0029288633
- Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models
- C. J. Leggetter and P. C. Woodland, "Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models," Comput. Speech Lang., vol. 9, no. 2, pp. 171-185, 1995.
- (1995) Comput. Speech Lang. , vol.9 , Issue.2 , pp. 171-185
- Leggetter, C.J.¹ Woodland, P.C.²

57
- 0003778679
- Lattice-based unsupervised MLLR for speaker adaptation
- M. Padmanabhan, G. Saon, and G. Zweig, "Lattice-based unsupervised MLLR for speaker adaptation," in Proc. ITRI ASR2000: ASR Challenges for the New Millennium, 2000, pp. 128-132.
- (2000) Proc. ITRI ASR2000: ASR Challenges for the New Millennium , pp. 128-132
- Padmanabhan, M.¹ Saon, G.² Zweig, G.³

58
- 44949193111
- Feature and model space speaker adaptation with full covariance Gaussians
- D. Povey and G. Saon, "Feature and model space speaker adaptation with full covariance Gaussians," in Proc. Annu. Conf. Int. Speech Communication Association (INTERSPEECH), 2006, pp. 1145-1148.
- (2006) Proc. Annu. Conf. Int. Speech Communication Association (INTERSPEECH) , pp. 1145-1148
- Povey, D.¹ Saon, G.²

59
- 78049368835
- The IBM 2008 GALE Arabic speech transcription system
- G. Saon, H. Soltau, U. Chaudhari, S. Chu, B. Kingsbury, H.-K. Kuo, L. Mangu, and D. Povey, "The IBM 2008 GALE Arabic speech transcription system," in Proc. Int. Conf. Acoustics, Speech, and Signal Processing (ICASSP), 2010, pp. 4378-4381.
- (2010) Proc. Int. Conf. Acoustics, Speech, and Signal Processing (ICASSP) , pp. 4378-4381
- Saon, G.¹ Soltau, H.² Chaudhari, U.³ Chu, S.⁴ Kingsbury, B.⁵ Kuo, H.-K.⁶ Mangu, L.⁷ Povey, D.⁸

60
- 79951796005
- The IBM Attila speech recognition toolkit
- H. Soltau, G. Saon, and B. Kingsbury, "The IBM Attila speech recognition toolkit," in Proc. IEEE Workshop Spoken Language Technology (SLT), 2010, pp. 97-102.
- (2010) Proc. IEEE Workshop Spoken Language Technology (SLT) , pp. 97-102
- Soltau, H.¹ Saon, G.² Kingsbury, B.³

61
- 0034320005
- Rapid speaker adaptation in eigenvoice space
- DOI 10.1109/89.876308
- R. Kuhn, J.-C. Junqua, P. Nguyen, and N. Niedzielski, "Rapid speaker adaptation in eigenvoice space," IEEE Trans. Audio Speech Lang. Processing, vol. 8, no. 4, pp. 695-707, 2000. (Pubitemid 32025317)
- (2000) IEEE Transactions on Speech and Audio Processing , vol.8 , Issue.6 , pp. 695-707
- Kuhn, R.¹ Junqua, J.-C.² Nguyen, P.³ Niedzielski, N.⁴

62
- 85009097035
- Fast speaker adaptation using eigenspace-based maximum likelihood linear regression
- K. Chen, W. Liau, H. Wang, and L.-S. Lee, "Fast speaker adaptation using eigenspace-based maximum likelihood linear regression," in Proc. Int. Conf. Spoken Language Processing (ICSLP), 2000, pp. 742-745.
- (2000) Proc. Int. Conf. Spoken Language Processing (ICSLP) , pp. 742-745
- Chen, K.¹ Liau, W.² Wang, H.³ Lee, L.-S.⁴

63
- 34047257854
- Aggregate a posteriori linear regression adaptation
- DOI 10.1109/TSA.2005.860847
- J.-T. Chien and C.-H. Huang, "Aggregate a posteriori linear regression adaptation," IEEE Trans. Audio Speech Lang. Processing, vol. 14, no. 3, pp. 797-807, 2006. (Pubitemid 46547644)
- (2006) IEEE Transactions on Audio, Speech and Language Processing , vol.14 , Issue.3 , pp. 797-807
- Chien, J.-T.¹ Huang, C.-H.²

64
- 85009119467
- Discriminative speaker adaptation with conditional maximum likelihood linear regression
- A. Gunawardana and W. Byrne, "Discriminative speaker adaptation with conditional maximum likelihood linear regression," in Proc. European Conf. Speech Communication and Technology (EUROSPEECH), 2001, pp. 1203-1206.
- (2001) Proc. European Conf. Speech Communication and Technology (EUROSPEECH) , pp. 1203-1206
- Gunawardana, A.¹ Byrne, W.²

65
- 40149091397
- MPE-based discriminative linear transforms for speaker adaptation
- DOI 10.1016/j.csl.2007.09.001, PII S0885230807000563
- L. Wang and P. C. Woodland, "MPE-based discriminative linear transforms for speaker adaptation," Comput. Speech Lang., vol. 22, no. 3, pp. 256-272, 2008. (Pubitemid 351329452)
- (2008) Computer Speech and Language , vol.22 , Issue.3 , pp. 256-272
- Wang, L.¹ Woodland, P.C.²

66
- 58349123022
- A study of minimum classification error (MCE) linear regression for supervised adaptation of MCE-trained continuous-density hidden Markov models
- J. Wu and Q. Huo, "A study of minimum classification error (MCE) linear regression for supervised adaptation of MCE-trained continuous-density hidden Markov models," IEEE Trans. Audio Speech Lang. Processing, vol. 15, no. 2, pp. 478-488, 2007.
- (2007) IEEE Trans. Audio Speech Lang. Processing , vol.15 , Issue.2 , pp. 478-488
- Wu, J.¹ Huo, Q.²

67
- 0030245128
- Robust continuous speech recognition using parallel model combination
- PII S1063667696067120
- M. J. F. Gales, "Robust continuous speech recognition using parallel model combination," IEEE Trans. Speech Audio Processing, vol. 4, no. 5, pp. 352-359, 1996. (Pubitemid 126753023)
- (1996) IEEE Transactions on Speech and Audio Processing , vol.4 , Issue.5 , pp. 352-359
- Gales, M.J.F.¹ Young, S.J.²

68
- 85009113852
- HMM adaptation using vector Taylor series for noisy speech recognition
- A. Acero, L. Deng, T. Kristjansson, and J. Zhang, "HMM adaptation using vector Taylor series for noisy speech recognition," in Proc. Int. Conf. Spoken Language Processing (ICSLP), 2000, pp. 869-872.
- (2000) Proc. Int. Conf. Spoken Language Processing (ICSLP) , pp. 869-872
- Acero, A.¹ Deng, L.² Kristjansson, T.³ Zhang, J.⁴

69
- 84055222005
- Context-dependent pre-trained deep neural networks for large-vocabulary speech recognition
- G. E. Dahl, D. Yu, L. Deng, and A. Acero, "Context-dependent pre-trained deep neural networks for large-vocabulary speech recognition," IEEE Trans. Audio Speech Lang. Processing, vol. 20, no. 1, pp. 30-42, 2012.
- (2012) IEEE Trans. Audio Speech Lang. Processing , vol.20 , Issue.1 , pp. 30-42
- Dahl, G.E.¹ Yu, D.² Deng, L.³ Acero, A.⁴

70
- 84865801985
- Conversational speech transcription using context-dependent deep neural networks
- F. Seide, G. Li, X. Chen, and D. Yu, "Conversational speech transcription using context-dependent deep neural networks," in Proc. Annu. Conf. Int. Speech Communication Association (INTERSPEECH), 2011, pp. 437-440.
- (2011) Proc. Annu. Conf. Int. Speech Communication Association (INTERSPEECH) , pp. 437-440
- Seide, F.¹ Li, G.² Chen, X.³ Yu, D.⁴

71
- 33745805403
- A fast learning algorithm for deep belief nets
- DOI 10.1162/neco.2006.18.7.1527
- G. Hinton, S. Osindero, and Y. Teh, "A fast learning algorithm for deep belief nets," Neural Comput., vol. 18, no. 7, pp. 1527-1554, 2006. (Pubitemid 44024729)
- (2006) Neural Computation , vol.18 , Issue.7 , pp. 1527-1554
- Hinton, G.E.¹ Osindero, S.² Teh, Y.-W.³

72
- 84055211743
- Acoustic modeling using deep belief networks
- A. Mohamed, G. E. Dahl, and G. Hinton, "Acoustic modeling using deep belief networks," IEEE Trans. Audio Speech Lang. Processing, vol. 20, no. 1, pp. 14-22, 2012.
- (2012) IEEE Trans. Audio Speech Lang. Processing , vol.20 , Issue.1 , pp. 14-22
- Mohamed, A.¹ Dahl, G.E.² Hinton, G.³

73
- 85032751458
- Deep neural networks for acoustic modeling in speech recognition
- G. Hinton, L. Deng, D. Yu, G. Dahl, A. Mohamed, N. Jaitly, V. Vanhoucke, P. Nguyen, T. Sainath, and B. Kingsbury, "Deep neural networks for acoustic modeling in speech recognition," IEEE Signal Processing Mag. vol. 29, no. 6, pp. 82-97, 2012.
- (2012) IEEE Signal Processing Mag. , vol.29 , Issue.6 , pp. 82-97
- Hinton, G.¹ Deng, L.² Yu, D.³ Dahl, G.⁴ Mohamed, A.⁵ Jaitly, N.⁶ Vanhoucke, V.⁷ Nguyen, P.⁸ Sainath, T.⁹ Kingsbury, B.¹⁰

74
- 0033329799
- Empirical study of smoothing techniques for language modeling
- DOI 10.1006/csla.1999.0128
- S. F. Chen and J. Goodman, "An empirical study of smoothing techniques for language modeling," Comput. Speech Lang., vol. 13, no. 4, pp. 359-394, 1999. (Pubitemid 30518216)
- (1999) Computer Speech and Language , vol.13 , Issue.4 , pp. 359-394
- Chen, S.F.¹ Goodman, J.²

75
- 0028996876
- Improved backing-off for m-gram language modeling
- R. Kneser and H. Ney, "Improved backing-off for m-gram language modeling," in Proc. Int. Conf. Acoustic, Speech, and Signal Processing (ICASSP), 1995, pp. 181-184.
- (1995) Proc. Int. Conf. Acoustic, Speech, and Signal Processing (ICASSP) , pp. 181-184
- Kneser, R.¹ Ney, H.²

76
- 38049151407
- A hierarchical Bayesian language model based on Pitman-Yor processes
- Y. W. Teh, "A hierarchical Bayesian language model based on Pitman-Yor processes," in Proc. Annu. Meeting of the Association for Computational Linguistics, 2006, pp. 985-992.
- (2006) Proc. Annu. Meeting of the Association for Computational Linguistics , pp. 985-992
- Teh, Y.W.¹

77
- 77956280276
- Hierarchical Bayesian language models for conversational speech recognition
- S. Huang and S. Renals, "Hierarchical Bayesian language models for conversational speech recognition," IEEE Trans. Audio Speech Lang. Processing, vol. 18, no. 8, pp. 1941-1954, 2010.
- (2010) IEEE Trans. Audio Speech Lang. Processing , vol.18 , Issue.8 , pp. 1941-1954
- Huang, S.¹ Renals, S.²

78
- 0000274403
- Exploiting latent semantic information in statistical language modeling
- J. Bellegarda, "Exploiting latent semantic information in statistical language modeling," Proc. IEEE, vol. 88, no. 8, pp. 1279-1296, 2000.
- (2000) Proc. IEEE , vol.88 , Issue.8 , pp. 1279-1296
- Bellegarda, J.¹

79
- 85121030057
- Topic-based language models using em
- D. Gildea and T. Hofmann, "Topic-based language models using EM," in Proc. European Conf. Speech Communication and Technology (EUROSPEECH), 1999, pp. 2167-2170.
- (1999) Proc. European Conf. Speech Communication and Technology (EUROSPEECH) , pp. 2167-2170
- Gildea, D.¹ Hofmann, T.²

80
- 0141607824
- Latent Dirichlet allocation
- D. M. Blei, A. Y. Ng, and M. I. Jordan, "Latent Dirichlet allocation," J. Mach. Learn. Res., vol. 3, no. 1, pp. 993-1022, 2003.
- (2003) J. Mach. Learn. Res. , vol.3 , Issue.1 , pp. 993-1022
- Blei, D.M.¹ Ng, A.Y.² Jordan, M.I.³

81
- 33745203547
- Dynamic language model adaptation using variational bayes inference
- 9th European Conference on Speech Communication and Technology, Eurospeech Interspeech
- Y. C. Tam and T. Schultz, "Dynamic language model adaptation using variational Bayes inference," in Proc. Annu. Conf. Int. Speech Communication Association (INTERSPEECH), 2005, pp. 5-8. (Pubitemid 43907987)
- (2005) 9th European Conference on Speech Communication and Technology , pp. 5-8
- Tam, Y.-C.¹ Schultz, T.²

82
- 78149256857
- Dirichlet class language models for speech recognition
- J.-T. Chien and C.-H. Chueh, "Dirichlet class language models for speech recognition," IEEE Trans. Audio Speech Lang. Processing, vol. 19, no. 3, pp. 482-495, 2011.
- (2011) IEEE Trans. Audio Speech Lang. Processing , vol.19 , Issue.3 , pp. 482-495
- Chien, J.-T.¹ Chueh, C.-H.²

83
- 85022919385
- Class-based n-gram models of natural language
- P. Brown, V. Della Pietra, P. D. Souza, J. Lai, and R. Mercer, "Class-based n-gram models of natural language," Comput. Linguist., vol. 18, no. 4, pp. 467-179, 1992.
- (1992) Comput. Linguist. , vol.18 , Issue.4 , pp. 467-179
- Brown, P.¹ Della Pietra, V.² Souza, P.D.³ Lai, J.⁴ Mercer, R.⁵

84
- 0030181951
- A maximum entropy approach to adaptive statistical language modeling
- R. Rosenfeld, "A maximum entropy approach to adaptive statistical language modeling," Comput. Speech Lang., vol. 10, no. 3, pp. 187-228, 1996.
- (1996) Comput. Speech Lang. , vol.10 , Issue.3 , pp. 187-228
- Rosenfeld, R.¹

85
- 34147179506
- Association pattern language modeling
- DOI 10.1109/TSA.2005.858551
- J.-T. Chien, "Association pattern language modeling," IEEE Trans. Audio Speech Lang. Processing, vol. 14, no. 5, pp. 1719-1728, 2006. (Pubitemid 46552927)
- (2006) IEEE Transactions on Audio, Speech and Language Processing , vol.14 , Issue.5 , pp. 1719-1728
- Chien, J.-T.¹

86
- 73649117102
- Joint acoustic and language modeling for speech recognition
- J.-T. Chien and C.-H. Chueh, "Joint acoustic and language modeling for speech recognition," Speech Commun., vol. 52, no. 3, pp. 223-235, 2010.
- (2010) Speech Commun. , vol.52 , Issue.3 , pp. 223-235
- Chien, J.-T.¹ Chueh, C.-H.²

87
- 84863387613
- Shrinking exponential language models
- S. F. Chen, "Shrinking exponential language models," in Proc. North American Chapter of the Association for Computational Linguistics-Human Language Technologies (NAACL-HLT), 2009, pp. 468-476.
- (2009) Proc. North American Chapter of the Association for Computational Linguistics-Human Language Technologies (NAACL-HLT) , pp. 468-476
- Chen, S.F.¹

88
- 0142166851
- A neural probabilistic language model
- Y. Bengio, R. Ducharme, P. Vincent, and C. Jauvin, "A neural probabilistic language model," J. Mach. Learn. Res., vol. 3, no. 2, pp. 1137-1155, 2003.
- (2003) J. Mach. Learn. Res. , vol.3 , Issue.2 , pp. 1137-1155
- Bengio, Y.¹ Ducharme, R.² Vincent, P.³ Jauvin, C.⁴

89
- 33847610331
- Continuous space language models
- DOI 10.1016/j.csl.2006.09.003, PII S0885230806000325
- H. Schwenk, "Continuous space language models," Comput. Speech Lang., vol. 21, no. 3, pp. 492-518, 2007. (Pubitemid 46367510)
- (2007) Computer Speech and Language , vol.21 , Issue.3 , pp. 492-518
- Schwenk, H.¹

90
- 79959829092
- Recurrent neural network based language model
- T. Mikolov, M. Karafiat, L. Burget, J. Cernocky, and S. Khudanpur, "Recurrent neural network based language model," in Proc. Annu. Conf. Int. Speech Communication Association (INTERSPEECH), 2010, pp. 1045-1048.
- (2010) Proc. Annu. Conf. Int. Speech Communication Association (INTERSPEECH) , pp. 1045-1048
- Mikolov, T.¹ Karafiat, M.² Burget, L.³ Cernocky, J.⁴ Khudanpur, S.⁵

91
- 0034295822
- Structured language modeling
- C. Chelba and F. Jelinek, "Structured language modeling," Comput. Speech Lang., vol. 14, no. 4, pp. 283-332, 2000.
- (2000) Comput. Speech Lang. , vol.14 , Issue.4 , pp. 283-332
- Chelba, C.¹ Jelinek, F.²

92
- 77949369404
- Syntactic features for Arabic speech recognition
- H.-K. Kuo, L. Mangu, A. Emami, I. Zitouni, and Y.-S. Lee, "Syntactic features for Arabic speech recognition," in Proc. IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 2009, pp. 327-332.
- (2009) Proc. IEEE Automatic Speech Recognition and Understanding Workshop (ASRU) , pp. 327-332
- Kuo, H.-K.¹ Mangu, L.² Emami, A.³ Zitouni, I.⁴ Lee, Y.-S.⁵

93
- 85009192356
- An architecture for rapid decoding of large vocabulary conversational speech
- G. Saon, G. Zweig, B. Kingsbury, L. Mangu, and U. Chaudhari, "An architecture for rapid decoding of large vocabulary conversational speech," in Proc. European Conf. Speech Communication and Technology (EUROSPEECH), 2003, pp. 1977-1980.
- (2003) Proc. European Conf. Speech Communication and Technology (EUROSPEECH) , pp. 1977-1980
- Saon, G.¹ Zweig, G.² Kingsbury, B.³ Mangu, L.⁴ Chaudhari, U.⁵

94
- 0036460907
- Weighted finite state transducers in speech recognition
- M. Mohri, F. Perreira, and M. Riley, "Weighted finite state transducers in speech recognition," Comput. Speech. Lang., vol. 16, no. 1, pp. 69-88, 2002.
- (2002) Comput. Speech. Lang. , vol.16 , Issue.1 , pp. 69-88
- Mohri, M.¹ Perreira, F.² Riley, M.³

95
- 80051634911
- A comparison of two LVR search optimization techniques
- S. Kanthak, H. Ney, M. Riley, and M. Mohri, "A comparison of two LVR search optimization techniques," in Proc. Int. Conf. Spoken Language Processing (ICSLP), 2002, pp. 1309-1312.
- (2002) Proc. Int. Conf. Spoken Language Processing (ICSLP) , pp. 1309-1312
- Kanthak, S.¹ Ney, H.² Riley, M.³ Mohri, M.⁴

96
- 77949347726
- Dynamic network decoding revisited
- H. Soltau and G. Saon, "Dynamic network decoding revisited," in Proc. IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 2009, pp. 276-281.
- (2009) Proc. IEEE Automatic Speech Recognition and Understanding Workshop (ASRU) , pp. 276-281
- Soltau, H.¹ Saon, G.²

97
- 85009209206
- Compiling large-context decision trees into finite-state transducers
- S. Chen, "Compiling large-context decision trees into finite-state transducers," in Proc. European Conf. Speech Communication and Technology (EUROSPEECH), 2003, pp. 1169-1172.
- (2003) Proc. European Conf. Speech Communication and Technology (EUROSPEECH) , pp. 1169-1172
- Chen, S.¹

98
- 0030638031
- A post-processing system to yield reduced word error rates: Recognizer output voting error reduction (ROVER)
- J. Fiscus, "A post-processing system to yield reduced word error rates: Recognizer output voting error reduction (ROVER)," in Proc. IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 1997, pp. 347-354.
- (1997) Proc. IEEE Automatic Speech Recognition and Understanding Workshop (ASRU) , pp. 347-354
- Fiscus, J.¹

99
- 4544253834
- Posterior probability decoding, confidence estimation and system combination
- G. Evermann and P. Woodland, "Posterior probability decoding, confidence estimation and system combination," in Proc. Speech Transcription Workshop, 2000.
- (2000) Proc. Speech Transcription Workshop
- Evermann, G.¹ Woodland, P.²

100
- 0034296009
- Finding consensus in speech recognition: Word error minimization and other applications of confusion networks
- L. Mangu, E. Brill, and A. Stolcke, "Finding consensus in speech recognition: Word error minimization and other applications of confusion networks," Comput. Speech Lang., vol. 14, no. 4, pp. 373-400, 2000.
- (2000) Comput. Speech Lang. , vol.14 , Issue.4 , pp. 373-400
- Mangu, L.¹ Brill, E.² Stolcke, A.³

101
- 0141700312
- The AT&T LVCSR 2000 system
- A. Ljolje, D. Hindle, M. Riley, and R. Sproat, "The AT&T LVCSR 2000 system," in Proc. NIST LVCSR W orkshop, 2000.
- (2000) Proc. NIST LVCSR W Orkshop
- Ljolje, A.¹ Hindle, D.² Riley, M.³ Sproat, R.⁴

102
- 77949370075
- A segmental CRF approach to large vocabulary continuous speech recognitio n
- G. Zweig and P. Nguyen, "A segmental CRF approach to large vocabulary continuous speech recognitio n," in Proc. IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 2009, pp. 152-157.
- (2009) Proc. IEEE Automatic Speech Recognition and Understanding Workshop (ASRU) , pp. 152-157
- Zweig, G.¹ Nguyen, P.²

103
- 33646818291
- Constructing ensembles of ASR systems using randomiz ed decision trees
- O. Siohan, B. Ramabhadran, and B. Kingsbury, "Constructing ensembles of ASR systems using randomiz ed decision trees," in Proc. Int. Conf. Acoustics, Speech, and Signal Processing (ICASSP), 2005, pp. 197-200.
- (2005) Proc. Int. Conf. Acoustics, Speech, and Signal Processing (ICASSP) , pp. 197-200
- Siohan, O.¹ Ramabhadran, B.² Kingsbury, B.³

104
- 80055092534
- Boosting systems for large vocabulary continuous speech recognition
- G. Saon and H. Soltau, "Boosting systems for large vocabulary continuous speech recognition," Spee ch Commun., vol. 54, no. 2, pp. 212-128, 2012.
- (2012) Spee Ch Commun. , vol.54 , Issue.2 , pp. 212-128
- Saon, G.¹ Soltau, H.²

105
- 78049502526
- The subspace Gaussian mixture models-A structured model for speech recognition
- D. Povey, L. Burget, M. Agarwal, P. Akyazi, F. Kai, A. Ghoshal, O. Glembek, N. Goel, M. Karafiat, A. Rastrow, R. C. Rose, P. Schwartz, and S. Thomas, "The subspace Gaussian mixture models-A structured model for speech recognition," Comput. Speech Lang., vol. 25, no. 2, pp. 404-439, 2011.
- (2011) Comput. Speech Lang. , vol.25 , Issue.2 , pp. 404-439
- Povey, D.¹ Burget, L.² Agarwal, M.³ Akyazi, P.⁴ Kai, F.⁵ Ghoshal, A.⁶ Glembek, O.⁷ Goel, N.⁸ Karafiat, M.⁹ Rastrow, A.¹⁰ Rose, R.C.¹¹ Schwartz, P.¹² Thomas, S.¹³

106
- 79959841827
- Canonical state models for automatic speech recognition
- M. J. F. Gales and K. Yu, "Canonical state models for automatic speech recognition," in Proc. Annu . Conf. Int. Speech Communication Association (INTERSPEECH), 2010, pp. 58-61.
- (2010) Proc. Annu . Conf. Int. Speech Communication Association (INTERSPEECH) , pp. 58-61
- Gales, M.J.F.¹ Yu, K.²

107
- 0742272654
- Modeling inverse covariance matrices by basis expansion
- P. A. Olsen and R. A. Gopinath, "Modeling inverse covariance matrices by basis expansion," IEEE Tr ans. Speech Audio Processing, vol. 12, no. 1, pp. 37-46, 2004.
- (2004) IEEE Tr Ans. Speech Audio Processing , vol.12 , Issue.1 , pp. 37-46
- Olsen, P.A.¹ Gopinath, R.A.²

108
- 80053610626
- Exemplar-based sparse rep resentation features: From TIMIT to LVCSR
- T. N. Sainath, B. Ramabhadran, M. Picheny, D. Nahamoo, and D. Kanevsky, "Exemplar-based sparse rep resentation features: from TIMIT to LVCSR," IEEE Trans. Audio Speech Lang. Processing, vol. 19, no. 8, pp. 2598-2613, 2011.
- (2011) IEEE Trans. Audio Speech Lang. Processing , vol.19 , Issue.8 , pp. 2598-2613
- Sainath, T.N.¹ Ramabhadran, B.² Picheny, M.³ Nahamoo, D.⁴ Kanevsky, D.⁵

109
- 80051625262
- Bayesian sensing hidden Markov models for speech recognition
- G. Saon and J.-T. Chien, "Bayesian sensing hidden Markov models for speech recognition," in Proc. Int. Conf. Acoustics, Speech, and Signal Processing (ICASSP), 2011, pp. 5056-5059.
- (2011) Proc. Int. Conf. Acoustics, Speech, and Signal Processing (ICASSP) , pp. 5056-5059
- Saon, G.¹ Chien, J.-T.²

110
- 0001224048
- Sparse bayesian learning and the relevance vector machine
- DOI 10.1162/15324430152748236
- M. E. Tipping, "Sparse Bayesian learning and the relevance vector machine," J. Mach. Learn. Res., vol. 1, no. 6, pp. 211-244, 2001. (Pubitemid 33687203)
- (2001) Journal of Machine Learning Research , vol.1 , Issue.3 , pp. 211-244
- Tipping, M.E.¹

111
- 84858979102
- Some properties of Bayesian sensing hidden Markov models
- G. Saon and J.-T. Chien, "Some properties of Bayesian sensing hidden Markov models," in Proc. IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 2011, pp. 65-70.
- (2011) Proc. IEEE Automatic Speech Recognition and Understanding Workshop (ASRU) , pp. 65-70
- Saon, G.¹ Chien, J.-T.²

112
- 80051622210
- Discriminative training for Bayesian sensing hidden Markov models
- G. Saon and J.-T. Chien, "Discriminative training for Bayesian sensing hidden Markov models," in Proc. Int. Conf. Acoutics, Speech, and Signal Processing (ICASSP), 2011, pp. 5316-5319.
- (2011) Proc. Int. Conf. Acoutics, Speech, and Signal Processing (ICASSP) , pp. 5316-5319
- Saon, G.¹ Chien, J.-T.²

113
- 84055217796
- Bayesian sensing hidden Markov models
- G. Saon and J.-T. Chien, "Bayesian sensing hidden Markov models," IEEE Trans. Audio Speech Lang. Processing, vol. 20, no. 1 , pp. 43-54, 2012.
- (2012) IEEE Trans. Audio Speech Lang. Processing , vol.20 , Issue.1 , pp. 43-54
- Saon, G.¹ Chien, J.-T.²

114
- 3042741069
- Variational Bayesian estimation and clustering for speech recognition
- S. Watanabe, Y. Minami, A. Nakamura, and N. Ueda, "Variational Bayesian estimation and clustering for speech recognition," IEEE Trans. Speech Audio Processing, vol. 12, no. 4, pp. 365-381, 2004.
- (2004) IEEE Trans. Speech Audio Processing , vol.12 , Issue.4 , pp. 365-381
- Watanabe, S.¹ Minami, Y.² Nakamura, A.³ Ueda, N.⁴

115
- 70349205593
- An evidence framework for Bayesian learning of conti nuous-density hidden Markov models
- .Y. Zhang, P. Liu, J.-T. Chien, and F. Soong, "An evidence framework for Bayesian learning of conti nuous-density hidden Markov models," in Proc. Int. Conf. Acoustics, Speech, and Signal Processing (ICASSP), 2009, pp. 3857-3860.
- (2009) Proc. Int. Conf. Acoustics, Speech, and Signal Processing (ICASSP) , pp. 3857-3860
- Zhang, Y.¹ Liu, P.² Chien, J.-T.³ Soong, F.⁴

116
- 18744376902
- Predictive hidden Markov model selection for speech recognition
- DOI 10.1109/TSA.2005.845810
- J.-T. Chien and S. Furui, "Predictive hidden markov model selection for speech recognition," IEEE Trans. Speech Audio Processing, vol. 13, no. 3, pp. 377-387, 2005. (Pubitemid 40666172)
- (2005) IEEE Transactions on Speech and Audio Processing , vol.13 , Issue.3 , pp. 377-387
- Chien, J.-T.¹ Furui, S.²

117
- 76849117578
- The nested Chinese restaurant process and Bayesian nonparametric inference of topic hierarchies
- article 7
- D. M. Blei, T. L. Griffiths, and M. I. Jordan, "The nested Chinese restaurant process and Bayesian nonparametric inference of topic hierarchies," J. ACM, vol. 57, no. 2, p. article 7, 2010.
- (2010) J. ACM , vol.57 , Issue.2
- Blei, D.M.¹ Griffiths, T.L.² Jordan, M.I.³

118
- 85032752250
- Bayesian nonparametric methods for learning Ma rkov switching processes
- E. Fox, E. Sudderth, M. I. Jordan, and A. Willsky, "Bayesian nonparametric methods for learning Ma rkov switching processes," IEEE Signal Processing Mag., vol. 27, no. 6, pp. 43-54, 2010.
- (2010) IEEE Signal Processing Mag. , vol.27 , Issue.6 , pp. 43-54
- Fox, E.¹ Sudderth, E.² Jordan, M.I.³ Willsky, A.⁴

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.