SCOPUS 정보 검색 플랫폼

Handbook of Natural Language Processing, Second Edition

Volumn , Issue , 2010, Pages 339-366

An overview of modern speech recognition

(2) Huang, Xuedong a Deng, Li a

a MICROSOFT (United States)

Author keywords

[No Author keywords available]

Indexed keywords

COMPUTER AIDED INSTRUCTION; HIDDEN MARKOV MODELS; REVIEWS; SPEECH; SPEECH COMMUNICATION;

COMMAND AND CONTROL; COMPUTER AIDED LANGUAGE LEARNING; DEEP INTEGRATIONS; EMERGING APPLICATIONS; HIDDEN MARKOV MODELS (HMMS); NATURAL COMMUNICATION; SPEECH RECOGNITION MODULES; TYPICAL APPLICATION;

SPEECH RECOGNITION;

EID: 85019175281 PISSN: None EISSN: None Source Type: Book
DOI: None Document Type: Chapter

Times cited : (34)

References (67)

1
- 0030677475
- Speaker adaptive training: Amaximumlikelihood approach to speaker normalization
- Munich, Germany
- Anastasakos, T., J. McDonough, and J. Makhoul (1997). Speaker adaptive training:Amaximumlikelihood approach to speaker normalization, in Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, pp. 1043-1046, Munich, Germany.
- (1997) Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing , pp. 1043-1046
- Anastasakos, T.¹ McDonough, J.² Makhoul, J.³

2
- 0022890536
- Maximum mutual information estimation of hidden Markov model parameters for speech recognition
- Tokyo, Japan
- Bahl, L., P. Brown, P. de Souza, and R. Mercer (1986). Maximum mutual information estimation of hidden Markov model parameters for speech recognition, in Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, pp. 49-52, Tokyo, Japan.
- (1986) Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing , pp. 49-52
- Bahl, L.¹ Brown, P.² De Souza, P.³ Mercer, R.⁴

3
- 0040856612
- Stochastic modeling for automatic speech recognition
- D. R. Reddy, (ed.), Academic Press, New York
- Baker, J. (1975). Stochastic modeling for automatic speech recognition, in D. R. Reddy, (ed.), Speech Recognition, Academic Press, New York.
- (1975) Speech Recognition
- Baker, J.¹

4
- 85054435217
- Baker, J., L. Deng, J. Glass, S. Khudanpur, C.-H. Lee, and N. Morgan (2007). MINDS report: Historical development and future directions in speech recognition and understanding. http://wwwnlpir. nist.gov/MINDS/FINAL/speech.web.pdf.
- (2007) MINDS report: Historical development and future directions in speech recognition and understanding
- Baker, J.¹ Deng, L.² Glass, J.³ Khudanpur, S.⁴ Lee, C.-H.⁵ Morgan, N.⁶

5
- 85032751593
- Updated MINDS report on speech recognition and understanding part I
- Baker, J., L. Deng, J. Glass, S. Khudanpur, C.-H. Lee, N. Morgan, and D. O’Shaughnessy (2009a). Updated MINDS report on speech recognition and understanding part I,. IEEE Signal Processing Magazine, 26(3), 75-80.
- (2009) IEEE Signal Processing Magazine , vol.26 , Issue.3 , pp. 75-80
- Baker, J.¹ Deng, L.² Glass, J.³ Khudanpur, S.⁴ Lee, C.-H.⁵ Morgan, N.⁶ O’Shaughnessy, D.⁷

6
- 85032759066
- Updated MINDS report on speech recognition and understanding part II
- Baker, J., L. Deng, J. Glass, S. Khudanpur, C.-H. Lee, N. Morgan, and D. O’Shaughnessy (2009b). Updated MINDS report on speech recognition and understanding part II,. IEEE Signal Processing Magazine, 26(4), 78-85.
- (2009) IEEE Signal Processing Magazine , vol.26 , Issue.4 , pp. 78-85
- Baker, J.¹ Deng, L.² Glass, J.³ Khudanpur, S.⁴ Lee, C.-H.⁵ Morgan, N.⁶ O’Shaughnessy, D.⁷

7
- 0001862769
- An inequality and associated maximization technique occurring in statistical estimation for probabilistic functions of a Markov process
- Baum, L. (1972). An inequality and associated maximization technique occurring in statistical estimation for probabilistic functions of a Markov process, Inequalities, III, 1-8.
- (1972) Inequalities , vol.3 , pp. 1-8
- Baum, L.¹

8
- 0003802343
- Wadsworth & Brooks, Pacific Grove, CA
- Breiman, L., J. Friedman, R. Olshen, and C. Stone (1984). Classification and Regression Trees, Wadsworth & Brooks, Pacific Grove, CA.
- (1984) Classification and Regression Trees
- Breiman, L.¹ Friedman, J.² Olshen, R.³ Stone, C.⁴

9
- 0034295822
- Structured language modeling
- Chelba C. and F. Jelinek (2000). Structured language modeling, Computer Speech and Language, 14, 283-332.
- (2000) Computer Speech and Language , vol.14 , pp. 283-332
- Chelba, C.¹ Jelinek, F.²

10
- 0019053271
- Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences
- Davis, S. and P. Mermelstein (1980). Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences, IEEE Transactions on Acoustics, Speech, and Signal Processing, 28(4), 357-366.
- (1980) IEEE Transactions on Acoustics, Speech, and Signal Processing , vol.28 , Issue.4 , pp. 357-366
- Davis, S.¹ Mermelstein, P.²

11
- 0002629270
- Maximum likelihood from incomplete data via the EM algorithm
- Dempster, A., N. Laird, and D. Rubin (1977). Maximum likelihood from incomplete data via the EM algorithm, Journal of the Royal Statistical Society, 39(1), 1-21.
- (1977) Journal of the Royal Statistical Society , vol.39 , Issue.1 , pp. 1-21
- Dempster, A.¹ Laird, N.² Rubin, D.³

12
- 0027678649
- A stochastic model of speech incorporating hierarchical nonstationarity
- Deng, L. (1993). A stochastic model of speech incorporating hierarchical nonstationarity, IEEE Transactions on Speech and Audio Processing, 1(4), 471-475.
- (1993) IEEE Transactions on Speech and Audio Processing , vol.1 , Issue.4 , pp. 471-475
- Deng, L.¹

13
- 4243109553
- Challenges in adopting speech recognition
- Deng, L. and X. D. Huang (2004). Challenges in adopting speech recognition, Communications of the ACM, 47(1), 11-13.
- (2004) Communications of the ACM , vol.47 , Issue.1 , pp. 11-13
- Deng, L.¹ Huang, X.D.²

14
- 4243117872
- Marcel Dekker Inc., New York
- Deng, L. and D. O’Shaughnessy (2003). Speech Processing-A Dynamic and Optimization-Oriented Approach, Marcel Dekker Inc., New York.
- (2003) Speech Processing-A Dynamic and Optimization-Oriented Approach
- Deng, L.¹ O’Shaughnessy, D.²

15
- 0030190520
- Transitional speech units and their representation by the regressive Markov states: Applications to speech recognition
- Deng, L. and H. Sameti (1996). Transitional speech units and their representation by the regressive Markov states: Applications to speech recognition, IEEE Transactions on Speech and Audio Processing, 4(4), 301-306.
- (1996) IEEE Transactions on Speech and Audio Processing , vol.4 , Issue.4 , pp. 301-306
- Deng, L.¹ Sameti, H.²

16
- 56249141845
- Structure-based and template-based automatic speech recognition-Comparing parametric and non-parametric approaches
- Antwerp, Belgium
- Deng, L. and H. Strik (2007). Structure-based and template-based automatic speech recognition-Comparing parametric and non-parametric approaches, in Proceedings of the 8th Annual Conference of the International Speech Communication Association Interspeech, Antwerp, Belgium.
- (2007) Proceedings of the 8th Annual Conference of the International Speech Communication Association Interspeech
- Deng, L.¹ Strik, H.²

17
- 0028234947
- A statistical approach to automatic speech recognition using the atomic speech units constructed from overlapping articulatory features
- Deng, L. and D. Sun (1994). A statistical approach to automatic speech recognition using the atomic speech units constructed from overlapping articulatory features, Journal of the Acoustical Society of America, 85(5), 2702-2719.
- (1994) Journal of the Acoustical Society of America , vol.85 , Issue.5 , pp. 2702-2719
- Deng, L.¹ Sun, D.²

18
- 0028516022
- Speech recognition using hidden Markov models with polynomial regression functions as nonstationary states
- Deng, L., M. Aksmanovic, D. Sun, and J. Wu (1994). Speech recognition using hidden Markov models with polynomial regression functions as nonstationary states, IEEE Transactions on Speech and Audio Processing, 2, 507-520.
- (1994) IEEE Transactions on Speech and Audio Processing , vol.2 , pp. 507-520
- Deng, L.¹ Aksmanovic, M.² Sun, D.³ Wu, J.⁴

19
- 2442551863
- Estimating cepstrum of speech under the presence of noise using a joint prior of static and dynamic features
- Deng, L., J. Droppo, and A. Acero (2004). Estimating cepstrum of speech under the presence of noise using a joint prior of static and dynamic features, IEEE Transactions on Speech and Audio Processing, 12(3), 218-233.
- (2004) IEEE Transactions on Speech and Audio Processing , vol.12 , Issue.3 , pp. 218-233
- Deng, L.¹ Droppo, J.² Acero, A.³

20
- 34047266395
- Structured speech modeling
- Deng, L., D. Yu, and A. Acero (2006). Structured speech modeling, IEEE Transactions on Audio, Speech and Language Processing (Special Issue on Rich Transcription), 14(5), 1492-1504.
- (2006) IEEE Transactions on Audio, Speech and Language Processing (Special Issue on Rich Transcription) , vol.14 , Issue.5 , pp. 1492-1504
- Deng, L.¹ Yu, D.² Acero, A.³

21
- 84901773892
- Environmental robustness
- Springer-Verlag, Berlin, Germany
- Droppo, J. and A. Acero (2008). Environmental robustness, in Handbook of Speech Processing, pp. 653-680, Springer-Verlag, Berlin, Germany.
- (2008) Handbook of Speech Processing , pp. 653-680
- Droppo, J.¹ Acero, A.²

22
- 0029725604
- A parametric approach to vocal tract length normalization
- Atlanta, GA
- Eide E. and H. Gish (1996). A parametric approach to vocal tract length normalization, in Proceedings of the International Conference on Acoustics, Speech, and Signal Processing, pp. 346-349, Atlanta, GA.
- (1996) Proceedings of the International Conference on Acoustics, Speech, and Signal Processing , pp. 346-349
- Eide, E.¹ Gish, H.²

23
- 0030638031
- A post-processing system to yield reduced word error rates: Recognizer output voting error reduction (ROVER)
- Santa Barbara, CA
- Fiscus, J. (1997). A post-processing system to yield reduced word error rates: Recognizer output voting error reduction (ROVER), IEEE Automatic Speech Recognition and Understanding Workshop, pp. 3477-3482, Santa Barbara, CA.
- (1997) IEEE Automatic Speech Recognition and Understanding Workshop , pp. 3477-3482
- Fiscus, J.¹

24
- 34547525365
- ALGONQUIN-learningdynamic noise models from noisy speech for robust speech recognition
- Frey, B., T. T. Kristjansson, L. Deng, and A. Acero (2001). ALGONQUIN-learningdynamic noise models from noisy speech for robust speech recognition, in Proceedings of Neural Information Processing Systems, pp. 100-107.
- (2001) Proceedings of Neural Information Processing Systems , pp. 100-107
- Frey, B.¹ Kristjansson, T.T.² Deng, L.³ Acero, A.⁴

25
- 0004072715
- (2nd Ed.), Marcel Dekker Inc., New York
- Furui, S. (2001). Digital Speech Processing, Synthesis and Recognition (2nd Ed.), Marcel Dekker Inc., New York.
- (2001) Digital Speech Processing, Synthesis and Recognition
- Furui, S.¹

26
- 34047246149
- Maximum entropy direct models for speech recognition
- Gao Y. and J. Kuo (2006). Maximum entropy direct models for speech recognition, IEEE Transactions on Speech and Audio Processing, 14(3), 873-881.
- (2006) IEEE Transactions on Speech and Audio Processing , vol.14 , Issue.3 , pp. 873-881
- Gao, Y.¹ Kuo, J.²

27
- 85032775863
- Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains
- Gauvain, J.-L. and C.-H. Lee (1997). Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains, IEEE Transactions on Speech and Audio Processing, 7, 711-720.
- (1997) IEEE Transactions on Speech and Audio Processing , vol.7 , pp. 711-720
- Gauvain, J.-L.¹ Lee, C.-H.²

28
- 0038359548
- A probabilistic framework for segment-based speech recognition
- M. Russell and J. Bilmes (eds.), Special issue
- Glass, J. (2003). A probabilistic framework for segment-based speech recognition, in M. Russell and J. Bilmes (eds.), New Computational Paradigms for Acoustic Modeling in Speech Recognition, Computer, Speech and Language (Special issue), 17(2-3), 137-152.
- (2003) New Computational Paradigms for Acoustic Modeling in Speech Recognition, Computer, Speech and Language , vol.17 , Issue.2-3 , pp. 137-152
- Glass, J.¹

29
- 0003901864
- John Wiley & Sons, New York
- Gold, B. and N. Morgan (2000). Speech and Audio Signal Processing, John Wiley & Sons, New York.
- (2000) Speech and Audio Signal Processing
- Gold, B.¹ Morgan, N.²

30
- 85009119467
- Discriminative speaker adaptation with conditional maximum likelihood linear regression
- Aalborg, Denmark
- Gunawardana, A. and W. Byrne (2001). Discriminative speaker adaptation with conditional maximum likelihood linear regression, Proceedings of the EUROSPEECH, Aalborg, Denmark.
- (2001) Proceedings of the EUROSPEECH
- Gunawardana, A.¹ Byrne, W.²

31
- 33745185781
- Hidden conditional random fields for phone classification
- Gunawardana, A., M. Mahajan, A. Acero, and J. C. Platt (2005). Hidden conditional random fields for phone classification, in Proceedings of the International Conference on Speech Communication and Technology, pp. 1117-1120.
- (2005) Proceedings of the International Conference on Speech Communication and Technology , pp. 1117-1120
- Gunawardana, A.¹ Mahajan, M.² Acero, A.³ Platt, J.C.⁴

32
- 85032750905
- Discriminative learning in sequential pattern recognition
- He, X., L. Deng, C. Wu (2008). Discriminative learning in sequential pattern recognition, IEEE Signal Processing Magazine, 25(5), 14-36.
- (2008) IEEE Signal Processing Magazine , vol.25 , Issue.5 , pp. 14-36
- He, X.¹ Deng, L.² Wu, C.³

33
- 0025041264
- Perceptual linear predictive analysis of speech
- Hermansky, H. (1990). Perceptual linear predictive analysis of speech, Journal of the Acoustical Society of America, 87(4), 1738-1752.
- (1990) Journal of the Acoustical Society of America , vol.87 , Issue.4 , pp. 1738-1752
- Hermansky, H.¹

34
- 0028517164
- RASTA processing of speech
- Hermansky H. and N. Morgan (1994). RASTA processing of speech, IEEE Transactions on Speech and Audio Processing, 2(4), 578-589.
- (1994) IEEE Transactions on Speech and Audio Processing , vol.2 , Issue.4 , pp. 578-589
- Hermansky, H.¹ Morgan, N.²

35
- 85054456426
- Leading a start-up in an enterprise: Lessons learned in creating Microsoft response point
- Huang, X. D. (2009). Leading a start-up in an enterprise: Lessons learned in creating Microsoft response point, IEEE Signal Processing Magazine, 26(2), 135-138.
- (2009) IEEE Signal Processing Magazine , vol.26 , Issue.2 , pp. 135-138
- Huang, X.D.¹

36
- 0027578837
- On speaker-independent, speaker-dependent and speaker adaptive speech recognition
- Huang, X. D. and K.-F. Lee (1993), On speaker-independent, speaker-dependent and speaker adaptive speech recognition, IEEE Transactions on Speech and Audio Processing, 1(2), 150-157.
- (1993) IEEE Transactions on Speech and Audio Processing , vol.1 , Issue.2 , pp. 150-157
- Huang, X.D.¹ Lee, K.-F.²

37
- 0004056285
- Prentice Hall, Upper Saddle River, NJ
- Huang, X. D., A. Acero, and H. Hon (2001). Spoken Language Processing-A Guide to Theory, Algorithms, and System Development, Prentice Hall, Upper Saddle River, NJ.
- (2001) Spoken Language Processing-A Guide to Theory, Algorithms, and System Development
- Huang, X.D.¹ Acero, A.² Hon, H.³

38
- 0014602879
- A fast sequential decoding algorithm using a stack
- Jelinek, F. (1969) A fast sequential decoding algorithm using a stack, IBM Journal of Research and Development, 13, 675-685.
- (1969) IBM Journal of Research and Development , vol.13 , pp. 675-685
- Jelinek, F.¹

39
- 0016939124
- Continuous speech recognition by statistical methods
- Jelinek, F. (1976). Continuous speech recognition by statistical methods, Proceedings of the IEEE, 64(4), 532-557.
- (1976) Proceedings of the IEEE , vol.64 , Issue.4 , pp. 532-557
- Jelinek, F.¹

40
- 0003786003
- MIT Press, Cambridge, MA
- Jelinek, F. (1997). Statistical Methods for Speech Recognition, MIT Press, Cambridge, MA.
- (1997) Statistical Methods for Speech Recognition
- Jelinek, F.¹

41
- 15844399848
- Vocabulary-independent word confidence measure using subword features
- Sydney, NSW
- Jiang, L. and X. D. Huang (1998). Vocabulary-independent word confidence measure using subword features, in Proceedings of the International Conference on Spoken Language Processing, pp. 401-404, Sydney, NSW.
- (1998) Proceedings of the International Conference on Spoken Language Processing , pp. 401-404
- Jiang, L.¹ Huang, X.D.²

42
- 0003847769
- Prentice Hall, Upper Saddle River, NJ
- Jurafsky D. and J. Martin (2000). Speech and Language Processing-An Introduction to Natural Language Processing, Computational Linguistics, and Speech Recognition, Prentice Hall, Upper Saddle River, NJ.
- (2000) Speech and Language Processing-An Introduction to Natural Language Processing, Computational Linguistics, and Speech Recognition
- Jurafsky, D.¹ Martin, J.²

43
- 0032289099
- Heteroscedastic analysis and reduced rank HMMs for improved speech recognition
- Kumar, N. and A. Andreou (1998). Heteroscedastic analysis and reduced rank HMMs for improved speech recognition, Speech Communication, 26, 283-297.
- (1998) Speech Communication , vol.26 , pp. 283-297
- Kumar, N.¹ Andreou, A.²

44
- 0003770715
- Springer-Verlag, Berlin, Germany
- Lee, K. F. (1988). Automatic Speech Recognition: The Development of the Sphinx Recognition System, Springer-Verlag, Berlin, Germany.
- (1988) Automatic Speech Recognition: The Development of the Sphinx Recognition System
- Lee, K.F.¹

45
- 0003770711
- Kluwer Academic, Norwell, MA
- Lee, C., F. Soong, and K. Paliwal (eds.) (1996). Automatic Speech and Speaker Recognition-Advanced Topics, Kluwer Academic, Norwell, MA.
- (1996) Automatic Speech and Speaker Recognition-Advanced Topics
- Lee, C.¹ Soong, F.² Paliwal, K.³

46
- 0029288633
- Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models
- Leggetter C. and P. Woodland (1995). Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models, Computer Speech and Language, 9, 171-185.
- (1995) Computer Speech and Language , vol.9 , pp. 171-185
- Leggetter, C.¹ Woodland, P.²

47
- 0023331258
- An introduction to computing with neural nets
- Lippman, R. (1987). An introduction to computing with neural nets, IEEE ASSP Magazine, 4(2), 4-22.
- (1987) IEEE ASSP Magazine , vol.4 , Issue.2 , pp. 4-22
- Lippman, R.¹

48
- 33745208000
- Investigations on error minimizing training criteria for discriminative training in automatic speech recognition
- Lisbon, Portugal
- Macherey, M., L. Haferkamp, R. Schlüter, and H. Ney (2005). Investigations on error minimizing training criteria for discriminative training in automatic speech recognition, in Proceedings of Interspeech, pp. 2133-2136, Lisbon, Portugal.
- (2005) Proceedings of Interspeech , pp. 2133-2136
- Macherey, M.¹ Haferkamp, L.² Schlüter, R.³ Ney, H.⁴

49
- 85032751546
- Pushing the envelope-Aside
- Morgan, N., Q. Zhu, A. Stolcke, K. Sonmez, S. Sivadas, T. Shinozaki, M. Ostendorf, P. Jain, H. Hermansky, D. Ellis, G. Doddington, B. Chen, O. Cetin, H. Bourlard, and M. Athineos (2005). Pushing the envelope-Aside, IEEE Signal Processing Magazine, 22, 81-88.
- (2005) IEEE Signal Processing Magazine , vol.22 , pp. 81-88
- Morgan, N.¹ Zhu, Q.² Stolcke, A.³ Sonmez, K.⁴ Sivadas, S.⁵ Shinozaki, T.⁶ Ostendorf, M.⁷ Jain, P.⁸ Hermansky, H.⁹ Ellis, D.¹⁰ Doddington, G.¹¹ Chen, B.¹² Cetin, O.¹³ Bourlard, H.¹⁴ Athineos, M.¹⁵

50
- 0021406359
- The use of a one-stage dynamic programming algorithm for connected word recognition
- Ney, H. (1984). The use of a one-stage dynamic programming algorithm for connected word recognition, IEEE Transactions on ASSP, 32, 263-271.
- (1984) IEEE Transactions on ASSP , vol.32 , pp. 263-271
- Ney, H.¹

51
- 0030245363
- From HMMs to segment models: A unified view of stochastic modeling for speech recognition
- Ostendorf, M., V. Digalakis, and J. Rohlicek (1996). From HMMs to segment models: A unified view of stochastic modeling for speech recognition, IEEE Transactions on Speech and Audio Processing, 4, 360-378.
- (1996) IEEE Transactions on Speech and Audio Processing , vol.4 , pp. 360-378
- Ostendorf, M.¹ Digalakis, V.² Rohlicek, J.³

52
- 0023834849
- Hidden Markov models: A guided tour
- Seattle, WA
- Poritz, A. (1988). Hidden Markov models: A guided tour, in Proceedings of the International Conference on Acoustics, Speech, and Signal Processing, Vol. 1, pp. 1-4, Seattle, WA.
- (1988) Proceedings of the International Conference on Acoustics, Speech, and Signal Processing , vol.1 , pp. 1-4
- Poritz, A.¹

53
- 33646788786
- FMPE: Discriminatively trained features for speech recognition
- Philadelphia, PA
- Povey, B., Kingsbury, L. Mangu, G. Saon, H. Soltau, and G. Zweig (2005). FMPE: Discriminatively trained features for speech recognition, in Proceedings of the International Conference on Acoustics, Speech, and Signal Processing, Philadelphia, PA.
- (2005) Proceedings of the International Conference on Acoustics, Speech, and Signal Processing
- Povey, B.¹ Kingsbury, L.M.² Saon, G.³ Soltau, H.⁴ Zweig, G.⁵

54
- 0004244302
- Prentice Hall, Englewood Cliffs, NJ
- Rabiner, L. and B. Juang (1993). Fundamentals of Speech Recognition, Prentice Hall, Englewood Cliffs, NJ.
- (1993) Fundamentals of Speech Recognition
- Rabiner, L.¹ Juang, B.²

55
- 0344611009
- Academic Press, New York
- Reddy, D. R. (ed.) (1975). Speech Recognition, Academic Press, New York.
- (1975) Speech Recognition
- Reddy, D.R.¹

56
- 84881675408
- Cepstral channel normalization techniques for HMMbased speaker verification
- Adelaide, SA
- Rosenberg, A., C. H. Lee, and F. K. Soong (1994). Cepstral channel normalization techniques for HMMbased speaker verification, in Proceedings of the International Conference on Acoustics, Speech, and Signal Processing, pp. 1835-1838, Adelaide, SA.
- (1994) Proceedings of the International Conference on Acoustics, Speech, and Signal Processing , pp. 1835-1838
- Rosenberg, A.¹ Lee, C.H.² Soong, F.K.³

57
- 0005670423
- A dynamic programming approach to continuous speech recognition
- Budapest, Hungary
- Sakoe, S. and S. Chiba (1971). A dynamic programming approach to continuous speech recognition, in Proceedings of the 7th International Congress on Acoustics, Vol. 3, pp. 65-69, Budapest, Hungary.
- (1971) Proceedings of the 7th International Congress on Acoustics , vol.3 , pp. 65-69
- Sakoe, S.¹ Chiba, S.²

58
- 0010727514
- Speech discrimination by dynamic programming
- Vintsyuk, T. (1968). Speech discrimination by dynamic programming, Kibernetika, 4(2), 81-88.
- (1968) Kibernetika , vol.4 , Issue.2 , pp. 81-88
- Vintsyuk, T.¹

59
- 84935113569
- Error bounds for convolutional codes and an asymptotically optimum decoding algorithm
- Viterbi, A. (1967). Error bounds for convolutional codes and an asymptotically optimum decoding algorithm, in IEEE Transactions on Information Theory, IT-13(2), 260-269.
- (1967) IEEE Transactions on Information Theory , vol.13 IT , Issue.2 , pp. 260-269
- Viterbi, A.¹

60
- 0025627406
- The N-best algorithm: An efficient and exact procedure for finding the N most likely sentence hypotheses
- Albuquerque, NM
- Schwartz, R. and Y. Chow (1990). The N-best algorithm: An efficient and exact procedure for finding the N most likely sentence hypotheses, in Proceedings of the International Conference on Acoustics, Speech, and Signal Processing, Albuquerque, NM.
- (1990) Proceedings of the International Conference on Acoustics, Speech, and Signal Processing
- Schwartz, R.¹ Chow, Y.²

61
- 0036165806
- An overlapping-feature based phonological model incorporating linguistic constraints: Applications to speech recognition
- Sun, J. and L. Deng (2002). An overlapping-feature based phonological model incorporating linguistic constraints: Applications to speech recognition, Journal of the Acoustical Society of America, 111(2), 1086-1101.
- (2002) Journal of the Acoustical Society of America , vol.111 , Issue.2 , pp. 1086-1101
- Sun, J.¹ Deng, L.²

62
- 70349209414
- Discriminative pronunciation learning using phonetic decoder and minimum-classification-error criterion
- Taipei, Taiwan
- Vinyals, O., L. Deng, D. Yu, and A. Acero (2009) Discriminative pronunciation learning using phonetic decoder and minimum-classification-error criterion, in Proceedings of the International Conference on Acoustics, Speech, and Signal Processing, Taipei, Taiwan.
- (2009) Proceedings of the International Conference on Acoustics, Speech, and Signal Processing
- Vinyals, O.¹ Deng, L.² Yu, D.³ Acero, A.⁴

63
- 0033677002
- A unified context-free grammar and n-gram model for spoken language processing
- Istanbul, Turkey
- Wang, Y., M. Mahajan, and X. Huang (2000). A unified context-free grammar and n-gram model for spoken language processing, in Proceedings of the International Conference on Acoustics, Speech, and Signal Processing, Istanbul, Turkey.
- (2000) Proceedings of the International Conference on Acoustics, Speech, and Signal Processing
- Wang, Y.¹ Mahajan, M.² Huang, X.³

64
- 85032751364
- An introduction to voice search
- Wang, Y., D. Yu, Y. Ju, and A. Acero (2008). An introduction to voice search, IEEE Signal Processing Magazine (Special Issue on Spoken Language Technology), 25(3), 29-38.
- (2008) IEEE Signal Processing Magazine (Special Issue on Spoken Language Technology) , vol.25 , Issue.3 , pp. 29-38
- Wang, Y.¹ Yu, D.² Ju, Y.³ Acero, A.⁴

65
- 85009227403
- Data-driven example based continuous speech recognition
- Geneva, Switzerland
- Wachter, M., K. Demuynck, D. Van Compernolle, and P. Wambacq (2003). Data-driven example based continuous speech recognition, in Proceedings of the EUROSPEECH, pp. 1133-1136, Geneva, Switzerland.
- (2003) Proceedings of the EUROSPEECH , pp. 1133-1136
- Wachter, M.¹ Demuynck, K.² Van Compernolle, D.³ Wambacq, P.⁴

66
- 66149085249
- An integrative and discriminative technique for spoken utterance classification
- Yaman, S., L. Deng, D. Yu, Y. Wang, and A. Acero (2008). An integrative and discriminative technique for spoken utterance classification, IEEE Transactions on Audio, Speech, and Language Processing, 16(6), 1207-1214.
- (2008) IEEE Transactions on Audio, Speech, and Language Processing , vol.16 , Issue.6 , pp. 1207-1214
- Yaman, S.¹ Deng, L.² Yu, D.³ Wang, Y.⁴ Acero, A.⁵

67
- 66149101303
- Robust speech recognition using cepstral minimum-mean-square-error noise suppressor
- Yu, D., L. Deng, J. Droppo, J. Wu, Y. Gong, and A. Acero (2008). Robust speech recognition using cepstral minimum-mean-square-error noise suppressor, IEEE Transactions on Audio, Speech, and Language Processing, 16(5), 1061-1070.
- (2008) IEEE Transactions on Audio, Speech, and Language Processing , vol.16 , Issue.5 , pp. 1061-1070
- Yu, D.¹ Deng, L.² Droppo, J.³ Wu, J.⁴ Gong, Y.⁵ Acero, A.⁶

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.