-
1
-
-
0030677475
-
Speaker adaptive training: Amaximumlikelihood approach to speaker normalization
-
Munich, Germany
-
Anastasakos, T., J. McDonough, and J. Makhoul (1997). Speaker adaptive training:Amaximumlikelihood approach to speaker normalization, in Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, pp. 1043-1046, Munich, Germany.
-
(1997)
Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing
, pp. 1043-1046
-
-
Anastasakos, T.1
McDonough, J.2
Makhoul, J.3
-
2
-
-
0022890536
-
Maximum mutual information estimation of hidden Markov model parameters for speech recognition
-
Tokyo, Japan
-
Bahl, L., P. Brown, P. de Souza, and R. Mercer (1986). Maximum mutual information estimation of hidden Markov model parameters for speech recognition, in Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, pp. 49-52, Tokyo, Japan.
-
(1986)
Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing
, pp. 49-52
-
-
Bahl, L.1
Brown, P.2
De Souza, P.3
Mercer, R.4
-
3
-
-
0040856612
-
Stochastic modeling for automatic speech recognition
-
D. R. Reddy, (ed.), Academic Press, New York
-
Baker, J. (1975). Stochastic modeling for automatic speech recognition, in D. R. Reddy, (ed.), Speech Recognition, Academic Press, New York.
-
(1975)
Speech Recognition
-
-
Baker, J.1
-
4
-
-
85054435217
-
-
Baker, J., L. Deng, J. Glass, S. Khudanpur, C.-H. Lee, and N. Morgan (2007). MINDS report: Historical development and future directions in speech recognition and understanding. http://wwwnlpir. nist.gov/MINDS/FINAL/speech.web.pdf.
-
(2007)
MINDS report: Historical development and future directions in speech recognition and understanding
-
-
Baker, J.1
Deng, L.2
Glass, J.3
Khudanpur, S.4
Lee, C.-H.5
Morgan, N.6
-
5
-
-
85032751593
-
Updated MINDS report on speech recognition and understanding part I
-
Baker, J., L. Deng, J. Glass, S. Khudanpur, C.-H. Lee, N. Morgan, and D. O’Shaughnessy (2009a). Updated MINDS report on speech recognition and understanding part I,. IEEE Signal Processing Magazine, 26(3), 75-80.
-
(2009)
IEEE Signal Processing Magazine
, vol.26
, Issue.3
, pp. 75-80
-
-
Baker, J.1
Deng, L.2
Glass, J.3
Khudanpur, S.4
Lee, C.-H.5
Morgan, N.6
O’Shaughnessy, D.7
-
6
-
-
85032759066
-
Updated MINDS report on speech recognition and understanding part II
-
Baker, J., L. Deng, J. Glass, S. Khudanpur, C.-H. Lee, N. Morgan, and D. O’Shaughnessy (2009b). Updated MINDS report on speech recognition and understanding part II,. IEEE Signal Processing Magazine, 26(4), 78-85.
-
(2009)
IEEE Signal Processing Magazine
, vol.26
, Issue.4
, pp. 78-85
-
-
Baker, J.1
Deng, L.2
Glass, J.3
Khudanpur, S.4
Lee, C.-H.5
Morgan, N.6
O’Shaughnessy, D.7
-
7
-
-
0001862769
-
An inequality and associated maximization technique occurring in statistical estimation for probabilistic functions of a Markov process
-
Baum, L. (1972). An inequality and associated maximization technique occurring in statistical estimation for probabilistic functions of a Markov process, Inequalities, III, 1-8.
-
(1972)
Inequalities
, vol.3
, pp. 1-8
-
-
Baum, L.1
-
8
-
-
0003802343
-
-
Wadsworth & Brooks, Pacific Grove, CA
-
Breiman, L., J. Friedman, R. Olshen, and C. Stone (1984). Classification and Regression Trees, Wadsworth & Brooks, Pacific Grove, CA.
-
(1984)
Classification and Regression Trees
-
-
Breiman, L.1
Friedman, J.2
Olshen, R.3
Stone, C.4
-
10
-
-
0019053271
-
Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences
-
Davis, S. and P. Mermelstein (1980). Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences, IEEE Transactions on Acoustics, Speech, and Signal Processing, 28(4), 357-366.
-
(1980)
IEEE Transactions on Acoustics, Speech, and Signal Processing
, vol.28
, Issue.4
, pp. 357-366
-
-
Davis, S.1
Mermelstein, P.2
-
11
-
-
0002629270
-
Maximum likelihood from incomplete data via the EM algorithm
-
Dempster, A., N. Laird, and D. Rubin (1977). Maximum likelihood from incomplete data via the EM algorithm, Journal of the Royal Statistical Society, 39(1), 1-21.
-
(1977)
Journal of the Royal Statistical Society
, vol.39
, Issue.1
, pp. 1-21
-
-
Dempster, A.1
Laird, N.2
Rubin, D.3
-
12
-
-
0027678649
-
A stochastic model of speech incorporating hierarchical nonstationarity
-
Deng, L. (1993). A stochastic model of speech incorporating hierarchical nonstationarity, IEEE Transactions on Speech and Audio Processing, 1(4), 471-475.
-
(1993)
IEEE Transactions on Speech and Audio Processing
, vol.1
, Issue.4
, pp. 471-475
-
-
Deng, L.1
-
13
-
-
4243109553
-
Challenges in adopting speech recognition
-
Deng, L. and X. D. Huang (2004). Challenges in adopting speech recognition, Communications of the ACM, 47(1), 11-13.
-
(2004)
Communications of the ACM
, vol.47
, Issue.1
, pp. 11-13
-
-
Deng, L.1
Huang, X.D.2
-
15
-
-
0030190520
-
Transitional speech units and their representation by the regressive Markov states: Applications to speech recognition
-
Deng, L. and H. Sameti (1996). Transitional speech units and their representation by the regressive Markov states: Applications to speech recognition, IEEE Transactions on Speech and Audio Processing, 4(4), 301-306.
-
(1996)
IEEE Transactions on Speech and Audio Processing
, vol.4
, Issue.4
, pp. 301-306
-
-
Deng, L.1
Sameti, H.2
-
17
-
-
0028234947
-
A statistical approach to automatic speech recognition using the atomic speech units constructed from overlapping articulatory features
-
Deng, L. and D. Sun (1994). A statistical approach to automatic speech recognition using the atomic speech units constructed from overlapping articulatory features, Journal of the Acoustical Society of America, 85(5), 2702-2719.
-
(1994)
Journal of the Acoustical Society of America
, vol.85
, Issue.5
, pp. 2702-2719
-
-
Deng, L.1
Sun, D.2
-
18
-
-
0028516022
-
Speech recognition using hidden Markov models with polynomial regression functions as nonstationary states
-
Deng, L., M. Aksmanovic, D. Sun, and J. Wu (1994). Speech recognition using hidden Markov models with polynomial regression functions as nonstationary states, IEEE Transactions on Speech and Audio Processing, 2, 507-520.
-
(1994)
IEEE Transactions on Speech and Audio Processing
, vol.2
, pp. 507-520
-
-
Deng, L.1
Aksmanovic, M.2
Sun, D.3
Wu, J.4
-
19
-
-
2442551863
-
Estimating cepstrum of speech under the presence of noise using a joint prior of static and dynamic features
-
Deng, L., J. Droppo, and A. Acero (2004). Estimating cepstrum of speech under the presence of noise using a joint prior of static and dynamic features, IEEE Transactions on Speech and Audio Processing, 12(3), 218-233.
-
(2004)
IEEE Transactions on Speech and Audio Processing
, vol.12
, Issue.3
, pp. 218-233
-
-
Deng, L.1
Droppo, J.2
Acero, A.3
-
20
-
-
34047266395
-
Structured speech modeling
-
Deng, L., D. Yu, and A. Acero (2006). Structured speech modeling, IEEE Transactions on Audio, Speech and Language Processing (Special Issue on Rich Transcription), 14(5), 1492-1504.
-
(2006)
IEEE Transactions on Audio, Speech and Language Processing (Special Issue on Rich Transcription)
, vol.14
, Issue.5
, pp. 1492-1504
-
-
Deng, L.1
Yu, D.2
Acero, A.3
-
21
-
-
84901773892
-
Environmental robustness
-
Springer-Verlag, Berlin, Germany
-
Droppo, J. and A. Acero (2008). Environmental robustness, in Handbook of Speech Processing, pp. 653-680, Springer-Verlag, Berlin, Germany.
-
(2008)
Handbook of Speech Processing
, pp. 653-680
-
-
Droppo, J.1
Acero, A.2
-
22
-
-
0029725604
-
A parametric approach to vocal tract length normalization
-
Atlanta, GA
-
Eide E. and H. Gish (1996). A parametric approach to vocal tract length normalization, in Proceedings of the International Conference on Acoustics, Speech, and Signal Processing, pp. 346-349, Atlanta, GA.
-
(1996)
Proceedings of the International Conference on Acoustics, Speech, and Signal Processing
, pp. 346-349
-
-
Eide, E.1
Gish, H.2
-
23
-
-
0030638031
-
A post-processing system to yield reduced word error rates: Recognizer output voting error reduction (ROVER)
-
Santa Barbara, CA
-
Fiscus, J. (1997). A post-processing system to yield reduced word error rates: Recognizer output voting error reduction (ROVER), IEEE Automatic Speech Recognition and Understanding Workshop, pp. 3477-3482, Santa Barbara, CA.
-
(1997)
IEEE Automatic Speech Recognition and Understanding Workshop
, pp. 3477-3482
-
-
Fiscus, J.1
-
24
-
-
34547525365
-
ALGONQUIN-learningdynamic noise models from noisy speech for robust speech recognition
-
Frey, B., T. T. Kristjansson, L. Deng, and A. Acero (2001). ALGONQUIN-learningdynamic noise models from noisy speech for robust speech recognition, in Proceedings of Neural Information Processing Systems, pp. 100-107.
-
(2001)
Proceedings of Neural Information Processing Systems
, pp. 100-107
-
-
Frey, B.1
Kristjansson, T.T.2
Deng, L.3
Acero, A.4
-
27
-
-
85032775863
-
Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains
-
Gauvain, J.-L. and C.-H. Lee (1997). Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains, IEEE Transactions on Speech and Audio Processing, 7, 711-720.
-
(1997)
IEEE Transactions on Speech and Audio Processing
, vol.7
, pp. 711-720
-
-
Gauvain, J.-L.1
Lee, C.-H.2
-
28
-
-
0038359548
-
A probabilistic framework for segment-based speech recognition
-
M. Russell and J. Bilmes (eds.), Special issue
-
Glass, J. (2003). A probabilistic framework for segment-based speech recognition, in M. Russell and J. Bilmes (eds.), New Computational Paradigms for Acoustic Modeling in Speech Recognition, Computer, Speech and Language (Special issue), 17(2-3), 137-152.
-
(2003)
New Computational Paradigms for Acoustic Modeling in Speech Recognition, Computer, Speech and Language
, vol.17
, Issue.2-3
, pp. 137-152
-
-
Glass, J.1
-
30
-
-
85009119467
-
Discriminative speaker adaptation with conditional maximum likelihood linear regression
-
Aalborg, Denmark
-
Gunawardana, A. and W. Byrne (2001). Discriminative speaker adaptation with conditional maximum likelihood linear regression, Proceedings of the EUROSPEECH, Aalborg, Denmark.
-
(2001)
Proceedings of the EUROSPEECH
-
-
Gunawardana, A.1
Byrne, W.2
-
31
-
-
33745185781
-
Hidden conditional random fields for phone classification
-
Gunawardana, A., M. Mahajan, A. Acero, and J. C. Platt (2005). Hidden conditional random fields for phone classification, in Proceedings of the International Conference on Speech Communication and Technology, pp. 1117-1120.
-
(2005)
Proceedings of the International Conference on Speech Communication and Technology
, pp. 1117-1120
-
-
Gunawardana, A.1
Mahajan, M.2
Acero, A.3
Platt, J.C.4
-
32
-
-
85032750905
-
Discriminative learning in sequential pattern recognition
-
He, X., L. Deng, C. Wu (2008). Discriminative learning in sequential pattern recognition, IEEE Signal Processing Magazine, 25(5), 14-36.
-
(2008)
IEEE Signal Processing Magazine
, vol.25
, Issue.5
, pp. 14-36
-
-
He, X.1
Deng, L.2
Wu, C.3
-
33
-
-
0025041264
-
Perceptual linear predictive analysis of speech
-
Hermansky, H. (1990). Perceptual linear predictive analysis of speech, Journal of the Acoustical Society of America, 87(4), 1738-1752.
-
(1990)
Journal of the Acoustical Society of America
, vol.87
, Issue.4
, pp. 1738-1752
-
-
Hermansky, H.1
-
35
-
-
85054456426
-
Leading a start-up in an enterprise: Lessons learned in creating Microsoft response point
-
Huang, X. D. (2009). Leading a start-up in an enterprise: Lessons learned in creating Microsoft response point, IEEE Signal Processing Magazine, 26(2), 135-138.
-
(2009)
IEEE Signal Processing Magazine
, vol.26
, Issue.2
, pp. 135-138
-
-
Huang, X.D.1
-
36
-
-
0027578837
-
On speaker-independent, speaker-dependent and speaker adaptive speech recognition
-
Huang, X. D. and K.-F. Lee (1993), On speaker-independent, speaker-dependent and speaker adaptive speech recognition, IEEE Transactions on Speech and Audio Processing, 1(2), 150-157.
-
(1993)
IEEE Transactions on Speech and Audio Processing
, vol.1
, Issue.2
, pp. 150-157
-
-
Huang, X.D.1
Lee, K.-F.2
-
37
-
-
0004056285
-
-
Prentice Hall, Upper Saddle River, NJ
-
Huang, X. D., A. Acero, and H. Hon (2001). Spoken Language Processing-A Guide to Theory, Algorithms, and System Development, Prentice Hall, Upper Saddle River, NJ.
-
(2001)
Spoken Language Processing-A Guide to Theory, Algorithms, and System Development
-
-
Huang, X.D.1
Acero, A.2
Hon, H.3
-
38
-
-
0014602879
-
A fast sequential decoding algorithm using a stack
-
Jelinek, F. (1969) A fast sequential decoding algorithm using a stack, IBM Journal of Research and Development, 13, 675-685.
-
(1969)
IBM Journal of Research and Development
, vol.13
, pp. 675-685
-
-
Jelinek, F.1
-
39
-
-
0016939124
-
Continuous speech recognition by statistical methods
-
Jelinek, F. (1976). Continuous speech recognition by statistical methods, Proceedings of the IEEE, 64(4), 532-557.
-
(1976)
Proceedings of the IEEE
, vol.64
, Issue.4
, pp. 532-557
-
-
Jelinek, F.1
-
42
-
-
0003847769
-
-
Prentice Hall, Upper Saddle River, NJ
-
Jurafsky D. and J. Martin (2000). Speech and Language Processing-An Introduction to Natural Language Processing, Computational Linguistics, and Speech Recognition, Prentice Hall, Upper Saddle River, NJ.
-
(2000)
Speech and Language Processing-An Introduction to Natural Language Processing, Computational Linguistics, and Speech Recognition
-
-
Jurafsky, D.1
Martin, J.2
-
43
-
-
0032289099
-
Heteroscedastic analysis and reduced rank HMMs for improved speech recognition
-
Kumar, N. and A. Andreou (1998). Heteroscedastic analysis and reduced rank HMMs for improved speech recognition, Speech Communication, 26, 283-297.
-
(1998)
Speech Communication
, vol.26
, pp. 283-297
-
-
Kumar, N.1
Andreou, A.2
-
45
-
-
0003770711
-
-
Kluwer Academic, Norwell, MA
-
Lee, C., F. Soong, and K. Paliwal (eds.) (1996). Automatic Speech and Speaker Recognition-Advanced Topics, Kluwer Academic, Norwell, MA.
-
(1996)
Automatic Speech and Speaker Recognition-Advanced Topics
-
-
Lee, C.1
Soong, F.2
Paliwal, K.3
-
46
-
-
0029288633
-
Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models
-
Leggetter C. and P. Woodland (1995). Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models, Computer Speech and Language, 9, 171-185.
-
(1995)
Computer Speech and Language
, vol.9
, pp. 171-185
-
-
Leggetter, C.1
Woodland, P.2
-
47
-
-
0023331258
-
An introduction to computing with neural nets
-
Lippman, R. (1987). An introduction to computing with neural nets, IEEE ASSP Magazine, 4(2), 4-22.
-
(1987)
IEEE ASSP Magazine
, vol.4
, Issue.2
, pp. 4-22
-
-
Lippman, R.1
-
48
-
-
33745208000
-
Investigations on error minimizing training criteria for discriminative training in automatic speech recognition
-
Lisbon, Portugal
-
Macherey, M., L. Haferkamp, R. Schlüter, and H. Ney (2005). Investigations on error minimizing training criteria for discriminative training in automatic speech recognition, in Proceedings of Interspeech, pp. 2133-2136, Lisbon, Portugal.
-
(2005)
Proceedings of Interspeech
, pp. 2133-2136
-
-
Macherey, M.1
Haferkamp, L.2
Schlüter, R.3
Ney, H.4
-
49
-
-
85032751546
-
Pushing the envelope-Aside
-
Morgan, N., Q. Zhu, A. Stolcke, K. Sonmez, S. Sivadas, T. Shinozaki, M. Ostendorf, P. Jain, H. Hermansky, D. Ellis, G. Doddington, B. Chen, O. Cetin, H. Bourlard, and M. Athineos (2005). Pushing the envelope-Aside, IEEE Signal Processing Magazine, 22, 81-88.
-
(2005)
IEEE Signal Processing Magazine
, vol.22
, pp. 81-88
-
-
Morgan, N.1
Zhu, Q.2
Stolcke, A.3
Sonmez, K.4
Sivadas, S.5
Shinozaki, T.6
Ostendorf, M.7
Jain, P.8
Hermansky, H.9
Ellis, D.10
Doddington, G.11
Chen, B.12
Cetin, O.13
Bourlard, H.14
Athineos, M.15
-
50
-
-
0021406359
-
The use of a one-stage dynamic programming algorithm for connected word recognition
-
Ney, H. (1984). The use of a one-stage dynamic programming algorithm for connected word recognition, IEEE Transactions on ASSP, 32, 263-271.
-
(1984)
IEEE Transactions on ASSP
, vol.32
, pp. 263-271
-
-
Ney, H.1
-
51
-
-
0030245363
-
From HMMs to segment models: A unified view of stochastic modeling for speech recognition
-
Ostendorf, M., V. Digalakis, and J. Rohlicek (1996). From HMMs to segment models: A unified view of stochastic modeling for speech recognition, IEEE Transactions on Speech and Audio Processing, 4, 360-378.
-
(1996)
IEEE Transactions on Speech and Audio Processing
, vol.4
, pp. 360-378
-
-
Ostendorf, M.1
Digalakis, V.2
Rohlicek, J.3
-
52
-
-
0023834849
-
Hidden Markov models: A guided tour
-
Seattle, WA
-
Poritz, A. (1988). Hidden Markov models: A guided tour, in Proceedings of the International Conference on Acoustics, Speech, and Signal Processing, Vol. 1, pp. 1-4, Seattle, WA.
-
(1988)
Proceedings of the International Conference on Acoustics, Speech, and Signal Processing
, vol.1
, pp. 1-4
-
-
Poritz, A.1
-
53
-
-
33646788786
-
FMPE: Discriminatively trained features for speech recognition
-
Philadelphia, PA
-
Povey, B., Kingsbury, L. Mangu, G. Saon, H. Soltau, and G. Zweig (2005). FMPE: Discriminatively trained features for speech recognition, in Proceedings of the International Conference on Acoustics, Speech, and Signal Processing, Philadelphia, PA.
-
(2005)
Proceedings of the International Conference on Acoustics, Speech, and Signal Processing
-
-
Povey, B.1
Kingsbury, L.M.2
Saon, G.3
Soltau, H.4
Zweig, G.5
-
56
-
-
84881675408
-
Cepstral channel normalization techniques for HMMbased speaker verification
-
Adelaide, SA
-
Rosenberg, A., C. H. Lee, and F. K. Soong (1994). Cepstral channel normalization techniques for HMMbased speaker verification, in Proceedings of the International Conference on Acoustics, Speech, and Signal Processing, pp. 1835-1838, Adelaide, SA.
-
(1994)
Proceedings of the International Conference on Acoustics, Speech, and Signal Processing
, pp. 1835-1838
-
-
Rosenberg, A.1
Lee, C.H.2
Soong, F.K.3
-
57
-
-
0005670423
-
A dynamic programming approach to continuous speech recognition
-
Budapest, Hungary
-
Sakoe, S. and S. Chiba (1971). A dynamic programming approach to continuous speech recognition, in Proceedings of the 7th International Congress on Acoustics, Vol. 3, pp. 65-69, Budapest, Hungary.
-
(1971)
Proceedings of the 7th International Congress on Acoustics
, vol.3
, pp. 65-69
-
-
Sakoe, S.1
Chiba, S.2
-
58
-
-
0010727514
-
Speech discrimination by dynamic programming
-
Vintsyuk, T. (1968). Speech discrimination by dynamic programming, Kibernetika, 4(2), 81-88.
-
(1968)
Kibernetika
, vol.4
, Issue.2
, pp. 81-88
-
-
Vintsyuk, T.1
-
59
-
-
84935113569
-
Error bounds for convolutional codes and an asymptotically optimum decoding algorithm
-
Viterbi, A. (1967). Error bounds for convolutional codes and an asymptotically optimum decoding algorithm, in IEEE Transactions on Information Theory, IT-13(2), 260-269.
-
(1967)
IEEE Transactions on Information Theory
, vol.13 IT
, Issue.2
, pp. 260-269
-
-
Viterbi, A.1
-
60
-
-
0025627406
-
The N-best algorithm: An efficient and exact procedure for finding the N most likely sentence hypotheses
-
Albuquerque, NM
-
Schwartz, R. and Y. Chow (1990). The N-best algorithm: An efficient and exact procedure for finding the N most likely sentence hypotheses, in Proceedings of the International Conference on Acoustics, Speech, and Signal Processing, Albuquerque, NM.
-
(1990)
Proceedings of the International Conference on Acoustics, Speech, and Signal Processing
-
-
Schwartz, R.1
Chow, Y.2
-
61
-
-
0036165806
-
An overlapping-feature based phonological model incorporating linguistic constraints: Applications to speech recognition
-
Sun, J. and L. Deng (2002). An overlapping-feature based phonological model incorporating linguistic constraints: Applications to speech recognition, Journal of the Acoustical Society of America, 111(2), 1086-1101.
-
(2002)
Journal of the Acoustical Society of America
, vol.111
, Issue.2
, pp. 1086-1101
-
-
Sun, J.1
Deng, L.2
-
62
-
-
70349209414
-
Discriminative pronunciation learning using phonetic decoder and minimum-classification-error criterion
-
Taipei, Taiwan
-
Vinyals, O., L. Deng, D. Yu, and A. Acero (2009) Discriminative pronunciation learning using phonetic decoder and minimum-classification-error criterion, in Proceedings of the International Conference on Acoustics, Speech, and Signal Processing, Taipei, Taiwan.
-
(2009)
Proceedings of the International Conference on Acoustics, Speech, and Signal Processing
-
-
Vinyals, O.1
Deng, L.2
Yu, D.3
Acero, A.4
-
63
-
-
0033677002
-
A unified context-free grammar and n-gram model for spoken language processing
-
Istanbul, Turkey
-
Wang, Y., M. Mahajan, and X. Huang (2000). A unified context-free grammar and n-gram model for spoken language processing, in Proceedings of the International Conference on Acoustics, Speech, and Signal Processing, Istanbul, Turkey.
-
(2000)
Proceedings of the International Conference on Acoustics, Speech, and Signal Processing
-
-
Wang, Y.1
Mahajan, M.2
Huang, X.3
-
64
-
-
85032751364
-
An introduction to voice search
-
Wang, Y., D. Yu, Y. Ju, and A. Acero (2008). An introduction to voice search, IEEE Signal Processing Magazine (Special Issue on Spoken Language Technology), 25(3), 29-38.
-
(2008)
IEEE Signal Processing Magazine (Special Issue on Spoken Language Technology)
, vol.25
, Issue.3
, pp. 29-38
-
-
Wang, Y.1
Yu, D.2
Ju, Y.3
Acero, A.4
-
65
-
-
85009227403
-
Data-driven example based continuous speech recognition
-
Geneva, Switzerland
-
Wachter, M., K. Demuynck, D. Van Compernolle, and P. Wambacq (2003). Data-driven example based continuous speech recognition, in Proceedings of the EUROSPEECH, pp. 1133-1136, Geneva, Switzerland.
-
(2003)
Proceedings of the EUROSPEECH
, pp. 1133-1136
-
-
Wachter, M.1
Demuynck, K.2
Van Compernolle, D.3
Wambacq, P.4
-
66
-
-
66149085249
-
An integrative and discriminative technique for spoken utterance classification
-
Yaman, S., L. Deng, D. Yu, Y. Wang, and A. Acero (2008). An integrative and discriminative technique for spoken utterance classification, IEEE Transactions on Audio, Speech, and Language Processing, 16(6), 1207-1214.
-
(2008)
IEEE Transactions on Audio, Speech, and Language Processing
, vol.16
, Issue.6
, pp. 1207-1214
-
-
Yaman, S.1
Deng, L.2
Yu, D.3
Wang, Y.4
Acero, A.5
-
67
-
-
66149101303
-
Robust speech recognition using cepstral minimum-mean-square-error noise suppressor
-
Yu, D., L. Deng, J. Droppo, J. Wu, Y. Gong, and A. Acero (2008). Robust speech recognition using cepstral minimum-mean-square-error noise suppressor, IEEE Transactions on Audio, Speech, and Language Processing, 16(5), 1061-1070.
-
(2008)
IEEE Transactions on Audio, Speech, and Language Processing
, vol.16
, Issue.5
, pp. 1061-1070
-
-
Yu, D.1
Deng, L.2
Droppo, J.3
Wu, J.4
Gong, Y.5
Acero, A.6
|