SCOPUS 정보 검색 플랫폼

Eurasip Journal on Audio, Speech, and Music Processing

Volumn 2009, Issue , 2009, Pages

Recognition of noisy speech: A comparative survey of robust model architecture and feature enhancement

(4) Schuller, Björn a Wllmer, Martin a Moosmayr, Tobias b Rigoll, Gerhard a

a TECHNICAL UNIVERSITY OF MUNICH (Germany)

b BMW GROUP (Germany)

Author keywords

[No Author keywords available]

Indexed keywords

EID: 67650135931 PISSN: 16874714 EISSN: 16874722 Source Type: Journal
DOI: 10.1155/2009/942617 Document Type: Article

Times cited : (38)

References (73)

1
- 65549153550
- Pittsburgh, Pa, USA Carnegie Mellon University
- Moreno P. J., Speech recognition in noisy environments, Ph.D. thesis 1996 Pittsburgh, Pa, USA Carnegie Mellon University
- (1996) Speech Recognition in Noisy Environments, Ph.D. Thesis
- Moreno, P.J.¹

2
- 0032785783
- Auditory processing of speech signals for robust speech recognition in real-world noisy environments
- Kim D.-S., Lee S.-Y., Kil R. M., Auditory processing of speech signals for robust speech recognition in real-world noisy environments IEEE Transactions on Speech and Audio Processing 1999 7 1 55 69
- (1999) IEEE Transactions on Speech and Audio Processing , vol.7 , Issue.1 , pp. 55-69
- Kim, D.-S.¹ Lee, S.-Y.² Kil, R.M.³

3
- 67650206594
- Proceedings of ISCA Workshop on Robustness in Conversational Interaction (Robust 04) August 2004 Norwich, UK
- Rose R. C., Environmental robustness in automatic speech recognition Proceedings of ISCA Workshop on Robustness in Conversational Interaction (Robust 04) August 2004 Norwich, UK
- Environmental robustness in automatic speech recognition
- Rose, R.C.¹

4
- 85009154305
- Proceedings of the 6th International Conference on Spoken Language Processing (ICSLP 00) October 2000 Beijing, China
- de la Torre A., Fohr D., Haton J. P., Compensation of noise effects for robust speech recognition in car environments 3 Proceedings of the 6th International Conference on Spoken Language Processing (ICSLP 00) October 2000 Beijing, China 730 733
- Compensation of noise effects for robust speech recognition in car environments , vol.3 , pp. 730-733
- De La Torre, A.¹ Fohr, D.² Haton, J.P.³

5
- 85009091200
- Proceedings of the 5th European Conference on Speech Communication and Technology (Eurospeech 97) September 1997 Rhodes, Greece
- Langmann D., Fischer A., Wuppermann F., Haeb-Umbach R., Eisele T., Acoustic front ends for speaker-independent digit recognition in car environments Proceedings of the 5th European Conference on Speech Communication and Technology (Eurospeech 97) September 1997 Rhodes, Greece 2571 2574
- Acoustic front ends for speaker-independent digit recognition in car environments , pp. 2571-2574
- Langmann, D.¹ Fischer, A.² Wuppermann, F.³ Haeb-Umbach, R.⁴ Eisele, T.⁵

6
- 0025041264
- Perceptual linear predictive (PLP) analysis of speech
- DOI 10.1121/1.399423
- Hermansky H., Perceptual linear predictive (PLP) analysis of speech The Journal of the Acoustical Society of America 1990 87 4 1738 1752 (Pubitemid 20256470)
- (1990) Journal of the Acoustical Society of America , vol.87 , Issue.4 , pp. 1738-1752
- Hermansky, H.¹

7
- 0030127017
- Signal conditioning techniques for robust speech recognition
- PII S1070990896032531
- Rahim M. G., Juang B.-H., Chou W., Buhrke E., Signal conditioning techniques for robust speech recognition IEEE Signal Processing Letters 1996 3 4 107 109 (Pubitemid 126518955)
- (1996) IEEE Signal Processing Letters , vol.3 , Issue.4 , pp. 107-109
- Rahim, M.G.¹ Juang, B.-H.² Chou, W.³ Buhrke, E.⁴

8
- 0032141206
- Cepstral domain segmental feature vector normalization for noise robust speech recognition
- PII S0167639398000338
- Viikki O., Laurila K., Cepstral domain segmental feature vector normalization for noise robust speech recognition Speech Communication 1998 25 13 133 147 (Pubitemid 128413638)
- (1998) Speech Communication , vol.25 , Issue.1-3 , pp. 133-147
- Viikki, O.¹ Laurila, K.²

9
- 18744371585
- Histogram equalization of speech representation for robust speech recognition
- de La Torre A., Peinado A. M., Segura J. C., Prez-Crdoba J. L., Bentez M. C., Rubio A. J., Histogram equalization of speech representation for robust speech recognition IEEE Transactions on Speech and Audio Processing 2005 13 3 355 366
- (2005) IEEE Transactions on Speech and Audio Processing , vol.13 , Issue.3 , pp. 355-366
- De La Torre, A.¹ Peinado, A.M.² Segura, J.C.³ Prez-Crdoba, J.L.⁴ Bentez, M.C.⁵ Rubio, A.J.⁶

10
- 4544236840
- Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 04) May 2004 Montreal, Canada
- Droppo J., Acero A., Noise robust speech recognition with a switching linear dynamic model 1 Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 04) May 2004 Montreal, Canada 953 956
- Noise robust speech recognition with a switching linear dynamic model , vol.1 , pp. 953-956
- Droppo, J.¹ Acero, A.²

11
- 0024610919
- A tutorial on hidden Markov models and selected applications in speech recognition
- Rabiner L. R., A tutorial on hidden Markov models and selected applications in speech recognition Proceedings of the IEEE 1989 77 2 257 286
- (1989) Proceedings of the IEEE , vol.77 , Issue.2 , pp. 257-286
- Rabiner, L.R.¹

12
- 33745185781
- Hidden conditional random fields for phone classification
- 9th European Conference on Speech Communication and Technology, Eurospeech Interspeech
- Gunawardana A., Mahajan M., Acero A., Platt J. C., Hidden conditional random fields for phone classification Proceedings of the 9th European Conference on Speech Communication and Technology (Interspeech 05) September 2005 Lisbon, Portugal 1117 1120 (Pubitemid 43908262)
- (2005) 9th European Conference on Speech Communication and Technology , pp. 1117-1120
- Gunawardana, A.¹ Mahajan, M.² Acero, A.³ Platt, J.C.⁴

13
- 13244265597
- Revisiting autoregressive hidden Markov modeling of speech signals
- DOI 10.1109/LSP.2004.840914
- Ephraim Y., Roberts W. J. J., Revisiting autoregressive hidden Markov modeling of speech signals IEEE Signal Processing Letters 2005 12 2 166 169 (Pubitemid 40181881)
- (2005) IEEE Signal Processing Letters , vol.12 , Issue.2 , pp. 166-169
- Ephraim, Y.¹ Roberts, W.J.J.²

14
- 54349106040
- Switching linear dynamical systems for noise robust speech recognition
- Mesot B., Barber D., Switching linear dynamical systems for noise robust speech recognition IEEE Transactions on Audio, Speech, and Language Processing 2007 15 6 1850 1858
- (2007) IEEE Transactions on Audio, Speech, and Language Processing , vol.15 , Issue.6 , pp. 1850-1858
- Mesot, B.¹ Barber, D.²

15
- 67650202047
- Proceedings of the DARPA of CSR Workshop February 1996 Ardenhouse, NY, USA
- Sankar A., Stolcke A., Chung T., Noise-resistant feature extraction and model training for robust speech recognition Proceedings of the DARPA of CSR Workshop February 1996 Ardenhouse, NY, USA 117 122
- Noise-resistant feature extraction and model training for robust speech recognition , pp. 117-122
- Sankar, A.¹ Stolcke, A.² Chung, T.³

16
- 0032027527
- Nonstationary environment compensation based on sequential estimation
- Kim N. S., Nonstationary environment compensation based on sequential estimation IEEE Signal Processing Letters 1998 5 3 57 59 (Pubitemid 128556794)
- (1998) IEEE Signal Processing Letters , vol.5 , Issue.3 , pp. 57-59
- Kim, N.S.¹

17
- 33846254193
- Proceedings of IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU 05) November 2005 San Juan, Puerto Rico, USA
- Lathoud G., Magimai-Doss M., Mesot B., Bourlard H., Unsupervised spectral subtraction for noise-robust ASR Proceedings of IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU 05) November 2005 San Juan, Puerto Rico, USA 343 348
- Unsupervised spectral subtraction for noise-robust ASR , pp. 343-348
- Lathoud, G.¹ Magimai-Doss, M.² Mesot, B.³ Bourlard, H.⁴

18
- 33745225168
- Comb filter decomposition for robust ASR
- 9th European Conference on Speech Communication and Technology, Eurospeech Interspeech
- Szymanski L., Bouchard M., Comb filter decomposition for robust ASR Proceedings of the 9th European Conference on Speech Communication and Technology (Interspeech 05) September 2005 Lisbon, Portugal 2645 2648 (Pubitemid 43908639)
- (2005) 9th European Conference on Speech Communication and Technology , pp. 2645-2648
- Szymanski, L.¹ Bouchard, M.²

19
- 34547531458
- Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 07) April 2007 Honolulu, Hawaii, USA
- Rifkin R., Schutte K., Saad M., Bouvrie J., Glass J., Noise robust phonetic classification with linear regularized least squares and second-order features 4 Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 07) April 2007 Honolulu, Hawaii, USA 881 884
- Noise robust phonetic classification with linear regularized least squares and second-order features , vol.4 , pp. 881-884
- Rifkin, R.¹ Schutte, K.² Saad, M.³ Bouvrie, J.⁴ Glass, J.⁵

20
- 36248982228
- An FFT-based companding front end for noise-robust automatic speech recognition
- Raj B., Turicchia L., Schmidt-Nielsen B., Sarpeshkar R., An FFT-based companding front end for noise-robust automatic speech recognition EURASIP Journal on Audio, Speech, and Music Processing 2007 2007 13
- (2007) EURASIP Journal on Audio, Speech, and Music Processing , vol.2007 , pp. 13
- Raj, B.¹ Turicchia, L.² Schmidt-Nielsen, B.³ Sarpeshkar, R.⁴

21
- 67650211649
- Proceedings of the International Workshop on Automatic Speech Recognition: Challenges for the Next Millennium (ISCA ITRW ASR 00) September 2000 Paris, France
- Hirsch H. G., Pierce D., The AURORA experimental framework for the performance evaluation of speech recognition systems under noisy conditions Proceedings of the International Workshop on Automatic Speech Recognition: Challenges for the Next Millennium (ISCA ITRW ASR 00) September 2000 Paris, France
- The AURORA experimental framework for the performance evaluation of speech recognition systems under noisy conditions
- Hirsch, H.G.¹ Pierce, D.²

22
- 0442317754
- ETSI ES 202 050 V1.1.5, Speech processing, transmission and quality aspects (STQ), distributed speech recognition; advanced front-end feature extraction algorithm; compression algorithms 2007
- (2007) ETSI ES 202 050 V1.1.5, Speech processing, transmission and quality aspects (STQ), distributed speech recognition; Advanced front-end feature extraction algorithm; Compression algorithms

23
- 67650182031
- Proceedings of IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU 05) November 2005 San Juan, Puerto Rico, USA
- Lathoud G., Doss M. M., Boulard H., Channel normalization for unsupervised spectral subtraction Proceedings of IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU 05) November 2005 San Juan, Puerto Rico, USA
- Channel normalization for unsupervised spectral subtraction
- Lathoud, G.¹ Doss, M.M.² Boulard, H.³

24
- 0030779363
- Noise compensation methods for hidden markov model speech recognition in adverse environments
- PII S1063667697007670
- Vaseghi S. V., Milner B. P., Noise compensation methods for hidden Markov model speech recognition in adverse environments IEEE Transactions on Speech and Audio Processing 1997 5 1 11 21 (Pubitemid 127746030)
- (1997) IEEE Transactions on Speech and Audio Processing , vol.5 , Issue.1 , pp. 11-21
- Vaseghi, S.V.¹ Milner, B.P.²

25
- 0003071809
- Evaluation and optimization of perceptually-based ASR front-end
- Junqua J.-C., Wakita H., Hermansky H., Evaluation and optimization of perceptually-based ASR front-end IEEE Transactions on Speech and Audio Processing 1993 1 1 39 48
- (1993) IEEE Transactions on Speech and Audio Processing , vol.1 , Issue.1 , pp. 39-48
- Junqua, J.-C.¹ Wakita, H.² Hermansky, H.³

26
- 0347968277
- Recursive estimation of nonstationary noise using iterative stochastic approximation for robust speech recognition
- Deng L., Droppo J., Acero A., Recursive estimation of nonstationary noise using iterative stochastic approximation for robust speech recognition IEEE Transactions on Speech and Audio Processing 2003 11 6 568 580
- (2003) IEEE Transactions on Speech and Audio Processing , vol.11 , Issue.6 , pp. 568-580
- Deng, L.¹ Droppo, J.² Acero, A.³

27
- 0033690878
- Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 00) June 2000 Istanbul, Turkey
- Zhu Q., Alwan A., On the use of variable frame rate analysis in speech recognition 3 Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 00) June 2000 Istanbul, Turkey 1783 1786
- On the use of variable frame rate analysis in speech recognition , vol.3 , pp. 1783-1786
- Zhu, Q.¹ Alwan, A.²

28
- 34547549142
- Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 07) April 2007 Honolulu, Hawaii, USA
- Schuller B., Seppi D., Batliner A., Maier A., Steidl S., Towards more reality in the recognition of emotional speech 4 Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 07) April 2007 Honolulu, Hawaii, USA 941 944
- Towards more reality in the recognition of emotional speech , vol.4 , pp. 941-944
- Schuller, B.¹ Seppi, D.² Batliner, A.³ Maier, A.⁴ Steidl, S.⁵

29
- 84946730259
- Proceedings of IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU 03) November-December 2003 St. Thomas, Virgin Islands, USA
- Hermansky H., TRAP-TANDEM: data-driven extraction of temporal features from speech Proceedings of IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU 03) November-December 2003 St. Thomas, Virgin Islands, USA 255 260
- TRAP-TANDEM: Data-driven extraction of temporal features from speech , pp. 255-260
- Hermansky, H.¹

30
- 0031619381
- Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 98) May 1998 Seattler, Wash, USA
- Bilmes J. A., Maximum mutual information based reduction strategies for cross-correlation based joint distributional modeling 1 Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 98) May 1998 Seattler, Wash, USA 469 472
- Maximum mutual information based reduction strategies for cross-correlation based joint distributional modeling , vol.1 , pp. 469-472
- Bilmes, J.A.¹

31
- 0002915083
- Relevance of time-frequency features for phonetic and speaker-channel classification
- Yang H. H., van Vuuren S., Sharma S., Hermansky H., Relevance of time-frequency features for phonetic and speaker-channel classification Speech Communication 2000 31 1 35 50
- (2000) Speech Communication , vol.31 , Issue.1 , pp. 35-50
- Yang, H.H.¹ Van Vuuren, S.² Sharma, S.³ Hermansky, H.⁴

32
- 0032676337
- On the relative importance of various components of the modulation spectrum for automatic speech recognition
- Kanedera N., Arai T., Hermansky H., Pavel M., On the relative importance of various components of the modulation spectrum for automatic speech recognition Speech Communication 1999 28 1 43 55
- (1999) Speech Communication , vol.28 , Issue.1 , pp. 43-55
- Kanedera, N.¹ Arai, T.² Hermansky, H.³ Pavel, M.⁴

33
- 85016663198
- Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 92) March 1992 San Francisco, Calif, USA
- Hermansky H., Morgan N., Bayya A., Kohn P., RASTA-PLP speech analysis technique 1 Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 92) March 1992 San Francisco, Calif, USA 121 124
- RASTA-PLP speech analysis technique , vol.1 , pp. 121-124
- Hermansky, H.¹ Morgan, N.² Bayya, A.³ Kohn, P.⁴

34
- 0032136330
- Robust speech recognition using the modulation spectrogram
- PII S0167639398000326
- Kingsbury B. E. D., Morgan N., Greenberg S., Robust speech recognition using the modulation spectrogram Speech Communication 1998 25 13 117 132 (Pubitemid 128413637)
- (1998) Speech Communication , vol.25 , Issue.1-3 , pp. 117-132
- Kingsbury, B.E.D.¹ Morgan, N.² Greenberg, S.³

35
- 0029725301
- Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 96) May 1996 Atlanta, Ga, USA
- Moreno P. J., Raj B., Stern R. M., A vector Taylor series approach for environment-independent speech recognition 2 Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 96) May 1996 Atlanta, Ga, USA 733 736
- A vector Taylor series approach for environment-independent speech recognition , vol.2 , pp. 733-736
- Moreno, P.J.¹ Raj, B.² Stern, R.M.³

36
- 0035309967
- An advanced contrast enhancement using partially overlapped sub-block histogram equalization
- DOI 10.1109/76.915354, PII S1051821501030117
- Kim J.-Y., Kim L.-S., Hwang S.-H., An advanced contrast enhancement using partially overlapped sub-block histogram equalization IEEE Transactions on Circuits and Systems for Video Technology 2001 11 4 475 484 (Pubitemid 32407181)
- (2001) IEEE Transactions on Circuits and Systems for Video Technology , vol.11 , Issue.4 , pp. 475-484
- Kim, J.-Y.¹ Kim, L.-S.² Hwang, S.-H.³

37
- 0042362207
- Cepstrum-domain acoustic feature compensation based on decomposition of speech and noise for ASR in noisy environments
- Kim H. K., Rose R. C., Cepstrum-domain acoustic feature compensation based on decomposition of speech and noise for ASR in noisy environments IEEE Transactions on Speech and Audio Processing 2003 11 5 435 446
- (2003) IEEE Transactions on Speech and Audio Processing , vol.11 , Issue.5 , pp. 435-446
- Kim, H.K.¹ Rose, R.C.²

38
- 54349096464
- Noisy speech feature estimation on the Aurora2 database using a switching linear dynamic model
- Deng J., Bouchard M., Yeap T. H., Noisy speech feature estimation on the Aurora2 database using a switching linear dynamic model Journal of Multimedia 2007 2 2 47 52
- (2007) Journal of Multimedia , vol.2 , Issue.2 , pp. 47-52
- Deng, J.¹ Bouchard, M.² Yeap, T.H.³

39
- 51449117724
- Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 08) April 2008 Las Vegas, Nev, USA
- Windmann S., Haeb-Umbach R., Modeling the dynamics of speech and noise for speech feature enhancement in ASR Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 08) April 2008 Las Vegas, Nev, USA 4409 4412
- Modeling the dynamics of speech and noise for speech feature enhancement in ASR , pp. 4409-4412
- Windmann, S.¹ Haeb-Umbach, R.²

40
- 84969845150
- Proceedings of the 7th International Conference on Spoken Language Processing (ICSLP 02) September 2002 Denver, Colo, USA
- Li Y., Erdogan H., Gao Y., Marcheret E., Incremental online feature space MLLR adaptation for telephony speech recognition Proceedings of the 7th International Conference on Spoken Language Processing (ICSLP 02) September 2002 Denver, Colo, USA 1417 1420
- Incremental online feature space MLLR adaptation for telephony speech recognition , pp. 1417-1420
- Li, Y.¹ Erdogan, H.² Gao, Y.³ Marcheret, E.⁴

41
- 3543087978
- Applications of support vector machines to speech recognition
- Ganapathiraju A., Hamaker J. E., Picone J., Applications of support vector machines to speech recognition IEEE Transactions on Signal Processing 2004 52 8 2348 2355
- (2004) IEEE Transactions on Signal Processing , vol.52 , Issue.8 , pp. 2348-2355
- Ganapathiraju, A.¹ Hamaker, J.E.² Picone, J.³

42
- 0142192295
- Proceedings of the 18th International Conference on Machine Learning (ICML 01) June-July 2001 Williamstown, Mass, USA
- Lafferty J. D., McCallum A., Pereiar F. C. N., Conditional random fields: probabilistic models for segmenting and labeling sequence data Proceedings of the 18th International Conference on Machine Learning (ICML 01) June-July 2001 Williamstown, Mass, USA 282 289
- Conditional random fields: Probabilistic models for segmenting and labeling sequence data , pp. 282-289
- Lafferty, J.D.¹ McCallum, A.² Pereiar, F.C.N.³

43
- 0031573117
- Long Short-Term Memory
- Hochreiter S., Schmidhuber J., Long short-term memory Neural Computation 1997 9 8 1735 1780 (Pubitemid 127462305)
- (1997) Neural Computation , vol.9 , Issue.8 , pp. 1735-1780
- Hochreiter, S.¹ Schmidhuber, J.²

44
- 38149014113
- Proceedings of the 17th International Conference on Artificial Neural Networks (ICANN 07) September 2007 Porto, Portugal Lecture Notes in Computer Science
- Fernndez S., Graves A., Schmidhuber J., An application of recurrent neural networks to discriminative keyword spotting 4669 Proceedings of the 17th International Conference on Artificial Neural Networks (ICANN 07) September 2007 Porto, Portugal 220 229 Lecture Notes in Computer Science
- An application of recurrent neural networks to discriminative keyword spotting , vol.4669 , pp. 220-229
- Fernndez, S.¹ Graves, A.² Schmidhuber, J.³

45
- 51449116065
- Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 08) April 2008 Las Vegas, Nev, USA
- de Andrade Bresolin A., Neto A. D. D., Alsina P. J., Digit recognition using wavelet and SVM in Brazilian Portuguese Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 08) April 2008 Las Vegas, Nev, USA 1545 1548
- Digit recognition using wavelet and SVM in Brazilian Portuguese , pp. 1545-1548
- De Andrade Bresolin, A.¹ Neto, A.D.D.² Alsina, P.J.³

46
- 85009242725
- Proceedings of the 7th International Conference on Spoken Language Processing (ICSLP 02) September 2002 Denver, Colo, USA
- Macho D., Mauuray L., Noe B., Evaluation of a noise-robust DSR front-end on Aurora database Proceedings of the 7th International Conference on Spoken Language Processing (ICSLP 02) September 2002 Denver, Colo, USA 17 20
- Evaluation of a noise-robust DSR front-end on Aurora database , pp. 17-20
- MacHo, D.¹ Mauuray, L.² Noe, B.³

47
- 0028419019
- Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains
- Gauvain J.-L., Lee C.-H., Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains IEEE Transactions on Speech and Audio Processing 1994 2 2 291 298
- (1994) IEEE Transactions on Speech and Audio Processing , vol.2 , Issue.2 , pp. 291-298
- Gauvain, J.-L.¹ Lee, C.-H.²

48
- 0141814617
- Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 03) April 2003 Hong Kong
- Wang Z., Schultz T., Waibel A., Comparison of acoustic model adaptation techniques on non-native speech 1 Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 03) April 2003 Hong Kong 540 543
- Comparison of acoustic model adaptation techniques on non-native speech , vol.1 , pp. 540-543
- Wang, Z.¹ Schultz, T.² Waibel, A.³

49
- 84908476421
- Proceedings of IEEE International Conference on Multimedia Expo (ICME 03) July 2003 Baltimore, Md, USA
- He X., Chou W., Minimum classification error linear regression for acoustic model adaptation of continuous density HMMS 1 Proceedings of IEEE International Conference on Multimedia Expo (ICME 03) July 2003 Baltimore, Md, USA 397 400
- Minimum classification error linear regression for acoustic model adaptation of continuous density HMMS , vol.1 , pp. 397-400
- He, X.¹ Chou, W.²

50
- 22944459276
- Proceedings of the International Workshop on Acoustic Echo and Noise Control (IWAENC 03) September 2003 Kyoto, Japan
- Martin R., Breithaupt C., Speech enhancement in the DFT domain using Laplacian speech priors Proceedings of the International Workshop on Acoustic Echo and Noise Control (IWAENC 03) September 2003 Kyoto, Japan 87 90
- Speech enhancement in the DFT domain using Laplacian speech priors , pp. 87-90
- Martin, R.¹ Breithaupt, C.²

51
- 0021645331
- Speech enhancement using a minimum mean-square error short-time spectral amplitude estimator
- Ephraim Y., Malah D., Speech enhancement using a minimum mean-square error short-time spectral amplitude estimator IEEE Transactions on Acoustics, Speech, and Signal Processing 1984 32 6 1109 1121
- (1984) IEEE Transactions on Acoustics, Speech, and Signal Processing , vol.32 , Issue.6 , pp. 1109-1121
- Ephraim, Y.¹ Malah, D.²

52
- 0010974144
- Providence, RI, USA American Mathematical Society
- Grinstead C. M., Snell J. L., Introduction to Probability 1997 Providence, RI, USA American Mathematical Society
- (1997) Introduction to Probability
- Grinstead, C.M.¹ Snell, J.L.²

53
- 0002629270
- Maximum likelihood from incomplete data via the em algorithm
- Dempster A. P., Laird N. M., Rubin D. B., Maximum likelihood from incomplete data via the EM algorithm Journal of the Royal Statistical Society. Series B 1977 39 1 1 38
- (1977) Journal of the Royal Statistical Society. Series B , vol.39 , Issue.1 , pp. 1-38
- Dempster, A.P.¹ Laird, N.M.² Rubin, D.B.³

54
- 0029345416
- Comparison of signal processing front ends for automatic word recognition
- Jankowski C. R. Jr., Vo H.-D. H., Lippmann R. P., Comparison of signal processing front ends for automatic word recognition IEEE Transactions on Speech and Audio Processing 1995 3 4 286 293
- (1995) IEEE Transactions on Speech and Audio Processing , vol.3 , Issue.4 , pp. 286-293
- Jankowski Jr., C.R.¹ Vo, H.-D.H.² Lippmann, R.P.³

55
- 34047249084
- Quantile based histogram equalization for noise robust large vocabulary speech recognition
- DOI 10.1109/TSA.2005.857792
- Hilger F., Ney H., Quantile based histogram equalization for noise robust large vocabulary speech recognition IEEE Transactions on Audio, Speech and Language Processing 2006 14 3 845 854 (Pubitemid 46547647)
- (2006) IEEE Transactions on Audio, Speech and Language Processing , vol.14 , Issue.3 , pp. 845-854
- Hilger, F.¹ Ney, H.²

56
- 54349123450
- Proceedings of the 8th European Conference on Speech Communication and Technology (Eurospeech 03) September 2003 Geneva, Switzerland
- Droppo J., Deng L., Acero A., A comparison of three non-linear observation models for noisy speech features 2 Proceedings of the 8th European Conference on Speech Communication and Technology (Eurospeech 03) September 2003 Geneva, Switzerland 681 684
- A comparison of three non-linear observation models for noisy speech features , vol.2 , pp. 681-684
- Droppo, J.¹ Deng, L.² Acero, A.³

57
- 0003834266
- Norwood, Mass, USA Artech House
- Bar-Shalom Y., Li X. R., Estimation and Tracking: Principles, Techniques, and Software 1993 Norwood, Mass, USA Artech House
- (1993) Estimation and Tracking: Principles, Techniques, and Software
- Bar-Shalom, Y.¹ Li, X.R.²

58
- 33749253818
- Conditional random fields for object recognition
- Cambridge, Mass, USA MIT Press
- Quattoni A., Collins M., Darrell T., Conditional random fields for object recognition Advances in Neural Information Processing Systems 17 2005 Cambridge, Mass, USA MIT Press 1097 1104
- (2005) Advances in Neural Information Processing Systems 17 , pp. 1097-1104
- Quattoni, A.¹ Collins, M.² Darrell, T.³

59
- 33646436650
- Proceedings of the Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology May-June 2003 Edmonton, Canada
- Sha F., Pereira F., Shallow parsing with conditional random fields 1 Proceedings of the Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology May-June 2003 Edmonton, Canada 134 141
- Shallow parsing with conditional random fields , vol.1 , pp. 134-141
- Sha, F.¹ Pereira, F.²

60
- 1542287488
- Proceedings of the 26th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 03) July-August 2003 Toronto, Canada
- Pinto D., McCallum A., Wei X., Croft W. B., Table extraction using conditional random fields Proceedings of the 26th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 03) July-August 2003 Toronto, Canada 235 242
- Table extraction using conditional random fields , pp. 235-242
- Pinto, D.¹ McCallum, A.² Wei, X.³ Croft, W.B.⁴

61
- 67650183443
- Proceedings of the 42nd Annual Meeting on Association for Computational Linguistics (ACL 04) July 2004 Barcelona, Spain
- Roark B., Saraclar M., Collins M., Johnson M., Discriminative language modeling with conditional random fields and the perceptron algorithm Proceedings of the 42nd Annual Meeting on Association for Computational Linguistics (ACL 04) July 2004 Barcelona, Spain 48 55
- Discriminative language modeling with conditional random fields and the perceptron algorithm , pp. 48-55
- Roark, B.¹ Saraclar, M.² Collins, M.³ Johnson, M.⁴

62
- 33845593205
- Hidden conditional random fields for gesture recognition
- DOI 10.1109/CVPR.2006.132, 1640937, Proceedings - 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, CVPR 2006
- Wang S. B., Quattoni A., Morency L.-P., Demirdjian D., Darrell T., Hidden conditional random fields for gesture recognition 2 Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 06 June 2006 New York, NY, USA 1521 1527 (Pubitemid 44931500)
- (2006) Proceedings - 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, CVPR 2006 , vol.2 , pp. 1521-1527
- Sy, B.W.¹ Quattoni, A.² Morency, L.-P.³ Demirdjian, D.⁴ Darrell, T.⁵

63
- 46449135503
- Proceedings of IEEE International Conference on Multimedia Expo (ICME 07) July 2007 Beijing, China
- Reiter S., Schuller B., Rigoll G., Hidden conditional random fields for meeting segmentation Proceedings of IEEE International Conference on Multimedia Expo (ICME 07) July 2007 Beijing, China 639 642
- Hidden conditional random fields for meeting segmentation , pp. 639-642
- Reiter, S.¹ Schuller, B.² Rigoll, G.³

64
- 48249106592
- Proceedings of the 4th IEEE Tutorial and Research Workshop on Perception and Interactive Technologies for Speech-Based Systems: Perception in Multimodal Dialogue Systems (PIT 08) June 2008 Kloster Irsee, Germany
- Schuller B., Eyben F., Rigoll G., Static and dynamic modelling for the recognition of non-verbal vocalisations in conversational speech Proceedings of the 4th IEEE Tutorial and Research Workshop on Perception and Interactive Technologies for Speech-Based Systems: Perception in Multimodal Dialogue Systems (PIT 08) June 2008 Kloster Irsee, Germany 99 110
- Static and dynamic modelling for the recognition of non-verbal vocalisations in conversational speech , pp. 99-110
- Schuller, B.¹ Eyben, F.² Rigoll, G.³

65
- 21644483999
- Maximum likelihood estimates of linear dynamic systems
- Rauch H. E., Tung G., Striebel C. T., Maximum likelihood estimates of linear dynamic systems AIAA Journal 1965 3 8 1445 1450
- (1965) AIAA Journal , vol.3 , Issue.8 , pp. 1445-1450
- Rauch, H.E.¹ Tung, G.² Striebel, C.T.³

66
- 33845270980
- Expectation correction for smoothed inference in switching linear dynamical systems
- Barber D., Expectation correction for smoothed inference in switching linear dynamical systems Journal of Machine Learning Research 2006 7 2515 2540 (Pubitemid 44866739)
- (2006) Journal of Machine Learning Research , vol.7 , pp. 2515-2540
- Barber, D.¹

67
- 0019612337
- Speech recognition: Turning theory to practice
- Doddington G. R., Schalk T. B., Speech recognition: turning theory to practice IEEE Spectrum 1981 18 9 26 32
- (1981) IEEE Spectrum , vol.18 , Issue.9 , pp. 26-32
- Doddington, G.R.¹ Schalk, T.B.²

68
- 38049036813
- Proceedings of the 2nd International Conference on Affective Computing and Intelligent Interaction (ACII 07) September 2007 Lisbon, Portugal
- Grimm M., Kroschel K., Harris H., On the necessity and feasibility of detecting a drivers emotional state while driving Proceedings of the 2nd International Conference on Affective Computing and Intelligent Interaction (ACII 07) September 2007 Lisbon, Portugal 126 138
- On the necessity and feasibility of detecting a drivers emotional state while driving , pp. 126-138
- Grimm, M.¹ Kroschel, K.² Harris, H.³

69
- 67650197312
- Proceedings of Interspeech September 2008 Brisbane, Australia
- Cooke M., Scharenborg O., The Interspeech 2008 consonant challenge Proceedings of Interspeech September 2008 Brisbane, Australia 1 4
- The Interspeech 2008 consonant challenge , pp. 1-4
- Cooke, M.¹ Scharenborg, O.²

70
- 84867196386
- Proceedings of Interspeech September 2008 Brisbane, Australia
- Borgstrm B. J., Alwan A., HMM-based estimation of unreliable spectral components for noise robust speech recognition Proceedings of Interspeech September 2008 Brisbane, Australia 1769 1772
- HMM-based estimation of unreliable spectral components for noise robust speech recognition , pp. 1769-1772
- Borgstrm, B.J.¹ Alwan, A.²

71
- 79958841880
- Proceedings of Interspeech September 2008 Brisbane, Australia
- Jancovic P., Mnevver K., On the mask modeling and feature representation in the missing-feature ASR: evaluation on the consonant challenge Proceedings of Interspeech September 2008 Brisbane, Australia 1777 1780
- On the mask modeling and feature representation in the missing-feature ASR: Evaluation on the consonant challenge , pp. 1777-1780
- Jancovic, P.¹ Mnevver, K.²

72
- 84867227925
- Proceedings of Interspeech September 2008 Brisbane, Australia
- Gemmeke J. F., Cranen B., Noise reduction through compressed sensing Proceedings of Interspeech September 2008 Brisbane, Australia 1785 1788
- Noise reduction through compressed sensing , pp. 1785-1788
- Gemmeke, J.F.¹ Cranen, B.²

73
- 84867210964
- Proceedings of Interspeech September 2008 Brisbane, Australia
- Schuller B., Wllmer M., Moosmayr T., Rigoll G., Speech recognition in noisy environments using a switching linear dynamic model for feature enhancement Proceedings of Interspeech September 2008 Brisbane, Australia
- Speech recognition in noisy environments using a switching linear dynamic model for feature enhancement
- Schuller, B.¹ Wllmer, M.² Moosmayr, T.³ Rigoll, G.⁴

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.