SCOPUS 정보 검색 플랫폼

IEEE Transactions on Audio, Speech and Language Processing

Volumn 15, Issue 5, 2007, Pages 1711-1723

Robust speaker recognition in noisy conditions

(4) Ming, Ji a Hazen, Timothy J b Glass, James R c Reynolds, Douglas A b

a QUEEN'S UNIVERSITY BELFAST (United Kingdom)

b MIT LINCOLN LABORATORY (United States)

c MASSACHUSETTS INSTITUTE OF TECHNOLOGY (United States)

Author keywords

Missing feature theory; Multicondition training; Noise compensation; Noise modeling; Speaker recognition

Indexed keywords

MISSING-FEATURE THEORY; MULTICONDITION TRAINING; NOISE COMPENSATION; NOISE MODELING; SPEAKER RECOGNITION;

ACOUSTIC NOISE; BIOMETRICS; LOUDSPEAKERS; POSITION CONTROL;

SPEECH RECOGNITION;

EID: 63249107289 PISSN: 15587916 EISSN: None Source Type: Journal
DOI: 10.1109/TASL.2007.899278 Document Type: Article

Times cited : (234)

References (51)

1
- 0016067897
- Effectiveness of linear prediction characteristics of the speech wave for automatic speaker identification and verification
- B. S. Atal, "Effectiveness of linear prediction characteristics of the speech wave for automatic speaker identification and verification," J. Acoust. Soc. Amer., vol. 55, pp. 1304-1312, 1974.
- (1974) J. Acoust. Soc. Amer , vol.55 , pp. 1304-1312
- Atal, B.S.¹

2
- 0028517164
- RASTA processing of speech
- Oct
- H. Hermansky and N. Morgan, "RASTA processing of speech," IEEE Trans. Speech Audio Process., vol. 2, no. 4, pp. 578-589, Oct. 1994.
- (1994) IEEE Trans. Speech Audio Process , vol.2 , Issue.4 , pp. 578-589
- Hermansky, H.¹ Morgan, N.²

3
- 0028515984
- Experimental evaluation of features for robust speaker identification
- Oct
- D. A. Reynolds, "Experimental evaluation of features for robust speaker identification," IEEE Trans. Speech Audio Process., vol. 2, no. 4, pp. 639-643, Oct. 1994.
- (1994) IEEE Trans. Speech Audio Process , vol.2 , Issue.4 , pp. 639-643
- Reynolds, D.A.¹

4
- 0030247355
- Robust speaker recognition: A feature-based approach
- Sep
- R. Mammone, X. Zhang, and R. P. Ramachandran, "Robust speaker recognition: A feature-based approach," IEEE Signal Process. Mag., vol. 13, no. 5, pp. 58-71, Sep. 1996.
- (1996) IEEE Signal Process. Mag , vol.13 , Issue.5 , pp. 58-71
- Mammone, R.¹ Zhang, X.² Ramachandran, R.P.³

5
- 0030353333
- Comparison of text-independent speaker recognition methods on telephone speech with acoustic mismatch
- Philadelpia, PA
- S. van Vuuren, "Comparison of text-independent speaker recognition methods on telephone speech with acoustic mismatch," in Proc. ICSLP'96, Philadelpia, PA, 1996, pp. 1788-1791.
- (1996) Proc. ICSLP'96 , pp. 1788-1791
- van Vuuren, S.¹

6
- 0026835134
- Global optmization of a neural network-hidden markov model hybrid
- Mar
- Y. Bengio, R. De Mori, G. Flammia, and R. Kompe, "Global optmization of a neural network-hidden markov model hybrid," IEEE Trans. Neural Netw., vol. 3, no. 2, pp. 252-259, Mar. 1992.
- (1992) IEEE Trans. Neural Netw , vol.3 , Issue.2 , pp. 252-259
- Bengio, Y.¹ De Mori, R.² Flammia, G.³ Kompe, R.⁴

7
- 33747684554
- Integrated optimization of feature transformation for speech recognition
- Madrid, Spain
- S. Euler, "Integrated optimization of feature transformation for speech recognition," in Proc. Eurospeech'95, Madrid, Spain, 1995, pp. 109-112.
- (1995) Proc. Eurospeech'95 , pp. 109-112
- Euler, S.¹

8
- 85135185331
- Discriminative feature and model design for automatic speech recognition
- Rhodes, Greece
- M. Rahim, Y. Bengio, and Y. Lecun, "Discriminative feature and model design for automatic speech recognition," in Proc. Eurospeech' 97, Rhodes, Greece, 1997, pp. 75-78.
- (1997) Proc. Eurospeech' 97 , pp. 75-78
- Rahim, M.¹ Bengio, Y.² Lecun, Y.³

9
- 0033746018
- Robustness to telephone handset distortion in speaker recognition by discriminative feature design
- L. P. Heck, Y. Konig, M. K. Sonmez, and M. Weintraub, "Robustness to telephone handset distortion in speaker recognition by discriminative feature design," Speech Commun., vol. 31, pp. 181-192, 2000.
- (2000) Speech Commun , vol.31 , pp. 181-192
- Heck, L.P.¹ Konig, Y.² Sonmez, M.K.³ Weintraub, M.⁴

10
- 0030247355
- Robust speaker recognition-A feature-based approach
- Sep
- R. Mammone, X. Zhang, and R. P. Ramachandran, "Robust speaker recognition-A feature-based approach," IEEE Signal Process. Mag., vol. 13, no. 5, pp. 58-71, Sep. 1996.
- (1996) IEEE Signal Process. Mag , vol.13 , Issue.5 , pp. 58-71
- Mammone, R.¹ Zhang, X.² Ramachandran, R.P.³

11
- 84892149368
- Magnitude-only estimation of handset nonlinearity with application to speaker recopgnition
- Seattle, WA
- T. F. Quatieri, D. A. Reynolds, and G. C. O'Leary, "Magnitude-only estimation of handset nonlinearity with application to speaker recopgnition," in Proc. ICASSP'98, Seattle, WA, 1998, pp. 745-748.
- (1998) Proc. ICASSP'98 , pp. 745-748
- Quatieri, T.F.¹ Reynolds, D.A.² O'Leary, G.C.³

12
- 85073258179
- Featurewarping for robust speaker verification
- Crete, Greece
- J. Pelecanos and S. Sridharan, "Featurewarping for robust speaker verification," in Proc. A Speaker Odyssey-The Speaker RecognitionWorkshop, Crete, Greece, 2001, pp. 213-218.
- (2001) Proc. A Speaker Odyssey-The Speaker RecognitionWorkshop , pp. 213-218
- Pelecanos, J.¹ Sridharan, S.²

13
- 0036298114
- Short-time Gaussianization for robust speaker verification
- Orlando, FL
- B. Xiang, U. Chaudhari, J. Navratil, G. Ramaswamy, and R. Gopinath, "Short-time Gaussianization for robust speaker verification," in Proc. ICASSP'02, Orlando, FL, 2002, pp. 681-684.
- (2002) Proc. ICASSP'02 , pp. 681-684
- Xiang, B.¹ Chaudhari, U.² Navratil, J.³ Ramaswamy, G.⁴ Gopinath, R.⁵

14
- 0033884858
- Speaker verification using adapted Gaussian mixture models
- D. A. Reynolds, T. F. Quatieri, and R. B. Dunn, "Speaker verification using adapted Gaussian mixture models," Digital Signal Process., vol. 10, pp. 19-41, 2000.
- (2000) Digital Signal Process , vol.10 , pp. 19-41
- Reynolds, D.A.¹ Quatieri, T.F.² Dunn, R.B.³

15
- 0141702107
- Feature and score normalization for speaker verification of cellular data
- Hong Kong, China
- C. Barras and J. L. Gauvain, "Feature and score normalization for speaker verification of cellular data," in Proc. ICASSP'03, Hong Kong, China, 2003, pp. 49-52.
- (2003) Proc. ICASSP'03 , pp. 49-52
- Barras, C.¹ Gauvain, J.L.²

16
- 0033884857
- Score normalization for text-independent speaker verification systems
- R. Auckenthaler, M. Carey, and H. Lloyd-Thomas, "Score normalization for text-independent speaker verification systems," Digital Signal Process., vol. 10, pp. 42-54, 2000.
- (2000) Digital Signal Process , vol.10 , pp. 42-54
- Auckenthaler, R.¹ Carey, M.² Lloyd-Thomas, H.³

17
- 0032595177
- Robust text-independent speaker identification over telephone channels
- Sep
- H. A. Murthy, F. Beaufays, L. P. Heck, and M. Weintraub, "Robust text-independent speaker identification over telephone channels," IEEE Trans. Speech Audio Process., vol. 7, no. 5, pp. 554-568, Sep. 1999.
- (1999) IEEE Trans. Speech Audio Process , vol.7 , Issue.5 , pp. 554-568
- Murthy, H.A.¹ Beaufays, F.² Heck, L.P.³ Weintraub, M.⁴

18
- 84988224855
- A model-based transformational approach to robust speaker recognition
- Beijing, China
- R. Teunen, B. Shahshahani, and L. P. Heck, "A model-based transformational approach to robust speaker recognition," in Proc. ICSLP'00, Beijing, China, 2000, pp. 495-498.
- (2000) Proc. ICSLP'00 , pp. 495-498
- Teunen, R.¹ Shahshahani, B.² Heck, L.P.³

19
- 0033748244
- Speaker verification over the telephone
- L. F. Lamel and J. L. Gauvain, "Speaker verification over the telephone," Speech Commun., vol. 31, pp. 141-154, 2000.
- (2000) Speech Commun , vol.31 , pp. 141-154
- Lamel, L.F.¹ Gauvain, J.L.²

20
- 85009167959
- Environment adaptation for robust speaker verification
- Geneva, Switzerland
- K. K. Yiu, M. W. Mak, and S. Y. Kung, "Environment adaptation for robust speaker verification," in Proc. Eurospeech'03, Geneva, Switzerland, 2003, pp. 2973-2976.
- (2003) Proc. Eurospeech'03 , pp. 2973-2976
- Yiu, K.K.¹ Mak, M.W.² Kung, S.Y.³

21
- 0030371776
- Overview of speaker enhancement techniques for automatic speaker recognition
- Philadelpia, PA
- J. Ortega-Garcia and L. Gonzalez-Rodriguez, "Overview of speaker enhancement techniques for automatic speaker recognition," in Proc. ICSLP'96, Philadelpia, PA, 1996, pp. 929-932.
- (1996) Proc. ICSLP'96 , pp. 929-932
- Ortega-Garcia, J.¹ Gonzalez-Rodriguez, L.²

22
- 33646772812
- An evaluation of VTS and IMM for speaker verification in noise
- Geneva, Switzerland
- Suhadi, S. Stan, T. Fingscheidt, and C. Beaugeant, "An evaluation of VTS and IMM for speaker verification in noise," in Proc. Eurospeech' 03, Geneva, Switzerland, 2003, pp. 1669-1672.
- (2003) Proc. Eurospeech' 03 , pp. 1669-1672
- Suhadi¹ Stan, S.² Fingscheidt, T.³ Beaugeant, C.⁴

23
- 85135375893
- HMM recognition in noise using parallel model combination
- Berlin, Germany
- M. J. F. Gales and S. Young, "HMM recognition in noise using parallel model combination," in Proc. Eurospeech'93, Berlin, Germany, 1993, pp. 837-840.
- (1993) Proc. Eurospeech'93 , pp. 837-840
- Gales, M.J.F.¹ Young, S.²

24
- 0030125219
- Speaker recognition using HMM composition in noisy environments
- T. Matsui, T. Kanno, and S. Furui, "Speaker recognition using HMM composition in noisy environments," Comput. Speech Lang., vol. 10, pp. 107-116, 1996.
- (1996) Comput. Speech Lang , vol.10 , pp. 107-116
- Matsui, T.¹ Kanno, T.² Furui, S.³

25
- 0034848879
- Text-dependent speaker verification under noisy conditions using parallel model combination
- Salt Lake City, UT
- L. P. Wong and M. Russell, "Text-dependent speaker verification under noisy conditions using parallel model combination," in Proc. ICASSP'01, Salt Lake City, UT, 2003, pp. 457-460.
- (2003) Proc. ICASSP'01 , pp. 457-460
- Wong, L.P.¹ Russell, M.²

26
- 0030649027
- Jacobian approach to fast acoustic model adaptation
- Munich, Germany
- S. Sagayama, Y. Yamaguchi, S. Takahashi, and J. Takahashi, "Jacobian approach to fast acoustic model adaptation," in Proc. ICASSP'97, Munich, Germany, 1997, pp. 835-838.
- (1997) Proc. ICASSP'97 , pp. 835-838
- Sagayama, S.¹ Yamaguchi, Y.² Takahashi, S.³ Takahashi, J.⁴

27
- 0347899510
- a-Jacobian environmental adaptation
- C. Cerisara, L. Rigaziob, and J.-C. Junqua, "a-Jacobian environmental adaptation," Speech Commun., vol. 42, pp. 25-41, 2004.
- (2004) Speech Commun , vol.42 , pp. 25-41
- Cerisara, C.¹ Rigaziob, L.² Junqua, J.-C.³

28
- 0030647921
- Robust speaker recognition through acoustic array processing and spectral normalization
- Munich, Germany
- L. Gonzalez-Rodriguez and J. Ortega-Garcia, "Robust speaker recognition through acoustic array processing and spectral normalization," in Proc. ICASSP'97, Munich, Germany, 1997, pp. 1103-1106.
- (1997) Proc. ICASSP'97 , pp. 1103-1106
- Gonzalez-Rodriguez, L.¹ Ortega-Garcia, J.²

29
- 85009270343
- Robust speaker recognition using microphone arrays
- Crete, Greece
- I. McCowan, J. Pelecanos, and S. Scridha, "Robust speaker recognition using microphone arrays," in Proc. A Speaker Odyssey-The Speaker Recognition Workshop, Crete, Greece, 2001, pp. 101-106.
- (2001) Proc. A Speaker Odyssey-The Speaker Recognition Workshop , pp. 101-106
- McCowan, I.¹ Pelecanos, J.² Scridha, S.³

30
- 0031619912
- Speaker verification in noisy environment with combined spectral subtraction and missing data theory
- Seattle, WA
- A. Drygajlo and M. El-Maliki, "Speaker verification in noisy environment with combined spectral subtraction and missing data theory," in Proc. ICASSP'98, Seattle, WA, 1998, pp. 121-124.
- (1998) Proc. ICASSP'98 , pp. 121-124
- Drygajlo, A.¹ El-Maliki, M.²

31
- 0033748161
- Localization and selection of speaker-specific information with statistical modelling
- L. Besacier, J. F. Bonastre, and C. Fredouille, "Localization and selection of speaker-specific information with statistical modelling," Speech Commun., vol. 31, pp. 89-106, 2000.
- (2000) Speech Commun , vol.31 , pp. 89-106
- Besacier, L.¹ Bonastre, J.F.² Fredouille, C.³

32
- 4544349444
- Universal compensation-An approach to noisy speech recognition assuming no knowledge of noise
- Montreal, QC, Canada
- J. Ming, "Universal compensation-An approach to noisy speech recognition assuming no knowledge of noise," in Proc. ICASSP'04, Montreal, QC, Canada, 2004, pp. I.961-I.964.
- (2004) Proc. ICASSP'04
- Ming, J.¹

33
- 33646782289
- Speaker identification in unknown noisy conditions-A universal compensation approach
- Philadelphia, PA
- J. Ming, D. Stewart, and S. Vaseghi, "Speaker identification in unknown noisy conditions-A universal compensation approach," in Proc. ICASSP'05, Philadelphia, PA, 2005, pp. 617-620.
- (2005) Proc. ICASSP'05 , pp. 617-620
- Ming, J.¹ Stewart, D.² Vaseghi, S.³

34
- 0030355935
- A new ASR approach based on independent processing and recombination of partial frequency bands
- Philadelpia, PA
- H. Bourlard and S. Dupont, "A new ASR approach based on independent processing and recombination of partial frequency bands," in Proc. ICSLP'96, Philadelpia, PA, 1996, pp. 426-429.
- (1996) Proc. ICSLP'96 , pp. 426-429
- Bourlard, H.¹ Dupont, S.²

35
- 0030365517
- Towards ASR on partially corrupted speech
- Philadelpia, PA
- H. Hermansky, S. Tibrewala, and M. Pavel, "Towards ASR on partially corrupted speech," in Proc. ICSLP'96, Philadelpia, PA, 1996, pp. 462-465.
- (1996) Proc. ICSLP'96 , pp. 462-465
- Hermansky, H.¹ Tibrewala, S.² Pavel, M.³

36
- 0023263708
- Multi-style training for robust isolated-word speech recognition
- Dallas, TX
- R. P. Lippmann, E. A. Martin, and D. B. Paul, "Multi-style training for robust isolated-word speech recognition," in Proc. ICASSP'87, Dallas, TX, 1987, pp. 705-708.
- (1987) Proc. ICASSP'87 , pp. 705-708
- Lippmann, R.P.¹ Martin, E.A.² Paul, D.B.³

37
- 85009070292
- Large-vocabulary speech recognition under adverse acoustic environments
- Beijing, China
- L. Deng, A. Acero, M. Plumpe, and X.-D. Huang, "Large-vocabulary speech recognition under adverse acoustic environments," in Proc. ICSLP'00, Beijing, China, 2000, pp. 806-809.
- (2000) Proc. ICSLP'00 , pp. 806-809
- Deng, L.¹ Acero, A.² Plumpe, M.³ Huang, X.-D.⁴

38
- 0036754943
- Robust speech recognition using probabilistic union models
- Sep
- J. Ming, P. Jancovic, and F. J. Smith, "Robust speech recognition using probabilistic union models," IEEE Trans. Speech Audio Process., vol. 10, no. 6, pp. 403-414, Sep. 2002.
- (2002) IEEE Trans. Speech Audio Process , vol.10 , Issue.6 , pp. 403-414
- Ming, J.¹ Jancovic, P.² Smith, F.J.³

39
- 33646410695
- A posterior union model with applications to robust speech and speaker recognition
- Article ID 75390
- J. Ming, J. Lin, and F. J. Smith, "A posterior union model with applications to robust speech and speaker recognition," EURASIP J. Appl. Signal Process., vol. 2006, pp. 1-12, 2006, Article ID 75390.
- (2006) EURASIP J. Appl. Signal Process , vol.2006 , pp. 1-12
- Ming, J.¹ Lin, J.² Smith, F.J.³

40
- 0030682302
- HTIMIT and LLHDB: Speech corpora for the study of handset transducer effects
- Munich, Germany
- D. A. Reynolds, "HTIMIT and LLHDB: Speech corpora for the study of handset transducer effects," in Proc. ICASSP'97,Munich, Germany, 1997, pp. 1535-1538.
- (1997) Proc. ICASSP'97 , pp. 1535-1538
- Reynolds, D.A.¹

41
- 0029355999
- Speaker identification and verification using Gaussian mixture speaker models
- D. A. Reynolds, "Speaker identification and verification using Gaussian mixture speaker models," Speech Commun., vol. 17, pp. 91-108, 1995.
- (1995) Speech Commun , vol.17 , pp. 91-108
- Reynolds, D.A.¹

42
- 0028996867
- CTIMIT: A speech corpus for the cellular environment with applications to automatic speech recognition
- Detroit, MI
- K. L. Brown and E. B. George, "CTIMIT: A speech corpus for the cellular environment with applications to automatic speech recognition," in Proc. ICASSP'95, Detroit, MI, 1995, pp. 105-108.
- (1995) Proc. ICASSP'95 , pp. 105-108
- Brown, K.L.¹ George, E.B.²

43
- 0025680225
- NTIMIT: A phonetically balanced, continuous speech telephone bandwidth speech database
- Albuquerque, NM
- C. Jankowski, A. Kalyanswamy, S. Basson, and J. Spitz, "NTIMIT: A phonetically balanced, continuous speech telephone bandwidth speech database," in Proc. ICASSP'90, Albuquerque, NM, 1990, pp. 109-112.
- (1990) Proc. ICASSP'90 , pp. 109-112
- Jankowski, C.¹ Kalyanswamy, A.² Basson, S.³ Spitz, J.⁴

44
- 0032091375
- Text-independent speaker recognition using non-linear frame likelihood transformation
- K. P. Markov and S. Nakagawa, "Text-independent speaker recognition using non-linear frame likelihood transformation," Speech Commun., vol. 24, pp. 193-209, 1998.
- (1998) Speech Commun , vol.24 , pp. 193-209
- Markov, K.P.¹ Nakagawa, S.²

45
- 85135144525
- On the decorrelation of the filter-bank energies in speech recognition
- Madrid, Spain
- C. Nadeu, J. Hernando, and M. Gorricho, "On the decorrelation of the filter-bank energies in speech recognition," in Proc. Eurospeech'95, Madrid, Spain, 1995, pp. 1381-1384.
- (1995) Proc. Eurospeech'95 , pp. 1381-1384
- Nadeu, C.¹ Hernando, J.² Gorricho, M.³

46
- 0038338247
- Decorrelated and liftered filter-bank energies for robust speech recognition
- Budapest, Hungary
- K. K. Paliwal, "Decorrelated and liftered filter-bank energies for robust speech recognition," in Proc. Eurospeech'99, Budapest, Hungary, 1999, pp. 85-88.
- (1999) Proc. Eurospeech'99 , pp. 85-88
- Paliwal, K.K.¹

47
- 0027465491
- The Lombard reflex and its role on human listeners and automatic speech recognizer
- J.-C. Junqua, "The Lombard reflex and its role on human listeners and automatic speech recognizer," J. Acoust. Soc. Amer., vol. 93, pp. 510-524, 1993.
- (1993) J. Acoust. Soc. Amer , vol.93 , pp. 510-524
- Junqua, J.-C.¹

48
- 0030283741
- Analysis and compensation of speech under stress and noise for environmental robustness in speech recognition
- J. H. L. Hansen, "Analysis and compensation of speech under stress and noise for environmental robustness in speech recognition," Speech Commun., vol. 20, pp. 151-173, 1996.
- (1996) Speech Commun , vol.20 , pp. 151-173
- Hansen, J.H.L.¹

49
- 0030365546
- Experiments of speech recognition in a noisy and reverberant environment using a microphone array and HMM adaptation
- Trento, Italy
- D. Giuliani, M. Omologo, and P. Svaizer, "Experiments of speech recognition in a noisy and reverberant environment using a microphone array and HMM adaptation," in Proc. ICSLP'96, Trento, Italy, 1996, pp. 1329-1332.
- (1996) Proc. ICSLP'96 , pp. 1329-1332
- Giuliani, D.¹ Omologo, M.² Svaizer, P.³

50
- 42749101361
- The MIT mobile device speaker verification corpus: Data collection and preliminary experiments
- San Juan, Puerto Rico, Online, Available
- R. Woo, A. Park, and T. J. Hazen, "The MIT mobile device speaker verification corpus: Data collection and preliminary experiments," in Proc. IEEE Odyssey 2006-The Speaker and Language Recognition Workshop, San Juan, Puerto Rico, 2006, pp. 1-6[Online]. Available: http://groups.csail. mit.edu/sls/mdsvc
- (2006) Proc. IEEE Odyssey 2006-The Speaker and Language Recognition Workshop , pp. 1-6
- Woo, R.¹ Park, A.² Hazen, T.J.³

51
- 33947622673
- Speaker verification over handheld devices with realistic noisy speech data
- Toulouse, France
- J. Ming, T. J. Hazen, and J. R. Glass, "Speaker verification over handheld devices with realistic noisy speech data," in Proc. ICASSP'06, Toulouse, France, 2006, pp. 637-640.
- (2006) Proc. ICASSP'06 , pp. 637-640
- Ming, J.¹ Hazen, T.J.² Glass, J.R.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.