SCOPUS 정보 검색 플랫폼

IEEE Transactions on Audio, Speech and Language Processing

Volumn 18, Issue 1, 2010, Pages 90-100

Modulation spectral features for robust far-field speaker identification

(2) Falk, Tiago H a Chan, Wai Yip b

a UNIVERSITY OF TORONTO (Canada)

b QUEEN S UNIVERSITY (Canada)

Author keywords

Gaussian mixture model (GMM); Modulation spectrum; Reverberation; Reverberation time; Speaker identification

Indexed keywords

ADAPTIVE CHANNEL SELECTION; BASELINE SYSTEMS; CHANNEL MODULATION; CLEAN SPEECH; FAR-FIELD; FILTER OUTPUT; GAUSSIAN MIXTURE MODEL; GAUSSIAN MIXTURE MODEL (GMM); MEL-FREQUENCY CEPSTRAL COEFFICIENTS; MODULATION FREQUENCIES; MODULATION SPECTRUM; MULTI-CHANNEL; REVERBERATION TIME; SIMULATION RESULT; SPEAKER IDENTIFICATION; SPECTRAL FEATURE; SPECTRAL SIGNAL; SPEECH SIGNALS; TEMPORAL ENVELOPES; TRAINING AND TESTING;

ARCHITECTURAL ACOUSTICS; COMMUNICATION CHANNELS (INFORMATION THEORY); COMPUTER VISION; FREQUENCY BANDS; LOUDSPEAKERS; MAGNETOSTRICTIVE DEVICES; MIXTURES; MODULATION; OBJECT RECOGNITION; SIGNAL PROCESSING; SIMULATORS; SPEECH RECOGNITION;

REVERBERATION;

EID: 70449360175 PISSN: 15587916 EISSN: None Source Type: Journal
DOI: 10.1109/TASL.2009.2023679 Document Type: Article

Times cited : (113)

References (41)

1
- 84902053085
- The effects of room acoustics on MFCC speech parameter
- Oct.
- Y. Pan and A.Waibel, "The effects of room acoustics on MFCC speech parameter," in Proc. Int. Conf. Spoken Lang. Process., Oct. 2000, pp. 129-132.
- (2000) Proc. Int. Conf. Spoken Lang. Process , pp. 129-132
- Pan, Y.¹ Waibel, A.²

2
- 0029725847
- Speaker recognition in reverberant enclosures
- May
- P. Castellano, S. Sridharan, and D. Cole, "Speaker recognition in reverberant enclosures," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., May 1996, vol.1, pp. 117-120.
- (1996) Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. , vol.1 , pp. 117-120
- Castellano, P.¹ Sridharan, S.² Cole, D.³

3
- 58349102016
- Analysis of feature extraction and channel compensation in a GMM speaker recognition system
- Sep.
- L. Burget, P. Matejka, P. Schwarz, O. Glembek, and J. Cernocky, "Analysis of feature extraction and channel compensation in a GMM speaker recognition system," IEEE Trans. Audio, Speech Lang. Process., vol.15, no.7, pp. 1979-1986, Sep. 2007.
- (2007) IEEE Trans. Audio, Speech Lang. Process , vol.15 , Issue.7 , pp. 1979-1986
- Burget, L.¹ Matejka, P.² Schwarz, P.³ Glembek, O.⁴ Cernocky, J.⁵

4
- 0028518091
- Microphone arrays and speaker identification
- Oct.
- Q. Lin, E. Jan, and J. Flanagan, "Microphone arrays and speaker identification," IEEE Trans. Speech Audio Process., vol.2, no.4, pp. 622-629, Oct. 1994.
- (1994) IEEE Trans. Speech Audio Process , vol.2 , Issue.4 , pp. 622-629
- Lin, Q.¹ Jan, E.² Flanagan, J.³

5
- 0030371776
- Overview of speech enhancement techniques for automatic speaker recognition
- J. Ortega-Garcia and J. Gonzalez-Rodriquez, "Overview of speech enhancement techniques for automatic speaker recognition," in Proc. Int. Conf. Spoken Lang. Process., 1996, vol.2, pp. 929-932.
- (1996) Proc. Int. Conf. Spoken Lang. Process. , vol.2 , pp. 929-932
- Ortega-Garcia, J.¹ Gonzalez-Rodriquez, J.²

6
- 0030371792
- Increasing robustness in GMM speaker recognition systems for noisy and reverberant speech with low complexity microphone arrays
- J. Gonzalez-Rodriguez, J. Ortega-Garcia, C. Martin, and L. Hernandez, "Increasing robustness in GMM speaker recognition systems for noisy and reverberant speech with low complexity microphone arrays," in Proc. Int. Conf. Spoken Lang. Process., 1996.
- (1996) Proc. Int. Conf. Spoken Lang. Process
- Gonzalez-Rodriguez, J.¹ Ortega-Garcia, J.² Martin, C.³ Hernandez, L.⁴

7
- 50449087648
- Far-field speaker recognition
- Sep.
- Q. Jin, T. Schultz, and A.Waibel, "Far-field speaker recognition," IEEE Trans. Audio, Speech, Lang. Process., vol.15, no.7, pp. 2023-2032, Sep. 2007.
- (2007) IEEE Trans. Audio, Speech, Lang. Process , vol.15 , Issue.7 , pp. 2023-2032
- Jin, Q.¹ Schultz, T.² Waibel, A.³

8
- 33746376208
- Combating reverberation in speaker verification
- May
- J. Gammal and R. Goubran, "Combating reverberation in speaker verification," in Proc. IEEE Conf. Instrum. Meas. Technol., May 2005, pp. 687-690.
- (2005) Proc. IEEE Conf. Instrum. Meas. Technol. , pp. 687-690
- Gammal, J.¹ Goubran, R.²

9
- 48349111776
- Talker identification using reverberation sensing system
- Oct.
- A. Abu-El-Quran, J. Gammal, R. Goubran, and A. Chan, "Talker identification using reverberation sensing system," in Proc. IEEE Conf. Sens., Oct. 2007, pp. 970-973.
- (2007) Proc. IEEE Conf. Sens. , pp. 970-973
- Abu-El-Quran, A.¹ Gammal, J.² Goubran, R.³ Chan, A.⁴

10
- 84863762570
- Compensation for room reverberation in speaker identification
- Aug.
- A. Akula and P. de Leon, "Compensation for room reverberation in speaker identification," in Proc. Eur. Signal Process. Conf., Aug. 2008.
- (2008) Proc. Eur. Signal Process. Conf.
- Akula, A.¹ De Leon, P.²

11
- 63649152839
- Speaker identification in the presence of room reverberation
- Sep.
- P. De Leon and A. Trevizo, "Speaker identification in the presence of room reverberation," in Proc. IEEE Biometrics Symp., Sep. 2007, pp. 1-6.
- (2007) Proc. IEEE Biometrics Symp. , pp. 1-6
- De Leon, P.¹ Trevizo, A.²

12
- 85032751546
- Pushing the envelope-aside
- Sep.
- N. Morgan et al., "Pushing the envelope-aside," IEEE Signal Process. Mag., vol.22, no.5, pp. 81-88, Sep. 2005.
- (2005) IEEE Signal Process. Mag. , vol.22 , Issue.5 , pp. 81-88
- Morgan, N.¹

13
- 84867199388
- Spectro-temporal features for robust farfield speaker identification
- Sep.
- T. H. Falk and W.-Y. Chan, "Spectro-temporal features for robust farfield speaker identification," in Proc. Int. Conf. Spoken Lang. Process., Sep. 2008, pp. 634-637.
- (2008) Proc. Int. Conf. Spoken Lang. Process , pp. 634-637
- Falk, T.H.¹ Chan, W.-Y.²

14
- 0003870155
- 4th ed. : Elsevier
- H. Kuttruff, Room Acoustics, 4th ed. : Elsevier, 2000.
- (2000) Room Acoustics
- Kuttruff, H.¹

15
- 0003879142
- Cambridge, MA: Harvard Univ. Press
- W. Sabine, Collected Papers on Acoustics. Cambridge, MA: Harvard Univ. Press, 1922.
- (1922) Collected Papers on Acoustics
- Sabine, W.¹

16
- 84953653955
- New method of measuring reverberation time
- Mar.
- M. Schroeder, "New method of measuring reverberation time," J. Acoust. Soc. Amer., vol.37, no.3, pp. 409-412, Mar. 1965
- (1965) J. Acoust. Soc. Amer. , vol.37 , Issue.3 , pp. 409-412
- Schroeder, M.¹

17
- 56149101743
- The simulation of realistic acoustic input scenarios for speech recognition systems
- H. Hirsch and H. Finster, "The simulation of realistic acoustic input scenarios for speech recognition systems," Proc. Interspeech, 2005.
- (2005) Proc. Interspeech
- Hirsch, H.¹ Finster, H.²

18
- 33845777262
- ITU-T Rec. G.191, Int. Telecom. Union
- ITU-T Rec. G.191, Software Tools for Speech and Audio Coding Standardization, Int. Telecom. Union, 2005.
- (2005) Software Tools for Speech and Audio Coding Standardization

19
- 0003629316
- ITU-T P.56, Int. Telecom. Union
- ITU-T P.56, Ojective Measurement of Active Speech Level, Int. Telecom. Union, 1993.
- (1993) Ojective Measurement of Active Speech Level

20
- 34250856781
- Multimicrophone speech dereverberation: Experimental validation
- K. Eneman and M. Moonen, "Multimicrophone speech dereverberation: Experimental validation," EURASIP J. Audio, Speech, Music Process., p. 19, 2007.
- (2007) EURASIP J. Audio, Speech, Music Process , pp. 19
- Eneman, K.¹ Moonen, M.²

21
- 84890497820
- Temporal dynamics for blind measurement of room acoustical parameters
- to be published
- T. H. Falk and W.-Y. Chan, "Temporal dynamics for blind measurement of room acoustical parameters," IEEE Trans. Instrum. Meas., 2009, to be published.
- (2009) IEEE Trans. Instrum. Meas.
- Falk, T.H.¹ Chan, W.-Y.²

22
- 0003913694
- An efficient implementation of the patterson-holdsworth auditory filterbank
- Perception Group
- M. Slaney, "An Efficient Implementation of the Patterson-Holdsworth Auditory Filterbank," Apple Computer, Perception Group, 1993.
- (1993) Apple Computer
- Slaney, M.¹

23
- 0025110885
- Derivation of auditory filter shapes from notched-noise data
- DOI 10.1016/0378-5955(90)90170-T
- B. Glasberg and B. Moore, "Derivation of auditory filter shapes from notched-noise data," Hear. Res., vol.47, no.1-2, pp. 103-138, 1990. (Pubitemid 20244652)
- (1990) Hearing Research , vol.47 , Issue.1-2 , pp. 103-138
- Glasberg, B.R.¹ Moore, B.C.J.²

24
- 0029952425
- A quantitative model of the effective signal processing in the auditory system. I-model structure
- T. Dau, D. Puschel, and A. Kohlrausch, "A quantitative model of the effective signal processing in the auditory system. I-model structure," J. Acoust. Soc. Amer., vol.99, no.6, pp. 3615-3622, 1996.
- (1996) J. Acoust. Soc. Amer. , vol.99 , Issue.6 , pp. 3615-3622
- Dau, T.¹ Puschel, D.² Kohlrausch, A.³

25
- 0027957839
- Effect of temporal envelope smearing on speech reception
- R. Drullman, J. Festen, and R. Plomp, "Effect of temporal envelope smearing on speech reception," J. Acoust. Soc. Amer., vol.95, no.2, pp. 1053-1064, Feb. 1994. (Pubitemid 24056370)
- (1994) Journal of the Acoustical Society of America , vol.95 , Issue.2 , pp. 1053-1064
- Drullman, R.¹ Festen, J.M.² Plomp, R.³

26
- 0028287770
- Effect of reducing slow temporal modulations on speech reception
- DOI 10.1121/1.409836
- R. Drullman, J. Festen, and R. Plomp, "Effect of reducing slow temporal modulations on speech reception," J. Acoust. Soc. Amer., vol.95, no.5, pp. 2670-2680, May 1994. (Pubitemid 24152861)
- (1994) Journal of the Acoustical Society of America , vol.95 , pp. 2670-2680
- Drullman, R.¹ Festen, J.M.² Plomp, R.³

27
- 0030369532
- Intelligibility of speech with filtered time trajectories of spectral envelopes
- Oct.
- T. Arai, M. Pavel, H. Hermansky, and C. Avendano, "Intelligibility of speech with filtered time trajectories of spectral envelopes," in Proc. Int. Conf. Speech Lang. Process., Oct. 1996, pp. 2490-2493.
- (1996) Proc. Int. Conf. Speech Lang. Process , pp. 2490-2493
- Arai, T.¹ Pavel, M.² Hermansky, H.³ Avendano, C.⁴

28
- 0242609086
- Blind estimation of reverberation time
- Nov.
- R. Ratnam, D. Jones, B. Wheeler, W. O'Brien, C. Lansing, and A. Feng, "Blind estimation of reverberation time," J. Acoust. Soc. Amer., vol.114, no.5, pp. 2877-2892, Nov. 2003.
- (2003) J. Acoust. Soc. Amer. , vol.114 , Issue.5 , pp. 2877-2892
- Ratnam, R.¹ Jones, D.² Wheeler, B.³ O'Brien, W.⁴ Lansing, C.⁵ Feng, A.⁶

29
- 0037034899
- Chimaeric sounds reveal dichotomies in auditory perception
- Mar.
- Z. Smith, B. Delgutte, and A. Oxenham, "Chimaeric sounds reveal dichotomies in auditory perception," Lett. Nature, vol.416, pp. 87-90, Mar. 2002.
- (2002) Lett. Nature , vol.416 , pp. 87-90
- Smith, Z.¹ Delgutte, B.² Oxenham, A.³

30
- 0029355999
- Speaker identification and verification using Gaussian mixture speaker models
- Aug.
- D. Reynolds, "Speaker identification and verification using Gaussian mixture speaker models," in Speech Commun., Aug. 1995, vol.17, pp. 91-108.
- (1995) Speech Commun. , vol.17 , pp. 91-108
- Reynolds, D.¹

31
- 0002629270
- Maximum likelihood from incomplete data via the em algorithm
- A. Dempster, N. Lair, and D. Rubin, "Maximum likelihood from incomplete data via the EM algorithm," J. R. Statist. Soc., vol.39, pp. 1-38, 1977.
- (1977) J. R. Statist. Soc. , vol.39 , pp. 1-38
- Dempster, A.¹ Lair, N.² Rubin, D.³

32
- 63249107289
- Robust speaker recognition in noisy conditions
- Jul.
- J. Ming, T. Hazend, J. Glass, and D. Reynolds, "Robust speaker recognition in noisy conditions," IEEE Trans. Audio, Speech, Lang. Process., vol.15, no.5, pp. 1711-1723, Jul. 2007.
- (2007) IEEE Trans. Audio, Speech, Lang. Process , vol.15 , Issue.5 , pp. 1711-1723
- Ming, J.¹ Hazend, T.² Glass, J.³ Reynolds, D.⁴

33
- 70449427270
- Spectro-temporal processing for blind estimation of reverberation time and single-ended quality measurement of reverberant speech
- Sep.
- T. H. Falk, H. Yuan, and W.-Y. Chan, "Spectro-temporal processing for blind estimation of reverberation time and single-ended quality measurement of reverberant speech," in Proc. Int. Conf. Spoken Lang. Process., Sep. 2007, pp. 514-517.
- (2007) Proc. Int. Conf. Spoken Lang. Process , pp. 514-517
- Falk, T.H.¹ Yuan, H.² Chan, W.-Y.³

34
- 0033708744
- Modulation enhancement of speech as a preprocessing for reverberant chambers with the hearing-impaired
- A. Kusumoto, T. Arai, T. Kitamura, M. Takahashi, and Y. Murahara, "Modulation enhancement of speech as a preprocessing for reverberant chambers with the hearing-impaired," in Proc. Int. Conf. Acoust., Speech, Signal Process., 2000, vol.II, pp. 853-856.
- (2000) Proc. Int. Conf. Acoust., Speech, Signal Process , vol.2 , pp. 853-856
- Kusumoto, A.¹ Arai, T.² Kitamura, T.³ Takahashi, M.⁴ Murahara, Y.⁵

35
- 57649245616
- The CHAINS corpus: Characterizing individual speakers
- F. Cummins, M. Grimaldi, T. Leonard, and J. Simko, "The CHAINS corpus: Characterizing individual speakers," in Proc. Int. Conf. Speech Comput., 2006.
- (2006) Proc. Int. Conf. Speech Comput.
- Cummins, F.¹ Grimaldi, M.² Leonard, T.³ Simko, J.⁴

36
- 66149120614
- Speaker identification using instantaneous frequencies
- Aug.
- M. Grimaldi and F. Cummins, "Speaker identification using instantaneous frequencies," IEEE Trans. Audio, Speech, Lang. Process., vol.16, no.6, pp. 1097-1111, Aug. 2008.
- (2008) IEEE Trans. Audio, Speech, Lang. Process , vol.16 , Issue.6 , pp. 1097-1111
- Grimaldi, M.¹ Cummins, F.²

37
- 0030172028
- A microphone array processing technique for speech enhancement in a reverberant space
- Q.-G. Liu, B. Champagne, and P. Kabal, "A microphone array processing technique for speech enhancement in a reverberant space," Speech Commun., vol.18, no.4, pp. 317-334, 1996.
- (1996) Speech Commun. , vol.18 , Issue.4 , pp. 317-334
- Liu, Q.-G.¹ Champagne, B.² Kabal, P.³

38
- 0242263684
- Subspace methods for multimicrophone speech dereverberation
- Sep.
- S. Gannot and M. Moonen, "Subspace methods for multimicrophone speech dereverberation," in Proc. Int. Workshop Acoust. Echo and Noise Control, Sep. 2001, pp. 47-50.
- (2001) Proc. Int. Workshop Acoust. Echo and Noise Control , pp. 47-50
- Gannot, S.¹ Moonen, M.²

39
- 70449620235
- A non-intrusive quality measure of dereverberated speech
- Sep.
- T. H. Falk and W.-Y. Chan, "A non-intrusive quality measure of dereverberated speech," in Proc. Int.Workshop Acoust. Echo Noise Control, Sep. 2008.
- (2008) Proc. Int.Workshop Acoust. Echo Noise Control
- Falk, T.H.¹ Chan, W.-Y.²

40
- 0141814662
- The ICSI meeting corpus
- Apr.
- A. Janin, D. Baron, J. Edwards, D. Ellis, D. Gelbart, N. Morgan, B. Peskin, T. Pfau, E. Shriberg, A. Stolcke, and C. Wooters, "The ICSI meeting corpus," in Proc. Int. Conf. Acoust., Speech, Signal Process., Apr. 2003, vol.I, pp. 364-367.
- (2003) Proc. Int. Conf. Acoust., Speech, Signal Process , vol.1 , pp. 364-367
- Janin, A.¹ Baron, D.² Edwards, J.³ Ellis, D.⁴ Gelbart, D.⁵ Morgan, N.⁶ Peskin, B.⁷ Pfau, T.⁸ Shriberg, E.⁹ Stolcke, A.¹⁰ Wooters, C.¹¹

41
- 4143108524
- 3GPP2 C.S0014-0
- Enhanced Variable Rate Codec (EVRC), 3GPP2 C.S0014-0, 1999.
- (1999) Enhanced Variable Rate Codec (EVRC)

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.