메뉴 건너뛰기




Volumn 18, Issue 1, 2010, Pages 90-100

Modulation spectral features for robust far-field speaker identification

Author keywords

Gaussian mixture model (GMM); Modulation spectrum; Reverberation; Reverberation time; Speaker identification

Indexed keywords

ADAPTIVE CHANNEL SELECTION; BASELINE SYSTEMS; CHANNEL MODULATION; CLEAN SPEECH; FAR-FIELD; FILTER OUTPUT; GAUSSIAN MIXTURE MODEL; GAUSSIAN MIXTURE MODEL (GMM); MEL-FREQUENCY CEPSTRAL COEFFICIENTS; MODULATION FREQUENCIES; MODULATION SPECTRUM; MULTI-CHANNEL; REVERBERATION TIME; SIMULATION RESULT; SPEAKER IDENTIFICATION; SPECTRAL FEATURE; SPECTRAL SIGNAL; SPEECH SIGNALS; TEMPORAL ENVELOPES; TRAINING AND TESTING;

EID: 70449360175     PISSN: 15587916     EISSN: None     Source Type: Journal    
DOI: 10.1109/TASL.2009.2023679     Document Type: Article
Times cited : (113)

References (41)
  • 1
    • 84902053085 scopus 로고    scopus 로고
    • The effects of room acoustics on MFCC speech parameter
    • Oct.
    • Y. Pan and A.Waibel, "The effects of room acoustics on MFCC speech parameter," in Proc. Int. Conf. Spoken Lang. Process., Oct. 2000, pp. 129-132.
    • (2000) Proc. Int. Conf. Spoken Lang. Process , pp. 129-132
    • Pan, Y.1    Waibel, A.2
  • 3
    • 58349102016 scopus 로고    scopus 로고
    • Analysis of feature extraction and channel compensation in a GMM speaker recognition system
    • Sep.
    • L. Burget, P. Matejka, P. Schwarz, O. Glembek, and J. Cernocky, "Analysis of feature extraction and channel compensation in a GMM speaker recognition system," IEEE Trans. Audio, Speech Lang. Process., vol.15, no.7, pp. 1979-1986, Sep. 2007.
    • (2007) IEEE Trans. Audio, Speech Lang. Process , vol.15 , Issue.7 , pp. 1979-1986
    • Burget, L.1    Matejka, P.2    Schwarz, P.3    Glembek, O.4    Cernocky, J.5
  • 4
    • 0028518091 scopus 로고
    • Microphone arrays and speaker identification
    • Oct.
    • Q. Lin, E. Jan, and J. Flanagan, "Microphone arrays and speaker identification," IEEE Trans. Speech Audio Process., vol.2, no.4, pp. 622-629, Oct. 1994.
    • (1994) IEEE Trans. Speech Audio Process , vol.2 , Issue.4 , pp. 622-629
    • Lin, Q.1    Jan, E.2    Flanagan, J.3
  • 6
    • 0030371792 scopus 로고    scopus 로고
    • Increasing robustness in GMM speaker recognition systems for noisy and reverberant speech with low complexity microphone arrays
    • J. Gonzalez-Rodriguez, J. Ortega-Garcia, C. Martin, and L. Hernandez, "Increasing robustness in GMM speaker recognition systems for noisy and reverberant speech with low complexity microphone arrays," in Proc. Int. Conf. Spoken Lang. Process., 1996.
    • (1996) Proc. Int. Conf. Spoken Lang. Process
    • Gonzalez-Rodriguez, J.1    Ortega-Garcia, J.2    Martin, C.3    Hernandez, L.4
  • 10
    • 84863762570 scopus 로고    scopus 로고
    • Compensation for room reverberation in speaker identification
    • Aug.
    • A. Akula and P. de Leon, "Compensation for room reverberation in speaker identification," in Proc. Eur. Signal Process. Conf., Aug. 2008.
    • (2008) Proc. Eur. Signal Process. Conf.
    • Akula, A.1    De Leon, P.2
  • 11
    • 63649152839 scopus 로고    scopus 로고
    • Speaker identification in the presence of room reverberation
    • Sep.
    • P. De Leon and A. Trevizo, "Speaker identification in the presence of room reverberation," in Proc. IEEE Biometrics Symp., Sep. 2007, pp. 1-6.
    • (2007) Proc. IEEE Biometrics Symp. , pp. 1-6
    • De Leon, P.1    Trevizo, A.2
  • 12
    • 85032751546 scopus 로고    scopus 로고
    • Pushing the envelope-aside
    • Sep.
    • N. Morgan et al., "Pushing the envelope-aside," IEEE Signal Process. Mag., vol.22, no.5, pp. 81-88, Sep. 2005.
    • (2005) IEEE Signal Process. Mag. , vol.22 , Issue.5 , pp. 81-88
    • Morgan, N.1
  • 13
    • 84867199388 scopus 로고    scopus 로고
    • Spectro-temporal features for robust farfield speaker identification
    • Sep.
    • T. H. Falk and W.-Y. Chan, "Spectro-temporal features for robust farfield speaker identification," in Proc. Int. Conf. Spoken Lang. Process., Sep. 2008, pp. 634-637.
    • (2008) Proc. Int. Conf. Spoken Lang. Process , pp. 634-637
    • Falk, T.H.1    Chan, W.-Y.2
  • 16
    • 84953653955 scopus 로고
    • New method of measuring reverberation time
    • Mar.
    • M. Schroeder, "New method of measuring reverberation time," J. Acoust. Soc. Amer., vol.37, no.3, pp. 409-412, Mar. 1965
    • (1965) J. Acoust. Soc. Amer. , vol.37 , Issue.3 , pp. 409-412
    • Schroeder, M.1
  • 17
    • 56149101743 scopus 로고    scopus 로고
    • The simulation of realistic acoustic input scenarios for speech recognition systems
    • H. Hirsch and H. Finster, "The simulation of realistic acoustic input scenarios for speech recognition systems," Proc. Interspeech, 2005.
    • (2005) Proc. Interspeech
    • Hirsch, H.1    Finster, H.2
  • 20
    • 34250856781 scopus 로고    scopus 로고
    • Multimicrophone speech dereverberation: Experimental validation
    • K. Eneman and M. Moonen, "Multimicrophone speech dereverberation: Experimental validation," EURASIP J. Audio, Speech, Music Process., p. 19, 2007.
    • (2007) EURASIP J. Audio, Speech, Music Process , pp. 19
    • Eneman, K.1    Moonen, M.2
  • 21
    • 84890497820 scopus 로고    scopus 로고
    • Temporal dynamics for blind measurement of room acoustical parameters
    • to be published
    • T. H. Falk and W.-Y. Chan, "Temporal dynamics for blind measurement of room acoustical parameters," IEEE Trans. Instrum. Meas., 2009, to be published.
    • (2009) IEEE Trans. Instrum. Meas.
    • Falk, T.H.1    Chan, W.-Y.2
  • 22
    • 0003913694 scopus 로고
    • An efficient implementation of the patterson-holdsworth auditory filterbank
    • Perception Group
    • M. Slaney, "An Efficient Implementation of the Patterson-Holdsworth Auditory Filterbank," Apple Computer, Perception Group, 1993.
    • (1993) Apple Computer
    • Slaney, M.1
  • 23
    • 0025110885 scopus 로고
    • Derivation of auditory filter shapes from notched-noise data
    • DOI 10.1016/0378-5955(90)90170-T
    • B. Glasberg and B. Moore, "Derivation of auditory filter shapes from notched-noise data," Hear. Res., vol.47, no.1-2, pp. 103-138, 1990. (Pubitemid 20244652)
    • (1990) Hearing Research , vol.47 , Issue.1-2 , pp. 103-138
    • Glasberg, B.R.1    Moore, B.C.J.2
  • 24
    • 0029952425 scopus 로고    scopus 로고
    • A quantitative model of the effective signal processing in the auditory system. I-model structure
    • T. Dau, D. Puschel, and A. Kohlrausch, "A quantitative model of the effective signal processing in the auditory system. I-model structure," J. Acoust. Soc. Amer., vol.99, no.6, pp. 3615-3622, 1996.
    • (1996) J. Acoust. Soc. Amer. , vol.99 , Issue.6 , pp. 3615-3622
    • Dau, T.1    Puschel, D.2    Kohlrausch, A.3
  • 26
    • 0028287770 scopus 로고
    • Effect of reducing slow temporal modulations on speech reception
    • DOI 10.1121/1.409836
    • R. Drullman, J. Festen, and R. Plomp, "Effect of reducing slow temporal modulations on speech reception," J. Acoust. Soc. Amer., vol.95, no.5, pp. 2670-2680, May 1994. (Pubitemid 24152861)
    • (1994) Journal of the Acoustical Society of America , vol.95 , pp. 2670-2680
    • Drullman, R.1    Festen, J.M.2    Plomp, R.3
  • 27
    • 0030369532 scopus 로고    scopus 로고
    • Intelligibility of speech with filtered time trajectories of spectral envelopes
    • Oct.
    • T. Arai, M. Pavel, H. Hermansky, and C. Avendano, "Intelligibility of speech with filtered time trajectories of spectral envelopes," in Proc. Int. Conf. Speech Lang. Process., Oct. 1996, pp. 2490-2493.
    • (1996) Proc. Int. Conf. Speech Lang. Process , pp. 2490-2493
    • Arai, T.1    Pavel, M.2    Hermansky, H.3    Avendano, C.4
  • 29
    • 0037034899 scopus 로고    scopus 로고
    • Chimaeric sounds reveal dichotomies in auditory perception
    • Mar.
    • Z. Smith, B. Delgutte, and A. Oxenham, "Chimaeric sounds reveal dichotomies in auditory perception," Lett. Nature, vol.416, pp. 87-90, Mar. 2002.
    • (2002) Lett. Nature , vol.416 , pp. 87-90
    • Smith, Z.1    Delgutte, B.2    Oxenham, A.3
  • 30
    • 0029355999 scopus 로고
    • Speaker identification and verification using Gaussian mixture speaker models
    • Aug.
    • D. Reynolds, "Speaker identification and verification using Gaussian mixture speaker models," in Speech Commun., Aug. 1995, vol.17, pp. 91-108.
    • (1995) Speech Commun. , vol.17 , pp. 91-108
    • Reynolds, D.1
  • 31
    • 0002629270 scopus 로고
    • Maximum likelihood from incomplete data via the em algorithm
    • A. Dempster, N. Lair, and D. Rubin, "Maximum likelihood from incomplete data via the EM algorithm," J. R. Statist. Soc., vol.39, pp. 1-38, 1977.
    • (1977) J. R. Statist. Soc. , vol.39 , pp. 1-38
    • Dempster, A.1    Lair, N.2    Rubin, D.3
  • 33
    • 70449427270 scopus 로고    scopus 로고
    • Spectro-temporal processing for blind estimation of reverberation time and single-ended quality measurement of reverberant speech
    • Sep.
    • T. H. Falk, H. Yuan, and W.-Y. Chan, "Spectro-temporal processing for blind estimation of reverberation time and single-ended quality measurement of reverberant speech," in Proc. Int. Conf. Spoken Lang. Process., Sep. 2007, pp. 514-517.
    • (2007) Proc. Int. Conf. Spoken Lang. Process , pp. 514-517
    • Falk, T.H.1    Yuan, H.2    Chan, W.-Y.3
  • 36
    • 66149120614 scopus 로고    scopus 로고
    • Speaker identification using instantaneous frequencies
    • Aug.
    • M. Grimaldi and F. Cummins, "Speaker identification using instantaneous frequencies," IEEE Trans. Audio, Speech, Lang. Process., vol.16, no.6, pp. 1097-1111, Aug. 2008.
    • (2008) IEEE Trans. Audio, Speech, Lang. Process , vol.16 , Issue.6 , pp. 1097-1111
    • Grimaldi, M.1    Cummins, F.2
  • 37
    • 0030172028 scopus 로고    scopus 로고
    • A microphone array processing technique for speech enhancement in a reverberant space
    • Q.-G. Liu, B. Champagne, and P. Kabal, "A microphone array processing technique for speech enhancement in a reverberant space," Speech Commun., vol.18, no.4, pp. 317-334, 1996.
    • (1996) Speech Commun. , vol.18 , Issue.4 , pp. 317-334
    • Liu, Q.-G.1    Champagne, B.2    Kabal, P.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.