메뉴 건너뛰기




Volumn 16, Issue 6, 2008, Pages 1097-1111

Speaker identification using instantaneous frequencies

(2)  Grimaldi, Marco a   Cummins, Fred a  

a NONE

Author keywords

AM FM representation; Instantaneous frequency; Speaker identification; Speaker recognition.

Indexed keywords

AM-FM REPRESENTATION; BANDWIDTH SCALING; CEPSTRAL COEFFICIENTS; CLASSIFICATION SYSTEM; EXPERIMENTAL EVALUATION; FORMANT TRACKING; FREQUENCY RANGES; GAUSSIAN MIXTURE MODEL; INSTANTANEOUS FREQUENCY; LIMITING CASE; NEW PARAMETERS; PARAMETRIZATION; REFERENCE SYSTEMS; SPEAKER IDENTIFICATION; SPEAKER RECOGNITION.; SPECTROGRAPHIC ANALYSIS; SPEECH DATA; SPEECH SIGNALS; TESTING MATERIALS; TEXT-INDEPENDENT SPEAKER IDENTIFICATION; VOICED SPEECH; WHISPERED SPEECH;

EID: 66149120614     PISSN: 15587916     EISSN: None     Source Type: Journal    
DOI: 10.1109/TASL.2008.2001109     Document Type: Article
Times cited : (123)

References (42)
  • 1
    • 33745222458 scopus 로고    scopus 로고
    • Forensic speaker identification, A likelihood ratio-based approach using vowel formants
    • Munich, Germany: LINCOM
    • T. B. Alderman, Forensic Speaker Identification, A Likelihood Ratio-Based Approach Using Vowel Formants, ser. Lincom Studies in Phonetics. Munich, Germany: LINCOM, 2005.
    • (2005) ser. lincom studies in phonetics
    • Alderman, T.B.1
  • 2
    • 0015112070 scopus 로고
    • "Speech analysis and synthesis by linear prediction of the speech wave,"
    • B. S. Atal and S. L. Hanauer, "Speech analysis and synthesis by linear prediction of the speech wave," J. Acoust. Soc. Amer., vol. 50, pp. 637-655, 1971.
    • (1971) J. Acoust. Soc. Amer. , vol.50 , pp. 637-655
    • Atal, B.S.1    Hanauer, S.L.2
  • 4
    • 84937035392 scopus 로고
    • "Estimating and interpreting the instanteneous frequency of a signal-Part 1: Fundamentals,"
    • Apr.
    • B. Boashash, "Estimating and interpreting the instanteneous frequency of a signal-Part 1: Fundamentals," Proc. IEEE, vol. 80, no. 4, pp. 519-538, Apr. 1992.
    • (1992) Proc. IEEE , vol.80 , Issue.4 , pp. 519-538
    • Boashash, B.1
  • 5
    • 4444257069 scopus 로고    scopus 로고
    • "Praat, a system for doing phonetics by computer,"
    • P. Boersma, "Praat, a system for doing phonetics by computer," Glot Int., vol. 5, no. 9/10, pp. 341-345, 2001.
    • (2001) Glot Int. , vol.5 , Issue.9-10 , pp. 341-345
    • Boersma, P.1
  • 7
    • 0031233424 scopus 로고    scopus 로고
    • "Speaker recognition: A tutorial,"
    • Sep.
    • J. P. Campbell, Jr., "Speaker recognition: A tutorial," Proc. IEEE, vol. 85, no. 9, pp. 1437-1462, Sep. 1997.
    • (1997) Proc. IEEE , vol.5 , Issue.9 , pp. 1437-1462
    • Campbell Jr., J.P.1
  • 8
    • 0000291808 scopus 로고    scopus 로고
    • Methods of combining multiple classifiers wtth different features and their applications to text-independent speaker identification
    • K. Chen, L. Wang, and H. Chi, "Methods of combining multiple clas-sifiers with different features and their applications to text-independent speaker identification," Int. J. Pattern Recognition Artif. Intell., vol. 11, no. 3, pp. 417-445, 1997. (Pubitemid 127623791)
    • (1997) International Journal of Pattern Recognition and Artificial Intelligence , vol.11 , Issue.3 , pp. 417-445
    • Chen, K.1    Wang, L.2    Chi, H.3
  • 9
    • 57649245616 scopus 로고    scopus 로고
    • "The chains corpus: Characterizing individual speakers,"
    • St. Petersburg, Russia
    • F. Cummins, M. Grimaldi, T. Leonard, and J. Simko, "The chains corpus: Characterizing individual speakers," in Proc. SPECOM'06, St. Petersburg, Russia, 2006, pp. 431-435.
    • (2006) In Proc. SPECOM'06 , pp. 431-435
    • Cummins, F.1    Grimaldi, M.2    Leonard, T.3    Simko, J.4
  • 10
    • 85009224932 scopus 로고    scopus 로고
    • "Robust energy demodulation based on continuous models with application to speech recognition,"
    • D. Dimitriadis and P. Maragos, "Robust energy demodulation based on continuous models with application to speech recognition," in Proc. Eurospeech'03, 2003, pp. 2853-2856.
    • (2003) In Proc. Eurospeech'03 , pp. 2853-2856
    • Dimitriadis, D.1    Maragos, P.2
  • 13
    • 0019555090 scopus 로고
    • CEPSTRAL ANALYSIS TECHNIQUE FOR AUTOMATIC SPEAKER VERIFICATION.
    • S. Furui, "Cepstral analysis technique for automatic speaker verification," IEEE Trans. Acoust., Speech, Signal Process., vol. ASSP-29, no. 2, pp. 254-272, Apr. 1981. (Pubitemid 11495877)
    • (1981) IEEE Transactions on Acoustics, Speech, and Signal Processing , vol.ASSP-29 , Issue.2 , pp. 254-272
    • Furui Sadaoki1
  • 14
    • 0000293183 scopus 로고
    • "Theory of communication,"
    • Nov.
    • D. Gabor, "Theory of communication," JIEE, vol. 93, no. 3, pp. 429-457, Nov. 1946.
    • (1946) JIEE , vol.93 , Issue.3 , pp. 429-457
    • Gabor, D.1
  • 16
    • 0025041264 scopus 로고
    • Perceptual linear predictive (PLP) analysis of speech
    • DOI 10.1121/1.399423
    • H. Hermansky, "Perceptual linear prediction (PLP) analysis for speech," J. Acoust. Soc. Amer., pp. 1738-1752, 1990. (Pubitemid 20256470)
    • (1990) Journal of the Acoustical Society of America , vol.87 , Issue.4 , pp. 1738-1752
    • Hermansky, H.1
  • 20
    • 0345940399 scopus 로고
    • "On Teager's energy algorithm and its generalization to continuos signals,"
    • New York, CD-ROM
    • J. K. Keiser, "On Teager's energy algorithm and its generalization to continuos signals," in Proc. IEEE DSP Workshop, New York, 1990, CD-ROM.
    • (1990) In Proc. IEEE DSP Workshop
    • Keiser, J.K.1
  • 21
    • 0005415015 scopus 로고
    • "Voiceprint identification,"
    • L. G. Kersta, "Voiceprint identification," Nature, vol. 196, pp. 1253-1257, 1962.
    • (1962) Nature , vol.196 , pp. 1253-1257
    • Kersta, L.G.1
  • 22
    • 2142809668 scopus 로고    scopus 로고
    • "Signal representation based on instantaneous amplitude models with application to speech synthesis,"
    • May
    • G. Li, L. Qiu, and L. K. Ng, "Signal representation based on instantaneous amplitude models with application to speech synthesis," IEEE Trans. Speech Audio Process., vol. 8, no. 3, pp. 353-357, May 2000.
    • (2000) IEEE Trans. Speech Audio Process. , vol.8 , Issue.3 , pp. 353-357
    • Li, G.1    Qiu, L.2    Ng, L.K.3
  • 23
    • 0016495091 scopus 로고
    • "Linear prediction: A tutorial review,"
    • Apr.
    • J. Makhoul, "Linear prediction: A tutorial review," Proc. IEEE, vol. 63, no. 4, pp. 561-580, Apr. 1975.
    • (1975) Proc. IEEE , vol.63 , Issue.4 , pp. 561-582
    • Makhoul, J.1
  • 24
    • 29444456613 scopus 로고    scopus 로고
    • Speaker recognition by location in the space of reference speakers
    • DOI 10.1016/j.specom.2005.06.014, PII S016763930500169X
    • Y. Mami and D. Charlet, "Speaker recognition by location in the space of reference speakers," Speech Commun., vol. 48, no. 2, pp. 127-141, 2006. (Pubitemid 43012027)
    • (2006) Speech Communication , vol.48 , Issue.2 , pp. 127-141
    • Mami, Y.1    Charlet, D.2
  • 25
    • 0027676955 scopus 로고
    • "Energy separation in signal modulations with application to speech analysis,"
    • Oct.
    • P. Maragos, J. F. Kaiser, and T. F. Quatieri, "Energy separation in signal modulations with application to speech analysis," IEEE Trans. Signal Process., vol. 41, no. 10, pp. 3024-3051, Oct. 1993.
    • (1993) IEEE Trans. Signal Process. , vol.41 , Issue.10 , pp. 3024-3051
    • Maragos, P.1    Kaiser, J.F.2    Quatieri, T.F.3
  • 27
    • 0032595177 scopus 로고    scopus 로고
    • "Robust text-independent speaker identification over telephone channels,"
    • Sep.
    • H. A. Murthy, F. Beaufays, L. P. Heck, and M. Weintraub, "Robust text-independent speaker identification over telephone channels," IEEE Trans. Speech Audio Process., vol. 7, no. 5, pp. 554-568, Sep. 1999.
    • (1999) IEEE Trans. Speech Audio Process. , vol.7 , Issue.5 , pp. 554-568
    • Murthy, H.A.1    Beaufays, F.2    Heck, L.P.3    Weintraub, M.4
  • 29
    • 85009192384 scopus 로고    scopus 로고
    • "Frequency-related representation of . speech,"
    • Sep.
    • K. K. Paliwal and B. S. Atal, "Frequency-related representation of . speech," in Proc. Eurospeech'03, Sep. 2003, pp. 65-68.
    • (2003) In Proc. Eurospeech'03 , pp. 65-68
    • Paliwal, K.K.1    Atal, B.S.2
  • 30
    • 0030008906 scopus 로고    scopus 로고
    • Speech formant frequency and bandwidth tracking using multiband energy demodulation
    • DOI 10.1121/1.414997
    • A. Potamianos and P, Maragos, "Speech formant frequency and bandwidth tracking using multiband energy demodulation," J. Acoust. Soc. Amer., vol. 99, pp. 3795-3806, 1996. (Pubitemid 26190269)
    • (1996) Journal of the Acoustical Society of America , vol.99 , Issue.6 , pp. 3795-3806
    • Potamianos, A.1    Maragos, P.2
  • 31
    • 0035278964 scopus 로고    scopus 로고
    • "Time-frequency distributions for automatic speech recognition,"
    • Mar.
    • A. Potamianos and P. Maragos, "Time-frequency distributions for automatic speech recognition," IEEE Trans. Speech Audio Process., vol. 9, no. 3, pp. 196-200, Mar. 2001.
    • (2001) IEEE Trans. Speech Audio Process. , vol.9 , Issue.3 , pp. 196-200
    • Potamianos, A.1    Maragos, P.2
  • 33
    • 0000330384 scopus 로고    scopus 로고
    • "On decomposing speech into modulated components,"
    • may
    • A. Rao and R. Kumaresan, "On decomposing speech into modulated components," IEEE Trans. Speech Audio Process., vol. 8, no. 3, pp. 240-254, May 2000.
    • (2000) IEEE Trans. Speech Audio Process. , vol.8 , Issue.3 , pp. 240-254
    • Rao, A.1    Kumaresan, R.2
  • 34
    • 0028515984 scopus 로고    scopus 로고
    • "Experimental evaluation of features for robust speaker identification,"
    • Oct.
    • D. A. Reynolds, "Experimental evaluation of features for robust speaker identification," IEEE Trans. Speech Audio Process., vol. 2, no. 4, pp. 639-643, Oct. 1994.
    • (2000) IEEE Trans. Speech Audio Process. , vol.2 , Issue.4 , pp. 639-643
    • Reynolds, D.A.1
  • 36
    • 0029209272 scopus 로고
    • "Robust text-independent speaker identification using gaussian mixture speaker models,"
    • Jan.
    • D. A. Reynolds and R. C. Rose, "Robust text-independent speaker identification using gaussian mixture speaker models," IEEE Trans. Speech Audio Process., vol. 3, no. 1, pp. 72-83, Jan. 1995.
    • (1995) IEEE Trans. Speech Audio Process. , vol.3 , Issue.1 , pp. 72-83
    • Reynolds, D.A.1    Rose, R.C.2
  • 37
    • 84874479055 scopus 로고    scopus 로고
    • "Computer recognition of speakers who disguise their voice,"
    • CD-ROM
    • R. D. Rodman, "Computer recognition of speakers who disguise their voice," in Proc ICSPAT'00, 2000, CD-ROM.
    • (2000) In Proc ICSPAT'00
    • Rodman, R.D.1
  • 38
    • 66149095995 scopus 로고    scopus 로고
    • Forensic speaker indentification
    • New York: Taylor and Francis
    • P. Rose, Forensic Speaker Indentification, ser. Forensic Science. New York: Taylor and Francis, 2002.
    • (2002) ser. Forensic Science
    • Rose, P.1
  • 39
    • 0003236089 scopus 로고
    • "Evidence for nonlinear sound production mechanisms in the vocal tract,"
    • ser. NATO Advanced Study Institute Series D, W. J. Hard-castle and A. Marchal, Eds. Bonas, France: Kluwer, Jul.
    • H. M. Teager and S. M. Teager, "Evidence for nonlinear sound production mechanisms in the vocal tract," in Speech Production and Speech Modelling, ser. NATO Advanced Study Institute Series D, W. J. Hard-castle and A. Marchal, Eds. Bonas, France: Kluwer, Jul. 1989, vol. 55.
    • (1989) In Speech Production and Speech Modelling , vol.55
    • Teager, H.M.1    Teager, S.M.2
  • 41
    • 14644412368 scopus 로고    scopus 로고
    • "Speaker verification using sequence discriminant support vector machines,"
    • Mar.
    • V. Wan and S. Renals, "Speaker verification using sequence discriminant support vector machines," IEEE Trans. Speech Audio Process., vol. 13, no. 2, pp. 203-210, Mar. 2005.
    • (2005) IEEE Trans. Speech Audio Process. , vol.13 , Issue.2 , pp. 203-210
    • Wan, V.1    Renals, S.2
  • 42
    • 0041360472 scopus 로고    scopus 로고
    • "Efficient text-independent speaker verification with structural Gaussian mixture models and neural network,"
    • Sep.
    • B. Xiang and T. Berger, "Efficient text-independent speaker verification with structural Gaussian mixture models and neural network," IEEE Trans. Speech Audio Process., vol. 11, no. 5, pp. 447-456, Sep. 2003.
    • (2003) IEEE Trans. Speech Audio Process. , vol.11 , Issue.5 , pp. 447-456
    • Xiang, B.1    Berger, T.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.