SCOPUS 정보 검색 플랫폼

IEEE Transactions on Audio, Speech and Language Processing

Volumn 16, Issue 6, 2008, Pages 1097-1111

Speaker identification using instantaneous frequencies

(2) Grimaldi, Marco a Cummins, Fred a

a NONE

Author keywords

AM FM representation; Instantaneous frequency; Speaker identification; Speaker recognition.

Indexed keywords

AM-FM REPRESENTATION; BANDWIDTH SCALING; CEPSTRAL COEFFICIENTS; CLASSIFICATION SYSTEM; EXPERIMENTAL EVALUATION; FORMANT TRACKING; FREQUENCY RANGES; GAUSSIAN MIXTURE MODEL; INSTANTANEOUS FREQUENCY; LIMITING CASE; NEW PARAMETERS; PARAMETRIZATION; REFERENCE SYSTEMS; SPEAKER IDENTIFICATION; SPEAKER RECOGNITION.; SPECTROGRAPHIC ANALYSIS; SPEECH DATA; SPEECH SIGNALS; TESTING MATERIALS; TEXT-INDEPENDENT SPEAKER IDENTIFICATION; VOICED SPEECH; WHISPERED SPEECH;

AMPLITUDE MODULATION; ELECTRIC FREQUENCY MEASUREMENT; LOUDSPEAKERS; SPEECH; SPEECH RECOGNITION;

IDENTIFICATION (CONTROL SYSTEMS);

EID: 66149120614 PISSN: 15587916 EISSN: None Source Type: Journal
DOI: 10.1109/TASL.2008.2001109 Document Type: Article

Times cited : (123)

References (42)

1
- 33745222458
- Forensic speaker identification, A likelihood ratio-based approach using vowel formants
- Munich, Germany: LINCOM
- T. B. Alderman, Forensic Speaker Identification, A Likelihood Ratio-Based Approach Using Vowel Formants, ser. Lincom Studies in Phonetics. Munich, Germany: LINCOM, 2005.
- (2005) ser. lincom studies in phonetics
- Alderman, T.B.¹

2
- 0015112070
- "Speech analysis and synthesis by linear prediction of the speech wave,"
- B. S. Atal and S. L. Hanauer, "Speech analysis and synthesis by linear prediction of the speech wave," J. Acoust. Soc. Amer., vol. 50, pp. 637-655, 1971.
- (1971) J. Acoust. Soc. Amer. , vol.50 , pp. 637-655
- Atal, B.S.¹ Hanauer, S.L.²

3
- 2942594475
- "A tutorial on text-inde-pendent speaker verification,"
- F. Bimbot, J.-F. Bonastre, C. Fredouille, G. Gravier, I. Ma-grin-Chagnolleau, S. Meignier, T. Merlin, J. Ortega-Garcfa, D. Petrovska-Delacretaz, and D. A. Reynolds, "A tutorial on text-inde-pendent speaker verification," EURASIP J. Appl. Signal Process., vol. 2004, no. 1, pp. 430-451, 2004.
- (2004) EURASIP J. Appl. Signal Process. , vol.2004 , Issue.1 , pp. 430-451
- Bimbot, F.¹ Bonastre, J.-F.² Fredouille, C.³ Gravier, G.⁴ Magrin-Chagnolleau, I.⁵ Meignier⁶ Merlin, T.⁷ Ortega-Garcfa, J.⁸ Petrovska-Delacretaz, D.⁹ Reynolds, D.A.¹⁰

4
- 84937035392
- "Estimating and interpreting the instanteneous frequency of a signal-Part 1: Fundamentals,"
- Apr.
- B. Boashash, "Estimating and interpreting the instanteneous frequency of a signal-Part 1: Fundamentals," Proc. IEEE, vol. 80, no. 4, pp. 519-538, Apr. 1992.
- (1992) Proc. IEEE , vol.80 , Issue.4 , pp. 519-538
- Boashash, B.¹

5
- 4444257069
- "Praat, a system for doing phonetics by computer,"
- P. Boersma, "Praat, a system for doing phonetics by computer," Glot Int., vol. 5, no. 9/10, pp. 341-345, 2001.
- (2001) Glot Int. , vol.5 , Issue.9-10 , pp. 341-345
- Boersma, P.¹

6
- 85009224958
- "Person authentication by voice: A need for caution,"
- Genoa, Italy, Sep.
- J. F. Bonastre, F. Bimbot, L. J. Boe, J. P. Campbell, D. A. Reynolds, and I. Magrin-Chagnolleau, "Person authentication by voice: A need for caution," in Proc. Eurospeech 2003, Genoa, Italy, Sep. 2003, pp. 33-36.
- (2003) In Proc. Eurospeech 2003 , pp. 33-36
- Bonastre, J.F.¹ Bimbot, F.² Boe, L.J.³ Campbell, J.P.⁴ Reynolds, D.A.⁵ Magrin-Chagnolleau, I.⁶

7
- 0031233424
- "Speaker recognition: A tutorial,"
- Sep.
- J. P. Campbell, Jr., "Speaker recognition: A tutorial," Proc. IEEE, vol. 85, no. 9, pp. 1437-1462, Sep. 1997.
- (1997) Proc. IEEE , vol.5 , Issue.9 , pp. 1437-1462
- Campbell Jr., J.P.¹

8
- 0000291808
- Methods of combining multiple classifiers wtth different features and their applications to text-independent speaker identification
- K. Chen, L. Wang, and H. Chi, "Methods of combining multiple clas-sifiers with different features and their applications to text-independent speaker identification," Int. J. Pattern Recognition Artif. Intell., vol. 11, no. 3, pp. 417-445, 1997. (Pubitemid 127623791)
- (1997) International Journal of Pattern Recognition and Artificial Intelligence , vol.11 , Issue.3 , pp. 417-445
- Chen, K.¹ Wang, L.² Chi, H.³

9
- 57649245616
- "The chains corpus: Characterizing individual speakers,"
- St. Petersburg, Russia
- F. Cummins, M. Grimaldi, T. Leonard, and J. Simko, "The chains corpus: Characterizing individual speakers," in Proc. SPECOM'06, St. Petersburg, Russia, 2006, pp. 431-435.
- (2006) In Proc. SPECOM'06 , pp. 431-435
- Cummins, F.¹ Grimaldi, M.² Leonard, T.³ Simko, J.⁴

10
- 85009224932
- "Robust energy demodulation based on continuous models with application to speech recognition,"
- D. Dimitriadis and P. Maragos, "Robust energy demodulation based on continuous models with application to speech recognition," in Proc. Eurospeech'03, 2003, pp. 2853-2856.
- (2003) In Proc. Eurospeech'03 , pp. 2853-2856
- Dimitriadis, D.¹ Maragos, P.²

11
- 27644455860
- "Robust AM-FM features for speech recognition,"
- D. V. Dimitriadis, P. Maragos, and A. Potamianos, "Robust AM-FM features for speech recognition," IEEE Signal Process. Lett., vol. 12, no. 9, pp. 621-624, Sep. 2005.
- (2005) IEEE Signal Process. Lett. , vol.12 , Issue.9 , pp. 621
- Dimitriadis, D.V.¹ Maragos, P.² Potamianos, A.³

12
- 0036311784
- "Nonlinear speech processing: Overview and applications,"
- M. Foundez-Zanuy, S. McLaughlin, A. Esposito, A. Hussain, J. Schoentgen, G. Kubin, W. B. Kleijn, and P. Maragos, "Nonlinear speech processing: Overview and applications," Control Intell. Syst., vol. 30, pp. 1-10, 2002.
- (2002) Control Intell. Syst. , vol.30 , pp. 1-10
- Foundez-Zanuy, M.¹ McLaughlin, S.² Esposito, A.³ Hussain, A.⁴ Schoentgen, J.⁵ Kubin, G.⁶ Kleijn, W.B.⁷ Maragos, P.⁸

13
- 0019555090
- CEPSTRAL ANALYSIS TECHNIQUE FOR AUTOMATIC SPEAKER VERIFICATION.
- S. Furui, "Cepstral analysis technique for automatic speaker verification," IEEE Trans. Acoust., Speech, Signal Process., vol. ASSP-29, no. 2, pp. 254-272, Apr. 1981. (Pubitemid 11495877)
- (1981) IEEE Transactions on Acoustics, Speech, and Signal Processing , vol.ASSP-29 , Issue.2 , pp. 254-272
- Furui Sadaoki¹

14
- 0000293183
- "Theory of communication,"
- Nov.
- D. Gabor, "Theory of communication," JIEE, vol. 93, no. 3, pp. 429-457, Nov. 1946.
- (1946) JIEE , vol.93 , Issue.3 , pp. 429-457
- Gabor, D.¹

15
- 0042615361
- "Public databases for speaker recognition and verification,"
- Apr.
- J. Godfrey, D. Graff, and A. Martin, "Public databases for speaker recognition and verification," in Proc. ESCA Workshop Automatic Speaker Recognition, Identification, Verification, Martigny, Switzerland, Apr. 1994, pp. 39-12.
- (1994) In Proc. ESCA Workshop Automatic Speaker Recognition, Identification, Verification , pp. 39-42
- Godfrey, J.¹ Graff, D.² Martin, A.³

16
- 0025041264
- Perceptual linear predictive (PLP) analysis of speech
- DOI 10.1121/1.399423
- H. Hermansky, "Perceptual linear prediction (PLP) analysis for speech," J. Acoust. Soc. Amer., pp. 1738-1752, 1990. (Pubitemid 20256470)
- (1990) Journal of the Acoustical Society of America , vol.87 , Issue.4 , pp. 1738-1752
- Hermansky, H.¹

17
- 85016663198
- "RASTA-PLP speech analysis technique,"
- Mar. 23-26
- H. Hermansky, N. Morgan, A. Bayya, and P. Kohn, "RASTA-PLP speech analysis technique," in Proc. IEEE Int. Con}. Acoust., Speech, Signal Process. (ICASSP'92), Mar. 23-26, 1992, vol. 1, pp. 121-124.
- (1992) In Proc. IEEE Int. Con}. Acoust., Speech, Signal Process. (ICASSP'92) , vol.1 , pp. 121-124
- Hermansky, H.¹ Morgan, N.² Bayya, A.³ Kohn, P.⁴

18
- 66149119135
- New York: Academic
- H. Hollien, Forensic Voice Indentification. New York: Academic, 2002.
- (2002) Forensic Voice Indentification
- Hollien, H.¹

19
- 0028996918
- "Measuring fine structure in speech: Application to speaker identification,"
- C. R. Jankowski, Jr, T. F. Quatieri, and D. A. Reynolds, "Measuring fine structure in speech: Application to speaker identification," in IEEE Int. Conf. Acoustics, Speech. Signal Process. (1CASSP'95), 1995, pp. 325-328.
- (1995) In IEEE Int. Conf. Acoustics, Speech. Signal Process. (1CASSP'95) , pp. 325-328
- Jankowski, C.R.¹ Quatieri Jr., T.F.² Reynolds, D.A.³

20
- 0345940399
- "On Teager's energy algorithm and its generalization to continuos signals,"
- New York, CD-ROM
- J. K. Keiser, "On Teager's energy algorithm and its generalization to continuos signals," in Proc. IEEE DSP Workshop, New York, 1990, CD-ROM.
- (1990) In Proc. IEEE DSP Workshop
- Keiser, J.K.¹

21
- 0005415015
- "Voiceprint identification,"
- L. G. Kersta, "Voiceprint identification," Nature, vol. 196, pp. 1253-1257, 1962.
- (1962) Nature , vol.196 , pp. 1253-1257
- Kersta, L.G.¹

22
- 2142809668
- "Signal representation based on instantaneous amplitude models with application to speech synthesis,"
- May
- G. Li, L. Qiu, and L. K. Ng, "Signal representation based on instantaneous amplitude models with application to speech synthesis," IEEE Trans. Speech Audio Process., vol. 8, no. 3, pp. 353-357, May 2000.
- (2000) IEEE Trans. Speech Audio Process. , vol.8 , Issue.3 , pp. 353-357
- Li, G.¹ Qiu, L.² Ng, L.K.³

23
- 0016495091
- "Linear prediction: A tutorial review,"
- Apr.
- J. Makhoul, "Linear prediction: A tutorial review," Proc. IEEE, vol. 63, no. 4, pp. 561-580, Apr. 1975.
- (1975) Proc. IEEE , vol.63 , Issue.4 , pp. 561-582
- Makhoul, J.¹

24
- 29444456613
- Speaker recognition by location in the space of reference speakers
- DOI 10.1016/j.specom.2005.06.014, PII S016763930500169X
- Y. Mami and D. Charlet, "Speaker recognition by location in the space of reference speakers," Speech Commun., vol. 48, no. 2, pp. 127-141, 2006. (Pubitemid 43012027)
- (2006) Speech Communication , vol.48 , Issue.2 , pp. 127-141
- Mami, Y.¹ Charlet, D.²

25
- 0027676955
- "Energy separation in signal modulations with application to speech analysis,"
- Oct.
- P. Maragos, J. F. Kaiser, and T. F. Quatieri, "Energy separation in signal modulations with application to speech analysis," IEEE Trans. Signal Process., vol. 41, no. 10, pp. 3024-3051, Oct. 1993.
- (1993) IEEE Trans. Signal Process. , vol.41 , Issue.10 , pp. 3024-3051
- Maragos, P.¹ Kaiser, J.F.² Quatieri, T.F.³

26
- 0004154016
- New York: Marcel Dekker
- G. McLachlan and K. E. Basford, Mixture Models. New York: Marcel Dekker, 1987.
- (1987) Mixture Models
- McLachlan, G.¹ Basford, K.E.²

27
- 0032595177
- "Robust text-independent speaker identification over telephone channels,"
- Sep.
- H. A. Murthy, F. Beaufays, L. P. Heck, and M. Weintraub, "Robust text-independent speaker identification over telephone channels," IEEE Trans. Speech Audio Process., vol. 7, no. 5, pp. 554-568, Sep. 1999.
- (1999) IEEE Trans. Speech Audio Process. , vol.7 , Issue.5 , pp. 554-568
- Murthy, H.A.¹ Beaufays, F.² Heck, L.P.³ Weintraub, M.⁴

28
- 4544326619
- "Usefulness of phase in speech processing,"
- Gifu, Japan
- K. K. Paliwal, "Usefulness of phase in speech processing," in Proc. IPSJ Spoken Lang. Process. Workshop, Gifu, Japan, 2003, pp. 1-6.
- (2003) In Proc. IPSJ Spoken Lang. Process. Workshop , pp. 1-6
- Paliwal, K.K.¹

29
- 85009192384
- "Frequency-related representation of . speech,"
- Sep.
- K. K. Paliwal and B. S. Atal, "Frequency-related representation of . speech," in Proc. Eurospeech'03, Sep. 2003, pp. 65-68.
- (2003) In Proc. Eurospeech'03 , pp. 65-68
- Paliwal, K.K.¹ Atal, B.S.²

30
- 0030008906
- Speech formant frequency and bandwidth tracking using multiband energy demodulation
- DOI 10.1121/1.414997
- A. Potamianos and P, Maragos, "Speech formant frequency and bandwidth tracking using multiband energy demodulation," J. Acoust. Soc. Amer., vol. 99, pp. 3795-3806, 1996. (Pubitemid 26190269)
- (1996) Journal of the Acoustical Society of America , vol.99 , Issue.6 , pp. 3795-3806
- Potamianos, A.¹ Maragos, P.²

31
- 0035278964
- "Time-frequency distributions for automatic speech recognition,"
- Mar.
- A. Potamianos and P. Maragos, "Time-frequency distributions for automatic speech recognition," IEEE Trans. Speech Audio Process., vol. 9, no. 3, pp. 196-200, Mar. 2001.
- (2001) IEEE Trans. Speech Audio Process. , vol.9 , Issue.3 , pp. 196-200
- Potamianos, A.¹ Maragos, P.²

32
- 0003425258
- Englewood Cliffs, NJ: Prentice-Hall
- L. R. Rabiner and R. W. Shafer, Digital Signal Processing of Speech Signals. Englewood Cliffs, NJ: Prentice-Hall, 1989.
- (1989) Digital Signal Processing of Speech Signals
- Rabiner, L.R.¹ Shafer, R.W.²

33
- 0000330384
- "On decomposing speech into modulated components,"
- may
- A. Rao and R. Kumaresan, "On decomposing speech into modulated components," IEEE Trans. Speech Audio Process., vol. 8, no. 3, pp. 240-254, May 2000.
- (2000) IEEE Trans. Speech Audio Process. , vol.8 , Issue.3 , pp. 240-254
- Rao, A.¹ Kumaresan, R.²

34
- 0028515984
- "Experimental evaluation of features for robust speaker identification,"
- Oct.
- D. A. Reynolds, "Experimental evaluation of features for robust speaker identification," IEEE Trans. Speech Audio Process., vol. 2, no. 4, pp. 639-643, Oct. 1994.
- (2000) IEEE Trans. Speech Audio Process. , vol.2 , Issue.4 , pp. 639-643
- Reynolds, D.A.¹

35
- 0036293830
- An overview of automatic speaker recognition technology
- D. A. Reynolds, "An overview of automatic speaker recognition technology," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP '02), 2002, pp. IV-4072-IV-4075. (Pubitemid 34711225)
- (2002) ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings , vol.4
- Reynolds, D.A.¹

36
- 0029209272
- "Robust text-independent speaker identification using gaussian mixture speaker models,"
- Jan.
- D. A. Reynolds and R. C. Rose, "Robust text-independent speaker identification using gaussian mixture speaker models," IEEE Trans. Speech Audio Process., vol. 3, no. 1, pp. 72-83, Jan. 1995.
- (1995) IEEE Trans. Speech Audio Process. , vol.3 , Issue.1 , pp. 72-83
- Reynolds, D.A.¹ Rose, R.C.²

37
- 84874479055
- "Computer recognition of speakers who disguise their voice,"
- CD-ROM
- R. D. Rodman, "Computer recognition of speakers who disguise their voice," in Proc ICSPAT'00, 2000, CD-ROM.
- (2000) In Proc ICSPAT'00
- Rodman, R.D.¹

38
- 66149095995
- Forensic speaker indentification
- New York: Taylor and Francis
- P. Rose, Forensic Speaker Indentification, ser. Forensic Science. New York: Taylor and Francis, 2002.
- (2002) ser. Forensic Science
- Rose, P.¹

39
- 0003236089
- "Evidence for nonlinear sound production mechanisms in the vocal tract,"
- ser. NATO Advanced Study Institute Series D, W. J. Hard-castle and A. Marchal, Eds. Bonas, France: Kluwer, Jul.
- H. M. Teager and S. M. Teager, "Evidence for nonlinear sound production mechanisms in the vocal tract," in Speech Production and Speech Modelling, ser. NATO Advanced Study Institute Series D, W. J. Hard-castle and A. Marchal, Eds. Bonas, France: Kluwer, Jul. 1989, vol. 55.
- (1989) In Speech Production and Speech Modelling , vol.55
- Teager, H.M.¹ Teager, S.M.²

40
- 0036298128
- Evaluation of kernel methods for speaker verification and identification
- V. Wan and S. Renals, "Evaluation of kernel methods for speaker verification and identification," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP'02), 2002, vol. 1, pp. 669-672. (Pubitemid 34710379)
- (2002) ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings , vol.1
- Wan, V.¹ Renals, S.²

41
- 14644412368
- "Speaker verification using sequence discriminant support vector machines,"
- Mar.
- V. Wan and S. Renals, "Speaker verification using sequence discriminant support vector machines," IEEE Trans. Speech Audio Process., vol. 13, no. 2, pp. 203-210, Mar. 2005.
- (2005) IEEE Trans. Speech Audio Process. , vol.13 , Issue.2 , pp. 203-210
- Wan, V.¹ Renals, S.²

42
- 0041360472
- "Efficient text-independent speaker verification with structural Gaussian mixture models and neural network,"
- Sep.
- B. Xiang and T. Berger, "Efficient text-independent speaker verification with structural Gaussian mixture models and neural network," IEEE Trans. Speech Audio Process., vol. 11, no. 5, pp. 447-456, Sep. 2003.
- (2003) IEEE Trans. Speech Audio Process. , vol.11 , Issue.5 , pp. 447-456
- Xiang, B.¹ Berger, T.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.