메뉴 건너뛰기




Volumn 10, Issue 6, 2002, Pages 371-378

Application of time-frequency principal component analysis to text-independent speaker identification

Author keywords

Closed set speaker identification; Contextual covariance matrix; Contextual principal components (CPC); POLYCOST database; Speaker recognition; Speech analysis; Speech representation; Time frequency principal components (TFPC); Vector filtering of spectral trajectories

Indexed keywords

CLOSED-SET SPEAKER IDENTIFICATION; CONTEXTUAL COVARIANCE MATRIX; SPEECH REPRESENTATION; TEXT-INDEPENDENT SPEAKER IDENTIFICATION; TIME-FREQUENCY PRINCIPAL COMPONENT ANALYSIS; VECTOR FILTERING OF SPECTRAL TRAJECTORIES;

EID: 0036754056     PISSN: 10636676     EISSN: None     Source Type: Journal    
DOI: 10.1109/TSA.2002.800557     Document Type: Article
Times cited : (15)

References (40)
  • 1
    • 0002161311 scopus 로고
    • The quefrency alanysis of time series for echoes: Cepstrum, pseudo-autocovariance, cross-cepstrum and saphe cracking
    • M. Rosenblatt, Ed. New York: Wiley; ch. 15
    • B. P. Bogert, M. J. R. Healy, and J. W. Tukey, "The quefrency alanysis of time series for echoes: cepstrum, pseudo-autocovariance, cross-cepstrum and saphe cracking," in Proceedings of the Symposium on Time Series Analysis, M. Rosenblatt, Ed. New York: Wiley, 1963, ch. 15, pp. 209-243.
    • (1963) Proceedings of the Symposium on Time Series Analysis , pp. 209-243
    • Bogert, B.P.1    Healy, M.J.R.2    Tukey, J.W.3
  • 3
    • 24844445229 scopus 로고
    • Perceptually-based features for speaker identification
    • Ph.D. dissertation, Univ. College Swansea, Univ. Wales, Wales, U.K., Feb.
    • L. Xu, "Perceptually-based features for speaker identification," Ph.D. dissertation, Univ. College Swansea, Univ. Wales, Wales, U.K., Feb. 1992.
    • (1992)
    • Xu, L.1
  • 4
    • 0011179636 scopus 로고
    • Combining features via LDA in speaker recognition
    • Berlin, Germany, Sept.
    • Z. P. Sun and J. S. Mason, "Combining features via LDA in speaker recognition," in Proc. EUROSPEECH 93, vol. 3, Berlin, Germany, Sept. 1993, pp. 2287-2290.
    • (1993) Proc. Eurospeech 93 , vol.3 , pp. 2287-2290
    • Sun, Z.P.1    Mason, J.S.2
  • 5
    • 0011242106 scopus 로고
    • Within class optimization of cepstra for speaker recognition
    • Berlin, Germany, Sept.
    • J. Thompson and J. S. Mason, "Within class optimization of cepstra for speaker recognition," in Proc. EUROSPEECH 93, vol. 1, Berlin, Germany, Sept. 1993, pp. 165-168.
    • (1993) Proc. Eurospeech 93 , vol.1 , pp. 165-168
    • Thompson, J.1    Mason, J.S.2
  • 6
    • 0011188681 scopus 로고
    • Text-dependent speaker verification using recurrent time delay neural networks for feature extraction
    • Beijing, China, Oct.
    • X. Wang and G. Zhao, "Text-dependent speaker verification using recurrent time delay neural networks for feature extraction," in Proc. ICSP 93, vol. 1, Beijing, China, Oct. 1993, pp. 674-677.
    • (1993) Proc. ICSP 93 , vol.1 , pp. 674-677
    • Wang, X.1    Zhao, G.2
  • 8
    • 0011232176 scopus 로고
    • Analysis of acoustic features affecting speaker identification
    • N. Higuchi and M. Hashimoto, "Analysis of acoustic features affecting speaker identification," in Proc. EUROSPEECH 95, vol. 1, 1995, pp. 435-438.
    • (1995) Proc. Eurospeech 95 , vol.1 , pp. 435-438
    • Higuchi, N.1    Hashimoto, M.2
  • 9
    • 0029726518 scopus 로고    scopus 로고
    • Fine structure features for speaker identification
    • Atlanta, GA, May
    • C. R. Jankowski, Jr., T. F. Quatieri, and D. A. Reynolds, "Fine structure features for speaker identification," in Proc. ICASSP 96 vol. 2, Atlanta, GA, May 1996, pp. 689-692.
    • (1996) Proc. ICASSP 96 , vol.2 , pp. 689-692
    • Jankowski C.R., Jr.1    Quatieri, T.F.2    Reynolds, D.A.3
  • 10
    • 0030371145 scopus 로고    scopus 로고
    • Speaker recognition model using two-dimensional mel-cepstrum and predictive neural network
    • T. Kitamura and S. Takei, "Speaker recognition model using two-dimensional mel-cepstrum and predictive neural network," in Proc. ICSLP 96, 1996.
    • Proc. ICSLP 96, 1996
    • Kitamura, T.1    Takei, S.2
  • 11
    • 0029725529 scopus 로고    scopus 로고
    • A general framework of feature extraction: Application to speaker recognition
    • Atlanta, GA, May
    • C.-S. Liu, "A general framework of feature extraction: Application to speaker recognition," in Proc. ICASSP 96, vol. 2, Atlanta, GA, May 1996, pp. 669-672.
    • (1996) Proc. ICASSP 96 , vol.2 , pp. 669-672
    • Liu, C.-S.1
  • 13
    • 0019583902 scopus 로고
    • Comparison of speaker recognition methods using static features and dynamic features
    • June
    • S. Furui, "Comparison of speaker recognition methods using static features and dynamic features," IEEE Trans. Acoust., Speech, Signal Processing, vol. 29, no. 3, pp. 342-350, June 1981.
    • (1981) IEEE Trans. Acoust., Speech, Signal Processing , vol.29 , Issue.3 , pp. 342-350
    • Furui, S.1
  • 14
    • 0024035182 scopus 로고
    • On the use of instantaneous and transitional spectral information in speaker recognition
    • June
    • F. K. Soong and A. E. Rosenberg, "On the use of instantaneous and transitional spectral information in speaker recognition," IEEE Trans. Acoust., Speech, Signal Processing, vol. 36, pp. 871-879, June 1988.
    • (1988) IEEE Trans. Acoust., Speech, Signal Processing , vol.36 , pp. 871-879
    • Soong, F.K.1    Rosenberg, A.E.2
  • 15
    • 0011242107 scopus 로고
    • Utilization de la prédiction linéaire en reconnaissance et adaptation au locuteur
    • Strasbourg, France, May
    • Y. Grenier, "Utilization de la prédiction linéaire en reconnaissance et adaptation au locuteur," in XIèmes Journées d'Etude sur la Parole, Strasbourg, France, May 1980, pp. 163-171.
    • (1980) XIèmes Journées d'Etude sur la Parole , pp. 163-171
    • Grenier, Y.1
  • 16
    • 84953669587 scopus 로고
    • Standard and target-driven AR-vector models for speech analysis and speaker recognition
    • San Francisco, CA, Mar.
    • F. Bimbot, L. Mathan, A. De Lima, and G. Chollet, "Standard and target-driven AR-vector models for speech analysis and speaker recognition," in Proc. ICASSP 92, vol. 2, San Francisco, CA, Mar. 1992, p. II.5-II.8.
    • (1992) Proc. ICASSP 92 , vol.2
    • Bimbot, F.1    Mathan, L.2    De Lima, A.3    Chollet, G.4
  • 17
    • 58049084980 scopus 로고
    • Cinematic techniques for speech processing: Temporal decomposition and multivariate linear prediction
    • San Francisco, CA, Mar.
    • C. Montacié, P. Deléglise, F. Bimbot, and M.-J. Caraty, "Cinematic techniques for speech processing: Temporal decomposition and multivariate linear prediction," in Proc. ICASSP 92, vol. 1, San Francisco, CA, Mar. 1992, pp. 153-156.
    • (1992) Proc. ICASSP 92 , vol.1 , pp. 153-156
    • Montacié, C.1    Deléglise, P.2    Bimbot, F.3    Caraty, M.-J.4
  • 18
    • 85135133863 scopus 로고
    • AR-vector models for free-text speaker recognition
    • Banff, AB, Canada, Oct.
    • C. Montacié and J.-L. LeFloch, "AR-vector models for free-text speaker recognition," in Proc. ICSLP 92, vol. 1, Banff, AB, Canada, Oct. 1992, pp. 611-614.
    • (1992) Proc. ICSLP 92 , vol.1 , pp. 611-614
    • Montacié, C.1    Lefloch, J.-L.2
  • 19
    • 0011226706 scopus 로고    scopus 로고
    • Approches statistiques et filtrage vectoriel de trajectoires spectrales pour l'identification du locuteur indépendante du texte
    • Ph.D. dissertation, École Nat, Supérieure Télécommun., Jan.
    • I. Magrin-Chagnolleau, "Approches statistiques et filtrage vectoriel de trajectoires spectrales pour l'identification du locuteur indépendante du texte," Ph.D. dissertation, École Nat, Supérieure Télécommun., Jan. 1997.
    • (1997)
    • Magrin-Chagnolleau, I.1
  • 21
    • 33750899716 scopus 로고    scopus 로고
    • Sous-espaces de projection de séquences de trames acoustiques pour l'analyze et la reconnaissance de parole
    • Avignon, France
    • F. Bimbot, E. Bocchieri, and B. Atal, "Sous-espaces de projection de séquences de trames acoustiques pour l'analyze et la reconnaissance de parole," in XXIèmes Journées d'Etude sur la Parole, Avignon, France, 1996.
    • (1996) XXIèmes Journées d'Etude sur la Parole
    • Bimbot, F.1    Bocchieri, E.2    Atal, B.3
  • 22
    • 0030374905 scopus 로고    scopus 로고
    • Robust speech recognition features based on temporal trajectory filtering of frequency band spectrum
    • J.-L. Shen, W.-L. Wang, and L.-S. Lee, "Robust speech recognition features based on temporal trajectory filtering of frequency band spectrum," in Proc. ICSLP 96, 1996.
    • Proc. ICSLP 96, 1996
    • Shen, J.-L.1    Wang, W.-L.2    Lee, L.-S.3
  • 24
    • 0030372606 scopus 로고    scopus 로고
    • Noise robust estimate of speech dynamics for speaker recognition
    • J. P. Openshaw and J. S. Mason, "Noise robust estimate of speech dynamics for speaker recognition," in Proc. ICSLP 96, 1996.
    • Proc. ICSLP 96, 1996
    • Openshaw, J.P.1    Mason, J.S.2
  • 26
    • 0001692019 scopus 로고
    • An analysis of cepstral-time matrices for noise and channel robust speech recognition
    • B. P. Milner and S. V. Vaseghi, "An analysis of cepstral-time matrices for noise and channel robust speech recognition," in Proc. EUROSPEECH 95, vol. 1, 1995, pp. 519-522.
    • (1995) Proc. Eurospeech 95 , vol.1 , pp. 519-522
    • Milner, B.P.1    Vaseghi, S.V.2
  • 27
    • 0030369274 scopus 로고    scopus 로고
    • Inclusion of temporal information into features for speech recognition
    • B. Milner, "Inclusion of temporal information into features for speech recognition," in Proc. ICSLP 96, 1996.
    • Proc. ICSLP 96, 1996
    • Milner, B.1
  • 31
    • 84953683778 scopus 로고
    • Efficient acoustic parameters for speaker recognition
    • J. J. Wolf, "Efficient acoustic parameters for speaker recognition," J. Acoust. Soc. Amer., pt. 2, vol. 51, no. 6, pp. 2044-2056, 1972.
    • (1972) J. Acoust. Soc. Amer. , vol.51 , Issue.6 PART 2 , pp. 2044-2056
    • Wolf, J.J.1
  • 33
    • 0016494495 scopus 로고
    • Selection of acoustic features for speaker identification
    • Apr.
    • M. R. Sambur, "Selection of acoustic features for speaker identification," IEEE Trans. Acoust., Speech, Signal Processing, vol. ASSP-23, pp. 176-182, Apr. 1975.
    • (1975) IEEE Trans. Acoust., Speech, Signal Processing , vol.ASSP-23 , pp. 176-182
    • Sambur, M.R.1
  • 36
    • 0029209272 scopus 로고
    • Robust text-independent speaker identification using Gaussian mixture speaker models
    • Jan.
    • D. A. Reynolds and R. C. Rose, "Robust text-independent speaker identification using Gaussian mixture speaker models," IEEE Trans. Speech Audio Processing, vol. 3, pp. 72-83, Jan. 1995.
    • (1995) IEEE Trans. Speech Audio Processing , vol.3 , pp. 72-83
    • Reynolds, D.A.1    Rose, R.C.2
  • 37
    • 84866588704 scopus 로고    scopus 로고
    • Eur. COST 250 Action. [Online]
    • (1998) Speaker Recognition in Telephony. Eur. COST 250 Action. [Online]. Available: http://circhp.epfl.ch/polycost.
    • (1998) Speaker Recognition in Telephony
  • 38
    • 0002629270 scopus 로고
    • Maximum likelihood from incomplete data via the EM algorithm
    • A. P. Dempster, N. M. Laird, and D. B. Rubin, "Maximum likelihood from incomplete data via the EM algorithm," J. R. Statist. Soc., vol. 6 no. 39, pp. 1-38, 1977.
    • (1977) J. R. Statist. Soc. , vol.6 , Issue.39 , pp. 1-38
    • Dempster, A.P.1    Laird, N.M.2    Rubin, D.B.3
  • 39
    • 0018918171 scopus 로고
    • An algorithm for vector quantization design
    • Jan.
    • Y. Linde, A. Buzo, and R. M. Gray, "An algorithm for vector quantization design," IEEE Trans. Commun., vol. 28, pp. 84-95, Jan. 1980.
    • (1980) IEEE Trans. Commun. , vol.28 , pp. 84-95
    • Linde, Y.1    Buzo, A.2    Gray, R.M.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.