SCOPUS 정보 검색 플랫폼

IEEE Transactions on Speech and Audio Processing

Volumn 10, Issue 6, 2002, Pages 371-378

Application of time-frequency principal component analysis to text-independent speaker identification

(3) Magrin Chagnolleau, Ivan a,b Durou, Geoffrey c Bimbot, Frédéric d

a IEEE (France)

b CNRS (France)

c FACULTÉ POLYTECHNIQUE DE MONS (Belgium)

d INRIA (France)

Author keywords

Closed set speaker identification; Contextual covariance matrix; Contextual principal components (CPC); POLYCOST database; Speaker recognition; Speech analysis; Speech representation; Time frequency principal components (TFPC); Vector filtering of spectral trajectories

Indexed keywords

CLOSED-SET SPEAKER IDENTIFICATION; CONTEXTUAL COVARIANCE MATRIX; SPEECH REPRESENTATION; TEXT-INDEPENDENT SPEAKER IDENTIFICATION; TIME-FREQUENCY PRINCIPAL COMPONENT ANALYSIS; VECTOR FILTERING OF SPECTRAL TRAJECTORIES;

COSINE TRANSFORMS; EIGENVALUES AND EIGENFUNCTIONS; FOURIER TRANSFORMS; MATRIX ALGEBRA; PRINCIPAL COMPONENT ANALYSIS; SPEECH ANALYSIS; SPEECH PROCESSING; SPEECH SYNTHESIS; VECTORS;

SPEECH RECOGNITION;

EID: 0036754056 PISSN: 10636676 EISSN: None Source Type: Journal
DOI: 10.1109/TSA.2002.800557 Document Type: Article

Times cited : (15)

References (40)

1
- 0002161311
- The quefrency alanysis of time series for echoes: Cepstrum, pseudo-autocovariance, cross-cepstrum and saphe cracking
- M. Rosenblatt, Ed. New York: Wiley; ch. 15
- B. P. Bogert, M. J. R. Healy, and J. W. Tukey, "The quefrency alanysis of time series for echoes: cepstrum, pseudo-autocovariance, cross-cepstrum and saphe cracking," in Proceedings of the Symposium on Time Series Analysis, M. Rosenblatt, Ed. New York: Wiley, 1963, ch. 15, pp. 209-243.
- (1963) Proceedings of the Symposium on Time Series Analysis , pp. 209-243
- Bogert, B.P.¹ Healy, M.J.R.² Tukey, J.W.³

2
- 0000250293
- Homomorphic analysis of speech
- June
- A. V. Oppenheim and R. W. Schafer, "Homomorphic analysis of speech," IEEE Tran. Audio Electroacoust., vol. AE-16, pp. 221-226, June 1968.
- (1968) IEEE Tran. Audio Electroacoust. , vol.AE-16 , pp. 221-226
- Oppenheim, A.V.¹ Schafer, R.W.²

3
- 24844445229
- Perceptually-based features for speaker identification
- Ph.D. dissertation, Univ. College Swansea, Univ. Wales, Wales, U.K., Feb.
- L. Xu, "Perceptually-based features for speaker identification," Ph.D. dissertation, Univ. College Swansea, Univ. Wales, Wales, U.K., Feb. 1992.
- (1992)
- Xu, L.¹

4
- 0011179636
- Combining features via LDA in speaker recognition
- Berlin, Germany, Sept.
- Z. P. Sun and J. S. Mason, "Combining features via LDA in speaker recognition," in Proc. EUROSPEECH 93, vol. 3, Berlin, Germany, Sept. 1993, pp. 2287-2290.
- (1993) Proc. Eurospeech 93 , vol.3 , pp. 2287-2290
- Sun, Z.P.¹ Mason, J.S.²

5
- 0011242106
- Within class optimization of cepstra for speaker recognition
- Berlin, Germany, Sept.
- J. Thompson and J. S. Mason, "Within class optimization of cepstra for speaker recognition," in Proc. EUROSPEECH 93, vol. 1, Berlin, Germany, Sept. 1993, pp. 165-168.
- (1993) Proc. Eurospeech 93 , vol.1 , pp. 165-168
- Thompson, J.¹ Mason, J.S.²

6
- 0011188681
- Text-dependent speaker verification using recurrent time delay neural networks for feature extraction
- Beijing, China, Oct.
- X. Wang and G. Zhao, "Text-dependent speaker verification using recurrent time delay neural networks for feature extraction," in Proc. ICSP 93, vol. 1, Beijing, China, Oct. 1993, pp. 674-677.
- (1993) Proc. ICSP 93 , vol.1 , pp. 674-677
- Wang, X.¹ Zhao, G.²

7
- 0028517164
- RASTA processing of speech
- Oct.
- H. Hermansky and N. Morgan, "RASTA processing of speech," IEEE Trans. Speech Audio Processing, vol. 2, pp. 578-589, Oct. 1994.
- (1994) IEEE Trans. Speech Audio Processing , vol.2 , pp. 578-589
- Hermansky, H.¹ Morgan, N.²

8
- 0011232176
- Analysis of acoustic features affecting speaker identification
- N. Higuchi and M. Hashimoto, "Analysis of acoustic features affecting speaker identification," in Proc. EUROSPEECH 95, vol. 1, 1995, pp. 435-438.
- (1995) Proc. Eurospeech 95 , vol.1 , pp. 435-438
- Higuchi, N.¹ Hashimoto, M.²

9
- 0029726518
- Fine structure features for speaker identification
- Atlanta, GA, May
- C. R. Jankowski, Jr., T. F. Quatieri, and D. A. Reynolds, "Fine structure features for speaker identification," in Proc. ICASSP 96 vol. 2, Atlanta, GA, May 1996, pp. 689-692.
- (1996) Proc. ICASSP 96 , vol.2 , pp. 689-692
- Jankowski C.R., Jr.¹ Quatieri, T.F.² Reynolds, D.A.³

10
- 0030371145
- Speaker recognition model using two-dimensional mel-cepstrum and predictive neural network
- T. Kitamura and S. Takei, "Speaker recognition model using two-dimensional mel-cepstrum and predictive neural network," in Proc. ICSLP 96, 1996.
- Proc. ICSLP 96, 1996
- Kitamura, T.¹ Takei, S.²

11
- 0029725529
- A general framework of feature extraction: Application to speaker recognition
- Atlanta, GA, May
- C.-S. Liu, "A general framework of feature extraction: Application to speaker recognition," in Proc. ICASSP 96, vol. 2, Atlanta, GA, May 1996, pp. 669-672.
- (1996) Proc. ICASSP 96 , vol.2 , pp. 669-672
- Liu, C.-S.¹

12
- 0003014045
- Subband approach for automatic speaker recognition: Optimal division of the frequency domain
- L. Besacier and J.-F. Bonastre, "Subband approach for automatic speaker recognition: Optimal division of the frequency domain," in Proc. Workshop Audio and Video Biometric Person Authentification, Craus-Montana, Switzerland, 1997, pp. 195-202.
- Proc. Workshop Audio and Video Biometric Person Authentification, Craus-Montana, Switzerland, 1997 , pp. 195-202
- Besacier, L.¹ Bonastre, J.-F.²

13
- 0019583902
- Comparison of speaker recognition methods using static features and dynamic features
- June
- S. Furui, "Comparison of speaker recognition methods using static features and dynamic features," IEEE Trans. Acoust., Speech, Signal Processing, vol. 29, no. 3, pp. 342-350, June 1981.
- (1981) IEEE Trans. Acoust., Speech, Signal Processing , vol.29 , Issue.3 , pp. 342-350
- Furui, S.¹

14
- 0024035182
- On the use of instantaneous and transitional spectral information in speaker recognition
- June
- F. K. Soong and A. E. Rosenberg, "On the use of instantaneous and transitional spectral information in speaker recognition," IEEE Trans. Acoust., Speech, Signal Processing, vol. 36, pp. 871-879, June 1988.
- (1988) IEEE Trans. Acoust., Speech, Signal Processing , vol.36 , pp. 871-879
- Soong, F.K.¹ Rosenberg, A.E.²

15
- 0011242107
- Utilization de la prédiction linéaire en reconnaissance et adaptation au locuteur
- Strasbourg, France, May
- Y. Grenier, "Utilization de la prédiction linéaire en reconnaissance et adaptation au locuteur," in XIèmes Journées d'Etude sur la Parole, Strasbourg, France, May 1980, pp. 163-171.
- (1980) XIèmes Journées d'Etude sur la Parole , pp. 163-171
- Grenier, Y.¹

16
- 84953669587
- Standard and target-driven AR-vector models for speech analysis and speaker recognition
- San Francisco, CA, Mar.
- F. Bimbot, L. Mathan, A. De Lima, and G. Chollet, "Standard and target-driven AR-vector models for speech analysis and speaker recognition," in Proc. ICASSP 92, vol. 2, San Francisco, CA, Mar. 1992, p. II.5-II.8.
- (1992) Proc. ICASSP 92 , vol.2
- Bimbot, F.¹ Mathan, L.² De Lima, A.³ Chollet, G.⁴

17
- 58049084980
- Cinematic techniques for speech processing: Temporal decomposition and multivariate linear prediction
- San Francisco, CA, Mar.
- C. Montacié, P. Deléglise, F. Bimbot, and M.-J. Caraty, "Cinematic techniques for speech processing: Temporal decomposition and multivariate linear prediction," in Proc. ICASSP 92, vol. 1, San Francisco, CA, Mar. 1992, pp. 153-156.
- (1992) Proc. ICASSP 92 , vol.1 , pp. 153-156
- Montacié, C.¹ Deléglise, P.² Bimbot, F.³ Caraty, M.-J.⁴

18
- 85135133863
- AR-vector models for free-text speaker recognition
- Banff, AB, Canada, Oct.
- C. Montacié and J.-L. LeFloch, "AR-vector models for free-text speaker recognition," in Proc. ICSLP 92, vol. 1, Banff, AB, Canada, Oct. 1992, pp. 611-614.
- (1992) Proc. ICSLP 92 , vol.1 , pp. 611-614
- Montacié, C.¹ Lefloch, J.-L.²

19
- 0011226706
- Approches statistiques et filtrage vectoriel de trajectoires spectrales pour l'identification du locuteur indépendante du texte
- Ph.D. dissertation, École Nat, Supérieure Télécommun., Jan.
- I. Magrin-Chagnolleau, "Approches statistiques et filtrage vectoriel de trajectoires spectrales pour l'identification du locuteur indépendante du texte," Ph.D. dissertation, École Nat, Supérieure Télécommun., Jan. 1997.
- (1997)
- Magrin-Chagnolleau, I.¹

20
- 84928752996
- Time-frequency principal components of speech: Application to speaker identification
- I. Magrin-Chagnolleau and G. Durou, "Time-frequency principal components of speech: Application to speaker identification," in Proc. EUROSPEECH 99, Budapest, Hungary, Sept. 1999, pp. 759-762.
- Proc. Eurospeech 99, Budapest, Hungary, Sept. 1999 , pp. 759-762
- Magrin-Chagnolleau, I.¹ Durou, G.²

21
- 33750899716
- Sous-espaces de projection de séquences de trames acoustiques pour l'analyze et la reconnaissance de parole
- Avignon, France
- F. Bimbot, E. Bocchieri, and B. Atal, "Sous-espaces de projection de séquences de trames acoustiques pour l'analyze et la reconnaissance de parole," in XXIèmes Journées d'Etude sur la Parole, Avignon, France, 1996.
- (1996) XXIèmes Journées d'Etude sur la Parole
- Bimbot, F.¹ Bocchieri, E.² Atal, B.³

22
- 0030374905
- Robust speech recognition features based on temporal trajectory filtering of frequency band spectrum
- J.-L. Shen, W.-L. Wang, and L.-S. Lee, "Robust speech recognition features based on temporal trajectory filtering of frequency band spectrum," in Proc. ICSLP 96, 1996.
- Proc. ICSLP 96, 1996
- Shen, J.-L.¹ Wang, W.-L.² Lee, L.-S.³

23
- 0030371121
- Frequency and time filtering of filter-bank energies for HMM speech recognition
- C. Nadeu, J. B. Marino, J. Hernando, and A. Nogueiras, "Frequency and time filtering of filter-bank energies for HMM speech recognition," in Proc. ICSLP 96, 1996.
- Proc. ICSLP 96, 1996
- Nadeu, C.¹ Marino, J.B.² Hernando, J.³ Nogueiras, A.⁴

24
- 0030372606
- Noise robust estimate of speech dynamics for speaker recognition
- J. P. Openshaw and J. S. Mason, "Noise robust estimate of speech dynamics for speaker recognition," in Proc. ICSLP 96, 1996.
- Proc. ICSLP 96, 1996
- Openshaw, J.P.¹ Mason, J.S.²

25
- 0030355950
- Sub-band adaptive filtering applied to speech enhancement
- D. J. Darlington and D. R. Campbell, "Sub-band adaptive filtering applied to speech enhancement," in Proc. ICSLP 96, 1996.
- Proc. ICSLP 96, 1996
- Darlington, D.J.¹ Campbell, D.R.²

26
- 0001692019
- An analysis of cepstral-time matrices for noise and channel robust speech recognition
- B. P. Milner and S. V. Vaseghi, "An analysis of cepstral-time matrices for noise and channel robust speech recognition," in Proc. EUROSPEECH 95, vol. 1, 1995, pp. 519-522.
- (1995) Proc. Eurospeech 95 , vol.1 , pp. 519-522
- Milner, B.P.¹ Vaseghi, S.V.²

27
- 0030369274
- Inclusion of temporal information into features for speech recognition
- B. Milner, "Inclusion of temporal information into features for speech recognition," in Proc. ICSLP 96, 1996.
- Proc. ICSLP 96, 1996
- Milner, B.¹

28
- 0030376355
- Dynamic features for segmental speech recognition
- N. Harte, S. Vaseghi, and B. Milner, "Dynamic features for segmental speech recognition," in Proc. ICSLP 96, 1996.
- Proc. ICSLP 96, 1996
- Harte, N.¹ Vaseghi, S.² Milner, B.³

29
- 0003946510
- Berlin, Germany: Springer-Verlag
- I. T. Jolliffe, Principal Component Analysis. Berlin, Germany: Springer-Verlag, 1986.
- (1986) Principal Component Analysis
- Jolliffe, I.T.¹

30
- 0003543769
- Paris, France: Éditions Technip
- G. Saporta, Probabilités, analyse des données et statistique. Paris, France: Éditions Technip, 1990.
- (1990) Probabilités, Analyse des Données et Statistique
- Saporta, G.¹

31
- 84953683778
- Efficient acoustic parameters for speaker recognition
- J. J. Wolf, "Efficient acoustic parameters for speaker recognition," J. Acoust. Soc. Amer., pt. 2, vol. 51, no. 6, pp. 2044-2056, 1972.
- (1972) J. Acoust. Soc. Amer. , vol.51 , Issue.6 PART 2 , pp. 2044-2056
- Wolf, J.J.¹

32
- 0003686431
- London, U.K.: Griffin
- S. M. Kendall and A. Stuart, The Advanced Theory of Statistics. London, U.K.: Griffin, 1977.
- (1977) The Advanced Theory of Statistics
- Kendall, S.M.¹ Stuart, A.²

33
- 0016494495
- Selection of acoustic features for speaker identification
- Apr.
- M. R. Sambur, "Selection of acoustic features for speaker identification," IEEE Trans. Acoust., Speech, Signal Processing, vol. ASSP-23, pp. 176-182, Apr. 1975.
- (1975) IEEE Trans. Acoust., Speech, Signal Processing , vol.ASSP-23 , pp. 176-182
- Sambur, M.R.¹

34
- 0003747605
- New York: Wiley
- D. M. Titterington, A. F. M. Smith, and U. E. Makov, Statistical Analysis of Finite Mixture Distributions. New York: Wiley, 1985.
- (1985) Statistical Analysis of Finite Mixture Distributions
- Titterington, D.M.¹ Smith, A.F.M.² Makov, U.E.³

35
- 0003891734
- London, U.K.: Marcel Dekker
- G. J. McLachlan and K. E. Basford, Mixture Models: Inference and Applications to Clustering. London, U.K.: Marcel Dekker, 1988.
- (1988) Mixture Models: Inference and Applications to Clustering
- McLachlan, G.J.¹ Basford, K.E.²

36
- 0029209272
- Robust text-independent speaker identification using Gaussian mixture speaker models
- Jan.
- D. A. Reynolds and R. C. Rose, "Robust text-independent speaker identification using Gaussian mixture speaker models," IEEE Trans. Speech Audio Processing, vol. 3, pp. 72-83, Jan. 1995.
- (1995) IEEE Trans. Speech Audio Processing , vol.3 , pp. 72-83
- Reynolds, D.A.¹ Rose, R.C.²

37
- 84866588704
- Eur. COST 250 Action. [Online]
- (1998) Speaker Recognition in Telephony. Eur. COST 250 Action. [Online]. Available: http://circhp.epfl.ch/polycost.
- (1998) Speaker Recognition in Telephony

38
- 0002629270
- Maximum likelihood from incomplete data via the EM algorithm
- A. P. Dempster, N. M. Laird, and D. B. Rubin, "Maximum likelihood from incomplete data via the EM algorithm," J. R. Statist. Soc., vol. 6 no. 39, pp. 1-38, 1977.
- (1977) J. R. Statist. Soc. , vol.6 , Issue.39 , pp. 1-38
- Dempster, A.P.¹ Laird, N.M.² Rubin, D.B.³

39
- 0018918171
- An algorithm for vector quantization design
- Jan.
- Y. Linde, A. Buzo, and R. M. Gray, "An algorithm for vector quantization design," IEEE Trans. Commun., vol. 28, pp. 84-95, Jan. 1980.
- (1980) IEEE Trans. Commun. , vol.28 , pp. 84-95
- Linde, Y.¹ Buzo, A.² Gray, R.M.³

40
- 0004285644
- New York: Wiley
- T. H. Wonnacott and R. J. Wonnacott, Introductory Statistics for Business and Economics. New York: Wiley, 1990.
- (1990) Introductory Statistics for Business and Economics
- Wonnacott, T.H.¹ Wonnacott, R.J.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.