SCOPUS 정보 검색 플랫폼

IEEE Transactions on Systems, Man and Cybernetics Part C: Applications and Reviews

Volumn 35, Issue 3, 2005, Pages 301-314

On the use of different speech representations for speaker modeling

a UNIVERSITY OF MANCHESTER (United Kingdom)

Author keywords

Different speech representations; Expectation maximazation (EM) algorithm; Generalized Gaussian mixture model (GGMM); KING speech corpus; Soft competition; Speaker modeling; Speaker recognition; Speaker specific information

Indexed keywords

LEARNING ALGORITHMS; MATHEMATICAL MODELS; PARAMETER ESTIMATION; PROBABILITY DISTRIBUTIONS; STATISTICAL METHODS;

DIFFERENT SPEECH REPRESENTATIONS; EXPECTATION-MAXIMIZATION (EM) ALGORITHM; GENERALIZED GAUSSIAN MIXTURE MODEL (GGMM); SOFT COMPETITION; SPEAKER RECOGNITION;

SPEECH RECOGNITION;

EID: 23944498183 PISSN: 10946977 EISSN: None Source Type: Journal
DOI: 10.1109/TSMCC.2005.848166 Document Type: Article

Times cited : (22)

References (44)

1
- 0015476226
- "Automatic speaker recognition based on pitch contours"
- B. S. Atal, "Automatic speaker recognition based on pitch contours," J. Acoust. Soc. Amer., vol. 52, no. 6, pp. 1687-1697, 1972.
- (1972) J. Acoust. Soc. Amer. , vol.52 , Issue.6 , pp. 1687-1697
- Atal, B.S.¹

2
- 0023671793
- "A TMS32020-based real time, text-independent, automatic speaker verification system"
- J. Attlli, M. Savic, and J. Campbell, "A TMS32020-based real time, text-independent, automatic speaker verification system," in Proc. Int. Conf. Acoustics, Speech, Signal Processing, 1988, pp. 599-602.
- (1988) Proc. Int. Conf. Acoustics, Speech, Signal Processing , pp. 599-602
- Attlli, J.¹ Savic, M.² Campbell, J.³

3
- 0033316261
- "Adaptive weighting of pattern features during learning"
- Y. Bennani, "Adaptive weighting of pattern features during learning," in Proc. Int. Joint Conf. Neural Networks, 1999, pp. 3008-3013.
- (1999) Proc. Int. Joint Conf. Neural Networks , pp. 3008-3013
- Bennani, Y.¹

4
- 84892142432
- "Frame pruning for speaker recognition"
- L. Besacier and J. F. Bonastre, "Frame pruning for speaker recognition," in Proc. Int. Conf. Acoustics, Speech, Signal Processing, vol. 2, 1998, pp. 765-768.
- (1998) Proc. Int. Conf. Acoustics, Speech, Signal Processing , vol.2 , pp. 765-768
- Besacier, L.¹ Bonastre, J.F.²

5
- 0031233424
- "Speaker recognition: A tutorial"
- J. P. Campbell, "Speaker recognition: A tutorial," Proc. IEEE, vol. 85, no. 9, pp. 1437-1462, 1997.
- (1997) Proc. IEEE , vol.85 , Issue.9 , pp. 1437-1462
- Campbell, J.P.¹

6
- 85073330884
- "Corpus design for speaker recognition"
- Martigny, Switzerland
- A. D. Carlo, M. Falcone, and A. Paoloni, "Corpus design for speaker recognition," in Proc. ESCA Workshop on Auto. Speaker Recog. Identifi. Verification, Martigny, Switzerland, 1994, pp. 47-50.
- (1994) Proc. ESCA Workshop on Auto. Speaker Recog. Identifi. Verification , pp. 47-50
- Carlo, A.D.¹ Falcone, M.² Paoloni, A.³

7
- 0029762782
- "Cohort selection and word grammar effects for speaker recognition"
- J. Colombi, D. Ruck, S. Rogers, M. Oxley, and T. Anderson, "Cohort selection and word grammar effects for speaker recognition," in Proc. Int. Conf. Acoustics, Speech, Signal Processing, 1996, pp. 85-88.
- (1996) Proc. Int. Conf. Acoustics, Speech, Signal Processing , pp. 85-88
- Colombi, J.¹ Ruck, D.² Rogers, S.³ Oxley, M.⁴ Anderson, T.⁵

8
- 0032066455
- "A connectionist method for pattern classification with diverse features"
- K. Chen, "A connectionist method for pattern classification with diverse features," Pattern Recognit. Lett., vol. 19, no. 7, pp. 545-558, 1998.
- (1998) Pattern Recognit. Lett. , vol.19 , Issue.7 , pp. 545-558
- Chen, K.¹

9
- 0033253694
- "A modular neural network architecture for pattern classification based on different feature sets"
- K. Chen and H. Chi, "A modular neural network architecture for pattern classification based on different feature sets," Int. J. Neural Syst., vol. 9, no. 6, pp. 563-581, 1999.
- (1999) Int. J. Neural Syst. , vol.9 , Issue.6 , pp. 563-581
- Chen, K.¹ Chi, H.²

10
- 0000291808
- "Methods of combining multiple classifiers with different features and their applications to text-independent speaker identification"
- K. Chen, L. Wang, and H. Chi, "Methods of combining multiple classifiers with different features and their applications to text-independent speaker identification," Int. J. Pattern Recognit. Artific. Intell., vol. 11, no. 3, pp. 417-445, 1997.
- (1997) Int. J. Pattern Recognit. Artific. Intell. , vol.11 , Issue.3 , pp. 417-445
- Chen, K.¹ Wang, L.² Chi, H.³

11
- 0344497799
- "Speaker identification based on hierarchical mixture of experts"
- Washington, DC
- K. Chen, D. Xie, and H. Chi, "Speaker identification based on hierarchical mixture of experts," in Proc. World Cong. Neural Networks, Washington, DC, 1995, pp. 1493-1496.
- (1995) Proc. World Cong. Neural Networks , pp. 1493-1496
- Chen, K.¹ Xie, D.² Chi, H.³

12
- 0030244499
- "A modified HME architecture for text-dependent speaker identification"
- Sep. 6
- K. Chen, D. Xie, and H. Chi, "A modified HME architecture for text-dependent speaker identification," IEEE Trans. Neural Netw., vol. 7, no. 5, pp: 1309-1313, Sep. 1996.
- (1996) IEEE Trans. Neural Netw. , vol.7 , Issue.5 , pp. 1309-1313
- Chen, K.¹ Xie, D.² Chi, H.³

13
- 0030157020
- "Text-dependent speaker identification based upon input/output HMMs: An empirical study"
- K. Chen, D. Xie, and H. Chi, "Text-dependent speaker identification based upon input/output HMMs: An empirical study," in Neural Proc. Lett., vol. 3, 1996, pp. 81-89.
- (1996) Neural Proc. Lett. , vol.3 , pp. 81-89
- Chen, K.¹ Xie, D.² Chi, H.³

14
- 0030093848
- "Speaker identification using time-delay HMEs"
- K. Chen, D. Xie, and H. Chi, "Speaker identification using time-delay HMEs," Int. J. Neural Syst., vol. 7, no. 1, pp. 29-43, 1996.
- (1996) Int. J. Neural Syst. , vol.7 , Issue.1 , pp. 29-43
- Chen, K.¹ Xie, D.² Chi, H.³

15
- 0032876594
- "Improved learning algorithm for mixture of experts in multiclass classification"
- K. Chen, L. Xu, and H. Chi, "Improved learning algorithm for mixture of experts in multiclass classification," Neural Netw., vol. 12, no. 9, pp. 1229-1252, 1999.
- (1999) Neural Netw. , vol.12 , Issue.9 , pp. 1229-1252
- Chen, K.¹ Xu, L.² Chi, H.³

16
- 0002629270
- "Maximum likelihood from incomplete data via the EM algorithm"
- A. P. Dempster, N. M. Laird, and D. B. Rubin, "Maximum likelihood from incomplete data via the EM algorithm," J. Roy. Statist. Soc. B, vol. 39, no. 1, pp. 1-38, 1977.
- (1997) J. Roy. Statist. Soc. B , vol.39 , Issue.1 , pp. 1-38
- Dempster, A.P.¹ Laird, N.M.² Rubin, D.B.³

17
- 0028748949
- "Growing cell structures: A self-organizing network for unsupervised and supervised learning"
- B. Fritzke, "Growing cell structures: A self-organizing network for unsupervised and supervised learning," Neural Netw., vol. 7, no. 9, pp. 1441-1660, 1994.
- (1994) Neural Netw. , vol.7 , Issue.9 , pp. 1441-1660
- Fritzke, B.¹

18
- 0031223555
- "Recent advances in speaker identification"
- S. Furui, "Recent advances in speaker identification," Pattern Recognit. Lett., vol. 18, no. 9, pp. 859-872, 1997.
- (1997) Pattern Recognit. Lett. , vol.18 , Issue.9 , pp. 859-872
- Furui, S.¹

19
- 0032495298
- "Speaker identification through use of features selected using genetic algorithm"
- A. Haydar, M. Demirekler, and M. K. Yurtseven, "Speaker identification through use of features selected using genetic algorithm," Electron. Lett., vol. 34, no. 1, pp. 39-40, 1998.
- (1998) Electron. Lett. , vol.34 , Issue.1 , pp. 39-40
- Haydar, A.¹ Demirekler, M.² Yurtseven, M.K.³

20
- 0036544002
- "Robust speech features based on wavelet transform with application to speaker identification"
- C. T. Hsieh, E. Lai, and Y. C. Wang, "Robust speech features based on wavelet transform with application to speaker identification," in Proc. Inst. Elect. Eng. Vis., Image, Signal Process., vol. 149, 2002, pp. 108-114.
- (2002) Proc. Inst. Elect. Eng. Vis., Image, Signal Process. , vol.149 , pp. 108-114
- Hsieh, C.T.¹ Lai, E.² Wang, Y.C.³

21
- 0004056285
- New York,: Wiley
- X. D. Huang, A. Acero, and H. W. Hon, Spoken Language Processing. New York,: Wiley, 2000.
- (2000) Spoken Language Processing
- Huang, X.D.¹ Acero, A.² Hon, H.W.³

22
- 0030643811
- "The use of harmonic features for speaker recognition"
- B. Imperl, Z. Kacic, and B. Horvat, "The use of harmonic features for speaker recognition," in Proc. Int. Conf. Acoustics, Speech, Signal Processing, 1997, pp. 1131-1134.
- (1997) Proc. Int. Conf. Acoustics, Speech, Signal Processing , pp. 1131-1134
- Imperl, B.¹ Kacic, Z.² Horvat, B.³

23
- 0001940458
- "Adaptive mixtures of local experts"
- R. A. Jacobs, M. I. Jordan, S. J. Nowlan, and G. E. Hinton, "Adaptive mixtures of local experts," Neural Comput., vol. 3, no. 1, pp. 79-87, 1991.
- (1991) Neural Comput. , vol.3 , Issue.1 , pp. 79-87
- Jacobs, R.A.¹ Jordan, M.I.² Nowlan, S.J.³ Hinton, G.E.⁴

24
- 0034856454
- "Learning statistically efficient features for speaker recognition"
- G. J. Jang, T. W. Lee, and Y. H. Oh, "Learning statistically efficient features for speaker recognition," in Proc. Int. Conf. Acoustics, Speech, and Signal Processing, 2001, pp. 437-440.
- (2001) Proc. Int. Conf. Acoustics, Speech, and Signal Processing , pp. 437-440
- Jang, G.J.¹ Lee, T.W.² Oh, Y.H.³

25
- 0028728326
- "Formants AM-FM for speaker identification"
- C. R. Jankowski, T. F. Quatieri, and D. A. Reynolds, "Formants AM-FM for speaker identification," in Proc. Int. Conf. Acoustics, Speech, and Signal Processing, 1994, pp. 608-611.
- (1994) Proc. Int. Conf. Acoustics, Speech, and Signal Processing , pp. 608-611
- Jankowski, C.R.¹ Quatieri, T.F.² Reynolds, D.A.³

26
- 0029726518
- "Fine structure features for speaker identification"
- C. R. Jankowski, T. F. Quatieri, and D. A. Reynolds, "Fine structure features for speaker identification," in Proc. Int. Conf. Acoustics, Speech, Signal Processing, 1996, pp. 689-612.
- (1996) Proc. Int. Conf. Acoustics, Speech, Signal Processing , pp. 612-689
- Jankowski, C.R.¹ Quatieri, T.F.² Reynolds, D.A.³

27
- 0032021555
- "On combining classifiers"
- Feb.
- J. Kitter, M. Hatef, R. Duin, and J. Matas, "On combining classifiers," IEEE Trans. Patt. Anal. Mach. Intell., vol. 20, no. 2, pp. 226-239, Feb., 1998.
- (1998) IEEE Trans. Patt. Anal. Mach. Intell. , vol.20 , Issue.2 , pp. 226-239
- Kitter, J.¹ Hatef, M.² Duin, R.³ Matas, J.⁴

28
- 0020594710
- "Text-independent speaker identification with short utterances"
- K. P. Li and E. H. Wrench, "Text-independent speaker identification with short utterances," in Proc. Int. Conf. Acoustics, Speech, Signal Processing, 1983, pp. 555-558.
- (1983) Proc. Int. Conf. Acoustics, Speech, Signal Processing , pp. 555-558
- Li, K.P.¹ Wrench, E.H.²

29
- 0030247355
- "Robust speaker recognition: A feature-based approach"
- Sept.
- R. J. Mammone, X. Zhang, and R. P. Ramachandran, "Robust speaker recognition: A feature-based approach," IEEE Signal Process. Mag., pp. 56-71, Sept. 1996.
- (1996) IEEE Signal Process. Mag. , pp. 56-71
- Mammone, R.J.¹ Zhang, X.² Ramachandran, R.P.³

30
- 0004066260
- New York,: Wiley
- G. McLanchlan and D. Peel, Finite Mixture Models. New York,: Wiley, 2000.
- (2000) Finite Mixture Models
- McLanchlan, G.¹ Peel, D.²

31
- 0027632248
- "Neural-gas' network for vector quantization and its application to time-series prediction"
- Jul.
- T. Martinetz, S. Berkovich, and K. Schulten, "Neural-gas' network for vector quantization and its application to time-series prediction," IEEE Trans. Neural Netw., vol. 4, no. 4, pp. 558-569, Jul. 1993.
- (1993) IEEE Trans. Neural Netw. , vol.4 , Issue.4 , pp. 558-569
- Martinetz, T.¹ Berkovich, S.² Schulten, K.³

32
- 0003663926
- London, U.K.: Chapman & Hall
- P. McLanchlan and J. A. Nelder, Generalized Linear Models. London, U.K.: Chapman & Hall, 1983.
- (1983) Generalized Linear Models
- McLanchlan, P.¹ Nelder, J.A.²

33
- 0011904253
- "A comparison of composite features, under degraded speech in speaker recognition"
- M. Pandit and J. Kittler, "A comparison of composite features, under degraded speech in speaker recognition," in Proc. Int. Conf. Acoustics, Speech, Signal Processing, vol. 2, 1993, pp. 371-374.
- (1993) Proc. Int. Conf. Acoustics, Speech, Signal Processing , vol.2 , pp. 371-374
- Pandit, M.¹ Kittler, J.²

34
- 1542491549
- "Feature selection for a DTW-based speaker verification"
- M. Pandit and J. Kittler, "Feature selection for a DTW-based speaker verification," in Proc. Int. Conf. Acoustics, Speech, Signal Processing, 1998, pp. 769-772.
- (1998) Proc. Int. Conf. Acoustics, Speech, Signal Processing , pp. 769-772
- Pandit, M.¹ Kittler, J.²

35
- 0036887596
- "Speaker recognition - General classifier approaches and data fusion methods"
- R. P. Ramachandran, K. R. Farrell, R. Ramachandran, and R. J., "Speaker recognition - General classifier approaches and data fusion methods," Pattern Recognit., vol. 35, no. 12, pp. 2801-2821, 2002.
- (2002) Pattern Recognit. , vol.35 , Issue.12 , pp. 2801-2821
- Ramachandran, R.P.¹ Farrell, K.R.² Ramachandran, R.³ J, R.⁴

36
- 0003988385
- "A Gaussian mixture modeling approach to text-independent speaker identification"
- Ph.D. dissertation, Elect. Eng., Georgia Inst. Technol., Atlanta
- D. A. Reynolds, "A Gaussian mixture modeling approach to text-independent speaker identification," Ph.D. dissertation, Elect. Eng., Georgia Inst. Technol., Atlanta, 1992.
- (1992)
- Reynolds, D.A.¹

37
- 85073368368
- "Speaker identification and verification using Gaussian mixture models"
- D. A. Reynolds, "Speaker identification and verification using Gaussian mixture models," in Proc. ESCA Workshop on Automatic Speaker Recog., Identifi. Verification, 1994, pp. 27-30.
- (1994) Proc. ESCA Workshop on Automatic Speaker Recog., Identifi. Verification , pp. 27-30
- Reynolds, D.A.¹

38
- 0028515984
- "Experimental evaluation of features for robust speaker identification"
- Oct.
- D. A. Reynolds, "Experimental evaluation of features for robust speaker identification," IEEE Trans. Speech Audio Process., vol. 2, no. 4, pp. 639-643, Oct., 1994.
- (1994) IEEE Trans. Speech Audio Process. , vol.2 , Issue.4 , pp. 639-643
- Reynolds, D.A.¹

39
- 0029209272
- "Robust text-independent speaker identification using Gaussian mixture models"
- Jan.
- D. A. Reynolds and R. C. Rose, "Robust text-independent speaker identification using Gaussian mixture models," IEEE Trans. Speech Audio Process., vol. 3, no. 1, pp. 72-83, Jan., 1995.
- (1995) IEEE Trans. Speech Audio Process. , vol.3 , Issue.1 , pp. 72-83
- Reynolds, D.A.¹ Rose, R.C.²

40
- 0001941052
- "Recent research in automatic speaker recognition"
- S. Furai and M.M. Sondhi, Eds. Norwell, MA,: Kluwer
- A. E. Rosenberg and F. Soong, "Recent research in automatic speaker recognition," in Advances in Speech, Signal Processing, S. Furai and M. M. Sondhi, Eds. Norwell, MA,: Kluwer, 1992, pp. 701-738.
- (1992) Advances in Speech, Signal Processing , pp. 701-738
- Rosenberg, A.E.¹ Soong, F.²

41
- 0035746757
- "Joint cohort normalization in a multi-feature speaker verification system"
- C. Sanderson and K. K. Paliwal, "Joint cohort normalization in a multi-feature speaker verification system," in Proc. Int. Conf. Acoustics, Speech, Signal Processing, 2001, pp. 232-235.
- (2001) Proc. Int. Conf. Acoustics, Speech, Signal Processing , pp. 232-235
- Sanderson, C.¹ Paliwal, K.K.²

42
- 0024035182
- "On the use of instantaneous and transitional spectral information in speaker recognition"
- Jun.
- F. Soong and A. E. Rosenberg, "On the use of instantaneous and transitional spectral information in speaker recognition," IEEE Trans. Acoust., Speech, Signal Process., vol. 36, no. 6, pp. 871-879, Jun. 1988.
- (1988) IEEE Trans. Acoust., Speech, Signal Process. , vol.36 , Issue.6 , pp. 871-879
- Soong, F.¹ Rosenberg, A.E.²

43
- 0036505591
- "Capture interspeaker information with a neural network for speaker identification"
- Mar.
- L. Wang, K. Chen, and H. Chi, "Capture interspeaker information with a neural network for speaker identification," IEEE Trans. Neural Netw., vol. 13, no. 2, pp. 436-445, Mar. 2002.
- (2002) IEEE Trans. Neural Netw. , vol.13 , Issue.2 , pp. 436-445
- Wang, L.¹ Chen, K.² Chi, H.³

44
- 85008028997
- "Errata to 'A modified HME architecture for text-dependent speaker identification'"
- Mar., for errata see
- K. Chen, D. Xie, and H. Chi, "Errata to 'A modified HME architecture for text-dependent speaker identification'," IEEE Trans. Neural Netw., vol 8, no. 2, p. 455, Mar., 1997. for errata see.
- (1997) IEEE Trans. Neural Netw. , vol.8 , Issue.2 , pp. 455
- Chen, K.¹ Xie, D.² Chi, H.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.