메뉴 건너뛰기




Volumn 35, Issue 3, 2005, Pages 301-314

On the use of different speech representations for speaker modeling

Author keywords

Different speech representations; Expectation maximazation (EM) algorithm; Generalized Gaussian mixture model (GGMM); KING speech corpus; Soft competition; Speaker modeling; Speaker recognition; Speaker specific information

Indexed keywords

LEARNING ALGORITHMS; MATHEMATICAL MODELS; PARAMETER ESTIMATION; PROBABILITY DISTRIBUTIONS; STATISTICAL METHODS;

EID: 23944498183     PISSN: 10946977     EISSN: None     Source Type: Journal    
DOI: 10.1109/TSMCC.2005.848166     Document Type: Article
Times cited : (22)

References (44)
  • 1
    • 0015476226 scopus 로고
    • "Automatic speaker recognition based on pitch contours"
    • B. S. Atal, "Automatic speaker recognition based on pitch contours," J. Acoust. Soc. Amer., vol. 52, no. 6, pp. 1687-1697, 1972.
    • (1972) J. Acoust. Soc. Amer. , vol.52 , Issue.6 , pp. 1687-1697
    • Atal, B.S.1
  • 3
    • 0033316261 scopus 로고    scopus 로고
    • "Adaptive weighting of pattern features during learning"
    • Y. Bennani, "Adaptive weighting of pattern features during learning," in Proc. Int. Joint Conf. Neural Networks, 1999, pp. 3008-3013.
    • (1999) Proc. Int. Joint Conf. Neural Networks , pp. 3008-3013
    • Bennani, Y.1
  • 5
    • 0031233424 scopus 로고    scopus 로고
    • "Speaker recognition: A tutorial"
    • J. P. Campbell, "Speaker recognition: A tutorial," Proc. IEEE, vol. 85, no. 9, pp. 1437-1462, 1997.
    • (1997) Proc. IEEE , vol.85 , Issue.9 , pp. 1437-1462
    • Campbell, J.P.1
  • 8
    • 0032066455 scopus 로고    scopus 로고
    • "A connectionist method for pattern classification with diverse features"
    • K. Chen, "A connectionist method for pattern classification with diverse features," Pattern Recognit. Lett., vol. 19, no. 7, pp. 545-558, 1998.
    • (1998) Pattern Recognit. Lett. , vol.19 , Issue.7 , pp. 545-558
    • Chen, K.1
  • 9
    • 0033253694 scopus 로고    scopus 로고
    • "A modular neural network architecture for pattern classification based on different feature sets"
    • K. Chen and H. Chi, "A modular neural network architecture for pattern classification based on different feature sets," Int. J. Neural Syst., vol. 9, no. 6, pp. 563-581, 1999.
    • (1999) Int. J. Neural Syst. , vol.9 , Issue.6 , pp. 563-581
    • Chen, K.1    Chi, H.2
  • 10
    • 0000291808 scopus 로고    scopus 로고
    • "Methods of combining multiple classifiers with different features and their applications to text-independent speaker identification"
    • K. Chen, L. Wang, and H. Chi, "Methods of combining multiple classifiers with different features and their applications to text-independent speaker identification," Int. J. Pattern Recognit. Artific. Intell., vol. 11, no. 3, pp. 417-445, 1997.
    • (1997) Int. J. Pattern Recognit. Artific. Intell. , vol.11 , Issue.3 , pp. 417-445
    • Chen, K.1    Wang, L.2    Chi, H.3
  • 11
    • 0344497799 scopus 로고
    • "Speaker identification based on hierarchical mixture of experts"
    • Washington, DC
    • K. Chen, D. Xie, and H. Chi, "Speaker identification based on hierarchical mixture of experts," in Proc. World Cong. Neural Networks, Washington, DC, 1995, pp. 1493-1496.
    • (1995) Proc. World Cong. Neural Networks , pp. 1493-1496
    • Chen, K.1    Xie, D.2    Chi, H.3
  • 12
    • 0030244499 scopus 로고    scopus 로고
    • "A modified HME architecture for text-dependent speaker identification"
    • Sep. 6
    • K. Chen, D. Xie, and H. Chi, "A modified HME architecture for text-dependent speaker identification," IEEE Trans. Neural Netw., vol. 7, no. 5, pp: 1309-1313, Sep. 1996.
    • (1996) IEEE Trans. Neural Netw. , vol.7 , Issue.5 , pp. 1309-1313
    • Chen, K.1    Xie, D.2    Chi, H.3
  • 13
    • 0030157020 scopus 로고    scopus 로고
    • "Text-dependent speaker identification based upon input/output HMMs: An empirical study"
    • K. Chen, D. Xie, and H. Chi, "Text-dependent speaker identification based upon input/output HMMs: An empirical study," in Neural Proc. Lett., vol. 3, 1996, pp. 81-89.
    • (1996) Neural Proc. Lett. , vol.3 , pp. 81-89
    • Chen, K.1    Xie, D.2    Chi, H.3
  • 14
    • 0030093848 scopus 로고    scopus 로고
    • "Speaker identification using time-delay HMEs"
    • K. Chen, D. Xie, and H. Chi, "Speaker identification using time-delay HMEs," Int. J. Neural Syst., vol. 7, no. 1, pp. 29-43, 1996.
    • (1996) Int. J. Neural Syst. , vol.7 , Issue.1 , pp. 29-43
    • Chen, K.1    Xie, D.2    Chi, H.3
  • 15
    • 0032876594 scopus 로고    scopus 로고
    • "Improved learning algorithm for mixture of experts in multiclass classification"
    • K. Chen, L. Xu, and H. Chi, "Improved learning algorithm for mixture of experts in multiclass classification," Neural Netw., vol. 12, no. 9, pp. 1229-1252, 1999.
    • (1999) Neural Netw. , vol.12 , Issue.9 , pp. 1229-1252
    • Chen, K.1    Xu, L.2    Chi, H.3
  • 16
    • 0002629270 scopus 로고    scopus 로고
    • "Maximum likelihood from incomplete data via the EM algorithm"
    • A. P. Dempster, N. M. Laird, and D. B. Rubin, "Maximum likelihood from incomplete data via the EM algorithm," J. Roy. Statist. Soc. B, vol. 39, no. 1, pp. 1-38, 1977.
    • (1997) J. Roy. Statist. Soc. B , vol.39 , Issue.1 , pp. 1-38
    • Dempster, A.P.1    Laird, N.M.2    Rubin, D.B.3
  • 17
    • 0028748949 scopus 로고
    • "Growing cell structures: A self-organizing network for unsupervised and supervised learning"
    • B. Fritzke, "Growing cell structures: A self-organizing network for unsupervised and supervised learning," Neural Netw., vol. 7, no. 9, pp. 1441-1660, 1994.
    • (1994) Neural Netw. , vol.7 , Issue.9 , pp. 1441-1660
    • Fritzke, B.1
  • 18
    • 0031223555 scopus 로고    scopus 로고
    • "Recent advances in speaker identification"
    • S. Furui, "Recent advances in speaker identification," Pattern Recognit. Lett., vol. 18, no. 9, pp. 859-872, 1997.
    • (1997) Pattern Recognit. Lett. , vol.18 , Issue.9 , pp. 859-872
    • Furui, S.1
  • 19
    • 0032495298 scopus 로고    scopus 로고
    • "Speaker identification through use of features selected using genetic algorithm"
    • A. Haydar, M. Demirekler, and M. K. Yurtseven, "Speaker identification through use of features selected using genetic algorithm," Electron. Lett., vol. 34, no. 1, pp. 39-40, 1998.
    • (1998) Electron. Lett. , vol.34 , Issue.1 , pp. 39-40
    • Haydar, A.1    Demirekler, M.2    Yurtseven, M.K.3
  • 20
    • 0036544002 scopus 로고    scopus 로고
    • "Robust speech features based on wavelet transform with application to speaker identification"
    • C. T. Hsieh, E. Lai, and Y. C. Wang, "Robust speech features based on wavelet transform with application to speaker identification," in Proc. Inst. Elect. Eng. Vis., Image, Signal Process., vol. 149, 2002, pp. 108-114.
    • (2002) Proc. Inst. Elect. Eng. Vis., Image, Signal Process. , vol.149 , pp. 108-114
    • Hsieh, C.T.1    Lai, E.2    Wang, Y.C.3
  • 31
    • 0027632248 scopus 로고
    • "Neural-gas' network for vector quantization and its application to time-series prediction"
    • Jul.
    • T. Martinetz, S. Berkovich, and K. Schulten, "Neural-gas' network for vector quantization and its application to time-series prediction," IEEE Trans. Neural Netw., vol. 4, no. 4, pp. 558-569, Jul. 1993.
    • (1993) IEEE Trans. Neural Netw. , vol.4 , Issue.4 , pp. 558-569
    • Martinetz, T.1    Berkovich, S.2    Schulten, K.3
  • 33
    • 0011904253 scopus 로고
    • "A comparison of composite features, under degraded speech in speaker recognition"
    • M. Pandit and J. Kittler, "A comparison of composite features, under degraded speech in speaker recognition," in Proc. Int. Conf. Acoustics, Speech, Signal Processing, vol. 2, 1993, pp. 371-374.
    • (1993) Proc. Int. Conf. Acoustics, Speech, Signal Processing , vol.2 , pp. 371-374
    • Pandit, M.1    Kittler, J.2
  • 35
    • 0036887596 scopus 로고    scopus 로고
    • "Speaker recognition - General classifier approaches and data fusion methods"
    • R. P. Ramachandran, K. R. Farrell, R. Ramachandran, and R. J., "Speaker recognition - General classifier approaches and data fusion methods," Pattern Recognit., vol. 35, no. 12, pp. 2801-2821, 2002.
    • (2002) Pattern Recognit. , vol.35 , Issue.12 , pp. 2801-2821
    • Ramachandran, R.P.1    Farrell, K.R.2    Ramachandran, R.3    J, R.4
  • 36
    • 0003988385 scopus 로고
    • "A Gaussian mixture modeling approach to text-independent speaker identification"
    • Ph.D. dissertation, Elect. Eng., Georgia Inst. Technol., Atlanta
    • D. A. Reynolds, "A Gaussian mixture modeling approach to text-independent speaker identification," Ph.D. dissertation, Elect. Eng., Georgia Inst. Technol., Atlanta, 1992.
    • (1992)
    • Reynolds, D.A.1
  • 38
    • 0028515984 scopus 로고
    • "Experimental evaluation of features for robust speaker identification"
    • Oct.
    • D. A. Reynolds, "Experimental evaluation of features for robust speaker identification," IEEE Trans. Speech Audio Process., vol. 2, no. 4, pp. 639-643, Oct., 1994.
    • (1994) IEEE Trans. Speech Audio Process. , vol.2 , Issue.4 , pp. 639-643
    • Reynolds, D.A.1
  • 39
    • 0029209272 scopus 로고
    • "Robust text-independent speaker identification using Gaussian mixture models"
    • Jan.
    • D. A. Reynolds and R. C. Rose, "Robust text-independent speaker identification using Gaussian mixture models," IEEE Trans. Speech Audio Process., vol. 3, no. 1, pp. 72-83, Jan., 1995.
    • (1995) IEEE Trans. Speech Audio Process. , vol.3 , Issue.1 , pp. 72-83
    • Reynolds, D.A.1    Rose, R.C.2
  • 40
    • 0001941052 scopus 로고
    • "Recent research in automatic speaker recognition"
    • S. Furai and M.M. Sondhi, Eds. Norwell, MA,: Kluwer
    • A. E. Rosenberg and F. Soong, "Recent research in automatic speaker recognition," in Advances in Speech, Signal Processing, S. Furai and M. M. Sondhi, Eds. Norwell, MA,: Kluwer, 1992, pp. 701-738.
    • (1992) Advances in Speech, Signal Processing , pp. 701-738
    • Rosenberg, A.E.1    Soong, F.2
  • 42
    • 0024035182 scopus 로고
    • "On the use of instantaneous and transitional spectral information in speaker recognition"
    • Jun.
    • F. Soong and A. E. Rosenberg, "On the use of instantaneous and transitional spectral information in speaker recognition," IEEE Trans. Acoust., Speech, Signal Process., vol. 36, no. 6, pp. 871-879, Jun. 1988.
    • (1988) IEEE Trans. Acoust., Speech, Signal Process. , vol.36 , Issue.6 , pp. 871-879
    • Soong, F.1    Rosenberg, A.E.2
  • 43
    • 0036505591 scopus 로고    scopus 로고
    • "Capture interspeaker information with a neural network for speaker identification"
    • Mar.
    • L. Wang, K. Chen, and H. Chi, "Capture interspeaker information with a neural network for speaker identification," IEEE Trans. Neural Netw., vol. 13, no. 2, pp. 436-445, Mar. 2002.
    • (2002) IEEE Trans. Neural Netw. , vol.13 , Issue.2 , pp. 436-445
    • Wang, L.1    Chen, K.2    Chi, H.3
  • 44
    • 85008028997 scopus 로고    scopus 로고
    • "Errata to 'A modified HME architecture for text-dependent speaker identification'"
    • Mar., for errata see
    • K. Chen, D. Xie, and H. Chi, "Errata to 'A modified HME architecture for text-dependent speaker identification'," IEEE Trans. Neural Netw., vol 8, no. 2, p. 455, Mar., 1997. for errata see.
    • (1997) IEEE Trans. Neural Netw. , vol.8 , Issue.2 , pp. 455
    • Chen, K.1    Xie, D.2    Chi, H.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.