메뉴 건너뛰기




Volumn 3, Issue 1, 1995, Pages 72-83

Robust Text-Independent Speaker Identification Using Gaussian Mixture Speaker Models

Author keywords

[No Author keywords available]

Indexed keywords

ACOUSTIC PROPERTIES; ALGORITHMS; COMMUNICATION CHANNELS (INFORMATION THEORY); DATABASE SYSTEMS; FUNCTIONS; MATHEMATICAL MODELS; PERFORMANCE; TELEPHONE SYSTEMS; VECTOR QUANTIZATION;

EID: 0029209272     PISSN: 10636676     EISSN: None     Source Type: Journal    
DOI: 10.1109/89.365379     Document Type: Article
Times cited : (2458)

References (42)
  • 1
    • 0022186569 scopus 로고
    • Investigation of text-independent speaker identification over telephone channels
    • H. Gish et al, “Investigation of text-independent speaker identification over telephone channels,” in Proc. IEEE ICASSP, 1985, pp. 379-382.
    • (1985) Proc. IEEE ICASSP , pp. 379-382
    • Gish, H.1
  • 2
    • 0022229052 scopus 로고
    • A vector quantization approach to speaker recognition
    • F. Soong et al, “A vector quantization approach to speaker recognition,” in Proc. IEEE ICASSP, 1985, pp. 387-390.
    • (1985) Proc. IEEE ICASSP , pp. 387-390
    • Soong, F.1
  • 3
    • 0026396338 scopus 로고
    • Radial basis function networks for speaker recognition
    • May
    • J. Oglesby and J. Mason, “Radial basis function networks for speaker recognition,” in Proc. IEEE ICASSP, May 1991, pp. 393-396.
    • (1991) Proc. IEEE ICASSP , pp. 393-396
    • Oglesby, J.1    Mason, J.2
  • 4
    • 0016939145 scopus 로고
    • Automatic recognition of speakers from their voices
    • Apr.
    • B. Atal, “Automatic recognition of speakers from their voices,” Proc. IEEE, vol. 64, pp. 460-475, Apr. 1976.
    • (1976) Proc. IEEE , vol.64 , pp. 460-475
    • Atal, B.1
  • 5
    • 0019052875 scopus 로고
    • A study of LPC analysis of speech in additive noise
    • Aug.
    • J. Tierney, “A study of LPC analysis of speech in additive noise,” IEEE Trans. Acoust., Speech, Signal Processing, vol. ASSP-28, pp. 389-397, Aug. 1980.
    • (1980) IEEE Trans. Acoust. , vol.ASSP-28 , pp. 389-397
    • Tierney, J.1
  • 7
    • 0015416804 scopus 로고
    • Talker recognition by longtime averaged speech spectrum
    • S. Furui, F. Itakura, and S. Saito, “Talker recognition by longtime averaged speech spectrum,” Electron., Commun. in Japan, vol. 55-A, no. 10, pp. 54-61, 1972.
    • (1972) Electron. , vol.55-A , Issue.10 , pp. 54-61
    • Furui, S.1    Itakura, F.2    Saito, S.3
  • 8
    • 0017626219 scopus 로고
    • Long-term feature averaging for speaker recognition
    • Aug.
    • J. Market, B. Oshika, and A. Gray, Jr., “Long-term feature averaging for speaker recognition,” IEEE Trans. Acoust., Speech, Signal Processing, vol. ASSP-25, pp. 330-337, Aug. 1977.
    • (1977) IEEE Trans. Acoust. , vol.ASSP-25 , pp. 330-337
    • Market, J.1    Oshika, B.2    Gray, A.3
  • 9
    • 0022889724 scopus 로고
    • Methods and experiments for text-independent speaker recognition over telephone channels
    • H. Gish et al., “Methods and experiments for text-independent speaker recognition over telephone channels,” in Proc. IEEE ICASSP, 1986, pp. 865-868.
    • (1986) Proc. IEEE ICASSP , pp. 865-868
    • Gish, H.1
  • 10
    • 0026382107 scopus 로고
    • A text-independent speaker recognition method robust against utterance variations
    • T. Matsui and S. Furui, “A text-independent speaker recognition method robust against utterance variations,” in Proc. IEEE ICASSP, 1991, pp. 377-380.
    • (1991) Proc. IEEE ICASSP , pp. 377-380
    • Matsui, T.1    Furui, S.2
  • 11
    • 84941535474 scopus 로고
    • Free-text speaker identification over long distance telephone channel using hypothesized phonetic segmentation
    • Y. Kao, P. Rajasekaran, and J. Baras, “Free-text speaker identification over long distance telephone channel using hypothesized phonetic segmentation,” in Proc. IEEE ICASSP, 1992, pp. II. 177-11. 180.
    • (1992) Proc. IEEE ICASSP
    • Kao, Y.1    Rajasekaran, P.2    Baras, J.3
  • 12
    • 84913871201 scopus 로고
    • Speaker recognition using linear predictive vector code-books
    • R. E. Helms, “Speaker recognition using linear predictive vector code-books,” Ph.D. thesis, Southern Methodist University, 1981.
    • (1981) Ph.D. thesis
    • Helms, R.E.1
  • 13
    • 0027252185 scopus 로고
    • Voice identification using nearest-neighbor distance measure
    • Apr.
    • A. Higgins, L. Bahler, and J. Porter, “Voice identification using nearest-neighbor distance measure,” in Proc. IEEE ICASSP, Apr. 1993, pp. II-375-II-378.
    • (1993) Proc. IEEE ICASSP
    • Higgins, A.1    Bahler, L.2    Porter, J.3
  • 14
    • 84985742249 scopus 로고
    • Linear predictive hidden Markov models and the speech signal
    • May
    • A. B. Poritz, “Linear predictive hidden Markov models and the speech signal,” in Proc. IEEE ICASSP, May 1982, pp. 1291-1294.
    • (1982) Proc. IEEE ICASSP , pp. 1291-1294
    • Poritz, A.B.1
  • 15
    • 0026117640 scopus 로고
    • On the application of mixture AR hidden Markov models to text independent speaker recognition
    • Mar.
    • N. Z. Tishby, “On the application of mixture AR hidden Markov models to text independent speaker recognition,” IEEE Trans. Signal Processing, vol. 39, pp. 563-570, Mar. 1991.
    • (1991) IEEE Trans. Signal Processing , vol.39 , pp. 563-570
    • Tishby, N.Z.1
  • 16
    • 0025629484 scopus 로고
    • Sub-word talker verification using hidden Markov models
    • Apr.
    • A. E. Rosenberg, C. H. Lee, and F. K. Soong, “Sub-word talker verification using hidden Markov models,” in IEEE ICASSP, Apr. 1990, pp. 269-272.
    • (1990) IEEE ICASSP , pp. 269-272
    • Rosenberg, A.E.1    Lee, C.H.2    Soong, F.K.3
  • 17
    • 85009210391 scopus 로고
    • Comparison of text-independent speaker recognition methods using VQ-distortion and discrete/continuous HMMs
    • Mar.
    • T. Matsui and S. Furui, “Comparison of text-independent speaker recognition methods using VQ-distortion and discrete/continuous HMMs,” in Proc. IEEE ICASSP, Mar. 1992, pp. II. 157-II.164.
    • (1992) Proc. IEEE ICASSP
    • Matsui, T.1    Furui, S.2
  • 18
    • 0026385269 scopus 로고
    • Text-independent talker identification with neural networks
    • May
    • L. Rudasi and S. A. Zahorian, “Text-independent talker identification with neural networks,” in Proc. IEEE ICASSP, May 1991, pp. 389-392.
    • (1991) Proc. IEEE ICASSP , pp. 389-392
    • Rudasi, L.1    Zahorian, S.A.2
  • 19
    • 0026382108 scopus 로고
    • On the use of TDNN-extracted features information in talker identification
    • May
    • Y. Bennani and P. Gallinari, “On the use of TDNN-extracted features information in talker identification,” in Proc. IEEE ICASSP, May 1991, pp. 385-388.
    • (1991) Proc. IEEE ICASSP , pp. 385-388
    • Bennani, Y.1    Gallinari, P.2
  • 20
    • 0028420014 scopus 로고
    • Integrated models of speech and background with application to speaker identification in noise
    • Apr.
    • R. C. Rose, E. M. Hofstetter, and D. A. Reynolds, “Integrated models of speech and background with application to speaker identification in noise,” IEEE Trans. Speech Audio Processing, vol. 2, no. 2, pp. 245-257, Apr. 1994.
    • (1994) IEEE Trans. Speech Audio Processing , vol.2 , Issue.2 , pp. 245-257
    • Rose, R.C.1    Hofstetter, E.M.2    Reynolds, D.A.3
  • 21
    • 0003988385 scopus 로고
    • A Gaussian mixture modeling approach to text-independent speaker identification
    • Sept.
    • D. A. Reynolds, “A Gaussian mixture modeling approach to text-independent speaker identification,” Ph.D. thesis, Georgia Inst of Technology, Sept. 1992.
    • (1992) Ph.D. thesis
    • Reynolds, D.A.1
  • 22
    • 0342990123 scopus 로고
    • PC-based TMS320C30 implementation of the Gaussian mixture model text-independent speaker recognition system
    • Nov.
    • D. A. Reynolds, R. C. Rose, and M. J. T. Smith, “PC-based TMS320C30 implementation of the Gaussian mixture model text-independent speaker recognition system,” in Proc. Int. Conf. Signal Processing Appl, Technol, Nov. 1992, pp. 967-973.
    • (1992) Proc. Int. Conf. Signal Processing Appl , pp. 967-973
    • Reynolds, D.A.1    Rose, R.C.2    Smith, M.J.T.3
  • 23
    • 0019053271 scopus 로고
    • Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences
    • Aug.
    • S. B. Davis and P. Mermelstein, “Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences,” IEEE Trans. Acoust., Speech, Signal Processing, vol. ASSP-28, pp. 357-366, Aug. 1980.
    • (1980) IEEE Trans. Acoust. , pp. 357-366
    • Davis, S.B.1    Mermelstein, P.2
  • 25
    • 0002629270 scopus 로고
    • Maximum likelihood from incomplete data via the EM algorithm
    • A. Dempster, N. Laird, and D. Rubin, “Maximum likelihood from incomplete data via the EM algorithm,” J. Royal Stat. Soc., vol. 39, pp. 1-38, 1977.
    • (1977) J. Royal Stat. Soc. , vol.39 , pp. 1-38
    • Dempster, A.1    Laird, N.2    Rubin, D.3
  • 26
    • 0000353178 scopus 로고
    • A maximization technique occurring in the statistical analysis of probabilistic functions of Markov chains
    • L. Baum et al., “A maximization technique occurring in the statistical analysis of probabilistic functions of Markov chains,” Ann. Math Stat., vol. 41, pp. 164-171, 1970.
    • (1970) Ann. Math Stat. , vol.41 , pp. 164-171
    • Baum, L.1
  • 28
    • 0022883703 scopus 로고
    • Noise compensation for speech recognition using probabilistic models
    • J. Holmes and N. Sedgwick, “Noise compensation for speech recognition using probabilistic models,” in Proc. IEEE ICASSP, 1986.
    • (1986) Proc. IEEE ICASSP
    • Holmes, J.1    Sedgwick, N.2
  • 29
    • 0001354471 scopus 로고
    • A constrained formulation of maximum-likelihood estimation for normal mixture distributions
    • R. Hathaway, “A constrained formulation of maximum-likelihood estimation for normal mixture distributions,” Ann. Stat., vol. 13, no. 2, pp. 795-800, 1985.
    • (1985) Ann. Stat. , vol.13 , Issue.2 , pp. 795-800
    • Hathaway, R.1
  • 31
    • 84983583256 scopus 로고
    • An integrated speech-background model for robust speaker identification
    • Mar.
    • D. A. Reynolds and R. C. Rose, “An integrated speech-background model for robust speaker identification,” in Proc. IEEE ICASSP, Mar. 1992, pp. II-185-II-188.
    • (1992) Proc. IEEE ICASSP
    • Reynolds, D.A.1    Rose, R.C.2
  • 32
    • 0016067897 scopus 로고
    • Effectiveness of linear prediction characteristics of the speech wave for automatic speaker identification and verification
    • June
    • B. Atal, “Effectiveness of linear prediction characteristics of the speech wave for automatic speaker identification and verification,” J. Acoust. Soc. Amer., vol. 55, pp. 1304-1312, June 1974.
    • (1974) J. Acoust. Soc. Amer. , vol.55 , pp. 1304-1312
    • Atal, B.1
  • 33
    • 0019583902 scopus 로고
    • Comparison of speaker recognition methods using statistical features and dynamic features
    • June
    • S. Furui, “Comparison of speaker recognition methods using statistical features and dynamic features,” IEEE Trans. Acoust., Speech, Signal Processing, vol. ASSP-29, pp. 342-350, June 1981.
    • (1981) IEEE Trans. Acoust. , vol.ASSP-29 , pp. 342-350
    • Furui, S.1
  • 34
    • 0021180281 scopus 로고
    • Investigation of text-independent speaker identification techniques under conditions of variable data
    • M. Krasner et al., “Investigation of text-independent speaker identification techniques under conditions of variable data,” in Proc. IEEE ICASSP, 1984, pp. 18B.5.1-4.
    • (1984) Proc. IEEE ICASSP
    • Krasner, M.1
  • 35
    • 0025414198 scopus 로고
    • On instantaneous and transitional spectral information for text-dependent speaker verification
    • Apr.
    • C. Bernasconi, “On instantaneous and transitional spectral information for text-dependent speaker verification,” Speech Commun., vol. 9, pp. 129-139, Apr. 1990.
    • (1990) Speech Commun. , vol.9 , pp. 129-139
    • Bernasconi, C.1
  • 36
    • 85016663198 scopus 로고
    • RASTA-PLP speech analysis technique
    • Mar.
    • H. Hermansky et al., “RASTA-PLP speech analysis technique,” in Proc. IEEE ICASSP, Mar. 1992, pp. 1.121-1.124.
    • (1992) Proc. IEEE ICASSP
    • Hermansky, H.1
  • 37
    • 0019555090 scopus 로고
    • Cepstral analysis technique for automatic speaker verification
    • Apr.
    • S. Furui, “Cepstral analysis technique for automatic speaker verification,” IEEE Trans. Acoust., Speech, Signal Processing, vol. ASSP-29, pp. 254-272, Apr. 1981.
    • (1981) IEEE Trans. Acoust. , vol.ASSP-29 , pp. 254-272
    • Furui, S.1
  • 38
    • 0019532669 scopus 로고
    • On talker verification via orthogonal parameters
    • Feb.
    • R. E. Bogner, “On talker verification via orthogonal parameters,” IEEE Trans. Acoust., Speech, Signal Processing, vol. ASSP-29, pp. 1-12, Feb. 1981.
    • (1981) IEEE Trans. Acoust. , vol.ASSP-29 , pp. 1-12
    • Bogner, R.E.1
  • 39
    • 0024035182 scopus 로고
    • On the use of instantaneous and transitional spectral information in speaker recognition
    • June
    • F. Soong and A. Rosenberg, “On the use of instantaneous and transitional spectral information in speaker recognition,” IEEE Trans. Acoust., Speech, Signal Processing, vol. 36, pp. 871-879, June 1988.
    • (1988) IEEE Trans. Acoust. , vol.36 , pp. 871-879
    • Soong, F.1    Rosenberg, A.2
  • 40
    • 0025682333 scopus 로고
    • Text-independent speaker identification using automatic acoustic segmentation
    • R. C. Rose and D. A. Reynolds, “Text-independent speaker identification using automatic acoustic segmentation,” in Proc. IEEE 1CASSP, 1990, pp. 293-296.
    • (1990) Proc. IEEE 1CASSP , pp. 293-296
    • Rose, R.C.1    Reynolds, D.A.2
  • 41
    • 0021412027 scopus 로고
    • Vector quantization
    • Apr.
    • R. Gray, “Vector quantization,” IEEE ASSP Magazine, pp. 4-29, Apr. 1984.
    • (1984) IEEE ASSP Magazine , pp. 4-29
    • Gray, R.1
  • 42
    • 0026401064 scopus 로고
    • Integration of speaker recognition systems
    • May
    • D. A. Reynolds and L. P. Heck, “Integration of speaker recognition systems,” in Proc. IEEE ICASSP, May 1991, pp. 869-872.
    • (1991) Proc. IEEE ICASSP , pp. 869-872
    • Reynolds, D.A.1    Heck, L.P.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.