메뉴 건너뛰기




Volumn 18, Issue 9, 1997, Pages 859-872

Recent advances in speaker recognition

Author keywords

HMM; Likelihood normalization; Speaker recognition; Speaker verification; Speker identification; Text prompted method

Indexed keywords

CHARACTER RECOGNITION; MARKOV PROCESSES; MATHEMATICAL MODELS; VECTOR QUANTIZATION;

EID: 0031223555     PISSN: 01678655     EISSN: None     Source Type: Journal    
DOI: 10.1016/S0167-8655(97)00073-1     Document Type: Article
Times cited : (168)

References (59)
  • 1
    • 0015476226 scopus 로고
    • Automatic speaker recognition based on pitch contours
    • Atal, B., 1972. Automatic speaker recognition based on pitch contours. J. Acoust. Soc. Am. 52 (6), 1687-1697.
    • (1972) J. Acoust. Soc. Am. , vol.52 , Issue.6 , pp. 1687-1697
    • Atal, B.1
  • 2
    • 0016067897 scopus 로고
    • Effectiveness of linear prediction characteristics of the speech wave for automatic speaker identification and verification
    • Atal, B., 1974. Effectiveness of linear prediction characteristics of the speech wave for automatic speaker identification and verification. J. Acoust. Soc. Am. 55 (6), 1304-1312.
    • (1974) J. Acoust. Soc. Am. , vol.55 , Issue.6 , pp. 1304-1312
    • Atal, B.1
  • 3
    • 0002286386 scopus 로고
    • Speaker verification using connected words
    • Carey, M., Parris, E., 1992. Speaker verification using connected words. Proc. Institute of Acoustics 14 (6), 95-100.
    • (1992) Proc. Institute of Acoustics , vol.14 , Issue.6 , pp. 95-100
    • Carey, M.1    Parris, E.2
  • 5
    • 0022150488 scopus 로고
    • Speaker recognition-identifying people by their voices
    • Doddington, G., 1985. Speaker recognition-identifying people by their voices. Proc. IEEE 73 (11), 1651-1664.
    • (1985) Proc. IEEE , vol.73 , Issue.11 , pp. 1651-1664
    • Doddington, G.1
  • 6
    • 0011862645 scopus 로고
    • Automatically focusing on good discriminating speech segments in speaker recognition
    • vol. 5.2
    • Eatock, J., Mason, J., 1990. Automatically focusing on good discriminating speech segments in speaker recognition. In: Proc. Internat. Conf. Spoken Language Processing, vol. 5.2, pp. 133-136.
    • (1990) Proc. Internat. Conf. Spoken Language Processing , pp. 133-136
    • Eatock, J.1    Mason, J.2
  • 7
    • 0015416804 scopus 로고
    • Talker recognition by longtime averaged speech spectrum
    • Furui, S., Itakura, F., Saito, S., 1972. Talker recognition by longtime averaged speech spectrum. Trans. IECE A55 1 (10), 549-556.
    • (1972) Trans. IECE A55 , vol.1 , Issue.10 , pp. 549-556
    • Furui, S.1    Itakura, F.2    Saito, S.3
  • 8
    • 0043116211 scopus 로고
    • An analysis of long-term variation of feature parameters of speech and its application to talker recognition
    • Furui, S., 1974. An analysis of long-term variation of feature parameters of speech and its application to talker recognition. Trans. IECE A57 (12), 880-887.
    • (1974) Trans. IECE , vol.A57 , Issue.12 , pp. 880-887
    • Furui, S.1
  • 9
    • 0019555090 scopus 로고
    • Cepstral analysis technique for automatic speaker verification
    • Furui, S., 1981. Cepstral analysis technique for automatic speaker verification. IEEE Trans. Acoust. Speech Signal Process. 29 (2), 254-272.
    • (1981) IEEE Trans. Acoust. Speech Signal Process. , vol.29 , Issue.2 , pp. 254-272
    • Furui, S.1
  • 10
    • 0001174086 scopus 로고
    • Research on individuality features in speech waves and automatic speaker recognition techniques
    • Furui, S., 1986. Research on individuality features in speech waves and automatic speaker recognition techniques. Speech Communication 5 (2), 183-197.
    • (1986) Speech Communication , vol.5 , Issue.2 , pp. 183-197
    • Furui, S.1
  • 12
    • 0042114273 scopus 로고
    • Speaker-independent and speaker-adaptive recognition techniques
    • Furui, S., Sondhi, M.M. (Eds.), Marcel Dekker, New York
    • Furui, S., 1991. Speaker-independent and speaker-adaptive recognition techniques. In: Furui, S., Sondhi, M.M. (Eds.), Advances in Speech Signal Processing. Marcel Dekker, New York, pp. 597-622.
    • (1991) Advances in Speech Signal Processing , pp. 597-622
    • Furui, S.1
  • 13
    • 0026402417 scopus 로고
    • Speaker-dependent-feature extraction, recognition and processing techniques
    • Furui, S., 1991b. Speaker-dependent-feature extraction, recognition and processing techniques. Speech Communication 10 (6), 505-520.
    • (1991) Speech Communication , vol.10 , Issue.6 , pp. 505-520
    • Furui, S.1
  • 15
    • 85135375893 scopus 로고
    • HMM recognition in noise using parallel model combination
    • Berlin
    • Gales, M., Young, S., 1993. HMM recognition in noise using parallel model combination. In: Proc. Eurospeech, Berlin, pp. II-837-840.
    • (1993) Proc. Eurospeech
    • Gales, M.1    Young, S.2
  • 16
    • 85135167035 scopus 로고
    • Experiments with speaker verification over the telephone
    • Madrid
    • Gauvain, J., Lamel, L., Prouts, B., 1995. Experiments with speaker verification over the telephone. In: Proc. Eurospeech, Madrid, pp. 651-654.
    • (1995) Proc. Eurospeech , pp. 651-654
    • Gauvain, J.1    Lamel, L.2    Prouts, B.3
  • 21
    • 0003019863 scopus 로고
    • Speaker verification using randomized phrase prompting
    • Higgins, A., Bahler, L., Porter, J., 1991. Speaker verification using randomized phrase prompting. Digital Signal Process. 1, 89-106.
    • (1991) Digital Signal Process. , vol.1 , pp. 89-106
    • Higgins, A.1    Bahler, L.2    Porter, J.3
  • 26
    • 0018331352 scopus 로고
    • Text-independent speaker recognition from a large linguistically unconstrained time-spaced data base
    • Markel, J., Davi, S., 1979. Text-independent speaker recognition from a large linguistically unconstrained time-spaced data base. IEEE Trans. Acoust. Speech Signal Process. 27 (1), 74-82.
    • (1979) IEEE Trans. Acoust. Speech Signal Process. , vol.27 , Issue.1 , pp. 74-82
    • Markel, J.1    Davi, S.2
  • 27
    • 85135371131 scopus 로고
    • Recognition of noisy speech by composition of hidden markov models
    • Berlin
    • Martin, F., Shikano, K., Minami, Y., 1993. Recognition of noisy speech by composition of hidden Markov models. In: Proc. Eurospeech, Berlin, pp. II-1031-1034.
    • (1993) Proc. Eurospeech
    • Martin, F.1    Shikano, K.2    Minami, Y.3
  • 28
    • 0001557861 scopus 로고
    • Text-independent speaker recognition using vocal tract, pitch information
    • Kobe, 5.3
    • Matsui, T., Furui, S., 1990. Text-independent speaker recognition using vocal tract, pitch information. In: Proc. Internat. Conf. on Spoken Language Processing, Kobe, 5.3, pp. 137-140.
    • (1990) Proc. Internat. Conf. on Spoken Language Processing , pp. 137-140
    • Matsui, T.1    Furui, S.2
  • 30
    • 85009210391 scopus 로고
    • Comparison of text-independent speaker recognition methods using VQ-distortion and discrete/continuous HMMs
    • San Francisco
    • Matsui, T., Furui, S., 1992. Comparison of text-independent speaker recognition methods using VQ-distortion and discrete/continuous HMMs. In: Proc. IEEE Internat. Conf. on Acoust. Speech, Signal Processing, San Francisco, pp. II-157-160.
    • (1992) Proc. IEEE Internat. Conf. on Acoust. Speech, Signal Processing
    • Matsui, T.1    Furui, S.2
  • 33
    • 85079093521 scopus 로고
    • Speaker adaptation of tied-mixture-based phoneme models for text-prompted speaker recognition
    • Adelaide, 13.1
    • Matsui, T., Furui, S., 1994. Speaker adaptation of tied-mixture-based phoneme models for text-prompted speaker recognition. In: Proc. IEEE Internat. Conf. on Acoust. Speech, Signal Processing, Adelaide, 13.1.
    • (1994) Proc. IEEE Internat. Conf. on Acoust. Speech, Signal Processing
    • Matsui, T.1    Furui, S.2
  • 35
    • 0030125219 scopus 로고    scopus 로고
    • Speaker recognition using HMM composition in noisy environments
    • Matsui, T., Furui, S., 1996b. Speaker recognition using HMM composition in noisy environments. Computer Speech and Language 10, 107-116.
    • (1996) Computer Speech and Language , vol.10 , pp. 107-116
    • Matsui, T.1    Furui, S.2
  • 36
    • 58049084980 scopus 로고
    • Cinematic techniques for speech processing: Temporal decomposition and multivariate linear prediction
    • San Francisco
    • Montacie, C. et al., 1992. Cinematic techniques for speech processing: Temporal decomposition and multivariate linear prediction. In: Proc. IEEE Internat. Conf. on Acoust. Speech, Signal Processing, San Francisco, pp. I-153-156.
    • (1992) Proc. IEEE Internat. Conf. on Acoust. Speech, Signal Processing
    • Montacie, C.1
  • 39
    • 0022794148 scopus 로고
    • Speaker recognition
    • O'Shaugnessy, D., 1986. Speaker recognition. IEEE ASSP Mag. 3 (4), 4-17.
    • (1986) IEEE ASSP Mag. , vol.3 , Issue.4 , pp. 4-17
    • O'Shaugnessy, D.1
  • 43
    • 0028420014 scopus 로고
    • Integrated models of signal and background with application to speaker identification in noise
    • Rose, R., Hofstetter, E., Reynolds, R., 1994. Integrated models of signal and background with application to speaker identification in noise. IEEE Trans. Speech Audio Process. 2 (2), 245-257.
    • (1994) IEEE Trans. Speech Audio Process. , vol.2 , Issue.2 , pp. 245-257
    • Rose, R.1    Hofstetter, E.2    Reynolds, R.3
  • 44
    • 0000592562 scopus 로고
    • Evaluation of a vector quantization talker recognition system in text independent and text dependent modes
    • Rosenberg, A., Soong, F., 1987. Evaluation of a vector quantization talker recognition system in text independent and text dependent modes. Computer Speech and Language 22, 143-157.
    • (1987) Computer Speech and Language , vol.22 , pp. 143-157
    • Rosenberg, A.1    Soong, F.2
  • 48
    • 0001941052 scopus 로고
    • Recent research in automatic speaker recognition
    • Furui, S., Sondhi, M.M. (Eds.), Marcel Dekker, New York
    • Rosenberg, A., Soong, F., 1991. Recent research in automatic speaker recognition. In: Furui, S., Sondhi, M.M. (Eds.), Advances in Speech Signal Processing. Marcel Dekker, New York, pp. 701-737.
    • (1991) Advances in Speech Signal Processing , pp. 701-737
    • Rosenberg, A.1    Soong, F.2
  • 49
    • 85027135125 scopus 로고
    • The use of cohort normalized scores for speaker verification
    • Banff, Th.sAM.4.2
    • Rosenberg, A., 1992. The use of cohort normalized scores for speaker verification. In: Proc. Internat. Conf. on Spoken Language Processing, Banff, Th.sAM.4.2, pp. 599-602.
    • (1992) Proc. Internat. Conf. on Spoken Language Processing , pp. 599-602
    • Rosenberg, A.1
  • 51
    • 85135162797 scopus 로고
    • Results of a speaker verification service trial using HMM models
    • Madrid
    • Setlur, A., Jacobs, T., 1995. Results of a speaker verification service trial using HMM models. In: Proc. EUROSPEECH'95, Madrid, pp. 639-642.
    • (1995) Proc. EUROSPEECH'95 , pp. 639-642
    • Setlur, A.1    Jacobs, T.2
  • 52
    • 0042114266 scopus 로고
    • Text-independent speaker recognition experiments using codebooks in vector quantization
    • abstract
    • Shikano, K., 1985. Text-independent speaker recognition experiments using codebooks in vector quantization. J. Acoust. Soc. Am. 77, 11, abstract.
    • (1985) J. Acoust. Soc. Am. , vol.77 , pp. 11
    • Shikano, K.1
  • 53
    • 85009265801 scopus 로고
    • An unsupervised, sequential learning algorithm for the segmentation of speech waveforms with multiple speakers
    • San Francisco
    • Siu, M., Yu, G., Gish, H., 1992. An unsupervised, sequential learning algorithm for the segmentation of speech waveforms with multiple speakers. In: Proc. IEEE Internat. Conf. on Acoust. Speech, Signal Processing, San Francisco, pp. I-189-192.
    • (1992) Proc. IEEE Internat. Conf. on Acoust. Speech, Signal Processing
    • Siu, M.1    Yu, G.2    Gish, H.3
  • 54
    • 0023314827 scopus 로고
    • A vector quantization approach to speaker recognition
    • Soong, F., Rosenberg, A., Juang, B., 1987. A vector quantization approach to speaker recognition. AT&T Tech. J. 66, 14-26.
    • (1987) AT&T Tech. J. , vol.66 , pp. 14-26
    • Soong, F.1    Rosenberg, A.2    Juang, B.3
  • 55
    • 0024035182 scopus 로고
    • On the use of instantaneous and transitional spectral information in speaker recognition
    • Soong, F., Rosenberg, A., 1988. On the use of instantaneous and transitional spectral information in speaker recognition. IEEE Trans. Acoust. Speech Signal Process. 36 (6), 871-879.
    • (1988) IEEE Trans. Acoust. Speech Signal Process. , vol.36 , Issue.6 , pp. 871-879
    • Soong, F.1    Rosenberg, A.2
  • 56
    • 0042615351 scopus 로고
    • Segment based text independent speaker recognition
    • in Japanese
    • Sugiyama, M., 1988. Segment based text independent speaker recognition. In: Proc. Spring Meeting of Acoust. Soc. Japan, pp. 75-76 (in Japanese).
    • (1988) Proc. Spring Meeting of Acoust. Soc. Japan , pp. 75-76
    • Sugiyama, M.1
  • 57
    • 0026117640 scopus 로고
    • On the application of mixture AR hidden Markov models to text independent speaker recognition
    • Tishby, N., 1991. On the application of mixture AR hidden Markov models to text independent speaker recognition. IEEE Trans. Acoust. Speech, Signal Process. 30 (3), 563-570.
    • (1991) IEEE Trans. Acoust. Speech, Signal Process. , vol.30 , Issue.3 , pp. 563-570
    • Tishby, N.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.