메뉴 건너뛰기




Volumn 4343 LNAI, Issue , 2007, Pages 241-259

Higher-level features in speaker recognition

Author keywords

Automatic speech recognition; High level features; Higher level features; Long range features; Phonetic speaker recognition; Prosodic features; Prosody; Speaker idiosyncrasies; Speaker recognition; Speaker verification; Stylistic features

Indexed keywords

AUTOMATION; FEATURE EXTRACTION; LINGUISTICS; SPEECH RECOGNITION;

EID: 36248960119     PISSN: 03029743     EISSN: 16113349     Source Type: Book Series    
DOI: 10.1007/978-3-540-74200-5_14     Document Type: Article
Times cited : (56)

References (59)
  • 2
    • 36249031961 scopus 로고    scopus 로고
    • Classification Methods for Speaker Recognition
    • Müller, C, ed, Speaker Classification I, Springer, Heidelberg
    • Sturim, D.E., Campbell, W.M., Reynolds, D.A.: Classification Methods for Speaker Recognition. In: Müller, C. (ed.) Speaker Classification I. LNCS (LNAI), vol. 4343, Springer, Heidelberg (2007)
    • (2007) LNCS (LNAI , vol.4343
    • Sturim, D.E.1    Campbell, W.M.2    Reynolds, D.A.3
  • 3
    • 36249013798 scopus 로고    scopus 로고
    • The Many Roles of Speaker Classification in Speaker Verification and Identification
    • Müller, C, ed, Speaker Classification I, Springer, Heidelberg
    • Markowitz, J.: The Many Roles of Speaker Classification in Speaker Verification and Identification. In: Müller, C. (ed.) Speaker Classification I. LNCS(LNAI), vol. 4343, Springer, Heidelberg (2007)
    • (2007) LNCS(LNAI , vol.4343
    • Markowitz, J.1
  • 4
    • 36249029527 scopus 로고    scopus 로고
    • Evaluations of Automatic Speaker Classification Systems
    • Müller, C, ed, Speaker Classification I, Springer, Heidelberg
    • Martin, A.F.: Evaluations of Automatic Speaker Classification Systems. In: Müller, C. (ed.) Speaker Classification I. LNCS(LNAI), vol. 4343, Springer, Heidelberg (2007)
    • (2007) LNCS(LNAI , vol.4343
    • Martin, A.F.1
  • 5
    • 0030376663 scopus 로고    scopus 로고
    • Robust prosodic features for speaker identification
    • Bunnell, H.T, Idsardi, W, eds, Philadelphia
    • Carey, M., Parris, E., Lloyd-Thomas, H., Bennett, S.: Robust prosodic features for speaker identification. In: Bunnell, H.T., Idsardi, W. (eds.) Proc. ICSLP. Philadelphia, vol. 3, pp. 1800-1803 (1996)
    • (1996) Proc. ICSLP , vol.3 , pp. 1800-1803
    • Carey, M.1    Parris, E.2    Lloyd-Thomas, H.3    Bennett, S.4
  • 6
    • 85135139722 scopus 로고    scopus 로고
    • A Lognormal Tied Mixture Model of Pitch for Prosody-Based Speaker Recognition
    • Kokkinakis, G, Fakotakis, N, Dermatas, E, eds, Rhodes, Greece, pp
    • Sönmez, M.K., Heck, L., Weintraub, M., Shriberg, E.: A Lognormal Tied Mixture Model of Pitch for Prosody-Based Speaker Recognition. In: Kokkinakis, G., Fakotakis, N., Dermatas, E. (eds.) Proc. EUROSPEECH, Rhodes, Greece, pp. 1391-1394 (1997)
    • (1997) Proc. EUROSPEECH , pp. 1391-1394
    • Sönmez, M.K.1    Heck, L.2    Weintraub, M.3    Shriberg, E.4
  • 9
    • 85009291564 scopus 로고    scopus 로고
    • ASR Dependent Techniques for Speaker Identification
    • Hansen, J.H.L, Pellom, B, eds, Denver, pp
    • Park, A., Hazen, T.J.: ASR Dependent Techniques for Speaker Identification. In: Hansen, J.H.L., Pellom, B. (eds.) Proc. ICSLP, Denver, pp. 1337-1340 (2002)
    • (2002) Proc. ICSLP , pp. 1337-1340
    • Park, A.1    Hazen, T.J.2
  • 10
    • 17344377138 scopus 로고    scopus 로고
    • Speaker Verification Using Text-Constrained Gaussian Mixture Models
    • Orlando, pp
    • Sturim, D.E., Reynolds, D.A., Dunn, R.B., Quatieri, T.F.: Speaker Verification Using Text-Constrained Gaussian Mixture Models. In: Proc. ICASSP. vol. I., Orlando, pp. 677-680 (2002)
    • (2002) Proc. ICASSP , vol.1 , pp. 677-680
    • Sturim, D.E.1    Reynolds, D.A.2    Dunn, R.B.3    Quatieri, T.F.4
  • 12
    • 85135167035 scopus 로고
    • Experiments with Speaker Verification Over the Telephone
    • Pardo, J.M, Enríquez, E, Ortega, J, Ferreiros, J, Macias, J, Valverde, F.J, eds, Madrid
    • Gauvain, J.L., Lamel, L.F., Prouts, B.: Experiments with Speaker Verification Over the Telephone. In: Pardo, J.M., Enríquez, E., Ortega, J., Ferreiros, J., Macias, J., Valverde, F.J. (eds.) Proc. EUROSPEECH, Madrid (1995)
    • (1995) Proc. EUROSPEECH
    • Gauvain, J.L.1    Lamel, L.F.2    Prouts, B.3
  • 13
    • 0030369359 scopus 로고    scopus 로고
    • Speaker Verification Through Large Vocabulary Continuous Speech Recognition
    • Bunnell, H.T, Idsardi, W, eds, Philadelphia, pp
    • Newman, M., Gillick, L., Ito, Y., McAllaster, D., Peskin, B.: Speaker Verification Through Large Vocabulary Continuous Speech Recognition. In: Bunnell, H.T., Idsardi, W. (eds.) Proc. ICSLP. vol. 4, Philadelphia, pp. 2419-2422 (1996)
    • (1996) Proc. ICSLP , vol.4 , pp. 2419-2422
    • Newman, M.1    Gillick, L.2    Ito, Y.3    McAllaster, D.4    Peskin, B.5
  • 15
    • 33646779908 scopus 로고    scopus 로고
    • Speaker Detection without Models
    • Philadelphia
    • Gillick, D., Stafford, S., Peskin, B.: Speaker Detection without Models. In: Proc. ICASSP. Philadelphia, vol. 1, pp. 757-760 (2005)
    • (2005) Proc. ICASSP , vol.1 , pp. 757-760
    • Gillick, D.1    Stafford, S.2    Peskin, B.3
  • 20
    • 33646348224 scopus 로고    scopus 로고
    • Improved Phonetic Speaker Recognition Using Lattice Decoding
    • Philadelphia
    • Hatch, A.O., Peskin, B., Stolcke, A.: Improved Phonetic Speaker Recognition Using Lattice Decoding. In: Proc. ICASSP. Philadelphia, vol. 1, pp. 169-172 (2005)
    • (2005) Proc. ICASSP , vol.1 , pp. 169-172
    • Hatch, A.O.1    Peskin, B.2    Stolcke, A.3
  • 21
    • 18144435041 scopus 로고    scopus 로고
    • Phonetic Speaker Recognition Using Maximum-Likelihood Binary-Decision Tree Models
    • Hong Kong
    • Navrátil, J., Jin, Q., Andrews, W.D., Campbell, J.P.: Phonetic Speaker Recognition Using Maximum-Likelihood Binary-Decision Tree Models. In: Proc. ICASSP. Hong Kong, vol. 4, pp. 796-799 (2003)
    • (2003) Proc. ICASSP , vol.4 , pp. 796-799
    • Navrátil, J.1    Jin, Q.2    Andrews, W.D.3    Campbell, J.P.4
  • 22
  • 23
    • 34547511465 scopus 로고    scopus 로고
    • Word-Conditioned Phone N-Grams for Speaker Recognition
    • Honolulu
    • Lei, H., Mirghafori, N.: Word-Conditioned Phone N-Grams for Speaker Recognition. In: Proc. ICASSP, Honolulu (2007)
    • (2007) Proc. ICASSP
    • Lei, H.1    Mirghafori, N.2
  • 24
    • 18144436320 scopus 로고    scopus 로고
    • Conditional Pronunciation Modeling in Speaker Detection
    • Hong Kong
    • Klusáček, D., Navrátil, J., Reynolds, D.A., Campbell, J.P.: Conditional Pronunciation Modeling in Speaker Detection. In: Proc. ICASSP. Hong Kong, vol. 4, pp. 804-807 (2003)
    • (2003) Proc. ICASSP , vol.4 , pp. 804-807
    • Klusáček, D.1    Navrátil, J.2    Reynolds, D.A.3    Campbell, J.P.4
  • 26
    • 85128436986 scopus 로고    scopus 로고
    • Modeling Dynamic Prosodic Variation for Speaker Verification
    • Mannell, R.H, Robert-Ribes, J, eds, Australian Speech Science and Technology Association, Sydney
    • Sönmez, K., Shriberg, E., Heck, L., Weintraub, M.: Modeling Dynamic Prosodic Variation for Speaker Verification. In: Mannell, R.H., Robert-Ribes, J. (eds.) Proc. ICSLP. vol. 7, pp. 3189-3192, Australian Speech Science and Technology Association, Sydney (1998)
    • (1998) Proc. ICSLP , vol.7 , pp. 3189-3192
    • Sönmez, K.1    Shriberg, E.2    Heck, L.3    Weintraub, M.4
  • 27
    • 0141521592 scopus 로고    scopus 로고
    • Modeling Prosodic Dynamics for Speaker Recognition
    • Hong Kong
    • Adami, A.G., Mihaescu, R., Reynolds, D.A., Godfrey, J.J.: Modeling Prosodic Dynamics for Speaker Recognition. In: Proc. ICASSP. Hong Kong, vol. 4, pp. 788-791 (2003)
    • (2003) Proc. ICASSP , vol.4 , pp. 788-791
    • Adami, A.G.1    Mihaescu, R.2    Reynolds, D.A.3    Godfrey, J.J.4
  • 29
    • 85143189570 scopus 로고    scopus 로고
    • Peskin, B., Navrátil, J., Abramson, J., Jones, D., Klusáček, D., Reynolds, D.A., Xiang, B.: Using Prosodic And Conversational Features for High Performance Speaker Recognition: Report From JHU WS'02. In: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '03), Hong Kong, pp. 792-795 (2003)
    • Peskin, B., Navrátil, J., Abramson, J., Jones, D., Klusáček, D., Reynolds, D.A., Xiang, B.: Using Prosodic And Conversational Features for High Performance Speaker Recognition: Report From JHU WS'02. In: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '03), Hong Kong, pp. 792-795 (2003)
  • 31
    • 21844454996 scopus 로고    scopus 로고
    • Shriberg, E., Ferrer, L., Kajarekar, S., Venkataraman, A., Stolcke, A.: Modeling prosodic feature sequences for speaker recognition. Speech Communication. (Special Issue on Quantitative Prosody Modelling for Natural Speech Description and Generation) 46(3-4), 455-472 (2005)
    • Shriberg, E., Ferrer, L., Kajarekar, S., Venkataraman, A., Stolcke, A.: Modeling prosodic feature sequences for speaker recognition. Speech Communication. (Special Issue on Quantitative Prosody Modelling for Natural Speech Description and Generation) 46(3-4), 455-472 (2005)
  • 33
    • 36248996792 scopus 로고    scopus 로고
    • A Text-Constrained Prosodic System for Speaker Verification
    • Antwerp, Belgium
    • Shriberg, E., Ferrer, L.: A Text-Constrained Prosodic System for Speaker Verification. In: Proceedings of Interspeech, Antwerp, Belgium (2007)
    • (2007) Proceedings of Interspeech
    • Shriberg, E.1    Ferrer, L.2
  • 34
    • 85009124414 scopus 로고    scopus 로고
    • Speaker Recognition Based on Idiolectal Differences Between Speakers
    • Dalsgaard, P, Lindberg, B, Benner, H, Tan, Z, eds, Aalborg, Denmark, pp
    • Doddington, G.: Speaker Recognition Based on Idiolectal Differences Between Speakers. In: Dalsgaard, P., Lindberg, B., Benner, H., Tan, Z. (eds.) Proc. EUROSPEECH, Aalborg, Denmark, pp. 2521-2524 (2001)
    • (2001) Proc. EUROSPEECH , pp. 2521-2524
    • Doddington, G.1
  • 36
    • 56149108574 scopus 로고    scopus 로고
    • Duration and Pronunciation Conditioned Lexical Modeling for Speaker Verification
    • Antwerp, Belgium
    • Tür, G., Shriberg, E., Stolcke, A., Kajarekar, S.: Duration and Pronunciation Conditioned Lexical Modeling for Speaker Verification. In: Proceedings of Interspeech, Antwerp, Belgium (2007)
    • (2007) Proceedings of Interspeech
    • Tür, G.1    Shriberg, E.2    Stolcke, A.3    Kajarekar, S.4
  • 40
    • 0015476226 scopus 로고
    • Automatic Speaker Recognition Based on Pitch Contours
    • Atal, B.: Automatic Speaker Recognition Based on Pitch Contours. Journal of the Acoustical Society of America 52(6), 1687-1697 (1972)
    • (1972) Journal of the Acoustical Society of America , vol.52 , Issue.6 , pp. 1687-1697
    • Atal, B.1
  • 50
    • 33645895387 scopus 로고    scopus 로고
    • Advances in Channel Compensation for SVM Speaker Recognition
    • Philadelphia
    • Solomonoff, A., Campbell, W.M., Boardman, I.: Advances in Channel Compensation for SVM Speaker Recognition. In: Proc. ICASSP, Philadelphia, vol. 1, pp. 629-632 (2005)
    • (2005) Proc. ICASSP , vol.1 , pp. 629-632
    • Solomonoff, A.1    Campbell, W.M.2    Boardman, I.3
  • 51
    • 0033884857 scopus 로고    scopus 로고
    • Score Normalization for Text-Independent Speaker Verification Systems
    • Auckenthaler, R., Carey, M., Lloyd-Thomas, H.: Score Normalization for Text-Independent Speaker Verification Systems. Digital Signal Processing 10(1-3), 42-54 (2000)
    • (2000) Digital Signal Processing , vol.10 , Issue.1-3 , pp. 42-54
    • Auckenthaler, R.1    Carey, M.2    Lloyd-Thomas, H.3
  • 52
    • 0036289656 scopus 로고    scopus 로고
    • Generalized Linear Discriminant Sequence Kernels for Speaker Recognition
    • Orlando
    • Campbell, W.M.: Generalized Linear Discriminant Sequence Kernels for Speaker Recognition. In: Proc. ICASSP, Orlando, vol. 1, pp. 161-164 (2002)
    • (2002) Proc. ICASSP , vol.1 , pp. 161-164
    • Campbell, W.M.1
  • 53
    • 33645887246 scopus 로고    scopus 로고
    • Support Vector Machines Using GMM Supervectors for Speaker Verification
    • Campbell, W.M., Sturim, D.E., Reynolds, D.A.: Support Vector Machines Using GMM Supervectors for Speaker Verification. IEEE Signal Processing Letters 13(5), 308-311 (2006)
    • (2006) IEEE Signal Processing Letters , vol.13 , Issue.5 , pp. 308-311
    • Campbell, W.M.1    Sturim, D.E.2    Reynolds, D.A.3
  • 54
    • 36249003154 scopus 로고    scopus 로고
    • A Study of Acoustic Correlates of Speaker Age
    • Müller, C, ed, Speaker Classification II, Springer, Heidelberg
    • Schötz, S., Müller, C.: A Study of Acoustic Correlates of Speaker Age. In: Müller, C. (ed.) Speaker Classification II. LNCS(LNAI), vol. 4441, Springer, Heidelberg (2007)
    • (2007) LNCS(LNAI , vol.4441
    • Schötz, S.1    Müller, C.2
  • 55
    • 36248938764 scopus 로고    scopus 로고
    • Speaker Characteristics
    • Müller, C, ed, Speaker Classification I, Springer, Heidelberg
    • Schultz, T.: Speaker Characteristics. In: Müller, C. (ed.) Speaker Classification I. LNCS(LNAI), vol. 4343, Springer, Heidelberg (2007)
    • (2007) LNCS(LNAI , vol.4343
    • Schultz, T.1
  • 56
    • 36249013993 scopus 로고    scopus 로고
    • Real-life Emotion Recognition in Speech
    • Müller, C, ed, Speaker Classification II, Springer, Heidelberg
    • Devillers, L., Vidrascu, L.: Real-life Emotion Recognition in Speech. In: Müller, C. (ed.) Speaker Classification II. LNCS(LNAI), vol. 4441, Springer, Heidelberg (2007)
    • (2007) LNCS(LNAI , vol.4441
    • Devillers, L.1    Vidrascu, L.2
  • 57
    • 33947641603 scopus 로고    scopus 로고
    • Combining Prosodic, Lexical and Cepstral Systems for Deceptive Speech Detection
    • Graciarena, M., Shriberg, E., Stolcke, A., Enos, F., Hirschberg, J., Kajarekar, S.: Combining Prosodic, Lexical and Cepstral Systems for Deceptive Speech Detection. In: Proc. ICASSP, vol. 1, pp. 1033-1036 (2006)
    • (2006) Proc. ICASSP , vol.1 , pp. 1033-1036
    • Graciarena, M.1    Shriberg, E.2    Stolcke, A.3    Enos, F.4    Hirschberg, J.5    Kajarekar, S.6


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.