메뉴 건너뛰기




Volumn , Issue , 2012, Pages 365-388

Prosodic features for speaker recognition

Author keywords

[No Author keywords available]

Indexed keywords

CONTINUOUS SPEECH RECOGNITION;

EID: 84955099902     PISSN: None     EISSN: None     Source Type: Book    
DOI: 10.1007/9781461402633_13     Document Type: Chapter
Times cited : (4)

References (45)
  • 1
    • 85025697637 scopus 로고    scopus 로고
    • Heck LP (2002) Integrating high-level information for robust speaker recognition in John Hopkins University workshop on SuperSID, Baltimore, Maryland. Http:Wwww.cslp.jhu.edu/ws2002/groups/supersid.
    • Heck, L.P.1
  • 2
    • 85009124414 scopus 로고    scopus 로고
    • Speaker recognition based on idiolectic differences between speakers
    • Aalborg, Denmark
    • Doddington GG (2001) Speaker recognition based on idiolectic differences between speakers. Proc. Eurospeech, Aalborg, Denmark, pp 2521-2524.
    • (2001) Proc. Eurospeech , pp. 2521-2524
    • Doddington, G.G.1
  • 3
    • 0031233424 scopus 로고    scopus 로고
    • Speaker recognition: A tutorial
    • Campbell JP (1997) Speaker recognition: a tutorial. Proc IEEE 85(9):1437-1462.
    • (1997) Proc IEEE , vol.85 , Issue.9 , pp. 1437-1462
    • Campbell, J.P.1
  • 4
    • 79953181342 scopus 로고    scopus 로고
    • Multilevel implicit features for language and speaker recognition. Ph. D
    • Indian Institute of Technology, Madras
    • Mary L (2006) Multilevel implicit features for language and speaker recognition. Ph. D. Thesis, Indian Institute of Technology, Madras.
    • (2006) Thesis
    • Mary, L.1
  • 5
    • 70350125882 scopus 로고    scopus 로고
    • An overview of text-independent speaker recognition: From features to supervectors
    • Kinnunen T, Li H (2010) An overview of text-independent speaker recognition: from features to supervectors. Speech Commun 52:12-40.
    • (2010) Speech Commun , vol.52 , pp. 12-40
    • Kinnunen, T.1    Li, H.2
  • 6
    • 85025589847 scopus 로고    scopus 로고
    • NIST (2001) Speaker recognition evaluation website: http://www.nist.gov/speech/tests/spk/2001.
  • 8
    • 0034275920 scopus 로고    scopus 로고
    • Prosody-based automatic segmentation of speech into sentences and topics
    • Shriberg E, Stolcke A, Hakkani-Tur D, Tur G (2000) Prosody-based automatic segmentation of speech into sentences and topics. Speech Commun 32:127-154.
    • (2000) Speech Commun , vol.32 , pp. 127-154
    • Shriberg, E.1    Stolcke, A.2    Hakkani-Tur, D.3    Tur, G.4
  • 9
    • 85135139722 scopus 로고    scopus 로고
    • A lognormal tied mixture model of pitch for prosody-based speaker recognition. Proc. Eurospeech, Rhodes
    • Sonmez MK, Heck L, Weintraub M, Shriberg E (1997) A lognormal tied mixture model of pitch for prosody-based speaker recognition. Proc. Eurospeech, Rhodes, Greece. 3, pp 1391-1394.
    • (1997) Greece , vol.3 , pp. 1391-1394
    • Sonmez, M.K.1    Heck, L.2    Weintraub, M.3    Shriberg, E.4
  • 10
    • 0017876719 scopus 로고
    • Correlation analysis of the physiological factors controlling fundamental voice frequency
    • Atkinson JE (1978) Correlation analysis of the physiological factors controlling fundamental voice frequency. J Acoust Soc Am 63(1):211-222.
    • (1978) J Acoust Soc Am , vol.63 , Issue.1 , pp. 211-222
    • Atkinson, J.E.1
  • 11
    • 22544440896 scopus 로고    scopus 로고
    • Combining evidence from source, suprasegmental and spectral features for a fixed-text speaker verification system
    • Yegnanarayana B, Prasanna SRM, Zachariah JM, Gupta CS (2005) Combining evidence from source, suprasegmental and spectral features for a fixed-text speaker verification system. Ieee Trans Speech Audio Process 13(4):575-582.
    • (2005) IEEE Trans Speech Audio Process , vol.13 , Issue.4 , pp. 575-582
    • Yegnanarayana, B.1    Prasanna, S.2    Zachariah, J.M.3    Gupta, C.S.4
  • 12
    • 0015476226 scopus 로고
    • Automatic speaker recognition based on pitch contours
    • Atal B (1972) Automatic speaker recognition based on pitch contours. J Acous Soc Am 52(3):1687-1697.
    • (1972) J Acous Soc Am , vol.52 , Issue.3 , pp. 1687-1697
    • Atal, B.1
  • 13
    • 0141521592 scopus 로고    scopus 로고
    • Modeling prosodic dynamics for speaker recognition
    • Hong Kong, China
    • Adami AG, Mihaescu R, Reynolds DA, Godfrey JJ (2003) Modeling prosodic dynamics for speaker recognition. Proc. Icassp, Hong Kong, China, 4, pp 788-791.
    • (2003) Proc. Icassp , vol.4 , pp. 788-791
    • Adami, A.G.1    Mihaescu, R.2    Reynolds, D.A.3    Godfrey, J.J.4
  • 14
    • 0016495091 scopus 로고
    • Linear prediction: A tutorial review
    • Makhoul J (1975) Linear prediction: a tutorial review. Proc IEEE 63:561-580.
    • (1975) Proc IEEE , vol.63 , pp. 561-580
    • Makhoul, J.1
  • 15
    • 0019555090 scopus 로고
    • Cepstral analysis technique for automatic speaker verification
    • Furui S (1981) Cepstral analysis technique for automatic speaker verification. Ieee Trans Speech Audio Process 29:254-272.
    • (1981) IEEE Trans Speech Audio Process , vol.29 , pp. 254-272
    • Furui, S.1
  • 16
    • 0029209272 scopus 로고
    • Robust text-independent speaker identification using gaussian mixture speaker models
    • Reynolds DA, Rose R (1995) Robust text-independent speaker identification using Gaussian mixture speaker models. Ieee Trans Speech Audio Process 3:72-83.
    • (1995) IEEE Trans Speech Audio Process , vol.3 , pp. 72-83
    • Reynolds, D.A.1    Rose, R.2
  • 17
    • 0029725601 scopus 로고    scopus 로고
    • The effect of handset variability on speaker recognition performance: Experiments on the switchboard corpus
    • Atlanta, GA, USA
    • Reynolds DA (1996) The effect of handset variability on speaker recognition performance: Experiments on the switchboard corpus. Proc. Icassp, Atlanta, GA, USA, 1, pp 113-116.
    • (1996) Proc. Icassp , vol.1 , pp. 113-116
    • Reynolds, D.A.1
  • 18
    • 0030361268 scopus 로고    scopus 로고
    • On using prosodic cues in automatic language identification
    • Philadelphia, PA, USA
    • Thyme-Gobbel AE, Hutchins SE (1996) On using prosodic cues in automatic language identification. Proc. Int. Conf. Spoken Language Processing, Philadelphia, PA, USA, 3, pp 1768-1772.
    • (1996) Proc. Int. Conf. Spoken Language Processing , vol.3 , pp. 1768-1772
    • Thyme-Gobbel, A.E.1    Hutchins, S.E.2
  • 19
    • 52949094265 scopus 로고    scopus 로고
    • Extraction and representation of prosodic features for language and speaker recognition
    • Mary L, Yegnanarayana B (2008) Extraction and representation of prosodic features for language and speaker recognition. Speech Commun 50:782-796.
    • (2008) Speech Commun , vol.50 , pp. 782-796
    • Mary, L.1    Yegnanarayana, B.2
  • 20
    • 85032752200 scopus 로고    scopus 로고
    • Forensic automatic speaker recognition
    • Drygajlo A (2007) Forensic automatic speaker recognition. Ieee Signal Process Mag 132-135.
    • (2007) IEEE Signal Process Mag , pp. 132-135
    • Drygajlo, A.1
  • 22
    • 29044446618 scopus 로고    scopus 로고
    • Technical speaker recognition: Evaluation, types and testing of evidence
    • Rose P (2006) Technical speaker recognition: evaluation, types and testing of evidence. Comp Speech Lang 20:159-1914.
    • (2006) Comp Speech Lang , vol.20 , pp. 159-1914
    • Rose, P.1
  • 24
    • 0039453719 scopus 로고    scopus 로고
    • Modeling dynamic prosodic variation for speaker variation
    • Sydney, Australia
    • Sonmez MK, Shriberg E, Heck L, Weintraub M (1998) Modeling dynamic prosodic variation for speaker variation. Proc. Icslp, Sydney, Australia, 7, pp 3189-3192.
    • (1998) Proc. Icslp , vol.7 , pp. 3189-3192
    • Sonmez, M.K.1    Shriberg, E.2    Heck, L.3    Weintraub, M.4
  • 25
    • 0141521592 scopus 로고    scopus 로고
    • Modeling prosodic dynamics for speaker recognition
    • Hong kong, China
    • Adami AG, Mihaescu R, Reynolds DA, Godfrey JJ (2003) Modeling prosodic dynamics for speaker recognition. Proc. Icassp, Hong kong, China, 4, pp 788-791.
    • (2003) Proc. Icassp , vol.4 , pp. 788-791
    • Adami, A.G.1    Mihaescu, R.2    Reynolds, D.A.3    Godfrey, J.J.4
  • 26
    • 0141856298 scopus 로고    scopus 로고
    • Using prosodic and conversational features for high-performance speaker recognition: Report from jhu ws'02
    • Hong kong, China
    • Peskin B, Navratil J, Abramson J, Jones D, Klusacek D, Reynolds D, Xiang B (2003) Using prosodic and conversational features for high-performance speaker recognition: report from JHU WS'02. Proc. Icassp, Hong kong, China, 4, pp 792-795.
    • (2003) Proc. Icassp , vol.4 , pp. 792-795
    • Peskin, B.1    Navratil, J.2    Abramson, J.3    Jones, D.4    Klusacek, D.5    Reynolds, D.6    Xiang, B.7
  • 27
    • 27644531433 scopus 로고    scopus 로고
    • Rhythmic unit extraction and modelling for automatic language identification
    • Rouas J, Farinas J, Pellegrino F, Andre-Obrecht R (2005) Rhythmic unit extraction and modelling for automatic language identification. Speech Commun 47:436-456.
    • (2005) Speech Commun , vol.47 , pp. 436-456
    • Rouas, J.1    Farinas, J.2    Pellegrino, F.3    Re-Obrecht, R.4
  • 28
    • 33745354229 scopus 로고    scopus 로고
    • Language identification using acoustic log-likelihoods of syllable-like units
    • Nagarajan T, Murthy HA (2006) Language identification using acoustic log-likelihoods of syllable-like units. Speech Commun 48:913-926.
    • (2006) Speech Commun , vol.48 , pp. 913-926
    • Nagarajan, T.1    Murthy, H.A.2
  • 29
    • 51449092448 scopus 로고    scopus 로고
    • Continuous prosodic features and formant modeling with joint factor analysis for speaker verification
    • Dehak N, Kenny P, Dumouchel P (2007) Continuous prosodic features and formant modeling with joint factor analysis for speaker verification. Proc. Of Interspeech, pp 1234-1237.
    • (2007) Proc. Of Interspeech , pp. 1234-1237
    • Dehak, N.1    Kenny, P.2    Dumouchel, P.3
  • 31
    • 0031760948 scopus 로고    scopus 로고
    • The frame/content theory of evolution of speech production
    • MacNeilage PF (1998) The frame/content theory of evolution of speech production. Behav Brain Sci 21:499-546.
    • (1998) Behav Brain Sci , vol.21 , pp. 499-546
    • MacNeilage, P.F.1
  • 32
    • 0002635113 scopus 로고    scopus 로고
    • Physiological organization of syllables: A review
    • Krakow RA (1999) Physiological organization of syllables: a review. J Phonetics 27:23-54.
    • (1999) J Phonetics , vol.27 , pp. 23-54
    • Krakow, R.A.1
  • 33
    • 2942726537 scopus 로고    scopus 로고
    • On the phonetics and phonology of “segmental anchoring” of f0: Evidence from german
    • Atterer M, Ladd DR (2004) On the phonetics and phonology of “segmental anchoring” of F0: evidence from German. J Phonetics 32:177-197.
    • (2004) J Phonetics , vol.32 , pp. 177-197
    • Atterer, M.1    Ladd, D.R.2
  • 34
    • 0009591303 scopus 로고    scopus 로고
    • Significance of vowel onset point for speech analysis
    • Indian Institute of Science
    • Prasanna SRM, Gangashetty SV, Yegnanarayana B (2001) Significance of vowel onset point for speech analysis. Proc. Signal Proc. Com, Indian Institute of Science, pp. 81-88.
    • (2001) Proc. Signal Proc. Com , pp. 81-88
    • Prasanna, S.1    Gangashetty, S.V.2    Yegnanarayana, B.3
  • 35
    • 33745205178 scopus 로고    scopus 로고
    • Event-based analysis of speech
    • Indian Institute of Technology, Madras
    • Prasanna SRM (2004) Event-based analysis of speech. Ph D Thesis, Indian Institute of Technology, Madras.
    • (2004) Ph D Thesis
    • Prasanna, S.1
  • 36
    • 65249110603 scopus 로고    scopus 로고
    • Detection of vowel onset point events using excitation source information
    • Prasanna SRM, Yegnanarayana B (2005) Detection of vowel onset point events using excitation source information. Proc. Of Interspeech, pp 1133-1136.
    • (2005) Proc. Of Interspeech , pp. 1133-1136
    • Prasanna, S.1    Yegnanarayana, B.2
  • 37
    • 0036288088 scopus 로고    scopus 로고
    • Detection of vowel onset point in speech
    • Signal Processing, Orlando, Fl, USA
    • Prasanna SRM, Zachariah JM (2002) Detection of vowel onset point in speech. Proc. Ieee Int ConfAcoust Speech, Signal Processing, Orlando, Fl, USA 4:4159.
    • (2002) Proc. Ieee Int Confacoust Speech , vol.4 , pp. 4159
    • Prasanna, S.1    Zachariah, J.M.2
  • 40
    • 0018656516 scopus 로고
    • Epoch extraction fromlinear prediction residual for identification of closed glottis interval
    • Ananthapadmanabha TV, Yegnanarayana B (1979) Epoch extraction fromlinear prediction residual for identification of closed glottis interval. Ieee Trans ASSP 27:309-319.
    • (1979) IEEE Trans ASSP , vol.27 , pp. 309-319
    • Ananthapadmanabha, T.V.1    Yegnanarayana, B.2
  • 42
    • 0034008810 scopus 로고    scopus 로고
    • Analysis and synthesis of intonation using the tilt model
    • Taylor P (2000) Analysis and synthesis of intonation using the tilt model. J Acoust Soc Am 107(3):1697-1714.
    • (2000) J Acoust Soc Am , vol.107 , Issue.3 , pp. 1697-1714
    • Taylor, P.1
  • 45
    • 0035989168 scopus 로고    scopus 로고
    • Aann-an alternative for gmm for pattern recognition
    • Yegnanarayana B, Kishore SP (2002) AANN-An alternative for GMM for pattern recognition. Neural Netw 15(3):459-469.
    • (2002) Neural Netw , vol.15 , Issue.3 , pp. 459-469
    • Yegnanarayana, B.1    Kishore, S.P.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.