메뉴 건너뛰기




Volumn , Issue , 2013, Pages 2559-2563

Addressee detection for dialog systems using temporal and spectral dimensions of speaking style

Author keywords

Addressee detection; Dialog system; Language model; Online processing; Out of domain data; Prosody; Speaking style; Spectral tilt; Vocal effort

Indexed keywords

COMPUTATIONAL LINGUISTICS; EXPERIMENTS; SPEECH SYNTHESIS;

EID: 84906248474     PISSN: 2308457X     EISSN: 19909772     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (31)

References (28)
  • 1
    • 83455206599 scopus 로고    scopus 로고
    • A comparison of addressee detection methods for multiparty conversations
    • R. op den Akker and D. Traum, A comparison of addressee detection methods for multiparty conversations, Proceedings of Diaholmia, pp. 99-106, 2009.
    • (2009) Proceedings of Diaholmia , pp. 99-106
    • Op Den Akker, R.1    Traum, D.2
  • 3
    • 84857736407 scopus 로고    scopus 로고
    • Multiparty turn taking in situated dialog: Study, lessons, and directions
    • Portland, OR, June
    • D. Bohus and E. Horvitz, Multiparty turn taking in situated dialog: Study, lessons, and directions, Proceedings ACL SIGDIAL, pp. 98-109, Portland, OR, June 2011.
    • (2011) Proceedings ACL SIGDIAL , pp. 98-109
    • Bohus, D.1    Horvitz, E.2
  • 4
    • 85009128838 scopus 로고    scopus 로고
    • Continuous listening for unconstrained spoken dialog
    • B. Yuan, T. Huang, and X. Tang, editors, Beijing, Oct. China Military Friendship Publish
    • T. Paek, E. Horvitz, and E. Ringger, Continuous listening for unconstrained spoken dialog, B. Yuan, T. Huang, and X. Tang, editors, Proc. ICSLP, vol. 1, pp. 138-141, Beijing, Oct. 2000. China Military Friendship Publish.
    • (2000) Proc. ICSLP , vol.1 , pp. 138-141
    • Paek, T.1    Horvitz, E.2    Ringger, E.3
  • 8
    • 84878404626 scopus 로고    scopus 로고
    • Learning when to listen: Detecting system-addressed speech in humanhuman- computer dialog
    • Portland, Oregon, Sep
    • E. Shriberg, A. Stolcke, D. Hakkani-Tr, and L. Heck, Learning when to listen: Detecting system-addressed speech in humanhuman- computer dialog, Proc. Interspeech, pp. 334-337, Portland, Oregon, Sep. 2012.
    • (2012) Proc. Interspeech , pp. 334-337
    • Shriberg, E.1    Stolcke, A.2    Hakkani-Tr, D.3    Heck, L.4
  • 11
    • 0345098384 scopus 로고
    • Multi-site data collection for a spoken language corpus
    • MADCOW, Harriman, NY, Feb. Defense Advanced Research Projects Agency, Information Science and Technology Office
    • MADCOW, Multi-site data collection for a spoken language corpus, Proc. DARPA SNP Workshop, pp. 7-14, Harriman, NY, Feb. 1992. Defense Advanced Research Projects Agency, Information Science and Technology Office.
    • (1992) Proc. Darpa Snp Workshop , pp. 7-14
  • 16
    • 0031023993 scopus 로고    scopus 로고
    • Glottal characteristics of female speakers: Acousticcorrelates
    • H. Hanson, Glottal characteristics of female speakers: Acousticcorrelates, Journal of the Acoustical Society of America, vol. 101, pp. 466-481, 1997.
    • (1997) Journal of the Acoustical Society of America , vol.101 , pp. 466-481
    • Hanson, H.1
  • 17
    • 0034123760 scopus 로고    scopus 로고
    • Acoustic effects of variation in vocal effort by men women and children
    • H. Traunmüller and A. Eriksson, Acoustic effects of variation in vocal effort by men, women, and children, Journal of the Acoustical Society of America, vol. 107, pp. 3438-3451, 2000.
    • (2000) Journal of the Acoustical Society of America , vol.107 , pp. 3438-3451
    • Traunmüller, H.1    Eriksson, A.2
  • 18
    • 84878620106 scopus 로고    scopus 로고
    • Cries and whispers - classification of vocal effort in expressive speech
    • Portland, Oregon, Sep
    • N. Obin, Cries and whispers - classification of vocal effort in expressive speech, Proc. Interspeech, Portland, Oregon, Sep. 2012.
    • (2012) Proc. Interspeech
    • Obin, N.1
  • 21
    • 85090317334 scopus 로고
    • A pitch extraction reference database
    • J. M. Pardo, E. Enŕiquez, J. Ortega, J. Ferreiros, J. Maćias, and F. J. Valverde, editors, Madrid, Sep
    • F. Plante, G. F. Meyer, and W. A. Ainsworth, A pitch extraction reference database, in J. M. Pardo, E. Enŕiquez, J. Ortega, J. Ferreiros, J. Maćias, and F. J. Valverde, editors, Proc. EUROSPEECH, pp. 837-840, Madrid, Sep. 1995.
    • (1995) Proc. EUROSPEECH , pp. 837-840
    • Plante, F.1    Meyer, G.F.2    Ainsworth, W.A.3
  • 22
    • 85093707396 scopus 로고
    • Enhanced pitch tracking and the processing of F0 contours for computer aided intonation teaching
    • Berlin, Sep
    • P. C. Bagshaw, S. M. Hiller, and M. A. Jack, Enhanced pitch tracking and the processing of F0 contours for computer aided intonation teaching, Proc. EUROSPEECH, pp. 1003-1006, Berlin, Sep. 1993.
    • (1993) Proc. EUROSPEECH , pp. 1003-1006
    • Bagshaw, P.C.1    Hiller, S.M.2    Jack, M.A.3
  • 23
    • 34047246313 scopus 로고    scopus 로고
    • Age, sex, and vowel dependencies of acoustic measures related to the voice source
    • M. Iseli, Y.-L. Shue, and A. Alwan, Age, sex, and vowel dependencies of acoustic measures related to the voice source, Journal of the Acoustical Society of America, vol. 121, pp. 2283-2295, 2007.
    • (2007) Journal of the Acoustical Society of America , vol.121 , pp. 2283-2295
    • Iseli, M.1    Shue, Y.-L.2    Alwan, A.3
  • 24
    • 85135173867 scopus 로고    scopus 로고
    • Speech recognition using on-line estimation of speaking rate
    • G. Kokkinakis, N. Fakotakis, and E. Dermatas, editors, Rhodes, Greece, Sep
    • N. Morgan, E. Fosler, and N. Mirghafori, Speech recognition using on-line estimation of speaking rate, in G. Kokkinakis, N. Fakotakis, and E. Dermatas, editors, Proc. EUROSPEECH, vol. 4, pp. 2079-2082, Rhodes, Greece, Sep. 1997.
    • (1997) Proc. EUROSPEECH , vol.4 , pp. 2079-2082
    • Morgan, N.1    Fosler, E.2    Mirghafori, N.3
  • 25
    • 84906264325 scopus 로고    scopus 로고
    • Efficient estimation of maximum entropy language models with N-gram features: An SRILM extension
    • Portland, Oregon, Sep
    • T. Alumäe and M. Kurimo, Efficient estimation of maximum entropy language models with N-gram features: An SRILM extension, Proc. Interspeech, pp. 1820-1823, Portland, Oregon, Sep. 2012.
    • (2012) Proc. Interspeech , pp. 1820-1823
    • Alumäe, T.1    Kurimo, M.2
  • 26
    • 0033905095 scopus 로고    scopus 로고
    • Boostexter: A boosting-based system for text categorization
    • R. E. Schapire and Y. Singer, Boostexter: A boosting-based system for text categorization, Machine Learning, vol. 39, pp. 135- 168, 2000.
    • (2000) Machine Learning , vol.39 , pp. 135-168
    • Schapire, R.E.1    Singer, Y.2
  • 28
    • 0033902487 scopus 로고    scopus 로고
    • Applying logistic regression to the fusion of the NIST'99 1-speaker submissions
    • Jan
    • S. Pigeon, P. Druyts, and P. Verlinde, Applying logistic regression to the fusion of the NIST'99 1-speaker submissions, Digital Signal Processing, vol. 10, pp. 237-248, Jan. 2000.
    • (2000) Digital Signal Processing , vol.10 , pp. 237-248
    • Pigeon, S.1    Druyts, P.2    Verlinde, P.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.