메뉴 건너뛰기




Volumn 52, Issue 3, 2010, Pages 236-245

Speaker adaptation of language and prosodic models for automatic dialog act segmentation of speech

Author keywords

Dialog act segmentation; Language modeling; Prosody modeling; Speaker adaptation; Spoken language understanding

Indexed keywords

AUTOMATIC SEGMENTATIONS; AUTOMATIC SPEECH RECOGNITION; DIALOG ACTS; FUTURE DIRECTIONS; IN-DEGREE; LANGUAGE MODELING; LINEAR COMBINATIONS; PROSODY MODELING; SPEAKER ADAPTATION; SPEECH UNDERSTANDING; SPOKEN LANGUAGE UNDERSTANDING;

EID: 73649084746     PISSN: 01676393     EISSN: None     Source Type: Journal    
DOI: 10.1016/j.specom.2009.10.005     Document Type: Article
Times cited : (5)

References (37)
  • 1
    • 85009142125 scopus 로고    scopus 로고
    • Language model adaptation based on PLSA of topics and speakers
    • Jeju, Korea
    • Akita, Y., Kawahara, T., 2004. Language model adaptation based on PLSA of topics and speakers. In: Proc. INTERSPEECH 2004-ICSLP, Jeju, Korea.
    • (2004) Proc. INTERSPEECH 2004-ICSLP
    • Akita, Y.1    Kawahara, T.2
  • 2
    • 34547544592 scopus 로고    scopus 로고
    • Sentence boundary detection of spontaneous Japanese using statistical language model and support vector machines
    • Pittsburgh, PA, USA
    • Akita, Y., Saikou, M., Nanjo, H., Kawahara, T., 2006. Sentence boundary detection of spontaneous Japanese using statistical language model and support vector machines. In: Proc. INTERSPEECH 2006 - ICSLP, Pittsburgh, PA, USA.
    • (2006) Proc. INTERSPEECH 2006 - ICSLP
    • Akita, Y.1    Saikou, M.2    Nanjo, H.3    Kawahara, T.4
  • 3
    • 85031636667 scopus 로고
    • Language model speaker adaptation
    • Madrid, Spain
    • Besling, S., Meier, H.-G., 1995. Language model speaker adaptation. In: Proc. EUROSPEECH, Madrid, Spain.
    • (1995) Proc. EUROSPEECH
    • Besling, S.1    Meier, H.-G.2
  • 6
    • 70349233562 scopus 로고    scopus 로고
    • project: dialog act labeling guide. Tech. Rep. TR-04-002, ICSI, Berkeley, CA, USA
    • Dhillon, R., Bhagat, S., Carvey, H., Shriberg, E., 2004. Meeting recorder project: dialog act labeling guide. Tech. Rep. TR-04-002, ICSI, Berkeley, CA, USA.
    • (2004) Meeting recorder
    • Dhillon, R.1    Bhagat, S.2    Carvey, H.3    Shriberg, E.4
  • 8
    • 3042826816 scopus 로고    scopus 로고
    • Speech-to-text and speech-to-speech summarization of spontaneous speech
    • Furui S., Kikuchi T., Shinnaka Y., and Hori C. Speech-to-text and speech-to-speech summarization of spontaneous speech. IEEE Trans. Speech Audio Process. 12 4 (2004) 401-408
    • (2004) IEEE Trans. Speech Audio Process. , vol.12 , Issue.4 , pp. 401-408
    • Furui, S.1    Kikuchi, T.2    Shinnaka, Y.3    Hori, C.4
  • 9
    • 0032050110 scopus 로고    scopus 로고
    • Maximum likelihood linear transformations for HMM-based speech recognition
    • Gales M. Maximum likelihood linear transformations for HMM-based speech recognition. Comput. Speech Language 12 (1998) 75-98
    • (1998) Comput. Speech Language , vol.12 , pp. 75-98
    • Gales, M.1
  • 10
    • 0028419019 scopus 로고
    • Maximum a posteriori estimation for multivariate Gaussian mixture observation of Markov chains
    • Gauvain J.-L., and Lee C.-H. Maximum a posteriori estimation for multivariate Gaussian mixture observation of Markov chains. IEEE Trans. Speech Audio Process. 2 2 (1994) 291-298
    • (1994) IEEE Trans. Speech Audio Process. , vol.2 , Issue.2 , pp. 291-298
    • Gauvain, J.-L.1    Lee, C.-H.2
  • 11
    • 73649135829 scopus 로고    scopus 로고
    • Hirst A., and Cristo A.D. (Eds), Cambridge University Press
    • In: Hirst A., and Cristo A.D. (Eds). Intonation Syst. (1998), Cambridge University Press
    • (1998) Intonation Syst.
  • 12
    • 85009291541 scopus 로고    scopus 로고
    • Maximum entropy model for punctuation annotation from speech
    • Denver, CO, USA
    • Huang, J., Zweig, G., 2002. Maximum entropy model for punctuation annotation from speech. In: Proc. of ICSLP 2002, Denver, CO, USA.
    • (2002) Proc. of ICSLP
    • Huang, J.1    Zweig, G.2
  • 15
    • 85083514474 scopus 로고    scopus 로고
    • Parsing conversational speech using enhanced segmentation
    • Boston, MA, USA
    • Kahn, J.G., Ostendorf, M., Chelba, C., 2004. Parsing conversational speech using enhanced segmentation. In: Proc. HLT-NAACL, Boston, MA, USA.
    • (2004) Proc. HLT-NAACL
    • Kahn, J.G.1    Ostendorf, M.2    Chelba, C.3
  • 16
    • 0242552344 scopus 로고    scopus 로고
    • A combined punctuation generation and speech recognition system and its performance enhancement using prosody
    • Kim J.H., and Woodland P. A combined punctuation generation and speech recognition system and its performance enhancement using prosody. Speech Communication 41 4 (2003) 563-577
    • (2003) Speech Communication , vol.41 , Issue.4 , pp. 563-577
    • Kim, J.H.1    Woodland, P.2
  • 17
    • 0028996876 scopus 로고
    • Improved backing-off for M-gram language modeling
    • Detroit, MI, USA
    • Kneser, R., Ney, H., 1995. Improved backing-off for M-gram language modeling. In: Proc. ICASSP, Detroit, MI, USA.
    • (1995) Proc. ICASSP
    • Kneser, R.1    Ney, H.2
  • 18
    • 56149093006 scopus 로고    scopus 로고
    • Speaker adaptation of language models for automatic dialog act segmentation of meetings
    • Antwerp, Belgium
    • Kolář, J., Liu, Y., Shriberg, E., 2007. Speaker adaptation of language models for automatic dialog act segmentation of meetings. In: Proc. INTERSPEECH 2007, Antwerp, Belgium.
    • (2007) Proc. INTERSPEECH
    • Kolář, J.1    Liu, Y.2    Shriberg, E.3
  • 19
    • 44949209648 scopus 로고    scopus 로고
    • On speaker-specific prosodic models for automatic dialog act segmentation of multi-party meetings
    • Pittsburgh, PA, USA
    • Kolář, J., Shriberg, E., Liu, Y., 2006a. On speaker-specific prosodic models for automatic dialog act segmentation of multi-party meetings. In: Proc. INTERSPEECH 2006 - ICSLP, Pittsburgh, PA, USA.
    • (2006) Proc. INTERSPEECH 2006 - ICSLP
    • Kolář, J.1    Shriberg, E.2    Liu, Y.3
  • 20
    • 33750242108 scopus 로고    scopus 로고
    • Kolář, J., Shriberg, E., Liu, Y., 2006b. Using prosody for automatic sentence segmentation of multi-party meetings. In: Text, Speech and Dialogue. Lecture Notes in Artificial Intelligence, 4188, pp. 629-636.
    • Kolář, J., Shriberg, E., Liu, Y., 2006b. Using prosody for automatic sentence segmentation of multi-party meetings. In: Text, Speech and Dialogue. Lecture Notes in Artificial Intelligence, Vol. 4188, pp. 629-636.
  • 21
    • 33746529930 scopus 로고    scopus 로고
    • A study in machine learning from imbalanced data for sentence boundary detection in speech
    • Liu Y., Chawla N., Harper M., Shriberg E., and Stolcke A. A study in machine learning from imbalanced data for sentence boundary detection in speech. Comput. Speech Language 20 (2006) 468-494
    • (2006) Comput. Speech Language , vol.20 , pp. 468-494
    • Liu, Y.1    Chawla, N.2    Harper, M.3    Shriberg, E.4    Stolcke, A.5
  • 22
    • 85117232702 scopus 로고    scopus 로고
    • Comparing and combining generative and posterior probability models: Some advances in sentence boundary detection in speech
    • Barcelona, Spain
    • Liu, Y., Stolcke, A., Shriberg, E., Harper, M., 2004. Comparing and combining generative and posterior probability models: Some advances in sentence boundary detection in speech. In: Proc. EMNLP. Barcelona, Spain.
    • (2004) Proc. EMNLP
    • Liu, Y.1    Stolcke, A.2    Shriberg, E.3    Harper, M.4
  • 23
    • 44849094956 scopus 로고    scopus 로고
    • Using conditional random fields for sentence boundary detection in speech
    • Ann Arbor, MI, USA
    • Liu, Y., Stolcke, A., Shriberg, E., Harper, M., 2005. Using conditional random fields for sentence boundary detection in speech. In: Proc. ACL, Ann Arbor, MI, USA.
    • (2005) Proc. ACL
    • Liu, Y.1    Stolcke, A.2    Shriberg, E.3    Harper, M.4
  • 26
    • 0003008756 scopus 로고
    • A hierarchical stochastic model for automatic prediction of prosodic boundary locations
    • Ostendorf M., and Veilleux N. A hierarchical stochastic model for automatic prediction of prosodic boundary locations. Comput. Linguistics 20 1 (1994) 27-54
    • (1994) Comput. Linguistics , vol.20 , Issue.1 , pp. 27-54
    • Ostendorf, M.1    Veilleux, N.2
  • 28
    • 0034275920 scopus 로고    scopus 로고
    • Prosody-based automatic segmentation of speech into sentences and topics
    • Shriberg E., Stolcke A., Hakkani-Tür D., and Tür G. Prosody-based automatic segmentation of speech into sentences and topics. Speech Communication 32 1-2 (2000) 127-154
    • (2000) Speech Communication , vol.32 , Issue.1-2 , pp. 127-154
    • Shriberg, E.1    Stolcke, A.2    Hakkani-Tür, D.3    Tür, G.4
  • 29
    • 85128436986 scopus 로고    scopus 로고
    • Modeling dynamic prosodic variation for speaker verification
    • Sydney, Australia
    • Sönmez, K., Shriberg, E., Heck, L., Weintraub, M., 1998. Modeling dynamic prosodic variation for speaker verification. In: Proc. ICSLP, Sydney, Australia.
    • (1998) Proc. ICSLP
    • Sönmez, K.1    Shriberg, E.2    Heck, L.3    Weintraub, M.4
  • 30
    • 85009230921 scopus 로고    scopus 로고
    • Sentence boundary detection in Arabic speech
    • Geneva, Switzerland
    • Srivastava, A., Kubala, F., 2003. Sentence boundary detection in Arabic speech. In: Proc. EUROSPEECH, Geneva, Switzerland.
    • (2003) Proc. EUROSPEECH
    • Srivastava, A.1    Kubala, F.2
  • 31
    • 84891308106 scopus 로고    scopus 로고
    • SRILM - An extensible language modeling toolkit
    • Denver, CO, USA
    • Stolcke, A., 2002. SRILM - An extensible language modeling toolkit. In: Proc. ICSLP, Denver, CO, USA.
    • (2002) Proc. ICSLP
    • Stolcke, A.1
  • 34
    • 34547505381 scopus 로고    scopus 로고
    • Unsupervised language model adaptation for meeting recognition
    • Honolulu, HI, USA
    • Tur, G., Stolcke, A., 2007. Unsupervised language model adaptation for meeting recognition. In: Proc. ICASSP, Honolulu, HI, USA.
    • (2007) Proc. ICASSP
    • Tur, G.1    Stolcke, A.2
  • 35
    • 85135190196 scopus 로고    scopus 로고
    • Integrated dialog act segmentation and classification using prosodic features and language models
    • Rhodes, Greece
    • Warnke, V., Kompe, R., Niemann, H., Nöth, E., 1997. Integrated dialog act segmentation and classification using prosodic features and language models. In: Proc. EUROSPEECH, Rhodes, Greece.
    • (1997) Proc. EUROSPEECH
    • Warnke, V.1    Kompe, R.2    Niemann, H.3    Nöth, E.4
  • 36
    • 70450161117 scopus 로고    scopus 로고
    • Joint segmentation and classification of dialog acts using conditional random fields
    • Brighton, UK
    • Zimmermann, M., 2009. Joint segmentation and classification of dialog acts using conditional random fields. In: Proc. INTERSPEECH 2009, Brighton, UK.
    • (2009) Proc. INTERSPEECH
    • Zimmermann, M.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.