메뉴 건너뛰기




Volumn 34, Issue 4, 2008, Pages 487-511

Hybrid reinforcement/supervised learning of dialogue policies from fixed data sets

Author keywords

[No Author keywords available]

Indexed keywords

REINFORCEMENT LEARNING; SPEECH PROCESSING; SUPERVISED LEARNING;

EID: 51449120317     PISSN: 08912017     EISSN: 15309312     Source Type: Journal    
DOI: 10.1162/coli.2008.07-028-R2-05-82     Document Type: Article
Times cited : (129)

References (50)
  • 1
    • 9444289010 scopus 로고    scopus 로고
    • Information states and dialog move engines
    • Available at
    • Bohlin, Peter, Robin Cooper, Elisabet Engdahl, and Staffan Larsson. 1999. Information states and dialog move engines. Electronic Transactions in AI, 3(9). Available at www. ep. liu. se/ej /etai/1999/D/.
    • (1999) Electronic Transactions in AI , vol.3 , Issue.9
    • Bohlin, P.1    Cooper, R.2    Engdahl, E.3    Larsson, S.4
  • 2
    • 51349089807 scopus 로고    scopus 로고
    • DIPPER: Description and formalisation of an information-state update dialogue system architecture
    • Sapporo
    • Bos, Johan, Ewan Klein, Oliver Lemon, and Tetsushi Oka. 2003. DIPPER: Description and formalisation of an information-state update dialogue system architecture. In Proceedings of the 4th SIGdial Workshop on Discourse and Dialogue, pages 115-124, Sapporo.
    • (2003) Proceedings of the 4th SIGdial Workshop on Discourse and Dialogue , pp. 115-124
    • Bos, J.1    Klein, E.2    Lemon, O.3    Oka, T.4
  • 4
    • 26444573745 scopus 로고    scopus 로고
    • Denecke, Matthias, Kohji Dohsaka, and Mikio Nakano, 2005. Fast reinforcement learning of dialogue policies using stable function approximation. In K. Y. Su, J. Tsujii, J.-H. Lee, and O. Y. Kwong, Natural Language Processing, IJCNLP 2004. Springer, Berlin, pages 1-11.
    • Denecke, Matthias, Kohji Dohsaka, and Mikio Nakano, 2005. Fast reinforcement learning of dialogue policies using stable function approximation. In K. Y. Su, J. Tsujii, J.-H. Lee, and O. Y. Kwong, Natural Language Processing, IJCNLP 2004. Springer, Berlin, pages 1-11.
  • 7
    • 85149152675 scopus 로고    scopus 로고
    • Combining acoustic and pragmatic features to predict recognition performance in spoken dialogue systems
    • Barcelona
    • Gabsdil Malte and Oliver Lemon. 2004. Combining acoustic and pragmatic features to predict recognition performance in spoken dialogue systems. In Proceedings of the 42nd Meeting of the Association for Computational Linguistics, pages 344-351, Barcelona.
    • (2004) Proceedings of the 42nd Meeting of the Association for Computational Linguistics , pp. 344-351
    • Malte, G.1    Lemon, O.2
  • 12
    • 0033677177 scopus 로고    scopus 로고
    • Goddeau, D. and J. Pineau. 2000. Fast reinforcement learning of dialog strategies. In Proceedings of the IEEE International Conference on Acoustics Speech and Signal Processing (ICASSP), pages 11-1233-1236, Istanbul.
    • Goddeau, D. and J. Pineau. 2000. Fast reinforcement learning of dialog strategies. In Proceedings of the IEEE International Conference on Acoustics Speech and Signal Processing (ICASSP), pages 11-1233-1236, Istanbul.
  • 14
    • 85009087667 scopus 로고    scopus 로고
    • Information state and dialogue management in the TRINDI Dialogue Move Engine Toolkit
    • Larsson, Staffan and David Traum. 2000. Information state and dialogue management in the TRINDI Dialogue Move Engine Toolkit. Natural Language Engineering, 6(3-4):323-340.
    • (2000) Natural Language Engineering , vol.6 , Issue.3-4 , pp. 323-340
    • Larsson, S.1    Traum, D.2
  • 15
    • 48749112832 scopus 로고    scopus 로고
    • Evaluating effectiveness and portability of reinforcement learned dialogue strategies with real users: The TALK TownInfo evaluation
    • Aruba
    • Lemon, Oliver, Kallirroi Georgila, and James Henderson. 2006. Evaluating effectiveness and portability of reinforcement learned dialogue strategies with real users: the TALK TownInfo evaluation. In Proceedings of the IEEE/ACL 2006 Workshop on Spoken Language Technology, pages 178-181, Aruba.
    • (2006) Proceedings of the IEEE/ACL 2006 Workshop on Spoken Language Technology , pp. 178-181
    • Lemon, O.1    Georgila, K.2    Henderson, J.3
  • 16
    • 84857773811 scopus 로고    scopus 로고
    • Integration of learning and adaptivity with the ISU approach
    • Technical Report D4.1, TALK Project
    • Lemon, Oliver, Kallirroi Georgila, James Henderson, Malte Gabsdil, Ivan Meza-Ruiz, and Steve Young. 2005. Integration of learning and adaptivity with the ISU approach. Technical Report D4.1, TALK Project.
    • (2005)
    • Lemon, O.1    Georgila, K.2    Henderson, J.3    Gabsdil, M.4    Meza-Ruiz, I.5    Young, S.6
  • 17
    • 84893350028 scopus 로고    scopus 로고
    • An ISU dialogue system exhibiting reinforcement learning of dialogue policies: Generic slot-filling in the TALK in-car system
    • Trento
    • Lemon, Oliver, Kallirroi Georgila, James Henderson, and Matthew Stuttle. 2006. An ISU dialogue system exhibiting reinforcement learning of dialogue policies: generic slot-filling in the TALK in-car system. In Proceedings of the Demonstrations of EACL, pages 119-122, Trento.
    • (2006) Proceedings of the Demonstrations of EACL , pp. 119-122
    • Lemon, O.1    Georgila, K.2    Henderson, J.3    Stuttle, M.4
  • 18
    • 57349119494 scopus 로고    scopus 로고
    • Showcase exhibiting reinforcement learning for dialogue strategies in the in-car domain
    • Technical Report D4.2, TALK Project
    • Lemon, Oliver, Kallirroi Georgila, and Matthew Stuttle. 2005. Showcase exhibiting reinforcement learning for dialogue strategies in the in-car domain. Technical Report D4.2, TALK Project.
    • (2005)
    • Lemon, O.1    Georgila, K.2    Stuttle, M.3
  • 20
    • 0033894474 scopus 로고    scopus 로고
    • A stochastic model of human-machine interaction for learning dialog strategies
    • Levin, Esther, Roberto Pieraccini, and Wieland Eckert. 2000. A stochastic model of human-machine interaction for learning dialog strategies. IEEE Transactions on Speech and Audio Processing, 8(1):11-23.
    • (2000) IEEE Transactions on Speech and Audio Processing , vol.8 , Issue.1 , pp. 11-23
    • Levin, E.1    Pieraccini, R.2    Eckert, W.3
  • 23
    • 33750253118 scopus 로고    scopus 로고
    • A probabilistic framework for dialog simulation and optimal strategy learning
    • Pietquin, Olivier and Thierry Dutoit. 2006b. A probabilistic framework for dialog simulation and optimal strategy learning. IEEE Transactions on Speech and Audio Processing, 14(2):589-599.
    • (2006) IEEE Transactions on Speech and Audio Processing , vol.14 , Issue.2 , pp. 589-599
    • Pietquin, O.1    Dutoit, T.2
  • 30
    • 33747607273 scopus 로고    scopus 로고
    • A survey of statistical user simulation techniques for reinforcement-learning of dialogue management strategies
    • Schatzmann, Jost, Karl Weilhammer, Matthew N. Stuttle, and Steve Young. 2006. A survey of statistical user simulation techniques for reinforcement-learning of dialogue management strategies. The Knowledge Engineering Review, 21:97-126.
    • (2006) The Knowledge Engineering Review , vol.21 , pp. 97-126
    • Schatzmann, J.1    Weilhammer, K.2    Stuttle, M.N.3    Young, S.4
  • 32
    • 33846263279 scopus 로고    scopus 로고
    • Automatic learning of dialogue strategy using dialogue simulation and reinforcement learning
    • San Diego, CA
    • Scheffler, Konrad and Steve Young. 2002. Automatic learning of dialogue strategy using dialogue simulation and reinforcement learning. In Proceedings of the Human Language Technology Conference, pages 12-19, San Diego, CA.
    • (2002) Proceedings of the Human Language Technology Conference , pp. 12-19
    • Scheffler, K.1    Young, S.2
  • 34
    • 85158142417 scopus 로고    scopus 로고
    • Empirical evaluation of a reinforcement learning dialogue system
    • Whistler
    • Singh, Satinder, Michael Kearns, Diane Litman, and Marilyn Walker. 2000a. Empirical evaluation of a reinforcement learning dialogue system. In Proceedings of the AAAI, pages 645-651, Whistler.
    • (2000) Proceedings of the AAAI , pp. 645-651
    • Singh, S.1    Kearns, M.2    Litman, D.3    Walker, M.4
  • 36
    • 0037841376 scopus 로고    scopus 로고
    • Optimizing dialogue management with reinforcement learning: Experiments with the NJFun system
    • Singh, Satinder, Diane Litman, Michael Kearns, and Marilyn Walker. 2002. Optimizing dialogue management with reinforcement learning: Experiments with the NJFun system. Journal of Artificial Intelligence Research (JAIR), 16:105-133.
    • (2002) Journal of Artificial Intelligence Research (JAIR) , vol.16 , pp. 105-133
    • Singh, S.1    Litman, D.2    Kearns, M.3    Walker, M.4
  • 39
    • 33846187514 scopus 로고    scopus 로고
    • DATE: A dialogue act tagging scheme for evaluation of spoken dialogue systems
    • San Diego, CA
    • Walker, M. and R. Passonneau. 2001. DATE: A dialogue act tagging scheme for evaluation of spoken dialogue systems. In Proceedings of the Human Language Technology Conference, pages 1-8, San Diego, CA.
    • (2001) Proceedings of the Human Language Technology Conference , pp. 1-8
    • Walker, M.1    Passonneau, R.2
  • 43
    • 84974698591 scopus 로고    scopus 로고
    • Towards developing general models of usability with PARADISE
    • Walker, Marilyn A., Candace A. Kamm, and Diane J. Litman. 2000. Towards developing general models of usability with PARADISE. Natural Language Engineering, 6(3):363-377.
    • (2000) Natural Language Engineering , vol.6 , Issue.3 , pp. 363-377
    • Walker, M.A.1    Kamm, C.A.2    Litman, D.J.3
  • 47
  • 48
    • 85118874370 scopus 로고    scopus 로고
    • Using Wizard-of-Oz simulations to bootstrap reinforcement-learning-based dialog management systems
    • Sapporo
    • Williams, Jason and Steve Young. 2003. Using Wizard-of-Oz simulations to bootstrap reinforcement-learning-based dialog management systems. In Proceedings of the 4th SIGdial Workshop on Discourse and Dialogue, pages 135-139, Sapporo.
    • (2003) Proceedings of the 4th SIGdial Workshop on Discourse and Dialogue , pp. 135-139
    • Williams, J.1    Young, S.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.