메뉴 건너뛰기




Volumn , Issue , 2013, Pages 61-69

Social signal and user adaptation in reinforcement learning-based dialogue management

Author keywords

Dialogue management; Reinforcement learning; Reward shaping; Social signals; User adaptation; Value function approximation

Indexed keywords

DIALOGUE MANAGEMENT; REWARD SHAPING; SOCIAL SIGNALS; USER ADAPTATION; VALUE FUNCTION APPROXIMATION;

EID: 84882935417     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1145/2493525.2493535     Document Type: Conference Paper
Times cited : (9)

References (28)
  • 3
    • 49949094272 scopus 로고    scopus 로고
    • Emotion and reinforcement: Affective facial expressions facilitate robot learning
    • volume 4451 of Lecture Notes in Computer Science
    • J. Broekens and P. Haazebroek. Emotion and reinforcement: Affective facial expressions facilitate robot learning. In Artificial Intelligence for Human Computing, volume 4451 of Lecture Notes in Computer Science, pages 113-132, 2007.
    • (2007) Artificial Intelligence for Human Computing , pp. 113-132
    • Broekens, J.1    Haazebroek, P.2
  • 5
    • 25144507455 scopus 로고    scopus 로고
    • Positive affect as implicit motivator: On the nonconscious operation of behavioral goals
    • Aug.
    • R. Custers and H. Aarts. Positive affect as implicit motivator: On the nonconscious operation of behavioral goals. Journal of Personality and Social Psychology, 89(2):129-142, Aug. 2005.
    • (2005) Journal of Personality and Social Psychology , vol.89 , Issue.2 , pp. 129-142
    • Custers, R.1    Aarts, H.2
  • 10
    • 76649127744 scopus 로고    scopus 로고
    • Tracking in reinforcement learning
    • volume 5863 of Lecture Notes in Computer Science
    • M. Geist, O. Pietquin, and G. Fricout. Tracking in reinforcement learning. In Neural Information Processing, volume 5863 of Lecture Notes in Computer Science, pages 502-511, 2009.
    • (2009) Neural Information Processing , pp. 502-511
    • Geist, M.1    Pietquin, O.2    Fricout, G.3
  • 11
    • 0032073263 scopus 로고    scopus 로고
    • Planning and acting in partially observable stochastic domains
    • May
    • L. P. Kaelbling, M. L. Littman, and A. R. Cassandra. Planning and acting in partially observable stochastic domains. Artificial Intelligence Journal, 101(1-2):99-134, May 1998.
    • (1998) Artificial Intelligence Journal , vol.101 , Issue.1-2 , pp. 99-134
    • Kaelbling, L.P.1    Littman, M.L.2    Cassandra, A.R.3
  • 12
    • 85024429815 scopus 로고
    • A new approach to linear filtering and prediction problems
    • R. Kalman. A new approach to linear filtering and prediction problems. Journal of Basic Engineering, 82:35-45, 1960.
    • (1960) Journal of Basic Engineering , vol.82 , pp. 35-45
    • Kalman, R.1
  • 14
    • 0030635367 scopus 로고    scopus 로고
    • Learning dialogue strategies within the markov decision process framework
    • E. Levin, R. Pieraccini, and W. Eckert. Learning dialogue strategies within the markov decision process framework. In ASRU, 1997.
    • (1997) ASRU
    • Levin, E.1    Pieraccini, R.2    Eckert, W.3
  • 15
    • 0141596576 scopus 로고    scopus 로고
    • Policy invariance under reward transformations: Theory and application to reward shaping
    • A. Y. Ng, D. Harada, and S. Russell. Policy invariance under reward transformations: Theory and application to reward shaping. In ICML, 1999.
    • (1999) ICML
    • Ng, A.Y.1    Harada, D.2    Russell, S.3
  • 16
    • 84882945174 scopus 로고    scopus 로고
    • Unsupervised clustering of probability distributions of semantic graphs for pomdp based spoken dialogue systems with summary space
    • F. Pinault and F. Lefèvre. Unsupervised clustering of probability distributions of semantic graphs for pomdp based spoken dialogue systems with summary space. In IJCAI 7th Workshop on knowledge and reasoning in practical dialogue systems, 2011.
    • (2011) IJCAI 7th Workshop on Knowledge and Reasoning in Practical Dialogue Systems
    • Pinault, F.1    Lefèvre, F.2
  • 18
    • 84880768440 scopus 로고    scopus 로고
    • A bayesian approach to imitation in reinforcement learning
    • B. Price and C. Boutilier. A bayesian approach to imitation in reinforcement learning. In IJCAI, 2003.
    • (2003) IJCAI
    • Price, B.1    Boutilier, C.2
  • 19
    • 84880707672 scopus 로고    scopus 로고
    • Spoken dialogue management using probabilistic reasoning
    • N. Roy, J. Pineau, and S. Thrun. Spoken dialogue management using probabilistic reasoning. In ACL, 2000.
    • (2000) ACL
    • Roy, N.1    Pineau, J.2    Thrun, S.3
  • 20
    • 33747607273 scopus 로고    scopus 로고
    • A survey of statistical user simulation techniques for reinforcement-learning of dialogue management strategies
    • June
    • J. Schatzmann, K. Weilhammer, M. Stuttle, and S. Young. A survey of statistical user simulation techniques for reinforcement-learning of dialogue management strategies. Knowledge Engineering Review, 21(2):97-126, June 2006.
    • (2006) Knowledge Engineering Review , vol.21 , Issue.2 , pp. 97-126
    • Schatzmann, J.1    Weilhammer, K.2    Stuttle, M.3    Young, S.4
  • 23
    • 84882998069 scopus 로고    scopus 로고
    • On the role of tracking in stationary environments
    • R. S. Sutton, A. Koop, and D. Silver. On the role of tracking in stationary environments. In ICML, 2007.
    • (2007) ICML
    • Sutton, R.S.1    Koop, A.2    Silver, D.3
  • 24
    • 77950862681 scopus 로고    scopus 로고
    • Bayesian update of dialogue state: A pomdp framework for spoken dialogue systems
    • B. Thomson and S. Young. Bayesian update of dialogue state: A pomdp framework for spoken dialogue systems. Computer Speech and Language, 24(4):562-588, 2010.
    • (2010) Computer Speech and Language , vol.24 , Issue.4 , pp. 562-588
    • Thomson, B.1    Young, S.2
  • 25
    • 9444259273 scopus 로고    scopus 로고
    • The information state approach to dialogue management
    • volume 22 of Text, Speech and Language Technology
    • D. R. Traum and S. Larsson. The information state approach to dialogue management. In Current and New Directions in Discourse and Dialogue, volume 22 of Text, Speech and Language Technology, pages 325-353, 2003.
    • (2003) Current and New Directions in Discourse and Dialogue , pp. 325-353
    • Traum, D.R.1    Larsson, S.2
  • 26
    • 61549132763 scopus 로고    scopus 로고
    • Social signal processing: Survey of an emerging domain
    • A. Vinciarelli, M. Pantic, and H. Bourlard. Social signal processing: Survey of an emerging domain. Image and Vision Computing, 27(12):1743-1759, 2009.
    • (2009) Image and Vision Computing , vol.27 , Issue.12 , pp. 1743-1759
    • Vinciarelli, A.1    Pantic, M.2    Bourlard, H.3
  • 27
    • 85065183198 scopus 로고    scopus 로고
    • Paradise: A framework for evaluating spoken dialogue agents
    • M. A. Walker, D. J. Litman, C. A. Kamm, and A. Abella. Paradise: A framework for evaluating spoken dialogue agents. In ACL, 1997.
    • (1997) ACL
    • Walker, M.A.1    Litman, D.J.2    Kamm, C.A.3    Abella, A.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.