SCOPUS 정보 검색 플랫폼

ACM International Conference Proceeding Series

Volumn , Issue , 2013, Pages 61-69

Social signal and user adaptation in reinforcement learning-based dialogue management

(2) Ferreira, Emmanuel a Lefèvre, Fabrice a

Author keywords

Dialogue management; Reinforcement learning; Reward shaping; Social signals; User adaptation; Value function approximation

Indexed keywords

DIALOGUE MANAGEMENT; REWARD SHAPING; SOCIAL SIGNALS; USER ADAPTATION; VALUE FUNCTION APPROXIMATION;

ARTIFICIAL INTELLIGENCE; COMMUNICATION;

REINFORCEMENT LEARNING;

EID: 84882935417 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: 10.1145/2493525.2493535 Document Type: Conference Paper

Times cited : (9)

References (28)

1
- 84872159748
- Learning the reward model of dialogue pomdps from data
- A. Boularias, H. R. Chinaei, and B. Chaib-draa. Learning the reward model of dialogue pomdps from data. In NIPS 2010 Workshop of Machine Learning for Assistive Techniques, 2010.
- (2010) NIPS 2010 Workshop of Machine Learning for Assistive Techniques
- Boularias, A.¹ Chinaei, H.R.² Chaib-Draa, B.³

2
- 77949366790
- Spotting agreement and disagreement: A survey of nonverbal audiovisual cues and tools
- K. Bousmalis, M. Mehu, and M. Pantic. Spotting agreement and disagreement: A survey of nonverbal audiovisual cues and tools. In Proceedings of the International Conference on Affective Computing and Intelligent Interaction, 2009.
- (2009) Proceedings of the International Conference on Affective Computing and Intelligent Interaction
- Bousmalis, K.¹ Mehu, M.² Pantic, M.³

3
- 49949094272
- Emotion and reinforcement: Affective facial expressions facilitate robot learning
- volume 4451 of Lecture Notes in Computer Science
- J. Broekens and P. Haazebroek. Emotion and reinforcement: Affective facial expressions facilitate robot learning. In Artificial Intelligence for Human Computing, volume 4451 of Lecture Notes in Computer Science, pages 113-132, 2007.
- (2007) Artificial Intelligence for Human Computing , pp. 113-132
- Broekens, J.¹ Haazebroek, P.²

4
- 84865717612
- Interspeech
- S. Chandramohan, M. Geist, F. Lefèvre, and O. Pietquin. User Simulation in Dialogue Systems using Inverse Reinforcement Learning. In Interspeech, 2011.
- (2011) User Simulation in Dialogue Systems Using Inverse Reinforcement Learning
- Chandramohan, S.¹ Geist, M.² Lefèvre, F.³ Pietquin, O.⁴

5
- 25144507455
- Positive affect as implicit motivator: On the nonconscious operation of behavioral goals
- Aug.
- R. Custers and H. Aarts. Positive affect as implicit motivator: On the nonconscious operation of behavioral goals. Journal of Personality and Social Psychology, 89(2):129-142, Aug. 2005.
- (2005) Journal of Personality and Social Psychology , vol.89 , Issue.2 , pp. 129-142
- Custers, R.¹ Aarts, H.²

6
- 84865703906
- Interspeech
- L. Daubigney, M. Gasic, S. Chandramohan, M. Geist, O. Pietquin, and S. Young. Uncertainty management for on-line optimisation of a pomdp-based large-scale spoken dialogue system. In Interspeech, 2011.
- (2011) Uncertainty Management for On-line Optimisation of a Pomdp-based Large-scale Spoken Dialogue System
- Daubigney, L.¹ Gasic, M.² Chandramohan, S.³ Geist, M.⁴ Pietquin, O.⁵ Young, S.⁶

7
- 84872138024
- A comprehensive reinforcement learning framework for dialogue management optimization
- L. Daubigney, M. Geist, S. Chandramohan, and O. Pietquin. A comprehensive reinforcement learning framework for dialogue management optimization. Journal on Selected Topics in Signal Processing, 6(8):891-902, 2012.
- (2012) Journal on Selected Topics in Signal Processing , vol.6 , Issue.8 , pp. 891-902
- Daubigney, L.¹ Geist, M.² Chandramohan, S.³ Pietquin, O.⁴

8
- 84857755225
- Gaussian processes for fast policy optimisation of pomdp-based dialogue managers
- M. Gašić, F. Jurčíček, S. Keizer, F. Mairesse, B. Thomson, K. Yu, and S. Young. Gaussian processes for fast policy optimisation of pomdp-based dialogue managers. In SIGDIAL, 2010.
- (2010) SIGDIAL
- Gašić, M.¹ Jurčíček, F.² Keizer, S.³ Mairesse, F.⁴ Thomson, B.⁵ Yu, K.⁶ Young, S.⁷

9
- 78651465938
- Kalman temporal differences
- Sept
- M. Geist and O. Pietquin. Kalman temporal differences. Journal of Artificial Intelligence Research (JAIR), 39(1):483-532, Sept. 2010.
- (2010) Journal of Artificial Intelligence Research (JAIR) , vol.39 , Issue.1 , pp. 483-532
- Geist, M.¹ Pietquin, O.²

10
- 76649127744
- Tracking in reinforcement learning
- volume 5863 of Lecture Notes in Computer Science
- M. Geist, O. Pietquin, and G. Fricout. Tracking in reinforcement learning. In Neural Information Processing, volume 5863 of Lecture Notes in Computer Science, pages 502-511, 2009.
- (2009) Neural Information Processing , pp. 502-511
- Geist, M.¹ Pietquin, O.² Fricout, G.³

11
- 0032073263
- Planning and acting in partially observable stochastic domains
- May
- L. P. Kaelbling, M. L. Littman, and A. R. Cassandra. Planning and acting in partially observable stochastic domains. Artificial Intelligence Journal, 101(1-2):99-134, May 1998.
- (1998) Artificial Intelligence Journal , vol.101 , Issue.1-2 , pp. 99-134
- Kaelbling, L.P.¹ Littman, M.L.² Cassandra, A.R.³

12
- 85024429815
- A new approach to linear filtering and prediction problems
- R. Kalman. A new approach to linear filtering and prediction problems. Journal of Basic Engineering, 82:35-45, 1960.
- (1960) Journal of Basic Engineering , vol.82 , pp. 35-45
- Kalman, R.¹

13
- 84857715566
- Parameter estimation for agenda-based user simulation
- S. Keizer, M. Gašić, F. Jurčíček, F. Mairesse, B. Thomson, K. Yu, and S. Young. Parameter estimation for agenda-based user simulation. In SIGDIAL, 2010.
- (2010) SIGDIAL
- Keizer, S.¹ Gašić, M.² Jurčíček, F.³ Mairesse, F.⁴ Thomson, B.⁵ Yu, K.⁶ Young, S.⁷

14
- 0030635367
- Learning dialogue strategies within the markov decision process framework
- E. Levin, R. Pieraccini, and W. Eckert. Learning dialogue strategies within the markov decision process framework. In ASRU, 1997.
- (1997) ASRU
- Levin, E.¹ Pieraccini, R.² Eckert, W.³

15
- 0141596576
- Policy invariance under reward transformations: Theory and application to reward shaping
- A. Y. Ng, D. Harada, and S. Russell. Policy invariance under reward transformations: Theory and application to reward shaping. In ICML, 1999.
- (1999) ICML
- Ng, A.Y.¹ Harada, D.² Russell, S.³

16
- 84882945174
- Unsupervised clustering of probability distributions of semantic graphs for pomdp based spoken dialogue systems with summary space
- F. Pinault and F. Lefèvre. Unsupervised clustering of probability distributions of semantic graphs for pomdp based spoken dialogue systems with summary space. In IJCAI 7th Workshop on knowledge and reasoning in practical dialogue systems, 2011.
- (2011) IJCAI 7th Workshop on Knowledge and Reasoning in Practical Dialogue Systems
- Pinault, F.¹ Lefèvre, F.²

17
- 52249090123
- Anytime point-based approximations for large POMDPs
- J. Pineau, G. Gordon, and S. Thrun. Anytime point-based approximations for large POMDPs. Journal of Artificial Intelligence Research, 27:335-380, 2006.
- (2006) Journal of Artificial Intelligence Research , vol.27 , pp. 335-380
- Pineau, J.¹ Gordon, G.² Thrun, S.³

18
- 84880768440
- A bayesian approach to imitation in reinforcement learning
- B. Price and C. Boutilier. A bayesian approach to imitation in reinforcement learning. In IJCAI, 2003.
- (2003) IJCAI
- Price, B.¹ Boutilier, C.²

19
- 84880707672
- Spoken dialogue management using probabilistic reasoning
- N. Roy, J. Pineau, and S. Thrun. Spoken dialogue management using probabilistic reasoning. In ACL, 2000.
- (2000) ACL
- Roy, N.¹ Pineau, J.² Thrun, S.³

20
- 33747607273
- A survey of statistical user simulation techniques for reinforcement-learning of dialogue management strategies
- June
- J. Schatzmann, K. Weilhammer, M. Stuttle, and S. Young. A survey of statistical user simulation techniques for reinforcement-learning of dialogue management strategies. Knowledge Engineering Review, 21(2):97-126, June 2006.
- (2006) Knowledge Engineering Review , vol.21 , Issue.2 , pp. 97-126
- Schatzmann, J.¹ Weilhammer, K.² Stuttle, M.³ Young, S.⁴

21
- 84867332081
- Paralinguistics in speech and language - state-of-the-art and the challenge
- Jan
- B. Schuller, S. Steidl, A. Batliner, F. Burkhardt, L. Devillers, C. Müller, and S. Narayanan. Paralinguistics in speech and language - state-of-the-art and the challenge. Computer Speech and Language (CSL), Special Issue on " Paralinguistics in Naturalistic Speech and Language", (1):4-39, Jan 2012.
- (2012) Computer Speech and Language (CSL), Special Issue on Paralinguistics in Naturalistic Speech and Language , Issue.1 , pp. 4-39
- Schuller, B.¹ Steidl, S.² Batliner, A.³ Burkhardt, F.⁴ Devillers, L.⁵ Müller, C.⁶ Narayanan, S.⁷

22
- 67549087536
- Reinforcement learning: An introduction
- R. S. Sutton and A. G. Barto. Reinforcement learning: An introduction. IEEE Transactions on Neural Networks, 9(5):1054-1054, 1998.
- (1998) IEEE Transactions on Neural Networks , vol.9 , Issue.5 , pp. 1054-1054
- Sutton, R.S.¹ Barto, A.G.²

23
- 84882998069
- On the role of tracking in stationary environments
- R. S. Sutton, A. Koop, and D. Silver. On the role of tracking in stationary environments. In ICML, 2007.
- (2007) ICML
- Sutton, R.S.¹ Koop, A.² Silver, D.³

24
- 77950862681
- Bayesian update of dialogue state: A pomdp framework for spoken dialogue systems
- B. Thomson and S. Young. Bayesian update of dialogue state: A pomdp framework for spoken dialogue systems. Computer Speech and Language, 24(4):562-588, 2010.
- (2010) Computer Speech and Language , vol.24 , Issue.4 , pp. 562-588
- Thomson, B.¹ Young, S.²

25
- 9444259273
- The information state approach to dialogue management
- volume 22 of Text, Speech and Language Technology
- D. R. Traum and S. Larsson. The information state approach to dialogue management. In Current and New Directions in Discourse and Dialogue, volume 22 of Text, Speech and Language Technology, pages 325-353, 2003.
- (2003) Current and New Directions in Discourse and Dialogue , pp. 325-353
- Traum, D.R.¹ Larsson, S.²

26
- 61549132763
- Social signal processing: Survey of an emerging domain
- A. Vinciarelli, M. Pantic, and H. Bourlard. Social signal processing: Survey of an emerging domain. Image and Vision Computing, 27(12):1743-1759, 2009.
- (2009) Image and Vision Computing , vol.27 , Issue.12 , pp. 1743-1759
- Vinciarelli, A.¹ Pantic, M.² Bourlard, H.³

27
- 85065183198
- Paradise: A framework for evaluating spoken dialogue agents
- M. A. Walker, D. J. Litman, C. A. Kamm, and A. Abella. Paradise: A framework for evaluating spoken dialogue agents. In ACL, 1997.
- (1997) ACL
- Walker, M.A.¹ Litman, D.J.² Kamm, C.A.³ Abella, A.⁴

28
- 70349231178
- The hidden information state model: A practical framework for pomdp-based spoken dialogue management
- S. Young, M. Gašić, S. Keizer, F. Mairesse, J. Schatzmann, B. Thomson, and K. Yu. The hidden information state model: A practical framework for pomdp-based spoken dialogue management. Computer Speech and Language, 24(2):150-174, 2010.
- (2010) Computer Speech and Language , vol.24 , Issue.2 , pp. 150-174
- Young, S.¹ Gašić, M.² Keizer, S.³ Mairesse, F.⁴ Schatzmann, J.⁵ Thomson, B.⁶ Yu, K.⁷

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.