SCOPUS 정보 검색 플랫폼

Computer Speech and Language

Volumn 26, Issue 3, 2012, Pages 168-192

Reinforcement learning for parameter estimation in statistical spoken dialogue systems

(3) Jurčíček, Filip a Thomson, Blaise a Young, Steve a

a UNIVERSITY OF CAMBRIDGE (United Kingdom)

Author keywords

Dialogue management; POMDP; Reinforcement learning; Spoken dialogue systems

Indexed keywords

DESIGN CONSIDERATIONS; DIALOGUE MANAGEMENT; DIALOGUE MODELS; DIALOGUE SYSTEMS; INFORMATION DOMAINS; MODEL PARAMETERS; PARTIALLY OBSERVABLE MARKOV DECISION PROCESS; POMDP; REINFORCEMENT ALGORITHMS; REINFORCEMENT TECHNIQUE; REWARD FUNCTION; SPOKEN DIALOGUE SYSTEM;

BEHAVIORAL RESEARCH; LEARNING ALGORITHMS; MARKOV PROCESSES; MATHEMATICAL MODELS; REINFORCEMENT; REINFORCEMENT LEARNING; SPEECH PROCESSING;

PARAMETER ESTIMATION;

EID: 84855300452 PISSN: 08852308 EISSN: 10958363 Source Type: Journal
DOI: 10.1016/j.csl.2011.09.004 Document Type: Article

Times cited : (48)

References (40)

1
- 14344253499
- Ph.D. Thesis, Australian National University
- Aberdeen, D.A., 2003. Policy-gradient algorithms for partially observable Markov decision processes. Ph.D. Thesis, Australian National University.
- (2003) Policy-gradient Algorithms for Partially Observable Markov Decision Processes
- Aberdeen, D.A.¹

2
- 0000396062
- Natural Gradient Works Efficiently in Learning
- Amari S. Natural gradient works efficiently in learning Neural Computation 10 2 1998 251 276 (Pubitemid 128463152)
- (1998) Neural Computation , vol.10 , Issue.2 , pp. 251-276
- Amari, S.-I.¹

3
- 33846516584
- Springer
- Bishop C. Pattern Recognition and Machine Learning 2006 Springer
- (2006) Pattern Recognition and Machine Learning
- Bishop, C.¹

4
- 85079986206
- Black A., and Lenzo K. Flite: A Small, Fast Speech Synthesis Engine 2008 http://www.speech.cs.cmu.edu/flite/index.html
- (2008) Flite: A Small, Fast Speech Synthesis Engine
- Black, A.¹ Lenzo, K.²

5
- 34548212336
- Efficient model learning for dialog management
- DOI 10.1145/1228716.1228726, HRI 2007 - Proceedings of the 2007 ACM/IEEE Conference on Human-Robot Interaction - Robot as Team Member
- Doshi F., and Roy N. Efficient model learning for dialog management HRI'07: Proceedings of the ACM/IEEE International Conference on Human-Robot Interaction 2007 ACM New York, NY, USA 65 72 http://doi.acm.org/10.1145/1228716. 1228726 (Pubitemid 47327128)
- (2007) HRI 2007 - Proceedings of the 2007 ACM/IEEE Conference on Human-Robot Interaction - Robot as Team Member , pp. 65-72
- Doshi, F.¹ Roy, N.²

6
- 56749163138
- Spoken language interaction with model uncertainty: An adaptive human robot interaction system
- 10.1080/09540090802413145
- Doshi F., and Roy N. Spoken language interaction with model uncertainty: an adaptive human robot interaction system Connection Science 20 4 2008 299 318 10.1080/09540090802413145
- (2008) Connection Science , vol.20 , Issue.4 , pp. 299-318
- Doshi, F.¹ Roy, N.²

7
- 77958539351
- The infinite partially observable Markov decision process
- Bengio Y. Schuurmans D. Lafferty J. Williams C.K.I. Culotta A.
- Doshi-Velez F. The infinite partially observable Markov decision process Bengio Y. Schuurmans D. Lafferty J. Williams C.K.I. Culotta A. Advances in Neural Information Processing Systems, vol. 22 2009 477 485
- (2009) Advances in Neural Information Processing Systems, Vol. 22 , pp. 477-485
- Doshi-Velez, F.¹

8
- 2942598511
- Evaluation and usability of multimodal spoken language dialogue systems
- 10.1016/j.specom.2004.02.001
- Dybkjaer L., Bernsen N.O., and Minker W. Evaluation and usability of multimodal spoken language dialogue systems Speech Communication 43 1-2 2004 33 54 10.1016/j.specom.2004.02.001
- (2004) Speech Communication , vol.43 , Issue.12 , pp. 33-54
- Dybkjaer, L.¹ Bernsen, N.O.² Minker, W.³

9
- 48749101568
- Automatic annotation of COMMUNICATOR dialogue data for learning dialogue strategies and user simulations
- Georgila K., Lemon O., and Henderson J. Automatic annotation of COMMUNICATOR dialogue data for learning dialogue strategies and user simulations Ninth Workshop on the Semantics and Pragmatics of Dialogue 2005
- (2005) Ninth Workshop on the Semantics and Pragmatics of Dialogue
- Georgila, K.¹ Lemon, O.² Henderson, J.³

10
- 84942484786
- Ridge regression: Biased estimation for nonorthogonal problems
- Hoerl A.E., and Kennard R.W. Ridge regression: biased estimation for nonorthogonal problems Technometrics 12 1970 55 67
- (1970) Technometrics , vol.12 , pp. 55-67
- Hoerl, A.E.¹ Kennard, R.W.²

11
- 84865718200
- Real user evaluation of spoken dialogue systems using Amazon Mechanical Turk
- Jurčíček F., Keizer S., Gašić M., Mairesse F., Thomson B., Yu K., and Young S. Real user evaluation of spoken dialogue systems using Amazon Mechanical Turk Proc. Interspeech 2011
- (2011) Proc. Interspeech
- Jurčíček, F.¹ Keizer, S.² Gašić, M.³ Mairesse, F.⁴ Thomson, B.⁵ Yu, K.⁶ Young, S.⁷

12
- 79959813974
- Natural Belief-Critic: A reinforcement algorithm for parameter estimation in statistical spoken dialogue systems
- Kobayashi T. Hirose K. Nakamura S.
- Jurčíček F., Thomson B., Keizer S., Mairesse F., Gašić M., Yu K., and Young S. Natural Belief-Critic: a reinforcement algorithm for parameter estimation in statistical spoken dialogue systems Kobayashi T. Hirose K. Nakamura S. Proc. Interspeech. ISCA 2010 90 93
- (2010) Proc. Interspeech. ISCA , pp. 90-93
- Jurčíček, F.¹ Thomson, B.² Keizer, S.³ Mairesse, F.⁴ Gašić, M.⁵ Yu, K.⁶ Young, S.⁷

13
- 80052051092
- Natural actor and belief critic: Reinforcement algorithm for learning parameters of dialogue systems modelled as pomdps
- JUNE 6:1-6:26.
- Jurčíček F., Thomson B., and Young S. Natural actor and belief critic: reinforcement algorithm for learning parameters of dialogue systems modelled as pomdps ACM Transactions on Speech and Language Processing 7 June 2011 6:1-6:26. http://doi.acm.org/10.1145/1966407.1966411
- (2011) ACM Transactions on Speech and Language Processing , vol.7
- Jurčíček, F.¹ Thomson, B.² Young, S.³

14
- 84867191183
- Effects of user modeling on POMDP-based dialogue systems
- Kim D., Sim H.S., Kim K., Kim J.H., Kim H., and Sung J.W. Effects of user modeling on POMDP-based dialogue systems Proceedings of Interspeech 2008
- (2008) Proceedings of Interspeech
- Kim, D.¹ Sim, H.S.² Kim, K.³ Kim, J.H.⁴ Kim, H.⁵ Sung, J.W.⁶

15
- 84898938510
- Actor-critic algorithms
- Konda V., and Tsitsiklis J. Actor-critic algorithms Advances in Neural Information Processing Systems 12 2000 1008 1014
- (2000) Advances in Neural Information Processing Systems , vol.12 , pp. 1008-1014
- Konda, V.¹ Tsitsiklis, J.²

16
- 33646413135
- Natural Actor-Critic
- Springer
- Peters J., Vijayakumar S., and Schaal S. Natural Actor-Critic European Conference on Machine Learning (ECML) 2005 Springer 280 291
- (2005) European Conference on Machine Learning (ECML) , pp. 280-291
- Peters, J.¹ Vijayakumar, S.² Schaal, S.³

17
- 80051605697
- Bayesian reinforcement learning for pomdp-based dialogue systems
- Png S., and Pineau J. Bayesian reinforcement learning for pomdp-based dialogue systems ICASSP'11: International Conference on Acoustics, Speech and Signal Processing 2011
- (2011) ICASSP'11: International Conference on Acoustics, Speech and Signal Processing
- Png, S.¹ Pineau, J.²

18
- 84880707672
- Spoken dialogue management using probabilistic reasoning
- Association for Computational Linguistics Morristown, NJ, USA
- Roy N., Pineau J., and Thrun S. Spoken dialogue management using probabilistic reasoning ACL'00: Proceedings of the 38th Annual Meeting on Association for Computational Linguistics 2000 Association for Computational Linguistics Morristown, NJ, USA 93 100 http://dx.doi.org/10.3115/1075218.1075231
- (2000) ACL'00: Proceedings of the 38th Annual Meeting on Association for Computational Linguistics , pp. 93-100
- Roy, N.¹ Pineau, J.² Thrun, S.³

19
- 77950862049
- Ph.D. Thesis, University of Cambridge
- Schatzmann, J., 2008. Statistical user modeling for dialogue systems. Ph.D. Thesis, University of Cambridge.
- (2008) Statistical User Modeling for Dialogue Systems
- Schatzmann, J.¹

20
- 33846257740
- Effects of the user model on simulation-based learning of dialogue strategies
- Schatzmann J., Stuttle M.N., Weilhammer K., and Young S. Effects of the user model on simulation-based learning of dialogue strategies IEEE ASRU'05: Proc. IEEE Workshop Automatic Speech Recognition and Understanding 2005
- (2005) IEEE ASRU'05: Proc. IEEE Workshop Automatic Speech Recognition and Understanding
- Schatzmann, J.¹ Stuttle, M.N.² Weilhammer, K.³ Young, S.⁴

21
- 0004102479
- MIT Press Cambridge, MA
- Sutton R., and Barto A. Reinforcement Learning: An Introduction. Adaptive Computation and Machine Learning 1998 MIT Press Cambridge, MA
- (1998) Reinforcement Learning: An Introduction. Adaptive Computation and Machine Learning
- Sutton, R.¹ Barto, A.²

22
- 84898939480
- Policy gradient methods for reinforcement learning with function approximation
- MIT Press
- Sutton R., McAllester D., Singh S., and Mansour Y. Policy gradient methods for reinforcement learning with function approximation Advances in Neural Information Processing Systems, vol. 12 2000 MIT Press 1057 1063
- (2000) Advances in Neural Information Processing Systems, Vol. 12 , pp. 1057-1063
- Sutton, R.¹ McAllester, D.² Singh, S.³ Mansour, Y.⁴

23
- 79959834356
- Using automatically transcribed dialogs to learn user models in a spoken dialog system
- Morristown, USA
- Syed U., and Williams J.D. Using automatically transcribed dialogs to learn user models in a spoken dialog system HLT Morristown, USA 2008 121 124
- (2008) HLT , pp. 121-124
- Syed, U.¹ Williams, J.D.²

24
- 79951783485
- Ph.D. Thesis, University of Cambridge
- Thomson, B., 2010. Statistical methods for spoken dialogue management. Ph.D. Thesis, University of Cambridge.
- (2010) Statistical Methods for Spoken Dialogue Management
- Thomson, B.¹

25
- 84867194827
- User study of the Bayesian Update of Dialogue State approach to dialogue management
- Brisbane, Australia
- Thomson B., Gašić M., Keizer S., Mairesse F., Schatzmann J., Yu K., and Young S. User study of the Bayesian Update of Dialogue State approach to dialogue management. Interspeech 2008 Brisbane, Australia 2008
- (2008) Interspeech 2008
- Thomson, B.¹ Gašić, M.² Keizer, S.³ Mairesse, F.⁴ Schatzmann, J.⁵ Yu, K.⁶ Young, S.⁷

26
- 79951792262
- Parameter learning for POMDP spoken dialogue models
- 10.1109/SLT.2010.5700863
- Thomson B., Jurčíček F., Gašić M., Keizer S., Mairesse F., Yu K., and Young S. Parameter learning for POMDP spoken dialogue models IEEE SLT '10: Spoken Language Technology Workshop 2010 271 276 10.1109/SLT.2010.5700863
- (2010) IEEE SLT '10: Spoken Language Technology Workshop , pp. 271-276
- Thomson, B.¹ Jurčíček, F.² Gašić, M.³ Keizer, S.⁴ Mairesse, F.⁵ Yu, K.⁶ Young, S.⁷

27
- 51449096257
- Bayesian update of dialogue state for robust dialogue systems
- Las Vegas
- Thomson B., Schatzmann J., and Young S. Bayesian update of dialogue state for robust dialogue systems Int Conf Acoustics Speech and Signal Processing ICASSP Las Vegas 2008
- (2008) Int Conf Acoustics Speech and Signal Processing ICASSP
- Thomson, B.¹ Schatzmann, J.² Young, S.³

28
- 77950862681
- Bayesian update of dialogue state: A POMDP framework for spoken dialogue systems
- Thomson B., and Young S. Bayesian update of dialogue state: a POMDP framework for spoken dialogue systems Computer Speech and Language 24 4 2010 562 588
- (2010) Computer Speech and Language , vol.24 , Issue.4 , pp. 562-588
- Thomson, B.¹ Young, S.²

29
- 84979592723
- Evaluating interactive dialogue systems: Extending component evaluation to integrated system evaluation
- Stroudsburg, PA, USA
- Walker M.A., Litman D.J., Kamm C.A., and Abella A. Evaluating interactive dialogue systems: extending component evaluation to integrated system evaluation Interactive Spoken Dialog Systems on Bringing Speech and NLP Together in Real Applications. ISDS'97. Association for Computational Linguistics Stroudsburg, PA, USA 1997 1 8
- (1997) Interactive Spoken Dialog Systems on Bringing Speech and NLP Together in Real Applications. ISDS'97. Association for Computational Linguistics , pp. 1-8
- Walker, M.A.¹ Litman, D.J.² Kamm, C.A.³ Abella, A.⁴

30
- 0026366683
- Understanding spontaneous speech
- Toronto, Canada
- Ward W. Understanding spontaneous speech Proc. Int. Conf. Acoustics, Speech and Signal Processing Toronto, Canada 1991 365 368
- (1991) Proc. Int. Conf. Acoustics, Speech and Signal Processing , pp. 365-368
- Ward, W.¹

31
- 77957283019
- Recurrent policy gradients
- Wierstra D., Förster A., Peters J., and Schmidhuber J. Recurrent policy gradients Logic Journal of the IGPL 18 5 2010 620 634
- (2010) Logic Journal of the IGPL , vol.18 , Issue.5 , pp. 620-634
- Wierstra, D.¹ Förster, A.² Peters, J.³ Schmidhuber, J.⁴

32
- 66149160386
- Integrating expert knowledge into POMDP optimization for spoken dialog systems
- Williams J.D. Integrating expert knowledge into POMDP optimization for spoken dialog systems Proceedings of the AAAI-08 Workshop on Advancements in POMDP Solvers 2008
- (2008) Proceedings of the AAAI-08 Workshop on Advancements in POMDP Solvers
- Williams, J.D.¹

33
- 33846220727
- Scaling up POMDPs for dialog management: The summary POMDP method
- Cancun, Mexico
- Williams J.D., and Young S. Scaling up POMDPs for dialog management: the summary POMDP method. IEEE ASRU'05: Proc. IEEE Workshop Automatic Speech Recognition and Understanding Cancun, Mexico November 2005
- (2005) IEEE ASRU'05: Proc. IEEE Workshop Automatic Speech Recognition and Understanding
- Williams, J.D.¹ Young, S.²

34
- 33750703175
- Partially observable Markov decision processes for spoken dialog systems
- DOI 10.1016/j.csl.2006.06.008, PII S0885230806000283
- Williams J.D., and Young S. Partially observable Markov decision processes for spoken dialog systems Computer Speech and Language 21 2 2007 393 422 (Pubitemid 44709839)
- (2007) Computer Speech and Language , vol.21 , Issue.2 , pp. 393-422
- Williams, J.D.¹ Young, S.²

35
- 52949143575
- Scaling POMDPs for spoken dialog management
- Williams J.D., and Young S. Scaling POMDPs for spoken dialog management IEEE Audio, Speech and Language Processing 15 7 2007 2116 2129
- (2007) IEEE Audio, Speech and Language Processing , vol.15 , Issue.7 , pp. 2116-2129
- Williams, J.D.¹ Young, S.²

36
- 0000337576
- Simple statistical gradient-following algorithms for connectionist reinforcement learning
- Williams R.J. Simple statistical gradient-following algorithms for connectionist reinforcement learning Machine Learning 8 1992 229 256
- (1992) Machine Learning , vol.8 , pp. 229-256
- Williams, R.J.¹

37
- 85009291577
- Talking to Machines (Statistically speaking)
- Denver, CO
- Young S. Talking to Machines (Statistically speaking) Int. Conf. Spoken Language Processing Denver, CO 2002
- (2002) Int. Conf. Spoken Language Processing
- Young, S.¹

38
- 33745208895
- Young S. ATK: An Application Toolkit for HTK 2005 http://mi.eng.cam.ac. uk/research/dialogue/atk-home
- (2005) ATK: An Application Toolkit for HTK
- Young, S.¹

39
- 78549277875
- Tech. rep., Cambridge University Engineering Dept
- Young, S., 2007. CUED Standard Dialogue Acts. Tech. rep., Cambridge University Engineering Dept. http://mi.eng.cam.ac.uk/research/dialogue/ LocalDocs/dastd.pdf.
- (2007) CUED Standard Dialogue Acts
- Young, S.¹

40
- 70349231178
- The Hidden Information State Model: A practical framework for POMDP-based spoken dialogue management
- Young S., Gašić M., Keizer S., Mairesse F., Schatzmann J., Thomson B., and Yu K. The Hidden Information State Model: a practical framework for POMDP-based spoken dialogue management Computer Speech and Language 24 2 2010 150 174
- (2010) Computer Speech and Language , vol.24 , Issue.2 , pp. 150-174
- Young, S.¹ Gašić, M.² Keizer, S.³ Mairesse, F.⁴ Schatzmann, J.⁵ Thomson, B.⁶ Yu, K.⁷

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.