SCOPUS 정보 검색 플랫폼

ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings

Volumn , Issue , 2012, Pages 4989-4992

Off-policy learning in large-scale POMDP-based dialogue systems

(3) Daubigney, Lucie a,c Geist, Matthieu a Pietquin, Olivier a,b

a SUPELEC (France)

b CNRS (France)

c LORIA (France)

Author keywords

Reinforcement Learning; Spoken Dialogue Systems

Indexed keywords

DIALOGUE SYSTEMS; GAUSSIAN PROCESSES; OPTIMAL POLICIES; OPTIMAL STRATEGIES; OPTIMISATIONS; PERCEPTRON; REAL-WORLD SYSTEM; SCALE-UP; SPOKEN DIALOGUE SYSTEM; STATE OF THE ART; VALUE FUNCTIONS;

LEARNING ALGORITHMS; OPTIMIZATION; REINFORCEMENT LEARNING; SIGNAL PROCESSING;

SPEECH PROCESSING;

EID: 84867619228 PISSN: 15206149 EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/ICASSP.2012.6289040 Document Type: Conference Paper

Times cited : (18)

References (19)

1
- 0004102479
- The MIT press
- R.S. Sutton and A.G. Barto, Reinforcement learning: An introduction, The MIT press, 1998.
- (1998) Reinforcement Learning: An Introduction
- Sutton, R.S.¹ Barto, A.G.²

2
- 84898955256
- Reinforcement learning for spoken dialogue systems
- S. Singh, M. Kearns, D. Litman, and M.Walker, "Reinforcement learning for spoken dialogue systems," in Proc. NIPS'99, 1999.
- Proc. NIPS'99, 1999
- Singh, S.¹ Kearns, M.² Litman, D.³ Walker, M.⁴

3
- 84867590498
- A stochastic model of human-machine interaction for learning dialog strategies
- E. Levin, R. Pieraccini, and W. Eckert, "A stochastic model of human-machine interaction for learning dialog strategies," IEEE TSAP, 2000.
- (2000) IEEE TSAP
- Levin, E.¹ Pieraccini, R.² Eckert, W.³

4
- 84867603402
- A probabilistic framework for dialog simulation and optimal strategy learning
- O. Pietquin and T. Dutoit, "A probabilistic framework for dialog simulation and optimal strategy learning," IEEE TSAP, 2006.
- (2006) IEEE TSAP
- Pietquin, O.¹ Dutoit, T.²

5
- 70349231178
- The hidden information state model: A practical framework for POMDP-based spoken dialogue management
- S. Young, M. Gasic, S. Keizer, F. Mairesse, J. Schatzmann, B. Thomson, and K. Yu, "The hidden information state model: A practical framework for POMDP-based spoken dialogue management," Computer Speech & Language, 2010.
- (2010) Computer Speech & Language
- Young, S.¹ Gasic, M.² Keizer, S.³ Mairesse, F.⁴ Schatzmann, J.⁵ Thomson, B.⁶ Yu, K.⁷

6
- 34247587828
- User modeling for spoken dialogue system evaluation
- W. Eckert, E. Levin, and R. Pieraccini, "User modeling for spoken dialogue system evaluation," in Proc. ASRU'97, December 1997.
- Proc. ASRU'97, December 1997
- Eckert, W.¹ Levin, E.² Pieraccini, R.³

7
- 33747607273
- A survey of statistical user simulation techniques for rl of dialogue management strategies
- J. Schatzmann, K. Weilhammer, M. Stuttle, and S. Young, "A survey of statistical user simulation techniques for rl of dialogue management strategies," The Knowledge Engineering Review, 2006.
- (2006) The Knowledge Engineering Review
- Schatzmann, J.¹ Weilhammer, K.² Stuttle, M.³ Young, S.⁴

8
- 33846257740
- Effects of the user model on simulation-based learning of dialogue strategies
- J. Schatzmann, M. N. Stuttle, K. Weilhammer, and S. Young, "Effects of the user model on simulation-based learning of dialogue strategies," in Proc. of ASRU'05, 2005.
- Proc. of ASRU'05, 2005
- Schatzmann, J.¹ Stuttle, M.N.² Weilhammer, K.³ Young, S.⁴

9
- 84857755225
- Gaussian processes for fast policy optimisation of POMDPbased dialogue managers
- M. Gašić, F. Jurčíček, S. Keizer, F. Mairesse, B. Thomson, K. Yu, and S. Young, "Gaussian processes for fast policy optimisation of POMDPbased dialogue managers," in Proc. of SIGDIAL 11, 2010.
- Proc. of SIGDIAL 11, 2010
- Gašić, M.¹ Jurčíček, F.² Keizer, S.³ Mairesse, F.⁴ Thomson, B.⁵ Yu, K.⁶ Young, S.⁷

10
- 79959813974
- Natural Belief-Critic: A reinforcement algorithm for parameter estimation in statistical spoken dialogue systems
- F. Jurcicek, B. Thomson, S. Keizer, M. Gasic, F. Mairesse, K. Yu, and S. Young, "Natural Belief-Critic: a reinforcement algorithm for parameter estimation in statistical spoken dialogue systems," in Interspeech' 10, 2010.
- (2010) Interspeech' 10
- Jurcicek, F.¹ Thomson, B.² Keizer, S.³ Gasic, M.⁴ Mairesse, F.⁵ Yu, K.⁶ Young, S.⁷

11
- 33846220727
- Scaling up POMDPs for dialogue management: The summary POMDP method
- J. Williams and S. Young, "Scaling up POMDPs for dialogue management: the summary POMDP method," in Proc. of ASRU, 2005.
- Proc. of ASRU, 2005.
- Williams, J.¹ Young, S.²

12
- 78651465938
- Kalman Temporal Differences
- M. Geist and O. Pietquin, "Kalman Temporal Differences," JAIR, 2010.
- (2010) JAIR
- Geist, M.¹ Pietquin, O.²

13
- 84881039547
- Sample Efficient Online Learning of Optimal Dialogue Policies with Kalman Temporal Differences
- O. Pietquin, M. Geist, and S. Chandramohan, "Sample Efficient Online Learning of Optimal Dialogue Policies with Kalman Temporal Differences," in Proc. of IJCAI 2011, 2011.
- (2011) Proc. of IJCAI 2011
- Pietquin, O.¹ Geist, M.² Chandramohan, S.³

14
- 33750703175
- Partially observable Markov decision processes for spoken dialog systems
- J. Williams and S. Young, "Partially observable Markov decision processes for spoken dialog systems," Comp. Speech and Language, 2007.
- (2007) Comp. Speech and Language
- Williams, J.¹ Young, S.²

15
- 85048464801
- Agenda-based user simulation for bootstrapping a pomdp dialogue system
- J. Schatzmann, B. Thomson, K. Weilhammer, H. Ye, and S. Young, "Agenda-based user simulation for bootstrapping a pomdp dialogue system.," in HLT/NAACL 2007, 2007.
- (2007) HLT/NAACL 2007
- Schatzmann, J.¹ Thomson, B.² Weilhammer, K.³ Ye, H.⁴ Young, S.⁵

16
- 84867601978
- Managing Uncertainty within the KTD Framework
- JMLR C&WP
- M. Geist and O. Pietquin, "Managing Uncertainty within the KTD Framework," in Proc. of the AL&E workshop, 2011, JMLR C&WP.
- Proc. of the AL&E Workshop, 2011
- Geist, M.¹ Pietquin, O.²

17
- 84865703906
- Uncertainty management for on-line optimisation of a POMDP-based large-scale spoken dialogue system
- L. Daubigney, M. Gasic, S. Chandramohan, M. Geist, O. Pietquin, and S. Young, "Uncertainty management for on-line optimisation of a POMDP-based large-scale spoken dialogue system," in Proc. of Interspeech 2011, 2011.
- (2011) Proc. of Interspeech 2011
- Daubigney, L.¹ Gasic, M.² Chandramohan, S.³ Geist, M.⁴ Pietquin, O.⁵ Young, S.⁶

18
- 71149109483
- Near-Bayesian Exploration in Polynomial Time
- J. Z. Kolter and A. Y. Ng, "Near-Bayesian Exploration in Polynomial Time," in Proc. of ICML 09, 2009.
- Proc. of ICML 09, 2009
- Kolter, J.Z.¹ Ng, A.Y.²

19
- 31844451013
- Reinforcement Learning with Gaussian Processes
- Y. Engel, S. Mannor, and R. Meir, "Reinforcement Learning with Gaussian Processes," in Proc of ICML 05, 2005.
- Proc of ICML 05, 2005
- Engel, Y.¹ Mannor, S.² Meir, R.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.