SCOPUS 정보 검색 플랫폼

Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH

Volumn , Issue , 2009, Pages 2475-2478

Reinforcement learning for dialog management using least-squares policy iteration and fast feature selection

(3) Li, Lihong a Williams, Jason D b Balakrishnan, Suhrid b

a RUTGERS UNIVERSITY (United States)

b AT AND T LABS RESEARCH (United States)

Author keywords

Dialog management; Partially observable Markov decision processes; Spoken dialog systems

Indexed keywords

DIALOG MANAGEMENT; DIALOG SYSTEMS; FEATURE SELECTION; LEAST SQUARE; PARTIALLY OBSERVABLE MARKOV DECISION PROCESS; POLICY ITERATION; SPOKEN DIALOG SYSTEMS;

DECISION MAKING; FEATURE EXTRACTION; HUMAN COMPUTER INTERACTION; MARKOV PROCESSES; REINFORCEMENT; REINFORCEMENT LEARNING; SPEECH COMMUNICATION;

LEARNING ALGORITHMS;

EID: 70450186275 PISSN: None EISSN: 19909772 Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (37)

References (15)

1
- 0033894474
- A stochastic model of human-machine interaction for learning dialog strategies
- E. Levin, R. Pieraccini, and W. Eckert, "A stochastic model of human-machine interaction for learning dialog strategies," IEEE Trans. Speech Audio Process., vol. 8, no. 1, pp. 11-23, 2000.
- (2000) IEEE Trans. Speech Audio Process , vol.8 , Issue.1 , pp. 11-23
- Levin, E.¹ Pieraccini, R.² Eckert, W.³

2
- 84880707672
- Spoken dialogue management using probabilistic reasoning
- N. Roy, J. Pineau, and S. Thrun, "Spoken dialogue management using probabilistic reasoning," in ACL, 2000, pp. 93-100.
- (2000) ACL , pp. 93-100
- Roy, N.¹ Pineau, J.² Thrun, S.³

3
- 51449120317
- Hybrid reinforcement/ supervised learning of dialogue policies from fixed data sets
- J. Henderson, O. Lemon, and K. Georgila, "Hybrid reinforcement/ supervised learning of dialogue policies from fixed data sets," Computational Linguistics, vol. 34, no. 4, pp. 487-511, 2008.
- (2008) Computational Linguistics , vol.34 , Issue.4 , pp. 487-511
- Henderson, J.¹ Lemon, O.² Georgila, K.³

4
- 51449096257
- Bayesian update of dialogue state for robust dialogue systems
- B. Thomson, J. Schatzmann, and S. Young, "Bayesian update of dialogue state for robust dialogue systems," in ICASSP, 2008.
- (2008) ICASSP
- Thomson, B.¹ Schatzmann, J.² Young, S.³

5
- 66149160386
- Integrating expert knowledge into POMDP optimization for spoken dialog systems
- J. D. Williams, "Integrating expert knowledge into POMDP optimization for spoken dialog systems," in AAAI-08 Workshop on Advancements in POMDP Solvers, 2008.
- (2008) AAAI-08 Workshop on Advancements in POMDP Solvers
- Williams, J.D.¹

6
- 34547982545
- Analyzing feature generation for value-function approximation
- R. Parr, C. Painter-Wakefield, L. Li, and M. L. Littman, "Analyzing feature generation for value-function approximation," in Int. Conf. Mach. Learning, 2007, pp. 737-744.
- (2007) Int. Conf. Mach. Learning , pp. 737-744
- Parr, R.¹ Painter-Wakefield, C.² Li, L.³ Littman, M.L.⁴

7
- 35748957806
- Proto-value functions: A Laplacian framework for learning representation and control in Markov decision processes
- S. Mahadevan and M. Maggioni, "Proto-value functions: A Laplacian framework for learning representation and control in Markov decision processes," Journall of Machine Learning Research, vol. 8, pp. 2169-2231, 2007.
- (2007) Journall of Machine Learning Research , vol.8 , pp. 2169-2231
- Mahadevan, S.¹ Maggioni, M.²

8
- 4644323293
- Least-squares policy iteration
- M. G. Lagoudakis and R. Parr, "Least-squares policy iteration," Journall of Machine Learning Research, vol. 4, pp. 1107-1149, 2003.
- (2003) Journall of Machine Learning Research , vol.4 , pp. 1107-1149
- Lagoudakis, M.G.¹ Parr, R.²

9
- 0004102479
- MIT Press
- R. S. Sutton and A. G. Barto, Reinforcement Learning: An Introduction. MIT Press, 1998.
- (1998) Reinforcement Learning: An Introduction
- Sutton, R.S.¹ Barto, A.G.²

10
- 33847202724
- Learning to predict by the methods of temporal differences
- R. S. Sutton, "Learning to predict by the methods of temporal differences," Machine Learning, vol. 3, pp. 9-44, 1988.
- (1988) Machine Learning , vol.3 , pp. 9-44
- Sutton, R.S.¹

11
- 70450138370
- J. D. Williams, Demonstration of a POMDP voice dialer, In ACL/HLT, 2008.
- J. D. Williams, "Demonstration of a POMDP voice dialer," In ACL/HLT, 2008.

12
- 70450127475
- The best of both worlds: Unifying conventional dialog systems and pomdps
- -, "The best of both worlds: Unifying conventional dialog systems and pomdps," in ICSLP-08, 2008.
- (2008) ICSLP-08

13
- 14344279109
- An application of reinforcement learning to dialogue strategy selection in a spoken dialogue system for email
- M. Walker, "An application of reinforcement learning to dialogue strategy selection in a spoken dialogue system for email," Journal of Articial Intelligence Research, vol. 12, pp. 387-416, 2000.
- (2000) Journal of Articial Intelligence Research , vol.12 , pp. 387-416
- Walker, M.¹

14
- 84859906764
- Automatic learning and evaluation of user-centered objective functions for dialogue system optimisation
- V. Rieser and O. Lemon, "Automatic learning and evaluation of user-centered objective functions for dialogue system optimisation," in Proc LREC, Marrakech, 2008.
- (2008) Proc LREC, Marrakech
- Rieser, V.¹ Lemon, O.²

15
- 51449123233
- Using dialogue acts to learn better repair strategies for spoken dialogue systems
- Las Vegas
- M. Frampton and O. Lemon, "Using dialogue acts to learn better repair strategies for spoken dialogue systems," in Proc ICASSP, Las Vegas, 2008.
- (2008) Proc ICASSP
- Frampton, M.¹ Lemon, O.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.