SCOPUS 정보 검색 플랫폼

Volumn , Issue , 2007, Pages 276-283

Estimating the reliability of MDP policies: A confidence interval approach

Author keywords

[No Author keywords available]

Indexed keywords

CONFIDENCE INTERVAL; CONTROL POLICY; FEATURE SETS; HIGH RELIABILITY; MARKOV DECISION PROCESSES; STATE-SPACE;

COMPUTATIONAL LINGUISTICS; FEATURE EXTRACTION; MARKOV PROCESSES; REINFORCEMENT LEARNING; RELIABILITY;

FINANCIAL DATA PROCESSING;

EID: 84858400406 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (17)

References (12)

1
- 85105809948
- Inductive learning algorithms and representations for text categorization
- S. Dumais, J. Platt, D. Heckerman, and M. Sahami. 1998. Inductive learning algorithms and representations for text categorization. In Conference on Information and Knowledge Management.
- (1998) Conference on Information and Knowledge Management
- Dumais, S.¹ Platt, J.² Heckerman, D.³ Sahami, M.⁴

2
- 33847636990
- Reinforcement learning of dialogue strategies using the user's last dialogue act
- M. Frampton and O. Lemon. 2005. Reinforcement learning of dialogue strategies using the user's last dialogue act. In IJCAI Wkshp. on K&R in Practical Dialogue Systems.
- (2005) IJCAI Wkshp. on K&R in Practical Dialogue Systems
- Frampton, M.¹ Lemon, O.²

3
- 33751381449
- Hybrid reinforcement/supervised learning for dialogue policies from communicator data
- J. Henderson, O. Lemon, and K. Georgila. 2005. Hybrid reinforcement/supervised learning for dialogue policies from communicator data. In IJCAI Wkshp. on K&R in Practical Dialogue Systems.
- (2005) IJCAI Wkshp. on K&R in Practical Dialogue Systems
- Henderson, J.¹ Lemon, O.² Georgila, K.³

4
- 39649090194
- Active learning in partially observable Markov decision processes
- R. Jaulmes, J. Pineau, and D. Precup. 2005. Active learning in partially observable markov decision processes. In European Conference on Machine Learning.
- (2005) European Conference on Machine Learning
- Jaulmes, R.¹ Pineau, J.² Precup, D.³

5
- 85135155957
- A stochastic model of computer-human interaction for learning dialogues
- E. Levin and R. Pieraccini. 1997. A stochastic model of computer-human interaction for learning dialogues. In Proc. of EUROSPEECH '97.
- (1997) Proc. of EUROSPEECH '97
- Levin, E.¹ Pieraccini, R.²

6
- 74049153470
- The Markov assumption in spoken dialogue management
- T. Paek and D. Chickering. 2005. The markov assumption in spoken dialogue management. In 6th SIGDial Workshop on Discourse and Dialogue.
- (2005) 6th SIGDial Workshop on Discourse and Dialogue
- Paek, T.¹ Chickering, D.²

7
- 84858417612
- Construction of confidence intervals for neural networks based on least squares estimation
- I. Rivals and L. Personnaz. 2002. Construction of confidence intervals for neural networks based on least squares estimation. In Neural Networks.
- (2002) Neural Networks
- Rivals, I.¹ Personnaz, L.²

8
- 0037806811
- The boosting approach to machine learning: An overview
- R. Schapire. 2002. The boosting approach to machine learning: An overview. In MSRI Workshop on Nonlinear Estimation and Classification.
- (2002) MSRI Workshop on Nonlinear Estimation and Classification
- Schapire, R.¹

9
- 84898955256
- Reinforcement learning for spoken dialogue systems
- S. Singh, M. Kearns, D. Litman, and M. Walker. 1999. Reinforcement learning for spoken dialogue systems. In Proc. NIPS '99.
- (1999) Proc. NIPS '99
- Singh, S.¹ Kearns, M.² Litman, D.³ Walker, M.⁴

10
- 0004007508
- The MIT Press
- R. Sutton and A. Barto. 1998. Reinforcement Learning. The MIT Press.
- (1998) Reinforcement Learning
- Sutton, R.¹ Barto, A.²

11
- 74049119541
- Comparing the utility of state features in spoken dialogue using reinforcement learning
- J. Tetreault and D. Litman. 2006. Comparing the utility of state features in spoken dialogue using reinforcement learning. In NAACL.
- (2006) NAACL
- Tetreault, J.¹ Litman, D.²

12
- 14344279109
- An application of reinforcement learning to dialogue strategy selection in a spoken dialogue system for email
- M. Walker. 2000. An application of reinforcement learning to dialogue strategy selection in a spoken dialogue system for email. JAIR, 12.
- (2000) JAIR , vol.12
- Walker, M.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.