메뉴 건너뛰기




Volumn , Issue , 2007, Pages 276-283

Estimating the reliability of MDP policies: A confidence interval approach

Author keywords

[No Author keywords available]

Indexed keywords

CONFIDENCE INTERVAL; CONTROL POLICY; FEATURE SETS; HIGH RELIABILITY; MARKOV DECISION PROCESSES; STATE-SPACE;

EID: 84858400406     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (17)

References (12)
  • 5
    • 85135155957 scopus 로고    scopus 로고
    • A stochastic model of computer-human interaction for learning dialogues
    • E. Levin and R. Pieraccini. 1997. A stochastic model of computer-human interaction for learning dialogues. In Proc. of EUROSPEECH '97.
    • (1997) Proc. of EUROSPEECH '97
    • Levin, E.1    Pieraccini, R.2
  • 7
    • 84858417612 scopus 로고    scopus 로고
    • Construction of confidence intervals for neural networks based on least squares estimation
    • I. Rivals and L. Personnaz. 2002. Construction of confidence intervals for neural networks based on least squares estimation. In Neural Networks.
    • (2002) Neural Networks
    • Rivals, I.1    Personnaz, L.2
  • 11
    • 74049119541 scopus 로고    scopus 로고
    • Comparing the utility of state features in spoken dialogue using reinforcement learning
    • J. Tetreault and D. Litman. 2006. Comparing the utility of state features in spoken dialogue using reinforcement learning. In NAACL.
    • (2006) NAACL
    • Tetreault, J.1    Litman, D.2
  • 12
    • 14344279109 scopus 로고    scopus 로고
    • An application of reinforcement learning to dialogue strategy selection in a spoken dialogue system for email
    • M. Walker. 2000. An application of reinforcement learning to dialogue strategy selection in a spoken dialogue system for email. JAIR, 12.
    • (2000) JAIR , vol.12
    • Walker, M.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.