메뉴 건너뛰기




Volumn , Issue , 2002, Pages

Predictive representations of state

Author keywords

[No Author keywords available]

Indexed keywords

DYNAMICAL SYSTEMS; MARKOV PROCESSES;

EID: 84898982129     PISSN: 10495258     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (281)

References (12)
  • 1
    • 0000353178 scopus 로고
    • A maximization technique occurring in the statistical analysis of probabilistic functions of Markov chains
    • Baum, L. E., Petrie, T., Soules, G., & Weiss, N. (1970). A maximization technique occurring in the statistical analysis of probabilistic functions of Markov chains. Annals of Mathematical Statistics, 41, 164-171.
    • (1970) Annals of Mathematical Statistics , vol.41 , pp. 164-171
    • Baum, L.E.1    Petrie, T.2    Soules, G.3    Weiss, N.4
  • 3
    • 0026998041 scopus 로고
    • Reinforcement learning with perceptual aliasing: The perceptual distinctions approach
    • San Jose, California: AAAI Press
    • Chrisman, L. (1992). Reinforcement learning with perceptual aliasing: The perceptual distinctions approach. Proceedings of the Tenth National Conference on Artificial Intelligence (pp. 183-188). San Jose, California: AAAI Press.
    • (1992) Proceedings of the Tenth National Conference on Artificial Intelligence , pp. 183-188
    • Chrisman, L.1
  • 4
    • 0034198996 scopus 로고    scopus 로고
    • Observable operator models for discrete stochastic time series
    • Jaeger, H. (2000). Observable operator models for discrete stochastic time series. Neural Computation, 12, 1371-1398.
    • (2000) Neural Computation , vol.12 , pp. 1371-1398
    • Jaeger, H.1
  • 5
    • 0032073263 scopus 로고    scopus 로고
    • Planning and acting in partially observable stochastic domains
    • Kaelbling, L. P., Littman, M. L., & Cassandra, A. R. (1998). Planning and acting in partially observable stochastic domains. Artificial Intelligence, 101, 99-134.
    • (1998) Artificial Intelligence , vol.101 , pp. 99-134
    • Kaelbling, L.P.1    Littman, M.L.2    Cassandra, A.R.3
  • 6
    • 0002679852 scopus 로고
    • A survey of algorithmic methods for partially observable Markov decision processes
    • Lovejoy, W. S. (1991). A survey of algorithmic methods for partially observable Markov decision processes. Annals of Operations Research, 28, 47-65.
    • (1991) Annals of Operations Research , vol.28 , pp. 47-65
    • Lovejoy, W.S.1
  • 8
    • 0028430130 scopus 로고
    • Diversity-based inference of finite automata
    • Rivest, R. L., & Schapire, R. E. (1994). Diversity-based inference of finite automata. Journal of the ACM, 41, 555-589.
    • (1994) Journal of the ACM , vol.41 , pp. 555-589
    • Rivest, R.L.1    Schapire, R.E.2
  • 11
    • 0033170372 scopus 로고    scopus 로고
    • Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning
    • Sutton, R. S., Precup, D., & Singh, S. (1999). Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning. Artificial Intelligence, 181-211.
    • (1999) Artificial Intelligence , pp. 181-211
    • Sutton, R.S.1    Precup, D.2    Singh, S.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.