메뉴 건너뛰기




Volumn 2006, Issue , 2006, Pages 217-224

Dealing with non-stationary environments using context detection

Author keywords

[No Author keywords available]

Indexed keywords

LEARNING ALGORITHMS; MATHEMATICAL MODELS; PROBLEM SOLVING; SIGNAL FILTERING AND PREDICTION;

EID: 33749262176     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (55)

References (9)
  • 2
    • 33749246501 scopus 로고    scopus 로고
    • Hidden-mode markov decision processes for nonstationary sequential decision making
    • London, UK: Springer-Verlag
    • Choi, S. P. M., Yeung, D.-Y., & Zhang, N. L. (2001). Hidden-mode markov decision processes for nonstationary sequential decision making. Sequence Learning - Paradigms, Algorithms, and Applications (pp. 264-287). London, UK: Springer-Verlag.
    • (2001) Sequence Learning - Paradigms, Algorithms, and Applications , pp. 264-287
    • Choi, S.P.M.1    Yeung, D.-Y.2    Zhang, N.L.3
  • 5
    • 0027684215 scopus 로고
    • Prioritized sweeping: Reinforcement learning with less data and less time
    • Moore, A. W., & Atkeson, C. G. (1993). Prioritized sweeping: Reinforcement learning with less data and less time. Machine Learning, 13, 103-130.
    • (1993) Machine Learning , vol.13 , pp. 103-130
    • Moore, A.W.1    Atkeson, C.G.2
  • 9
    • 0033170372 scopus 로고    scopus 로고
    • Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning
    • Sutton, R. S., Precup, D., & Singh, S. P. (1999). Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning. Artificial Intelligence, 112, 181-211.
    • (1999) Artificial Intelligence , vol.112 , pp. 181-211
    • Sutton, R.S.1    Precup, D.2    Singh, S.P.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.