메뉴 건너뛰기




Volumn , Issue , 1995, Pages 1080-1086

Reinforcement Learning by Probability Matching

Author keywords

[No Author keywords available]

Indexed keywords

NETWORK ARCHITECTURE; PROBABILITY DISTRIBUTIONS;

EID: 85156273296     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (14)

References (5)
  • 3
    • 0003617454 scopus 로고
    • PhD Thesis, Dept. of Computer and Information Science, University of Massachusetts, Amherst, MA
    • Sutton, R. S. (1984). Temporal credit assignment in reinforcement learning. PhD Thesis, Dept. of Computer and Information Science, University of Massachusetts, Amherst, MA.
    • (1984) Temporal credit assignment in reinforcement learning
    • Sutton, R. S.1
  • 4
    • 0000337576 scopus 로고
    • Simple statistical gradient-following algorithms for connectionist reinforcement learning
    • Williams, R. J. (1992). Simple statistical gradient-following algorithms for connectionist reinforcement learning. Machine Learning, 8:229-256.
    • (1992) Machine Learning , vol.8 , pp. 229-256
    • Williams, R. J.1
  • 5
    • 0041154467 scopus 로고
    • Function optimization using connectionist reinforcement learning algorithms
    • Williams, R. J. and Peng, J. (1991). Function optimization using connectionist reinforcement learning algorithms. Connection Science, 3:241-268.
    • (1991) Connection Science , vol.3 , pp. 241-268
    • Williams, R. J.1    Peng, J.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.