SCOPUS 정보 검색 플랫폼

NIPS 1995: Proceedings of the 8th International Conference on Neural Information Processing Systems

Volumn , Issue , 1995, Pages 1080-1086

Reinforcement Learning by Probability Matching

(2) Sabes, Philip N a Jordan, Michael I a

a MASSACHUSETTS INSTITUTE OF TECHNOLOGY (United States)

Author keywords

[No Author keywords available]

Indexed keywords

NETWORK ARCHITECTURE; PROBABILITY DISTRIBUTIONS;

ASSOCIATIVE REINFORCEMENT LEARNING; LEARNING RULES; LOCAL MINIMUMS; MATCHING ALGORITHM; MATCHINGS; MIXTURE OF EXPERTS NETWORK; PROBABILITY MATCHING; PROBABILITY: DISTRIBUTIONS; REINFORCEMENT LEARNINGS; SIMPLE++;

REINFORCEMENT LEARNING;

EID: 85156273296 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (14)

References (5)

1
- 0001940458
- Adaptive mixtures of local experts
- Jacobs, R. A., Jordan, M. I., Nowlan, S. J., and Hinton, G. E. (1991). Adaptive mixtures of local experts. Neural Computation, 3:79-87.
- (1991) Neural Computation , vol.3 , pp. 79-87
- Jacobs, R. A.¹ Jordan, M. I.² Nowlan, S. J.³ Hinton, G. E.⁴

2
- 0011886618
- PhD Thesis, Dept. of Electrical Engineering, India Institute of Science, Bangalore
- Phansalkar, V. V. (1991). Learning automata algorithms for connectionist systems - local and global convergence. PhD Thesis, Dept. of Electrical Engineering, India Institute of Science, Bangalore.
- (1991) Learning automata algorithms for connectionist systems - local and global convergence
- Phansalkar, V. V.¹

3
- 0003617454
- PhD Thesis, Dept. of Computer and Information Science, University of Massachusetts, Amherst, MA
- Sutton, R. S. (1984). Temporal credit assignment in reinforcement learning. PhD Thesis, Dept. of Computer and Information Science, University of Massachusetts, Amherst, MA.
- (1984) Temporal credit assignment in reinforcement learning
- Sutton, R. S.¹

4
- 0000337576
- Simple statistical gradient-following algorithms for connectionist reinforcement learning
- Williams, R. J. (1992). Simple statistical gradient-following algorithms for connectionist reinforcement learning. Machine Learning, 8:229-256.
- (1992) Machine Learning , vol.8 , pp. 229-256
- Williams, R. J.¹

5
- 0041154467
- Function optimization using connectionist reinforcement learning algorithms
- Williams, R. J. and Peng, J. (1991). Function optimization using connectionist reinforcement learning algorithms. Connection Science, 3:241-268.
- (1991) Connection Science , vol.3 , pp. 241-268
- Williams, R. J.¹ Peng, J.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.