SCOPUS 정보 검색 플랫폼

Advances in Neural Information Processing Systems

Volumn , Issue , 2004, Pages

Distributed optimization in adaptive networks

(2) Moallemi, Ciamac C a Van Roy, Benjamin a

Author keywords

[No Author keywords available]

Indexed keywords

COMMUNICATION; GRADIENT METHODS; LEARNING ALGORITHMS; MARKOV PROCESSES; MOBILE DEVICES; ORDINARY DIFFERENTIAL EQUATIONS; SENSOR NETWORKS;

AGGREGATE PERFORMANCE; DISTRIBUTED COMMUNICATIONS; DISTRIBUTED OPTIMIZATION; ELECTRONIC COMPONENT; LOCAL COMMUNICATIONS; MARKOV DECISION PROCESSES; PERFORMANCE OBJECTIVE; POLICY GRADIENT METHODS;

OPTIMIZATION;

EID: 33746360402 PISSN: 10495258 EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (30)

References (9)

1
- 0034439308
- Stochastic optimization of controlled markov decision processes
- P. L. Bartlett and J. Baxter. Stochastic Optimization of Controlled Markov Decision Processes. In IEEE Conference on Decision and Control, pages 124-129, 2000.
- (2000) IEEE Conference on Decision and Control , pp. 124-129
- Bartlett, P.L.¹ Baxter, J.²

2
- 0036477347
- Estimation and approximation bounds for gradient-based reinforcement learning
- P. L. Bartlett and J. Baxter. Estimation and Approximation Bounds for Gradient-Based Reinforcement Learning. Journal of Computer and System Sciences, 64:133-150, 2002.
- (2002) Journal of Computer and System Sciences , vol.64 , pp. 133-150
- Bartlett, P.L.¹ Baxter, J.²

3
- 0013535965
- Infinite-horizon gradient-based policy search
- J. Baxter and P. L. Bartlett. Infinite-Horizon Gradient-Based Policy Search. Journal of Artificial Intelligence Research, 15:319-350, 2001.
- (2001) Journal of Artificial Intelligence Research , vol.15 , pp. 319-350
- Baxter, J.¹ Bartlett, P.L.²

4
- 0013495368
- Infinite-horizon gradient-based policy search: II Gradient ascent algorithms and experiments
- J. Baxter, P. L. Bartlett, and L.Weaver. Infinite-Horizon Gradient-Based Policy Search: II. Gradient Ascent Algorithms and Experiments. Journal of Artificial Intelligence Research, 15:351-381, 2001.
- (2001) Journal of Artificial Intelligence Research , vol.15 , pp. 351-381
- Baxter, J.¹ Bartlett, P.L.² Weaver, L.³

5
- 85153938292
- Reinforcement learning algorithms for partially observable markov decision problems
- T. Jaakkola, S. P. Singh, and M. I. Jordan. Reinforcement Learning Algorithms for Partially Observable Markov Decision Problems. In Advances in Neural Information Processing Systems 7, pages 345-352, 1995.
- (1995) Advances in Neural Information Processing Systems , vol.7 , pp. 345-352
- Jaakkola, T.¹ Singh, S.P.² Jordan, M.I.³

6
- 0004066022
- Springer- Verlag, New York, NY
- H. J. Kushner and G. Yin. Stochastic Approximation Algorithms and Applications. Springer- Verlag, New York, NY, 1997.
- (1997) Stochastic Approximation Algorithms and Applications
- Kushner, H.J.¹ Yin, G.²

7
- 84899014596
- Call admission control and routing in integrated service networks
- P. Marbach, O. Mihatsch, and J.N. Tsitsiklis. Call Admission Control and Routing in Integrated Service Networks. In IEEE Conference on Decision and Control, 1998.
- (1998) IEEE Conference on Decision and Control
- Marbach, P.¹ Mihatsch, O.² Tsitsiklis, J.N.³

8
- 0035249254
- Simulation-based optimization of markov reward processes
- P. Marbach and J.N. Tsitsiklis. Simulation-Based Optimization of Markov Reward Processes. IEEE Transactions on Automatic Control, 46(2):191-209, 2001.
- (2001) IEEE Transactions on Automatic Control , vol.46 , Issue.2 , pp. 191-209
- Marbach, P.¹ Tsitsiklis, J.N.²

9
- 84898950476
- URL
- C. C. Moallemi and B. Van Roy. Appendix to NIPS Submission. URL: Http://www.moallemi.com/ciamac/papers/nips-2003-appendix.pdf, 2003.
- (2003) Appendix to NIPS Submission
- Moallemi, C.C.¹ Van Roy, B.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.