메뉴 건너뛰기




Volumn , Issue , 2004, Pages

Distributed optimization in adaptive networks

Author keywords

[No Author keywords available]

Indexed keywords

COMMUNICATION; GRADIENT METHODS; LEARNING ALGORITHMS; MARKOV PROCESSES; MOBILE DEVICES; ORDINARY DIFFERENTIAL EQUATIONS; SENSOR NETWORKS;

EID: 33746360402     PISSN: 10495258     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (30)

References (9)
  • 1
    • 0034439308 scopus 로고    scopus 로고
    • Stochastic optimization of controlled markov decision processes
    • P. L. Bartlett and J. Baxter. Stochastic Optimization of Controlled Markov Decision Processes. In IEEE Conference on Decision and Control, pages 124-129, 2000.
    • (2000) IEEE Conference on Decision and Control , pp. 124-129
    • Bartlett, P.L.1    Baxter, J.2
  • 2
    • 0036477347 scopus 로고    scopus 로고
    • Estimation and approximation bounds for gradient-based reinforcement learning
    • P. L. Bartlett and J. Baxter. Estimation and Approximation Bounds for Gradient-Based Reinforcement Learning. Journal of Computer and System Sciences, 64:133-150, 2002.
    • (2002) Journal of Computer and System Sciences , vol.64 , pp. 133-150
    • Bartlett, P.L.1    Baxter, J.2
  • 4
    • 0013495368 scopus 로고    scopus 로고
    • Infinite-horizon gradient-based policy search: II Gradient ascent algorithms and experiments
    • J. Baxter, P. L. Bartlett, and L.Weaver. Infinite-Horizon Gradient-Based Policy Search: II. Gradient Ascent Algorithms and Experiments. Journal of Artificial Intelligence Research, 15:351-381, 2001.
    • (2001) Journal of Artificial Intelligence Research , vol.15 , pp. 351-381
    • Baxter, J.1    Bartlett, P.L.2    Weaver, L.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.