메뉴 건너뛰기




Volumn , Issue , 2010, Pages

Reinforcement learning with a Gaussian mixture model

Author keywords

[No Author keywords available]

Indexed keywords

COMPUTATIONAL EFFICIENCY; GAUSSIAN DISTRIBUTION; GAUSSIAN NOISE (ELECTRONIC); ITERATIVE METHODS; PROBABILITY DENSITY FUNCTION;

EID: 79959391832     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/IJCNN.2010.5596306     Document Type: Conference Paper
Times cited : (28)

References (25)
  • 1
    • 0000439527 scopus 로고
    • Optimal global rates of convergence for nonparametric regression
    • C. Stone, "Optimal global rates of convergence for nonparametric regression," The Annals of Statistics, vol. 10, no. 4, pp. 1040-1053, 1982.
    • (1982) The Annals of Statistics , vol.10 , Issue.4 , pp. 1040-1053
    • Stone, C.1
  • 3
    • 61849173491 scopus 로고    scopus 로고
    • Gaussian process dynamic programming
    • M. Diesenroth, C. Rasmussen, and J. Peters, "Gaussian process dynamic programming," Neurocomputing, vol. 72, no. 7-9, pp. 1508-1524, 2009.
    • (2009) Neurocomputing , vol.72 , Issue.7-9 , pp. 1508-1524
    • Diesenroth, M.1    Rasmussen, C.2    Peters, J.3
  • 6
    • 84880694195 scopus 로고
    • Stable function approximation in dynamic programming
    • G. J. Gordon, "Stable function approximation in dynamic programming," in ICML, 1995, pp. 261-268.
    • (1995) ICML , pp. 261-268
    • Gordon, G.J.1
  • 7
    • 21844465127 scopus 로고    scopus 로고
    • Tree-based batch mode reinforcement learning
    • D. Ernst, P. Geurts, and L. Wehenkel, "Tree-based batch mode reinforcement learning," J. Mach. Learn. Res., vol. 6, pp. 503-556, 2005.
    • (2005) J. Mach. Learn. Res. , vol.6 , pp. 503-556
    • Ernst, D.1    Geurts, P.2    Wehenkel, L.3
  • 8
    • 0036832956 scopus 로고    scopus 로고
    • Kernel-based reinforcement learning
    • D. Ormoneit and S. Sen, "Kernel-based reinforcement learning," Machine Learning, vol. 49, no. 2-3, pp. 161-178, 2002.
    • (2002) Machine Learning , vol.49 , Issue.2-3 , pp. 161-178
    • Ormoneit, D.1    Sen, S.2
  • 10
    • 33646398129 scopus 로고    scopus 로고
    • Neural fitted Q iteration-first experiences with a data efficient neural reinforcement learning method
    • -, "Neural fitted Q iteration-first experiences with a data efficient neural reinforcement learning method," Lecture notes in computer science, vol. 3720, pp. 317-328, 2005.
    • (2005) Lecture Notes in Computer Science , vol.3720 , pp. 317-328
    • Riedmiller, M.1
  • 12
    • 34249833101 scopus 로고
    • Q-learning
    • [Online]. Available
    • C. Watkins and P. Dayan, "Q-learning," Machine Learning, vol. 8, no. 3-4, pp. 279-292, 1992. [Online]. Available: http://jmvidal.cse.sc.edu/ library/watkins92a.pdf
    • (1992) Machine Learning , vol.8 , Issue.3-4 , pp. 279-292
    • Watkins, C.1    Dayan, P.2
  • 15
    • 0141571972 scopus 로고    scopus 로고
    • On Gaussian radial basis function approximations: Interpretation, extensions, and learning strategies
    • M. Figueiredo, "On Gaussian radial basis function approximations: Interpretation, extensions, and learning strategies," Pattern Recognition, International Conference on, vol. 2, pp. 618-621, 2000.
    • (2000) Pattern Recognition, International Conference on , vol.2 , pp. 618-621
    • Figueiredo, M.1
  • 20
    • 0034131785 scopus 로고    scopus 로고
    • On-line em algorithm for the normalized Gaussian network
    • M.-A. Sato and S. Ishii, "On-line em algorithm for the normalized Gaussian network," Neural Comput., vol. 12, no. 2, pp. 407-432, 2000.
    • (2000) Neural Comput. , vol.12 , Issue.2 , pp. 407-432
    • Sato, M.-A.1    Ishii, S.2
  • 22
    • 0002788893 scopus 로고    scopus 로고
    • A view of the em algorithm that justifies incremental, sparse, and other variants
    • Norwell, MA, USA: Kluwer Academic Publishers
    • R. Neal and G. Hinton, "A view of the em algorithm that justifies incremental, sparse, and other variants," in Proceedings of the NATO Advanced Study Institute on Learning in graphical models. Norwell, MA, USA: Kluwer Academic Publishers, 1998, pp. 355-368.
    • (1998) Proceedings of the NATO Advanced Study Institute on Learning in Graphical Models , pp. 355-368
    • Neal, R.1    Hinton, G.2
  • 23
    • 0002210775 scopus 로고
    • The role of exploration in learning control
    • D.White and D. Sofge, Eds. Florence, Kentucky 41022: Van Nostrand Reinhold
    • S. Thrun, "The role of exploration in learning control," in Handbook for Intelligent Control: Neural, Fuzzy and Adaptive Approaches, D.White and D. Sofge, Eds. Florence, Kentucky 41022: Van Nostrand Reinhold, 1992.
    • (1992) Handbook for Intelligent Control: Neural, Fuzzy and Adaptive Approaches
    • Thrun, S.1
  • 24
    • 0033629916 scopus 로고    scopus 로고
    • Reinforcement learning in continuous time and space
    • K. Doya, "Reinforcement learning in continuous time and space," Neural Comput., vol. 12, no. 1, pp. 219-245, 2000.
    • (2000) Neural Comput. , vol.12 , Issue.1 , pp. 219-245
    • Doya, K.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.