메뉴 건너뛰기




Volumn 4, Issue , 2016, Pages 2809-2822

Graying the black box: Understanding DQNs

Author keywords

[No Author keywords available]

Indexed keywords

ARTIFICIAL INTELLIGENCE; LEARNING SYSTEMS;

EID: 84998679057     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (61)

References (42)
  • 5
    • 0002278788 scopus 로고    scopus 로고
    • Hierarchical reinforcement learning with the MAXQ value function decomposition
    • Dietterich, Thomas G. Hierarchical reinforcement learning with the MAXQ value function decomposition. J. Artif. Intell. Res.(JAIR), 13:227-303, 2000.
    • (2000) J. Artif. Intell. Res.(JAIR) , vol.13 , pp. 227-303
    • Dietterich, T.G.1
  • 6
    • 84998993668 scopus 로고    scopus 로고
    • Learning embedded maps of markov processes
    • Citeseer
    • Engel, Yaakov and Mannor, Shie. Learning embedded maps of markov processes. In in Proceedings of ICML 2001. Citeseer, 2001.
    • (2001) Proceedings of ICML 2001
    • Engel, Y.1    Mannor, S.2
  • 12
    • 85032751123 scopus 로고    scopus 로고
    • Manifold-learning-based feature extraction for classification of hyperspectral data: A review of advances in manifold learning
    • IEEE
    • Lunga, Dalton, Prasad, Santasriya, Crawford, Melba M, and Ersoy, Ozan. Manifold-learning-based feature extraction for classification of hyperspectral data: A review of advances in manifold learning. Signal Processing Magazine, IEEE, 31(1):55-66, 2014.
    • (2014) Signal Processing Magazine , vol.31 , Issue.1 , pp. 55-66
    • Lunga, D.1    Prasad, S.2    Crawford, M.M.3    Ersoy, O.4
  • 13
  • 15
    • 84945250000 scopus 로고    scopus 로고
    • Q-cutdynamic discovery of sub-goals in reinforcement learning
    • Q, Springer
    • Menache, Ishai, Mannor, Shie, and Shimkin, Nahum. Q-cutdynamic discovery of sub-goals in reinforcement learning. In Machine Learning: ECML 2002, pp. 295- 306. Springer, 2002.
    • (2002) Machine Learning: ECML 2002 , pp. 295-306
    • Menache, I.1    Mannor, S.2    Shimkin, N.3
  • 21
    • 0346738900 scopus 로고    scopus 로고
    • Flexible decomposition algorithms for weakly coupled Markov decision problems
    • Morgan Kaufmann Publishers Inc
    • Parr, Ronald. Flexible decomposition algorithms for weakly coupled Markov decision problems. In Proceedings of the Fourteenth conference on Uncertainty in artificial intelligence, pp. 422-430. Morgan Kaufmann Publishers Inc., 1998.
    • (1998) Proceedings of the Fourteenth Conference on Uncertainty in Artificial Intelligence , pp. 422-430
    • Parr, R.1
  • 22
    • 21344435992 scopus 로고    scopus 로고
    • Invariant visual representation by single neurons in the human brain
    • Quiroga, R Quian, Reddy, Leila, Kreiman, Gabriel, Koch, Christof, and Fried, Itzhak. Invariant visual representation by single neurons in the human brain. Nature, 435 (7045): 1102-1107, 2005.
    • (2005) Nature , vol.435 , Issue.7045 , pp. 1102-1107
    • Quiroga, R.Q.1    Reddy, L.2    Kreiman, G.3    Koch, C.4    Fried, I.5
  • 24
    • 33646398129 scopus 로고    scopus 로고
    • Neural fitted Q iteration-first experiences with a data efficient neural reinforcement learning method
    • Springer
    • Riedmiller, Martin. Neural fitted Q iteration-first experiences with a data efficient neural reinforcement learning method. In Machine Learning: ECML 2005, pp. 317- 328. Springer, 2005.
    • (2005) Machine Learning: ECML 2005 , pp. 317-328
    • Riedmiller, M.1
  • 31
    • 0033170372 scopus 로고    scopus 로고
    • Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning
    • Sutton, Richard S, Precup, Doina, and Singh, Satinder. Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning. Artificial intelligence, 112( 1): 181-211, 1999.
    • (1999) Artificial Intelligence , vol.112 , Issue.1 , pp. 181-211
    • Sutton, R.S.1    Precup, D.2    Singh, S.3
  • 33
    • 0034704229 scopus 로고    scopus 로고
    • A global geometric framework for nonlinear dimensionality reduction
    • Tenenbaum, Joshua B, De Silva, Vin, and Langford, John C. A global geometric framework for nonlinear dimensionality reduction. Science, 290(5500):2319-2323, 2000.
    • (2000) Science , vol.290 , Issue.5500 , pp. 2319-2323
    • Tenenbaum, J.B.1    De Silva, V.2    Langford, J.C.3
  • 34
    • 0029276036 scopus 로고
    • Temporal difference learning and TD- Gammon
    • Tesauro, Gerald. Temporal difference learning and TD- Gammon. Communications of the ACM, 38(3):58-68, 1995.
    • (1995) Communications of the ACM , vol.38 , Issue.3 , pp. 58-68
    • Tesauro, G.1
  • 35
    • 0031998630 scopus 로고    scopus 로고
    • Learning metric-topological maps for indoor mobile robot navigation
    • Thrun, Sebastian. Learning metric-topological maps for indoor mobile robot navigation. Artificial Intelligence, 99(1):21-71, 1998.
    • (1998) Artificial Intelligence , vol.99 , Issue.1 , pp. 21-71
    • Thrun, S.1
  • 36
    • 0031143730 scopus 로고    scopus 로고
    • An analysis of temporal-difference learning with function approximation
    • Tsitsiklis, John N and Van Roy, Benjamin. An analysis of temporal-difference learning with function approximation. Automatic Control, IEEE Transactions on, 42(5): 674-690, 1997.
    • (1997) Automatic Control, IEEE Transactions on , vol.42 , Issue.5 , pp. 674-690
    • Tsitsiklis, J.N.1    Van Roy, B.2
  • 37
    • 84919775831 scopus 로고    scopus 로고
    • Accelerating t-SNE using tree- based algorithms
    • Van Der Maaten, Laurens. Accelerating t-SNE using tree- based algorithms. The Journal of Machine Learning Research, 15(1):3221-3245, 2014.
    • (2014) The Journal of Machine Learning Research , vol.15 , Issue.1 , pp. 3221-3245
    • Van Der Maaten, L.1
  • 42
    • 84906489074 scopus 로고    scopus 로고
    • Visualizing and understanding convolutional networks
    • Springer
    • Zeiler, Matthew D and Fergus, Rob. Visualizing and understanding convolutional networks. In Computer Vision- ECCV2014, pp. 818-833. Springer, 2014.
    • (2014) Computer Vision- ECCV2014 , pp. 818-833
    • Zeiler, M.D.1    Fergus, R.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.