-
1
-
-
0030302511
-
Cyclic flows, Markov process and stochastic traffic assignment
-
T. Akamatsu, Cyclic flows, Markov process and stochastic traffic assignment, Transport. Res. B 30 (5) (1996) 369-386.
-
(1996)
Transport. Res. B
, vol.30
, Issue.5
, pp. 369-386
-
-
Akamatsu, T.1
-
3
-
-
84890245567
-
-
Wiley, New York
-
M.S. Bazaraa, H.D. Sherali, C.M. Shetty, Nonlinear Programming: Theory and Algorithms, Wiley, New York, 1993.
-
(1993)
Nonlinear Programming: Theory and Algorithms
-
-
Bazaraa, M.S.1
Sherali, H.D.2
Shetty, C.M.3
-
7
-
-
0000719863
-
Packet routing in dynamically changing networks: A reinforcement learning approach
-
J.A. Boyan, M.L. Littman, Packet routing in dynamically changing networks: A reinforcement learning approach, Adv. Neural Inf. Process. Syst. (NIPS) 6 (1994) 671-678.
-
(1994)
Adv. Neural Inf. Process. Syst. (NIPS)
, vol.6
, pp. 671-678
-
-
Boyan, J.A.1
Littman, M.L.2
-
8
-
-
11344275321
-
Fastest mixing Markov chain on a graph
-
S. Boyd, P. Diaconis, L. Xiao, Fastest mixing Markov chain on a graph, SIAM Rev. (2004) 667-689.
-
(2004)
SIAM Rev
, pp. 667-689
-
-
Boyd, S.1
Diaconis, P.2
Xiao, L.3
-
12
-
-
0034342516
-
On the existence of fixed points for approximate value iteration and temporal-difference learning
-
D.P. de Farias, B.V. Roy, On the existence of fixed points for approximate value iteration and temporal-difference learning, J. Opt. Theory Appl. 105 (2000) 22-32.
-
(2000)
J. Opt. Theory Appl
, vol.105
, pp. 22-32
-
-
de Farias, D.P.1
Roy, B.V.2
-
14
-
-
0015078345
-
A probabilistic multipath assignment model that obviates path enumeration
-
R. Dial, A probabilistic multipath assignment model that obviates path enumeration, Transport. Res. 5 (1971) 83-111.
-
(1971)
Transport. Res
, vol.5
, pp. 83-111
-
-
Dial, R.1
-
15
-
-
33847766633
-
Random-walk computation of similarities between nodes of a graph, with application to collaborative recommendation
-
F. Fouss, A. Pirotte, J.-M. Renders, M. Saerens, Random-walk computation of similarities between nodes of a graph, with application to collaborative recommendation, IEEE Trans. Knowl. Data Eng. 19 (3) (2007) 355-369.
-
(2007)
IEEE Trans. Knowl. Data Eng
, vol.19
, Issue.3
, pp. 355-369
-
-
Fouss, F.1
Pirotte, A.2
Renders, J.-M.3
Saerens, M.4
-
16
-
-
4844223639
-
-
M. Guo, Y. Liu, J. Malec, A new Q-learning algorithm based on the metropolis criterion, IEEE Trans. Syst. Man Cybernet. B: Cybernet. 34 (5) (2004) 2140-2143.
-
M. Guo, Y. Liu, J. Malec, A new Q-learning algorithm based on the metropolis criterion, IEEE Trans. Syst. Man Cybernet. B: Cybernet. 34 (5) (2004) 2140-2143.
-
-
-
-
22
-
-
0032679082
-
Exploration of multi-state environments: Local measures and back-propagation of uncertainty
-
N. Meuleau, P. Bourgine, Exploration of multi-state environments: Local measures and back-propagation of uncertainty, Mach. Learn. 35 (1999) 117-154.
-
(1999)
Mach. Learn
, vol.35
, pp. 117-154
-
-
Meuleau, N.1
Bourgine, P.2
-
23
-
-
33746878798
-
-
Memo 2001-003, Massachusetts Institute of Technology
-
N. Meuleau, L. Peshkin, K. Kim, Exploration in gradient-based reinforcement learning, AI Memo 2001-003, Massachusetts Institute of Technology, 2001.
-
(2001)
Exploration in gradient-based reinforcement learning, AI
-
-
Meuleau, N.1
Peshkin, L.2
Kim, K.3
-
29
-
-
0003998394
-
-
Addison-Wesley, Reading, MA
-
H. Raiffa, Decision Analysis, Addison-Wesley, Reading, MA, 1970.
-
(1970)
Decision Analysis
-
-
Raiffa, H.1
-
30
-
-
85030589365
-
-
G. Rummery, M. Niranjan, On-line Q-learning using connectionist systems, Technical Report CUED/F-INFENG/TR 166, Cambridge University Engineering Department, 1994.
-
G. Rummery, M. Niranjan, On-line Q-learning using connectionist systems, Technical Report CUED/F-INFENG/TR 166, Cambridge University Engineering Department, 1994.
-
-
-
-
31
-
-
22944459214
-
The principal components analysis of a graph, and its relationships to spectral clustering
-
Proceedings of the 15th European Conference on Machine Learning ECML, Springer, Berlin
-
M. Saerens, F. Fouss, L. Yen, P. Dupont, The principal components analysis of a graph, and its relationships to spectral clustering, in: Proceedings of the 15th European Conference on Machine Learning (ECML 2004), Lecture Notes in Artificial Intelligence, vol. 3201, Springer, Berlin, 2004, pp. 371-383.
-
(2004)
Lecture Notes in Artificial Intelligence
, vol.3201
, pp. 371-383
-
-
Saerens, M.1
Fouss, F.2
Yen, L.3
Dupont, P.4
-
32
-
-
85030575492
-
-
G. Shani, R. Brafman, S. Shimony, Adaptation for changing stochastic environments through online POMDP policy learning, in: Workshop on Reinforcement Learning in Non-Stationary Environments, ECML 2005, 2005, pp. 61-70.
-
G. Shani, R. Brafman, S. Shimony, Adaptation for changing stochastic environments through online POMDP policy learning, in: Workshop on Reinforcement Learning in Non-Stationary Environments, ECML 2005, 2005, pp. 61-70.
-
-
-
-
33
-
-
0029753630
-
Reinforcement learning with replacing eligibility traces
-
S. Singh, R. Sutton, Reinforcement learning with replacing eligibility traces, Mach. Learn. 22 (1996) 123-158.
-
(1996)
Mach. Learn
, vol.22
, pp. 123-158
-
-
Singh, S.1
Sutton, R.2
-
35
-
-
33746329499
-
The fastest mixing Markov process on a graph and a connection to a maximum variance unfolding problem
-
J. Sun, S. Boyd, L. Xiao, P. Diaconis, The fastest mixing Markov process on a graph and a connection to a maximum variance unfolding problem, SIAM Rev. (2006) 681-699.
-
(2006)
SIAM Rev
, pp. 681-699
-
-
Sun, J.1
Boyd, S.2
Xiao, L.3
Diaconis, P.4
-
36
-
-
0004102479
-
-
The MIT Press, Cambridge, MA
-
R.S. Sutton, A.G. Barto, Reinforcement Learning: An Introduction, The MIT Press, Cambridge, MA, 1998.
-
(1998)
Reinforcement Learning: An Introduction
-
-
Sutton, R.S.1
Barto, A.G.2
-
37
-
-
39649107929
-
A one-parameter family of distributed consensus algorithms with boundary: From shortest paths to mean hitting times
-
A. Tahbaz, A. Jadbabaie, A one-parameter family of distributed consensus algorithms with boundary: From shortest paths to mean hitting times, in: Proceedings of IEEE Conference on Decision and Control, 2006, pp. 4664-4669.
-
(2006)
Proceedings of IEEE Conference on Decision and Control
, pp. 4664-4669
-
-
Tahbaz, A.1
Jadbabaie, A.2
-
38
-
-
0004126844
-
-
third ed, Academic Press, New York
-
H.M. Taylor, S. Karlin, An Introduction to Stochastic Modeling, third ed., Academic Press, New York, 1998.
-
(1998)
An Introduction to Stochastic Modeling
-
-
Taylor, H.M.1
Karlin, S.2
-
39
-
-
0003411271
-
Efficient exploration in reinforcement learning
-
Technical Report, School of Computer Science, Carnegie Mellon University
-
S. Thrun, Efficient exploration in reinforcement learning, Technical Report, School of Computer Science, Carnegie Mellon University, 1992.
-
(1992)
-
-
Thrun, S.1
-
40
-
-
0002210775
-
The role of exploration in learning control
-
D. White, D. Sofge Eds, Van Nostrand Reinhold, Princeton, NJ
-
S. Thrun, The role of exploration in learning control, in: D. White, D. Sofge (Eds.), Handbook for Intelligent Control: Neural, Fuzzy and Adaptive Approaches, Van Nostrand Reinhold, Princeton, NJ, 1992.
-
(1992)
Handbook for Intelligent Control: Neural, Fuzzy and Adaptive Approaches
-
-
Thrun, S.1
-
41
-
-
27744518715
-
-
MIT Press, Cambridge
-
S. Thrun, W. Burgard, D. Fox, Probabilistic Robotics, MIT Press, Cambridge, 2005.
-
(2005)
Probabilistic Robotics
-
-
Thrun, S.1
Burgard, W.2
Fox, D.3
|