-
1
-
-
14344251217
-
Apprenticeship learning via inverse reinforcement learning
-
Greiner, R. and Schuurmans, D. (eds.), ACM Press
-
Abbeel, P. and Ng, A. Apprenticeship learning via inverse reinforcement learning. In Greiner, R. and Schuurmans, D. (eds.), Proc. of 21st International Conference on Machine Learning (ICML 2004). ACM Press, 2004.
-
(2004)
Proc. of 21st International Conference on Machine Learning (ICML 2004)
-
-
Abbeel, P.1
Ng, A.2
-
2
-
-
77955809093
-
Autonomous helicopter aerobatics through apprenticeship learning
-
Abbeel, P., Coates, A., and Ng, A. Y. Autonomous helicopter aerobatics through apprenticeship learning. International Journal of Robotics Research, 29 (13):1608-1639, 2010.
-
(2010)
International Journal of Robotics Research
, vol.29
, Issue.13
, pp. 1608-1639
-
-
Abbeel, P.1
Coates, A.2
Ng, A.Y.3
-
3
-
-
63149159130
-
A survey of robot learning from demonstration
-
Argali, B. D., Chernova, S., Veloso, M., and Browning, B. A survey of robot learning from demonstration. Robotics and Autonomous Systems, 57(5):469-483, 2009.
-
(2009)
Robotics and Autonomous Systems
, vol.57
, Issue.5
, pp. 469-483
-
-
Argali, B.D.1
Chernova, S.2
Veloso, M.3
Browning, B.4
-
4
-
-
78649507911
-
A bayesian sampling approach to exploration in reinforcement learning
-
AUAI Press
-
Asmuth, J., Li, L., Littman, M., Nouri, A., and Wingate, D. A bayesian sampling approach to exploration in reinforcement learning. In Proc. of the 25th Annual Conference on Uncertainty in Artificial Intelligence (UAI'09), pp. 19-26. AUAI Press, 2009.
-
(2009)
Proc. of the 25th Annual Conference on Uncertainty in Artificial Intelligence (UAI'09)
, pp. 19-26
-
-
Asmuth, J.1
Li, L.2
Littman, M.3
Nouri, A.4
Wingate, D.5
-
6
-
-
84862293297
-
Relative entropy inverse reinforcement learning
-
Boularias, A., Kober, J., and Peters, J. Relative entropy inverse reinforcement learning. Journal of Machine Learning Research: Workshop and Conference Proceedings (AISTATS 2011), 15:182-189, 2011.
-
(2011)
Journal of Machine Learning Research: Workshop and Conference Proceedings (AISTATS 2011)
, vol.15
, pp. 182-189
-
-
Boularias, A.1
Kober, J.2
Peters, J.3
-
7
-
-
79955875655
-
Inverse reinforcement learning in partially observable environments
-
Choi, J. and Kim, K.-E. Inverse reinforcement learning in partially observable environments. Journal of Machine Learning Research, 12:691-730, 2011.
-
(2011)
Journal of Machine Learning Research
, vol.12
, pp. 691-730
-
-
Choi, J.1
Kim, K.-E.2
-
9
-
-
77955814312
-
Learning to navigate through crowded environments
-
Henry, P., Vollmer, C., Ferris, B., and Fox, D. Learning to navigate through crowded environments. In Proc. of 2010 IEEE International Conference of Robotics and Automation (ICRA 2010), pp. 981-986, 2010.
-
(2010)
Proc. of 2010 IEEE International Conference of Robotics and Automation (ICRA 2010)
, pp. 981-986
-
-
Henry, P.1
Vollmer, C.2
Ferris, B.3
Fox, D.4
-
10
-
-
85162071686
-
What makes some POMDP problems easy to approximate?
-
Platt, J., Koller, D., Singer, Y., and Roweis, S. (eds.) MIT Press, Cambridge, MA
-
Hsu, D., Lee, W. S., and Rong, N. What makes some POMDP problems easy to approximate? In Platt, J., Koller, D., Singer, Y., and Roweis, S. (eds.), Advances in Neural Information Processing Systems 20, pp. 689-696. MIT Press, Cambridge, MA, 2008.
-
(2008)
Advances in Neural Information Processing Systems 20
, pp. 689-696
-
-
Hsu, D.1
Lee, W.S.2
Rong, N.3
-
12
-
-
0032073263
-
Planning and acting in partially observable stochastic domains
-
Kaelbling, L. P., Littman, M. L., and Cassandra, A. R. Planning and acting in partially observable stochastic domains. Artificial Intelligence, 101:99-134, 1998.
-
(1998)
Artificial Intelligence
, vol.101
, pp. 99-134
-
-
Kaelbling, L.P.1
Littman, M.L.2
Cassandra, A.R.3
-
13
-
-
84863243133
-
Frame-based probabilistic framework for spoken dialog management using dialog examples
-
Kim, K., Lee, C., Jung, S., and Lee, G. G. A frame-based probabilistic framework for spoken dialog management using dialog examples. In Proc. of the 9th SIGdial Workshop on Discourse and Dialogue, pp. 120-127, 2008.
-
(2008)
Proc. of the 9th SIGdial Workshop on Discourse and Dialogue
, pp. 120-127
-
-
Kim, K.1
Lee, C.2
Jung, S.3
Lee, G.G.A.4
-
14
-
-
70349645087
-
SARSOP: Efficient point-based POMDP planning by approximating optimally reachable belief spaces
-
Kurniawati, H., Hsu, D., and Lee, W. S. SARSOP: Efficient point-based POMDP planning by approximating optimally reachable belief spaces. In Proc. Robotics: Science and Systems, 2008.
-
(2008)
Proc. Robotics: Science and Systems
-
-
Kurniawati, H.1
Hsu, D.2
Lee, W.S.3
-
15
-
-
80053423076
-
Controlling Listening-oriented Dialogue using Partially Observable Markov Decision Processes
-
ACL
-
Meguro, T., Higashinaka, R., Minami, Y., and Dohsaka, K. Controlling Listening-oriented Dialogue using Partially Observable Markov Decision Processes. In Proc. of the 23rd International Conference on Computational Linguistics (COLING 2010), pp. 761-769. ACL, 2010.
-
(2010)
Proc. of the 23rd International Conference on Computational Linguistics (COLING 2010)
, pp. 761-769
-
-
Meguro, T.1
Higashinaka, R.2
Minami, Y.3
Dohsaka, K.4
-
16
-
-
5744249209
-
Equations of state calculations by fast computing machines
-
Metropolis, N., Rosenbluth, A. W., Rosenbluth, M. N., Teller, A. H., and Teller, E. Equations of state calculations by fast computing machines. Journal of Chemical Physics, 21:1087-1092, 1953.
-
(1953)
Journal of Chemical Physics
, vol.21
, pp. 1087-1092
-
-
Metropolis, N.1
Rosenbluth, A.W.2
Rosenbluth, M.N.3
Teller, A.H.4
Teller, E.5
-
17
-
-
72449199041
-
Training parsers by inverse reinforcement learning
-
Neu, G. and Szepesvári, C. Training parsers by inverse reinforcement learning. Machine Learning, 77(2-3): 303-337, 2009.
-
(2009)
Machine Learning
, vol.77
, Issue.2-3
, pp. 303-337
-
-
Neu, G.1
Szepesvári, C.2
-
18
-
-
85011436515
-
Direct search algorithms for optimization calculations
-
Powell, M. J. D. Direct search algorithms for optimization calculations. Acta Numerica, 7:287-336, 1998.
-
(1998)
Acta Numerica
, vol.7
, pp. 287-336
-
-
Powell, M.J.D.1
-
20
-
-
85162018872
-
Bayes-adaptive POMDPs
-
Platt, J. C., Koller, D., Singer, Y., and Roweis, S. T. (eds.)
-
Ross, S., Chaib-draa, B., and Pineau, J. Bayes-adaptive POMDPs. In Platt, J. C., Koller, D., Singer, Y., and Roweis, S. T. (eds.), Advances in Neural Information Processing Systems 20, 2008.
-
(2008)
Advances in Neural Information Processing Systems 20
-
-
Ross, S.1
Chaib-draa, B.2
Pineau, J.3
-
21
-
-
0015658957
-
The optimal control of partially observable Markov processes over a finite horizon
-
Smallwood, R. and Sondik, E. The optimal control of partially observable Markov processes over a finite horizon,. Operations Research, 21:1071-1088, 1973.
-
(1973)
Operations Research
, vol.21
, pp. 1071-1088
-
-
Smallwood, R.1
Sondik, E.2
-
22
-
-
79951792262
-
Parameter learning for POMDP spoken dialogue models
-
IEEE
-
Thomson, B., Jurčíček, F., Gašić, M., Keizer, S., Mairesse, F., Yu, K., and Young, S. Parameter learning for POMDP spoken dialogue models. In Proc. of the 3rd IEEE Workshop on Spoken Language Technology (SLT 2010), pp. 271-276. IEEE, 2010.
-
(2010)
Proc. of the 3rd IEEE Workshop on Spoken Language Technology (SLT 2010)
, pp. 271-276
-
-
Thomson, B.1
Jurčíček, F.2
Gašić, M.3
Keizer, S.4
Mairesse, F.5
Yu, K.6
Young, S.7
-
23
-
-
84863276973
-
Partially observable Markov decision processes with continuous observations for dialogue management
-
Williams, J. D., Poupart, P., and Young, S. Partially observable Markov decision processes with continuous observations for dialogue management. In Proc. of the 6th SIGdial Workshop on Discourse and Dialogue, pp. 25-34. 2005.
-
(2005)
Proc. of the 6th SIGdial Workshop on Discourse and Dialogue
, pp. 25-34
-
-
Williams, J.D.1
Poupart, P.2
Young, S.3
-
24
-
-
77956500986
-
Modeling interaction via the principle of maximum causal entropy
-
Fürnkranz, J. and Joachims, T. (eds.), Omnipress
-
Ziebart, B., Bragnell, J. A., and Dey, A. K. Modeling interaction via the principle of maximum causal entropy. In Fürnkranz, J. and Joachims, T. (eds.), Proc. of the 27th International Conference on Machine Learning (ICML 2010), pp. 1255-1262. Omnipress, 2010.
-
(2010)
Proc. of the 27th International Conference on Machine Learning (ICML 2010)
, pp. 1255-1262
-
-
Ziebart, B.1
Bragnell, J.A.2
Dey, A.K.3
|