SCOPUS 정보 검색 플랫폼

Proceedings of the 29th International Conference on Machine Learning, ICML 2012

Volumn 2, Issue , 2012, Pages 1495-1502

Apprenticeship learning for model parameters of partially observable environments

(2) Makino, Takaki a Takeuchi, Johane b

a INSTITUTE OF INDUSTRIAL SCIENCE (Japan)

b HONDA RESEARCH INSTITUTE JAPAN CO LTD (Japan)

Author keywords

[No Author keywords available]

Indexed keywords

ACTION SELECTION; APPRENTICESHIP LEARNING; DIALOGUE SYSTEMS; ENVIRONMENT MODELS; EXPLICIT MODELING; MODEL PARAMETERS; OPTIMAL ACTIONS; PARTIALLY OBSERVABLE ENVIRONMENTS;

LEARNING SYSTEMS; SPEECH PROCESSING;

APPRENTICES;

EID: 84867126700 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (13)

References (24)

1
- 14344251217
- Apprenticeship learning via inverse reinforcement learning
- Greiner, R. and Schuurmans, D. (eds.), ACM Press
- Abbeel, P. and Ng, A. Apprenticeship learning via inverse reinforcement learning. In Greiner, R. and Schuurmans, D. (eds.), Proc. of 21st International Conference on Machine Learning (ICML 2004). ACM Press, 2004.
- (2004) Proc. of 21st International Conference on Machine Learning (ICML 2004)
- Abbeel, P.¹ Ng, A.²

2
- 77955809093
- Autonomous helicopter aerobatics through apprenticeship learning
- Abbeel, P., Coates, A., and Ng, A. Y. Autonomous helicopter aerobatics through apprenticeship learning. International Journal of Robotics Research, 29 (13):1608-1639, 2010.
- (2010) International Journal of Robotics Research , vol.29 , Issue.13 , pp. 1608-1639
- Abbeel, P.¹ Coates, A.² Ng, A.Y.³

3
- 63149159130
- A survey of robot learning from demonstration
- Argali, B. D., Chernova, S., Veloso, M., and Browning, B. A survey of robot learning from demonstration. Robotics and Autonomous Systems, 57(5):469-483, 2009.
- (2009) Robotics and Autonomous Systems , vol.57 , Issue.5 , pp. 469-483
- Argali, B.D.¹ Chernova, S.² Veloso, M.³ Browning, B.⁴

4
- 78649507911
- A bayesian sampling approach to exploration in reinforcement learning
- AUAI Press
- Asmuth, J., Li, L., Littman, M., Nouri, A., and Wingate, D. A bayesian sampling approach to exploration in reinforcement learning. In Proc. of the 25th Annual Conference on Uncertainty in Artificial Intelligence (UAI'09), pp. 19-26. AUAI Press, 2009.
- (2009) Proc. of the 25th Annual Conference on Uncertainty in Artificial Intelligence (UAI'09) , pp. 19-26
- Asmuth, J.¹ Li, L.² Littman, M.³ Nouri, A.⁴ Wingate, D.⁵

5
- 0030242097
- Input-output HMM's for sequence processing
- Bengio, Y. and Frasconi, P. Input-output HMM's for sequence processing. IEEE Transactions on Neural Networks, 7(5):1231-1249, 1996.
- (1996) IEEE Transactions on Neural Networks , vol.7 , Issue.5 , pp. 1231-1249
- Bengio, Y.¹ Frasconi, P.²

6
- 84862293297
- Relative entropy inverse reinforcement learning
- Boularias, A., Kober, J., and Peters, J. Relative entropy inverse reinforcement learning. Journal of Machine Learning Research: Workshop and Conference Proceedings (AISTATS 2011), 15:182-189, 2011.
- (2011) Journal of Machine Learning Research: Workshop and Conference Proceedings (AISTATS 2011) , vol.15 , pp. 182-189
- Boularias, A.¹ Kober, J.² Peters, J.³

7
- 79955875655
- Inverse reinforcement learning in partially observable environments
- Choi, J. and Kim, K.-E. Inverse reinforcement learning in partially observable environments. Journal of Machine Learning Research, 12:691-730, 2011.
- (2011) Journal of Machine Learning Research , vol.12 , pp. 691-730
- Choi, J.¹ Kim, K.-E.²

8
- 0003860037
- Chapman&Hall/CRC
- Gilks, W., Richardson, S., and Spiegelhalter, D. Markov Chain Monte Carlo in Practice. Chapman&Hall/CRC, 1996.
- (1996) Markov Chain Monte Carlo in Practice
- Gilks, W.¹ Richardson, S.² Spiegelhalter, D.³

9
- 77955814312
- Learning to navigate through crowded environments
- Henry, P., Vollmer, C., Ferris, B., and Fox, D. Learning to navigate through crowded environments. In Proc. of 2010 IEEE International Conference of Robotics and Automation (ICRA 2010), pp. 981-986, 2010.
- (2010) Proc. of 2010 IEEE International Conference of Robotics and Automation (ICRA 2010) , pp. 981-986
- Henry, P.¹ Vollmer, C.² Ferris, B.³ Fox, D.⁴

10
- 85162071686
- What makes some POMDP problems easy to approximate?
- Platt, J., Koller, D., Singer, Y., and Roweis, S. (eds.) MIT Press, Cambridge, MA
- Hsu, D., Lee, W. S., and Rong, N. What makes some POMDP problems easy to approximate? In Platt, J., Koller, D., Singer, Y., and Roweis, S. (eds.), Advances in Neural Information Processing Systems 20, pp. 689-696. MIT Press, Cambridge, MA, 2008.
- (2008) Advances in Neural Information Processing Systems 20 , pp. 689-696
- Hsu, D.¹ Lee, W.S.² Rong, N.³

11
- 79956052831
- Johnson, S. G. The NLopt nonlinear-optimization package. http://ab-initio.mit.edu/nlopt, 2008.
- (2008) The NLopt Nonlinear-optimization Package
- Johnson, S.G.¹

12
- 0032073263
- Planning and acting in partially observable stochastic domains
- Kaelbling, L. P., Littman, M. L., and Cassandra, A. R. Planning and acting in partially observable stochastic domains. Artificial Intelligence, 101:99-134, 1998.
- (1998) Artificial Intelligence , vol.101 , pp. 99-134
- Kaelbling, L.P.¹ Littman, M.L.² Cassandra, A.R.³

13
- 84863243133
- Frame-based probabilistic framework for spoken dialog management using dialog examples
- Kim, K., Lee, C., Jung, S., and Lee, G. G. A frame-based probabilistic framework for spoken dialog management using dialog examples. In Proc. of the 9th SIGdial Workshop on Discourse and Dialogue, pp. 120-127, 2008.
- (2008) Proc. of the 9th SIGdial Workshop on Discourse and Dialogue , pp. 120-127
- Kim, K.¹ Lee, C.² Jung, S.³ Lee, G.G.A.⁴

14
- 70349645087
- SARSOP: Efficient point-based POMDP planning by approximating optimally reachable belief spaces
- Kurniawati, H., Hsu, D., and Lee, W. S. SARSOP: Efficient point-based POMDP planning by approximating optimally reachable belief spaces. In Proc. Robotics: Science and Systems, 2008.
- (2008) Proc. Robotics: Science and Systems
- Kurniawati, H.¹ Hsu, D.² Lee, W.S.³

15
- 80053423076
- Controlling Listening-oriented Dialogue using Partially Observable Markov Decision Processes
- ACL
- Meguro, T., Higashinaka, R., Minami, Y., and Dohsaka, K. Controlling Listening-oriented Dialogue using Partially Observable Markov Decision Processes. In Proc. of the 23rd International Conference on Computational Linguistics (COLING 2010), pp. 761-769. ACL, 2010.
- (2010) Proc. of the 23rd International Conference on Computational Linguistics (COLING 2010) , pp. 761-769
- Meguro, T.¹ Higashinaka, R.² Minami, Y.³ Dohsaka, K.⁴

16
- 5744249209
- Equations of state calculations by fast computing machines
- Metropolis, N., Rosenbluth, A. W., Rosenbluth, M. N., Teller, A. H., and Teller, E. Equations of state calculations by fast computing machines. Journal of Chemical Physics, 21:1087-1092, 1953.
- (1953) Journal of Chemical Physics , vol.21 , pp. 1087-1092
- Metropolis, N.¹ Rosenbluth, A.W.² Rosenbluth, M.N.³ Teller, A.H.⁴ Teller, E.⁵

17
- 72449199041
- Training parsers by inverse reinforcement learning
- Neu, G. and Szepesvári, C. Training parsers by inverse reinforcement learning. Machine Learning, 77(2-3): 303-337, 2009.
- (2009) Machine Learning , vol.77 , Issue.2-3 , pp. 303-337
- Neu, G.¹ Szepesvári, C.²

18
- 85011436515
- Direct search algorithms for optimization calculations
- Powell, M. J. D. Direct search algorithms for optimization calculations. Acta Numerica, 7:287-336, 1998.
- (1998) Acta Numerica , vol.7 , pp. 287-336
- Powell, M.J.D.¹

19
- 77956052826
- Bayesian inverse reinforcement learning
- Ramachandran, D. and Amir, E. Bayesian inverse reinforcement learning. In Proc. of International Joint Conference of Artifical Intelligence (IJCAI-2007), pp. 2586-2591, 2007.
- (2007) Proc. of International Joint Conference of Artifical Intelligence (IJCAI-2007) , pp. 2586-2591
- Ramachandran, D.¹ Amir, E.²

20
- 85162018872
- Bayes-adaptive POMDPs
- Platt, J. C., Koller, D., Singer, Y., and Roweis, S. T. (eds.)
- Ross, S., Chaib-draa, B., and Pineau, J. Bayes-adaptive POMDPs. In Platt, J. C., Koller, D., Singer, Y., and Roweis, S. T. (eds.), Advances in Neural Information Processing Systems 20, 2008.
- (2008) Advances in Neural Information Processing Systems 20
- Ross, S.¹ Chaib-draa, B.² Pineau, J.³

21
- 0015658957
- The optimal control of partially observable Markov processes over a finite horizon
- Smallwood, R. and Sondik, E. The optimal control of partially observable Markov processes over a finite horizon,. Operations Research, 21:1071-1088, 1973.
- (1973) Operations Research , vol.21 , pp. 1071-1088
- Smallwood, R.¹ Sondik, E.²

22
- 79951792262
- Parameter learning for POMDP spoken dialogue models
- IEEE
- Thomson, B., Jurčíček, F., Gašić, M., Keizer, S., Mairesse, F., Yu, K., and Young, S. Parameter learning for POMDP spoken dialogue models. In Proc. of the 3rd IEEE Workshop on Spoken Language Technology (SLT 2010), pp. 271-276. IEEE, 2010.
- (2010) Proc. of the 3rd IEEE Workshop on Spoken Language Technology (SLT 2010) , pp. 271-276
- Thomson, B.¹ Jurčíček, F.² Gašić, M.³ Keizer, S.⁴ Mairesse, F.⁵ Yu, K.⁶ Young, S.⁷

23
- 84863276973
- Partially observable Markov decision processes with continuous observations for dialogue management
- Williams, J. D., Poupart, P., and Young, S. Partially observable Markov decision processes with continuous observations for dialogue management. In Proc. of the 6th SIGdial Workshop on Discourse and Dialogue, pp. 25-34. 2005.
- (2005) Proc. of the 6th SIGdial Workshop on Discourse and Dialogue , pp. 25-34
- Williams, J.D.¹ Poupart, P.² Young, S.³

24
- 77956500986
- Modeling interaction via the principle of maximum causal entropy
- Fürnkranz, J. and Joachims, T. (eds.), Omnipress
- Ziebart, B., Bragnell, J. A., and Dey, A. K. Modeling interaction via the principle of maximum causal entropy. In Fürnkranz, J. and Joachims, T. (eds.), Proc. of the 27th International Conference on Machine Learning (ICML 2010), pp. 1255-1262. Omnipress, 2010.
- (2010) Proc. of the 27th International Conference on Machine Learning (ICML 2010) , pp. 1255-1262
- Ziebart, B.¹ Bragnell, J.A.² Dey, A.K.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.