SCOPUS 정보 검색 플랫폼

International Journal of Intelligent Computing and Cybernetics

Volumn 5, Issue 3, 2012, Pages 293-311

A survey of inverse reinforcement learning techniques

(2) Zhifei, Shao a Joo, Er Meng a

a NANYANG TECHNOLOGICAL UNIVERSITY (Singapore)

Author keywords

Artificial intelligence; Inverse reinforcement learning; Learning methods; Reinforcement learning; Reward function

Indexed keywords

COMPLEX PROBLEMS; DESIGN/METHODOLOGY/APPROACH; DYNAMIC ENVIRONMENTS; FUNDAMENTAL THEORY; INVERSE REINFORCEMENT LEARNING; LATEST DEVELOPMENT; LEARNING METHODS; REINFORCEMENT LEARNING TECHNIQUES; REWARD FUNCTION; SEQUENTIAL DECISION MAKING; SUCCINCT REPRESENTATION;

ALGORITHMS; ARTIFICIAL INTELLIGENCE; SURVEYS;

REINFORCEMENT LEARNING;

EID: 84865146660 PISSN: 1756378X EISSN: 17563798 Source Type: Journal
DOI: 10.1108/17563781211255862 Document Type: Article

Times cited : (102)

References (58)

1
- 14344251217
- Apprenticeship learning via inverse reinforcement learning
- Abbeel, P. and Ng, A. (2004), "Apprenticeship learning via inverse reinforcement learning" in Proceedings of the 21st International Conference on Machine Learning, p. 1.
- (2004) Proceedings of the 21st International Conference on Machine Learning , pp. 1
- Abbeel, P.¹ Ng, A.²

2
- 77955809093
- Autonomous helicopter aerobatics through apprenticeship learning
- Abbeel, P., Coates, A. and Ng, A. (2010), "Autonomous helicopter aerobatics through apprenticeship learning" in International Journal of Robotics Research, Vol. 29, No. 13, pp. 1608-39.
- (2010) International Journal of Robotics Research , vol.29 , Issue.13 , pp. 1608-1639
- Abbeel, P.¹ Coates, A.² Ng, A.³

3
- 84883027643
- Autonomous autorotation of an RC helicopter
- Abbeel, P., Coates, A., Hunter, T. and Ng, A. (2009), "Autonomous autorotation of an RC helicopter" in Proceedings of the International Symposium on Experimental Robotics, pp. 385-94.
- (2009) Proceedings of the International Symposium on Experimental Robotics , pp. 385-394
- Abbeel, P.¹ Coates, A.² Hunter, T.³ Ng, A.⁴

4
- 84864030941
- An application of reinforcement learning to aerobatic helicopter flight
- Abbeel, P., Coates, A., Quigley, M. and Ng, A. (2007), "An application of reinforcement learning to aerobatic helicopter flight" in Advances in Neural Information Processing Systems 19: Proceedings of the 2006 Conference, p. 1.
- (2007) Advances in Neural Information Processing Systems 19: Proceedings of the 2006 Conference , pp. 1
- Abbeel, P.¹ Coates, A.² Quigley, M.³ Ng, A.⁴

5
- 67650136522
- Apprenticeship learning for motion planning with application to parking lot navigation
- Abbeel, P., Dolgov, D., Ng, A. and Thrun, S. (2008), "Apprenticeship learning for motion planning with application to parking lot navigation" in Intelligent Robots and Systems, 2008. IROS 2008. IEEE/RSJ International Conference on, pp. 1083-90.
- (2008) Intelligent Robots and Systems, 2008. IROS 2008. IEEE/RSJ International Conference on , pp. 1083-1090
- Abbeel, P.¹ Dolgov, D.² Ng, A.³ Thrun, S.⁴

6
- 0000396062
- Natural gradient works efficiently in learning
- Amari, S. (1998), "Natural gradient works efficiently in learning" in Neural Computation, Vol. 10, No. 2, pp. 251-76.
- (1998) Neural Computation , vol.10 , Issue.2 , pp. 251-276
- Amari, S.¹

7
- 63149159130
- A survey of robot learning from demonstration
- Argall, B., Chernova, S., Veloso, M. and Browning, B. (2009), "A survey of robot learning from demonstration" in Robotics and Autonomous Systems, Vol. 57, No. 5, pp. 469-83.
- (2009) Robotics and Autonomous Systems , vol.57 , Issue.5 , pp. 469-483
- Argall, B.¹ Chernova, S.² Veloso, M.³ Browning, B.⁴

8
- 0002130986
- Robot learning from demonstration
- Morgan Kaufmann, Burlington, MA
- Atkeson, C. and Schaal, S. (1997), "Robot learning from demonstration" in Proceedings of the International Conference on Machine Learning (ICML'97), Morgan Kaufmann, Burlington, MA, pp. 12-20.
- (1997) Proceedings of the International Conference on Machine Learning (ICML'97) , pp. 12-20
- Atkeson, C.¹ Schaal, S.²

9
- 80053440459
- Apprenticeship learning about multiple intentions
- Babes, M., Marivate, V., Littman, M. and Subramanian, K. (2010), "Apprenticeship learning about multiple intentions", Proceedings of International Conference on Machine Learning (ICML 2011).
- (2010) Proceedings of International Conference on Machine Learning (ICML 2011)
- Babes, M.¹ Marivate, V.² Littman, M.³ Subramanian, K.⁴

10
- 85162002716
- Bootstrapping apprenticeship learning
- Boularias, A. and Chaib-Draa, B. (2011), "Bootstrapping apprenticeship learning", Proceedings of Neural Information Processing Systems, 2010.
- (2011) Proceedings of Neural Information Processing Systems, 2010
- Boularias, A.¹ Chaib-Draa, B.²

11
- 84862293297
- Relative entropy inverse reinforcement learning
- Boularias, A., Kober, J. and Peters, J. (2011), "Relative entropy inverse reinforcement learning" in Proceedings of Fourteenth International Conference on Artificial Intelligence and Statistics (AISTATS 2011), JMLR WC&P, Vol. 15, pp. 182-9.
- (2011) Proceedings of Fourteenth International Conference on Artificial Intelligence and Statistics (AISTATS 2011), JMLR WC&P , vol.15 , pp. 182-189
- Boularias, A.¹ Kober, J.² Peters, J.³

12
- 84865717612
- User simulation in dialogue systems using inverse reinforcement learning
- Chandramohan, S., Geist, M., Lefevre, F. and Pietquin, O. (2011), "User simulation in dialogue systems using inverse reinforcement learning", Proceedings of the 12th Annual Conference of the International Speech Communication Association (Interspeech 2011), Florence (Italy), August.
- (2011) Proceedings of the 12th Annual Conference of the International Speech Communication Association (Interspeech 2011), Florence (Italy), August
- Chandramohan, S.¹ Geist, M.² Lefevre, F.³ Pietquin, O.⁴

13
- 78751695440
- Inverse reinforcement learning in partially observable environments
- Choi, J. and Kim, K. (2009), "Inverse reinforcement learning in partially observable environments" in Proceedings of the 21st International Joint Conference on Artifical Intelligence (IJCAI), pp. 1028-33.
- (2009) Proceedings of the 21st International Joint Conference on Artifical Intelligence (IJCAI) , pp. 1028-1033
- Choi, J.¹ Kim, K.²

14
- 78651512500
- A mobile robot that understands pedestrian spatial behaviors
- Chung, S. and Huang, H. (2010), "A mobile robot that understands pedestrian spatial behaviors" in Intelligent Robots and Systems (IROS), 2010 IEEE/RSJ International Conference on, pp. 5861-6.
- (2010) Intelligent Robots and Systems (IROS), 2010 IEEE/RSJ International Conference on , pp. 5861-5866
- Chung, S.¹ Huang, H.²

15
- 56449129785
- Learning for control from multiple demonstrations
- Coates, A., Abbeel, P. and Ng, A. (2008), "Learning for control from multiple demonstrations" in Proceedings of the 25th International Conference on Machine Learning, pp. 144-51.
- (2008) Proceedings of the 25th International Conference on Machine Learning , pp. 144-151
- Coates, A.¹ Abbeel, P.² Ng, A.³

16
- 78649831352
- Selecting operator queries using expected myopic gain
- Cohn, R., Maxim, M., Durfee, E. and Singh, S. (2010), "Selecting operator queries using expected myopic gain" in IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology, pp. 40-7.
- (2010) IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology , pp. 40-47
- Cohn, R.¹ Maxim, M.² Durfee, E.³ Singh, S.⁴

17
- 84877776997
- Bayesian multitask inverse reinforcement learning
- Dimitrakakis, C. and Rothkopf, C. (2011), "Bayesian multitask inverse reinforcement learning", paper presented at the 9th European Workshop on Reinforcement Learning (EWRL 2011), Athens, Greece, 9-11 September.
- (2011) Paper Presented at the 9th European Workshop on Reinforcement Learning (EWRL 2011), Athens, Greece, 9-11 September
- Dimitrakakis, C.¹ Rothkopf, C.²

18
- 0000030684
- The expected-utility hypothesis and the measurability of utility
- Friedman, M. and Savage, L. (1952), "The expected-utility hypothesis and the measurability of utility" in The Journal of Political Economy, No. 6, pp. 463-74.
- (1952) The Journal of Political Economy , Issue.6 , pp. 463-474
- Friedman, M.¹ Savage, L.²

19
- 84871697922
- Donut as I do: Learning from failed demonstrations
- Grollman, D. and Billard, A. (2011), "Donut as I do: learning from failed demonstrations" in IEEE International Conference on Robotics and Automation, Shanghai, 9-13 May, pp. 9-13.
- (2011) IEEE International Conference on Robotics and Automation, Shanghai, 9-13 May , pp. 9-13
- Grollman, D.¹ Billard, A.²

20
- 0029720233
- Human-to-robot skill transfer using the spore approximation
- Grudic, G. and Lawrence, P. (1996), "Human-to-robot skill transfer using the spore approximation" in Robotics and Automation, Proceedings, 1996 IEEE International Conference on, Vol. 4, pp. 2962-7.
- (1996) Robotics and Automation, Proceedings, 1996 IEEE International Conference on , vol.4 , pp. 2962-2967
- Grudic, G.¹ Lawrence, P.²

21
- 77955814312
- Learning to navigate through crowded environments
- Henry, P., Vollmer, C., Ferris, B. and Fox, D. (2010), "Learning to navigate through crowded environments" in Robotics and Automation (ICRA), 2010 IEEE International Conference on, pp. 981-6.
- (2010) Robotics and Automation (ICRA), 2010 IEEE International Conference on , pp. 981-986
- Henry, P.¹ Vollmer, C.² Ferris, B.³ Fox, D.⁴

22
- 2342632212
- Solving a huge number of similar tasks: A combination of multi-task learning and a hierarchical Bayesian approach
- Heskes, T. (1998), "Solving a huge number of similar tasks: a combination of multi-task learning and a hierarchical Bayesian approach" in Proceedings of the 15th International Conference on Machine Learning (ICML'98), pp. 233-41.
- (1998) Proceedings of the 15th International Conference on Machine Learning (ICML'98) , pp. 233-241
- Heskes, T.¹

23
- 11944275853
- Information theory and statistical mechanics
- Jaynes, E. (1957), "Information theory and statistical mechanics" in Physical Review, Vol. 108, No. 2, p. 171.
- (1957) Physical Review , vol.108 , Issue.2 , pp. 171
- Jaynes, E.¹

24
- 0029679044
- Reinforcement learning: A survey
- Kaelbling, L., Littman, M. and Moore, A. (1996), "Reinforcement learning: a survey" in Journal of Artificial Intelligence Research, Vol. 4, pp. 237-85.
- (1996) Journal of Artificial Intelligence Research , vol.4 , pp. 237-285
- Kaelbling, L.¹ Littman, M.² Moore, A.³

25
- 84865134051
- The Snowbird Workshop (submitted to)
- Kalakrishnan, M., Theodorou, E. and Schaal, S. (2010), "Inverse reinforcement learning with PI 2", The Snowbird Workshop (submitted to).
- (2010) Inverse reinforcement learning with PI 2
- Kalakrishnan, M.¹ Theodorou, E.² Schaal, S.³

26
- 0004001439
- Cambridge University Press
- Keeney, R. and Raiffa, H. (1993), Decisions with Multiple Objectives: Preferences and Value Tradeoffs, Cambridge University Press, Cambridge.
- (1993) Decisions with Multiple Objectives: Preferences and Value Tradeoffs
- Keeney, R.¹ Raiffa, H.²

27
- 77953327625
- Imitation and reinforcement learning, practical algorithms for motor primitives in robotics
- Kober, J. and Peters, J. (2010), "Imitation and reinforcement learning, practical algorithms for motor primitives in robotics" in Robotics and Automation Magazine, IEEE, Vol. 17, No. 2, pp. 55-62.
- (2010) Robotics and Automation Magazine, IEEE , vol.17 , Issue.2 , pp. 55-62
- Kober, J.¹ Peters, J.²

28
- 85162069513
- Hierarchical apprenticeship learning with application to quadruped locomotion
- MIT Press, Cambridge, MA
- Kolter, J., Abbeel, P. and Ng, A. (2008), "Hierarchical apprenticeship learning with application to quadruped locomotion", Advances in Neural Information Processing Systems, MIT Press, Cambridge, MA.
- (2008) Advances in Neural Information Processing Systems
- Kolter, J.¹ Abbeel, P.² Ng, A.³

29
- 0142192295
- Conditional random fields: Probabilistic models for segmenting and labeling sequence data
- Morgan Kaufmann, Burlington, MA
- Lafferty, J., McCallum, A. and Pereira, F. (2011), "Conditional random fields: probabilistic models for segmenting and labeling sequence data" in Proceedings of the International Conference on Machine Learning (ICML 2011), Morgan Kaufmann, Burlington, MA, pp. 282-9.
- (2011) Proceedings of the International Conference on Machine Learning (ICML 2011) , pp. 282-289
- Lafferty, J.¹ McCallum, A.² Pereira, F.³

30
- 84865112972
- ACM SIGGRAPH 2010 papers, New York, NY
- Lee, S. and Popovi, Z. (2010), "Learning behavior styles with inverse reinforcement learning" in ACM, New York, NY, pp. 1-7, ACM SIGGRAPH 2010 papers.
- (2010) Learning behavior styles with inverse reinforcement learning , pp. 1-7
- Lee, S.¹ Popovi, Z.²

31
- 70349966131
- Active learning for reward estimation in inverse reinforcement learning
- Lopes, M., Melo, F. and Montesano, L. (2009), "Active learning for reward estimation in inverse reinforcement learning" in Machine Learning and Knowledge Discovery in Databases, Vol. 5782, No. 1, pp. 31-46.
- (2009) Machine Learning and Knowledge Discovery in Databases , vol.5782 , Issue.1 , pp. 31-46
- Lopes, M.¹ Melo, F.² Montesano, L.³

32
- 79953146294
- Robot self-initiative and personalization by learning through repeated interactions
- Mason, M. and Lopes, M. (2011), "Robot self-initiative and personalization by learning through repeated interactions" in Proceedings of the 6th International Conference on Human-robot Interaction, pp. 433-40.
- (2011) Proceedings of the 6th International Conference on Human-Robot Interaction , pp. 433-440
- Mason, M.¹ Lopes, M.²

33
- 78049399307
- Learning from demonstration using MDP induced metrics
- Springer, Berlin
- Melo, F. and Lopes, M. (2010), "Learning from demonstration using MDP induced metrics" in Machine Learning and Knowledge Discovery in Databases, Springer, Berlin, pp. 385-401.
- (2010) Machine Learning and Knowledge Discovery in Databases , pp. 385-401
- Melo, F.¹ Lopes, M.²

34
- 84865148144
- A survey of POMDP solution techniques
- Murphy, K. (2000), "A survey of POMDP solution techniques" in Environment, Vol. 2, p. X3.
- (2000) Environment , vol.2
- Murphy, K.¹

35
- 80053212134
- Apprenticeship learning using inverse reinforcement learning and gradient methods
- Neu, G. and Szepesvári, C. (2007), "Apprenticeship learning using inverse reinforcement learning and gradient methods" in Proceedings of the Twenty-third Conference Annual Conference on Uncertainty in Artificial Intelligence (UAI-07), pp. 295-302.
- (2007) Proceedings of the Twenty-Third Conference Annual Conference on Uncertainty in Artificial Intelligence (UAI-07) , pp. 295-302
- Neu, G.¹ Szepesvári, C.²

36
- 0042547347
- Algorithms for inverse reinforcement learning
- Ng, A. and Russell, S. (2000), "Algorithms for inverse reinforcement learning" in Proceedings of the Seventeenth International Conference on Machine Learning, pp. 663-70.
- (2000) Proceedings of the Seventeenth International Conference on Machine Learning , pp. 663-670
- Ng, A.¹ Russell, S.²

37
- 0003212629
- Efficient training of artificial neural networks for autonomous navigation
- Pomerleau, D. (1991), "Efficient training of artificial neural networks for autonomous navigation" in Neural Computation, Vol. 3, No. 1, pp. 88-97.
- (1991) Neural Computation , vol.3 , Issue.1 , pp. 88-97
- Pomerleau, D.¹

38
- 85102627959
- Wiley, New York, NY
- Puterman, M. (1994), Markov Decision Processes: Discrete Stochastic Dynamic Programming, Wiley, New York, NY.
- (1994) Markov Decision Processes: Discrete Stochastic Dynamic Programming
- Puterman, M.¹

39
- 80053156567
- Inverse reinforcement learning with Gaussian process
- Qiao, Q. and Beling, P. (2011), "Inverse reinforcement learning with Gaussian process" in American Control Conference (ACC), pp. 113-18.
- (2011) American Control Conference (ACC) , pp. 113-118
- Qiao, Q.¹ Beling, P.²

40
- 77956052826
- Bayesian inverse reinforcement learning
- Ramachandran, D. and Amir, E. (2007), "Bayesian inverse reinforcement learning", Proceedings of the 20th International Joint Conference on Artificial Intelligence.
- (2007) Proceedings of the 20th International Joint Conference on Artificial Intelligence
- Ramachandran, D.¹ Amir, E.²

41
- 33749252753
- Maximum margin planning
- Ratliff, N., Bagnell, J. and Zinkevich, M. (2006), "Maximum margin planning" in Proceedings of the 23rd International Conference on Machine Learning, pp. 729-36.
- (2006) Proceedings of the 23rd International Conference on Machine Learning , pp. 729-736
- Ratliff, N.¹ Bagnell, J.² Zinkevich, M.³

42
- 67650957592
- Learning to search: Functional gradient techniques for imitation learning
- Ratliff, N., Silver, D. and Bagnell, J. (2009), "Learning to search: functional gradient techniques for imitation learning" in Autonomous Robots, No. 1, pp. 25-53.
- (2009) Autonomous Robots , Issue.1 , pp. 25-53
- Ratliff, N.¹ Silver, D.² Bagnell, J.³

43
- 0000033354
- An empirical Bayes approach to statistics
- Robbins, H. (1992), "An empirical Bayes approach to statistics" in Breakthroughs in Statistics: Foundations and Basic Theory, Vol. 1, p. 388.
- (1992) Breakthroughs in Statistics: Foundations and Basic Theory , vol.1 , pp. 388
- Robbins, H.¹

44
- 80052420104
- Preference elicitation and inverse reinforcement learning
- Rothkopf, C. and Dimitrakakis, C. (2011), "Preference elicitation and inverse reinforcement learning" in Proceedings of 22nd European Conference on Machine Learning ECML, Part III, LNAI 6913, pp. 34-48.
- (2011) Proceedings of 22nd European Conference on Machine Learning ECML, Part III, LNAI 6913 , pp. 34-48
- Rothkopf, C.¹ Dimitrakakis, C.²

45
- 0031640746
- Learning agents for uncertain environments (extended abstract)
- Russell, S. (1998), "Learning agents for uncertain environments (extended abstract)" in Proceedings of the Eleventh Annual Conference on Computational Learning Theory, pp. 101-3.
- (1998) Proceedings of the Eleventh Annual Conference on Computational Learning Theory , pp. 101-103
- Russell, S.¹

46
- 0033151712
- Is imitation learning the route to humanoid robots?
- Schaal, S. (1999), "Is imitation learning the route to humanoid robots?" in Trends in Cognitive Sciences, Vol. 3, No. 6, pp. 233-42.
- (1999) Trends in Cognitive Sciences , vol.3 , Issue.6 , pp. 233-242
- Schaal, S.¹

47
- 78650179844
- Modified reward function on abstract features in inverse reinforcement learning
- Springer
- Shen-yi, C., Hui, Q., Jia, F., Zhuo-jun, J., Miao-liang, Z., Springer (2010), "Modified reward function on abstract features in inverse reinforcement learning" in Journal of Zhejiang University - Science C, Vol. 11, No. 9, pp. 718-23.
- (2010) Journal of Zhejiang University - Science C , vol.11 , Issue.9 , pp. 718-723
- Shen-yi, C.¹ Hui, Q.² Jia, F.³ Zhuo-jun, J.⁴ Miao-liang, Z.⁵

48
- 33845622083
- Inverse reinforcement learning with evaluation
- Silva, V., Costa, A. and Lima, P. (2006), "Inverse reinforcement learning with evaluation" in IEEE International Conference on Robotics and Automation (ICRA06), Orlando, FL, USA, pp. 4246-51.
- (2006) IEEE International Conference on Robotics and Automation (ICRA06), Orlando, FL, USA , pp. 4246-4251
- Silva, V.¹ Costa, A.² Lima, P.³

49
- 77957947591
- Learning from demonstration for autonomous navigation in complex unstructured terrain
- Silver, D., Bagnell, J. and Stentz, A. (2010), "Learning from demonstration for autonomous navigation in complex unstructured terrain" in The International Journal of Robotics Research, Vol. 29, No. 12, p. 1565.
- (2010) The International Journal of Robotics Research , vol.29 , Issue.12 , pp. 1565
- Silver, D.¹ Bagnell, J.² Stentz, A.³

50
- 79957999943
- Perceptual interpretation for autonomous navigation through dynamic imitation learning
- Silver, D., Bagnell, J. and Stentz, A. (2011), "Perceptual interpretation for autonomous navigation through dynamic imitation learning" in International Symposium on Robotics Research, pp. 433-49.
- (2011) International Symposium on Robotics Research , pp. 433-449
- Silver, D.¹ Bagnell, J.² Stentz, A.³

51
- 0003871607
- PhD thesis, Stanford University, Stanford, CA
- Sondik, E. (1971), "The optimal control of partially observable Markov processes", Stanford University, Stanford, CA, PhD thesis.
- (1971) The optimal control of partially observable Markov processes
- Sondik, E.¹

52
- 0003420416
- MIT Press, Cambridge, MA
- Sutton, R. and Barto, A. (1998), Introduction to Reinforcement Learning, MIT Press, Cambridge, MA.
- (1998) Introduction to Reinforcement Learning
- Sutton, R.¹ Barto, A.²

53
- 85162012324
- A game-theoretic approach to apprenticeship learning
- MIT Press, Cambridge, MA
- Syed, U. and Schapire, R. (2008), "A game-theoretic approach to apprenticeship learning" in Advances in Neural Information Processing Systems, MIT Press, Cambridge, MA, pp. 1449-56.
- (2008) Advances in Neural Information Processing Systems , pp. 1449-1456
- Syed, U.¹ Schapire, R.²

54
- 77955839705
- Parameterized maneuver learning for autonomous helicopter flight
- Tang, J., Singh, A., Goehausen, N. and Abbeel, P. (2010), "Parameterized maneuver learning for autonomous helicopter flight" in Robotics and Automation (ICRA), 2010 IEEE International Conference on, pp. 1142-8.
- (2010) Robotics and Automation (ICRA), 2010 IEEE International Conference on , pp. 1142-1148
- Tang, J.¹ Singh, A.² Goehausen, N.³ Abbeel, P.⁴

55
- 77955836276
- Reinforcement learning of motor skills in high dimensions: A path integral approach
- Theodorou, E., Buchli, J. and Schaal, S. (2010), "Reinforcement learning of motor skills in high dimensions: a path integral approach" in Robotics and Automation (ICRA), 2010 IEEE International Conference on, pp. 2397-403.
- (2010) Robotics and Automation (ICRA), 2010 IEEE International Conference on , pp. 2397-2403
- Theodorou, E.¹ Buchli, J.² Schaal, S.³

56
- 84866840265
- Enabling environment design via active indirect elicitation
- Zhang, H. and Parkes, D. (2008), "Enabling environment design via active indirect elicitation", Proceedings Workshop on Preference Handling, Chicago, IL.
- (2008) Proceedings Workshop on Preference Handling, Chicago, IL
- Zhang, H.¹ Parkes, D.²

57
- 57749097473
- Maximum entropy inverse reinforcement learning
- Ziebart, B., Maas, A., Bagnell, J. and Dey, A. (2008), "Maximum entropy inverse reinforcement learning" in Proceedings 23rd AAAI Conference Artificial Intelligence, pp. 1433-8.
- (2008) Proceedings 23rd AAAI Conference Artificial Intelligence , pp. 1433-1438
- Ziebart, B.¹ Maas, A.² Bagnell, J.³ Dey, A.⁴

58
- 0003949807
- Cambridge University Press, New York, NY
- Leishman, J. (2006), Principles of Helicopter Aerodynamics, Cambridge University Press, New York, NY.
- (2006) Principles of Helicopter Aerodynamics
- Leishman, J.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.