SCOPUS 정보 검색 플랫폼

IEEE Transactions on Cognitive and Developmental Systems

Volumn 9, Issue 1, 2017, Pages 44-55

Learning From Explanations Using Sentiment and Advice in RL

(6) Krening, Samantha a Harrison, Brent a Feigh, Karen M a Isbell, Charles Lee a Riedl, Mark a Thomaz, Andrea a

a GEORGIA INSTITUTE OF TECHNOLOGY (United States)

Author keywords

Advice; reinforcement learning (RL); sentiment

Indexed keywords

ARTIFICIAL INTELLIGENCE; LEARNING SYSTEMS; PASSIVE FILTERS; SOFTWARE AGENTS; TEACHING;

ADVICE; COGNITIVE LOADS; FALSE NEGATIVES; MACHINE LEARNING TECHNIQUES; REINFORCEMENT LEARNING AGENT; SENTIMENT; SENTIMENT ANALYSIS; STATE INFORMATION;

REINFORCEMENT LEARNING;

EID: 85015665009 PISSN: 23798920 EISSN: 23798939 Source Type: Journal
DOI: 10.1109/TCDS.2016.2628365 Document Type: Article

Times cited : (72)

References (22)

1
- 31844444663
- Apprenticeship learning via inverse reinforcement learning
- Banff, AB, Canada
- P. Abbeel and A. Y. Ng, "Apprenticeship learning via inverse reinforcement learning," in Proc. 21st Int. Conf. Mach. Learn., Banff, AB, Canada, 2004, p. 1.
- (2004) Proc. 21st Int. Conf. Mach. Learn. , pp. 1
- Abbeel, P.¹ Ng, A.Y.²

2
- 69549135371
- Learning robot motion control with demonstration and advice-operators
- Nice, France
- B. D. Argall, B. Browning, and M. Veloso, "Learning robot motion control with demonstration and advice-operators," in Proc. IEEE/RSJ Int. Conf. Intell. Robots Syst. (IROS), Nice, France, 2008, pp. 399-404.
- (2008) Proc. IEEE/RSJ Int. Conf. Intell. Robots Syst. (IROS) , pp. 399-404
- Argall, B.D.¹ Browning, B.² Veloso, M.³

3
- 63149159130
- A survey of robot learning from demonstration
- B. D. Argall, S. Chernova, M. Veloso, and B. Browning, "A survey of robot learning from demonstration," Robot. Auton. Syst., vol. 57, no. 5, pp. 469-483, 2009.
- (2009) Robot. Auton. Syst. , vol.57 , Issue.5 , pp. 469-483
- Argall, B.D.¹ Chernova, S.² Veloso, M.³ Browning, B.⁴

4
- 84899837449
- Robot learning from human teachers
- S. Chernova and A. L. Thomaz, "Robot learning from human teachers," Synth. Lectures Artif. Intell. Mach. Learn., vol. 8, no. 3, pp. 1-121, 2014.
- (2014) Synth. Lectures Artif. Intell. Mach. Learn. , vol.8 , Issue.3 , pp. 1-121
- Chernova, S.¹ Thomaz, A.L.²

5
- 84899422939
- Object focused Q-learning for autonomous agents
- St. Paul, MN, USA
- L. C. Cobo, C. L. Isbell, and A. L. Thomaz, "Object focused Q-learning for autonomous agents," in Proc. Int. Conf. Auton. Agents Multi Agent Syst., St. Paul, MN, USA, 2013, pp. 1061-1068.
- (2013) Proc. Int. Conf. Auton. Agents Multi Agent Syst. , pp. 1061-1068
- Cobo, L.C.¹ Isbell, C.L.² Thomaz, A.L.³

6
- 56449093331
- An object-oriented representation for efficient reinforcement learning
- Helsinki, Finland
- C. Diuk, A. Cohen, and M. L. Littman, "An object-oriented representation for efficient reinforcement learning," in Proc. 25th Int. Conf. Mach. Learn., Helsinki, Finland, 2008, pp. 240-247.
- (2008) Proc. 25th Int. Conf. Mach. Learn. , pp. 240-247
- Diuk, C.¹ Cohen, A.² Littman, M.L.³

7
- 84899032502
- Policy shaping: Integrating human feedback with reinforcement learning
- S. Griffith, K. Subramanian, J. Scholz, C. Isbell, and A. L. Thomaz, "Policy shaping: Integrating human feedback with reinforcement learning," in Proc. Adv. Neural Inf. Process. Syst., 2013, pp. 2625-2633.
- (2013) Proc. Adv. Neural Inf. Process. Syst. , pp. 2625-2633
- Griffith, S.¹ Subramanian, K.² Scholz, J.³ Isbell, C.⁴ Thomaz, A.L.⁵

8
- 84876838136
- Object-oriented representation and hierarchical reinforcement learning in infinite mario
- Athens, Greece
- M. Joshi, R. Khobragade, S. Sarda, U. Deshpande, and S. Mohan, "Object-oriented representation and hierarchical reinforcement learning in infinite mario," in Proc. IEEE 24th Int. Conf. Tools Artif. Intell. (ICTAI), vol. 1. Athens, Greece, 2012, pp. 1076-1081.
- (2012) Proc. IEEE 24th Int. Conf. Tools Artif. Intell. (ICTAI) , vol.1 , pp. 1076-1081
- Joshi, M.¹ Khobragade, R.² Sarda, S.³ Deshpande, U.⁴ Mohan, S.⁵

9
- 85014298650
- Objectfocused advice in reinforcement learning
- Singapore
- S. Krening, B. Harrison, K. M. Feigh, C. Isbell, and A. Thomaz, "Objectfocused advice in reinforcement learning," in Proc. Int. Conf. Auton. Agents Multi Agent Syst., Singapore, 2016, pp. 1447-1448.
- (2016) Proc. Int. Conf. Auton. Agents Multi Agent Syst. , pp. 1447-1448
- Krening, S.¹ Harrison, B.² Feigh, K.M.³ Isbell, C.⁴ Thomaz, A.⁵

10
- 32144462307
- Guiding a reinforcement learner with natural language advice: Initial results in RoboCup soccer
- San Jose, CA, USA
- G. Kuhlmann, P. Stone, R. Mooney, and J. Shavlik, "Guiding a reinforcement learner with natural language advice: Initial results in RoboCup soccer," in Proc. AAAI Workshop Supervisory Control Learn. Adapt. Syst., San Jose, CA, USA, 2004.
- (2004) Proc. AAAI Workshop Supervisory Control Learn. Adapt. Syst.
- Kuhlmann, G.¹ Stone, P.² Mooney, R.³ Shavlik, J.⁴

11
- 84964661682
- Grounding english commands to reward functions
- Rome, Italy, Jul.
- J. MacGlashan et al., "Grounding English commands to reward functions," in Proc. Robot. Sci. Syst., Rome, Italy, Jul. 2015.
- (2015) Proc. Robot. Sci. Syst.
- MacGlashan, J.¹

12
- 29344474034
- Giving advice about preferred actions to reinforcement learners via knowledge-based kernel regression
- Pittsburgh, PA, USA
- R. Maclin, J. Shavlik, L. Torrey, T. Walker, and E. Wild, "Giving advice about preferred actions to reinforcement learners via knowledge-based kernel regression," in Proc. Nat. Conf. Artif. Intell., vol. 20. Pittsburgh, PA, USA, 2005, pp. 819-824.
- (2005) Proc. Nat. Conf. Artif. Intell. , vol.20 , pp. 819-824
- Maclin, R.¹ Shavlik, J.² Torrey, L.³ Walker, T.⁴ Wild, E.⁵

13
- 85117622017
- The stanford CoreNLP natural language processing toolkit
- Baltimore, MD, USA
- C. D. Manning et al., "The Stanford CoreNLP natural language processing toolkit," in Proc. 52nd Annu. Meeting Assoc. Comput. Linguist. Syst. Demonstrations, Baltimore, MD, USA, 2014, pp. 55-60.
- (2014) Proc. 52nd Annu. Meeting Assoc. Comput. Linguist. Syst. Demonstrations , pp. 55-60
- Manning, C.D.¹

14
- 84911454077
- An interactive approach for situated task specification through verbal instructions
- Paris, France
- C. Meriçli, S. D. Klee, J. Paparian, and M. Veloso, "An interactive approach for situated task specification through verbal instructions," in Proc. Int. Conf. Auton. Agents Multi Agent Syst., Paris, France, 2014, pp. 1069-1076.
- (2014) Proc. Int. Conf. Auton. Agents Multi Agent Syst. , pp. 1069-1076
- Meriçli, C.¹ Klee, S.D.² Paparian, J.³ Veloso, M.⁴

15
- 84859895244
- Seeing stars: Exploiting class relationships for sentiment categorization with respect to rating scales
- Ann Arbor, MI, USA
- B. Pang and L. Lee, "Seeing stars: Exploiting class relationships for sentiment categorization with respect to rating scales," in Proc. 43rd Annu. Meeting Assoc. Comput. Linguist., Ann Arbor, MI, USA, 2005, pp. 115-124.
- (2005) Proc. 43rd Annu. Meeting Assoc. Comput. Linguist. , pp. 115-124
- Pang, B.¹ Lee, L.²

16
- 48449095896
- Opinion mining and sentiment analysis
- B. Pang and L. Lee, "Opinion mining and sentiment analysis," Found. Trends Inf. Retrieval, vol. 2, nos. 1-2, pp. 1-135, 2008.
- (2008) Found. Trends Inf. Retrieval , vol.2 , Issue.1-2 , pp. 1-135
- Pang, B.¹ Lee, L.²

17
- 84865040660
- Instructing a reinforcement learner
- Marco Island, FL, USA
- M. S. Sivamurugan and B. Ravindran, "Instructing a reinforcement learner," in Proc. FLAIRS Conf., Marco Island, FL, USA, 2012.
- (2012) Proc. FLAIRS Conf.
- Sivamurugan, M.S.¹ Ravindran, B.²

18
- 0004168373
- New York, NY, USA: D. Appleton
- B. F. Skinner, The Behavior of Organisms: An Experimental Analysis. New York, NY, USA: D. Appleton, 1938.
- (1938) The Behavior of Organisms: An Experimental Analysis
- Skinner, B.F.¹

19
- 0019891981
- Selection by consequences
- B. F. Skinner, "Selection by consequences," Science, vol. 213, no. 4507, pp. 501-504, 1981.
- (1981) Science , vol.213 , Issue.4507 , pp. 501-504
- Skinner, B.F.¹

20
- 84926358845
- Recursive deep models for semantic compositionality over a sentiment treebank
- Seattle, WA, USA
- R. Socher et al., "Recursive deep models for semantic compositionality over a sentiment treebank," in Proc. Conf. Empir. Methods Nat. Lang. Process. (EMNLP), vol. 1631. Seattle, WA, USA, pp. 1631-1642, 2013.
- (2013) Proc. Conf. Empir. Methods Nat. Lang. Process. (EMNLP) , vol.1631 , pp. 1631-1642
- Socher, R.¹

21
- 0004102479
- Cambridge, MA, USA: MIT Press
- R. S. Sutton and A. G. Barto, Reinforcement Learning: An Introduction, vol. 1. Cambridge, MA, USA: MIT Press, 1998.
- (1998) Reinforcement Learning: An Introduction , vol.1
- Sutton, R.S.¹ Barto, A.G.²

22
- 79959428573
- The 2009 mario AI competition
- Barcelona, Spain
- J. Togelius, S. Karakovskiy, and R. Baumgarten, "The 2009 Mario AI competition," in Proc. IEEE Congr. Evol. Comput. (CEC), Barcelona, Spain, 2010, pp. 1-8.
- (2010) Proc. IEEE Congr. Evol. Comput. (CEC) , pp. 1-8
- Togelius, J.¹ Karakovskiy, S.² Baumgarten, R.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.