메뉴 건너뛰기




Volumn 9, Issue 1, 2017, Pages 44-55

Learning From Explanations Using Sentiment and Advice in RL

Author keywords

Advice; reinforcement learning (RL); sentiment

Indexed keywords

ARTIFICIAL INTELLIGENCE; LEARNING SYSTEMS; PASSIVE FILTERS; SOFTWARE AGENTS; TEACHING;

EID: 85015665009     PISSN: 23798920     EISSN: 23798939     Source Type: Journal    
DOI: 10.1109/TCDS.2016.2628365     Document Type: Article
Times cited : (72)

References (22)
  • 1
    • 31844444663 scopus 로고    scopus 로고
    • Apprenticeship learning via inverse reinforcement learning
    • Banff, AB, Canada
    • P. Abbeel and A. Y. Ng, "Apprenticeship learning via inverse reinforcement learning," in Proc. 21st Int. Conf. Mach. Learn., Banff, AB, Canada, 2004, p. 1.
    • (2004) Proc. 21st Int. Conf. Mach. Learn. , pp. 1
    • Abbeel, P.1    Ng, A.Y.2
  • 3
  • 6
    • 56449093331 scopus 로고    scopus 로고
    • An object-oriented representation for efficient reinforcement learning
    • Helsinki, Finland
    • C. Diuk, A. Cohen, and M. L. Littman, "An object-oriented representation for efficient reinforcement learning," in Proc. 25th Int. Conf. Mach. Learn., Helsinki, Finland, 2008, pp. 240-247.
    • (2008) Proc. 25th Int. Conf. Mach. Learn. , pp. 240-247
    • Diuk, C.1    Cohen, A.2    Littman, M.L.3
  • 11
    • 84964661682 scopus 로고    scopus 로고
    • Grounding english commands to reward functions
    • Rome, Italy, Jul.
    • J. MacGlashan et al., "Grounding English commands to reward functions," in Proc. Robot. Sci. Syst., Rome, Italy, Jul. 2015.
    • (2015) Proc. Robot. Sci. Syst.
    • MacGlashan, J.1
  • 12
    • 29344474034 scopus 로고    scopus 로고
    • Giving advice about preferred actions to reinforcement learners via knowledge-based kernel regression
    • Pittsburgh, PA, USA
    • R. Maclin, J. Shavlik, L. Torrey, T. Walker, and E. Wild, "Giving advice about preferred actions to reinforcement learners via knowledge-based kernel regression," in Proc. Nat. Conf. Artif. Intell., vol. 20. Pittsburgh, PA, USA, 2005, pp. 819-824.
    • (2005) Proc. Nat. Conf. Artif. Intell. , vol.20 , pp. 819-824
    • Maclin, R.1    Shavlik, J.2    Torrey, L.3    Walker, T.4    Wild, E.5
  • 15
    • 84859895244 scopus 로고    scopus 로고
    • Seeing stars: Exploiting class relationships for sentiment categorization with respect to rating scales
    • Ann Arbor, MI, USA
    • B. Pang and L. Lee, "Seeing stars: Exploiting class relationships for sentiment categorization with respect to rating scales," in Proc. 43rd Annu. Meeting Assoc. Comput. Linguist., Ann Arbor, MI, USA, 2005, pp. 115-124.
    • (2005) Proc. 43rd Annu. Meeting Assoc. Comput. Linguist. , pp. 115-124
    • Pang, B.1    Lee, L.2
  • 16
    • 48449095896 scopus 로고    scopus 로고
    • Opinion mining and sentiment analysis
    • B. Pang and L. Lee, "Opinion mining and sentiment analysis," Found. Trends Inf. Retrieval, vol. 2, nos. 1-2, pp. 1-135, 2008.
    • (2008) Found. Trends Inf. Retrieval , vol.2 , Issue.1-2 , pp. 1-135
    • Pang, B.1    Lee, L.2
  • 17
    • 84865040660 scopus 로고    scopus 로고
    • Instructing a reinforcement learner
    • Marco Island, FL, USA
    • M. S. Sivamurugan and B. Ravindran, "Instructing a reinforcement learner," in Proc. FLAIRS Conf., Marco Island, FL, USA, 2012.
    • (2012) Proc. FLAIRS Conf.
    • Sivamurugan, M.S.1    Ravindran, B.2
  • 19
    • 0019891981 scopus 로고
    • Selection by consequences
    • B. F. Skinner, "Selection by consequences," Science, vol. 213, no. 4507, pp. 501-504, 1981.
    • (1981) Science , vol.213 , Issue.4507 , pp. 501-504
    • Skinner, B.F.1
  • 20
    • 84926358845 scopus 로고    scopus 로고
    • Recursive deep models for semantic compositionality over a sentiment treebank
    • Seattle, WA, USA
    • R. Socher et al., "Recursive deep models for semantic compositionality over a sentiment treebank," in Proc. Conf. Empir. Methods Nat. Lang. Process. (EMNLP), vol. 1631. Seattle, WA, USA, pp. 1631-1642, 2013.
    • (2013) Proc. Conf. Empir. Methods Nat. Lang. Process. (EMNLP) , vol.1631 , pp. 1631-1642
    • Socher, R.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.