SCOPUS 정보 검색 플랫폼

Volumn 11, Issue 6, 2011, Pages 4097-4109

Knowledge of opposite actions for reinforcement learning

Author keywords

NOQ( ) algorithm; Opposite action; Opposition weight; Opposition based learning (OBL); OQ( ) algorithm; Q( ); Reinforcement learning

Indexed keywords

ACTION SPACES; DISCRETE STATE; FASTER CONVERGENCE; MACHINE INTELLIGENCE; MODEL FREE; OPPOSITE ACTION; OPPOSITION WEIGHT; OPPOSITION-BASED LEARNING; OPTIMAL POLICIES; REAL-WORLD PROBLEM; STATE SPACE; VALUE FUNCTIONS;

ARTIFICIAL INTELLIGENCE; LEARNING ALGORITHMS;

REINFORCEMENT LEARNING;

EID: 79956159518 PISSN: 15684946 EISSN: None Source Type: Journal
DOI: 10.1016/j.asoc.2011.01.045 Document Type: Article

Times cited : (18)

References (39)

1
- 15744386698
- Emotionally motivated reinforcement learning based controller
- The Hague, The Netherlands
- A. Ayesh Emotionally motivated reinforcement learning based controller IEEE SMC The Hague, The Netherlands 2004
- (2004) IEEE SMC
- Ayesh, A.¹

2
- 0028731609
- Fuzzy Q-learning: A new approach for fuzzy dynamic programming, IEEE World Congress on Computational Intelligence
- H.R. Berenji Fuzzy Q-learning: a new approach for fuzzy dynamic programming, IEEE World Congress on Computational Intelligence Fuzzy Systems 1 1994 486 491
- (1994) Fuzzy Systems , vol.1 , pp. 486-491
- Berenji, H.R.¹

3
- 0030400438
- Fuzzy Q-learning for generalization of reinforcement learning
- H.R. Berenji Fuzzy Q-learning for generalization of reinforcement learning Proceedings of the Fifth IEEE International Conference on Fuzzy Systems 3 1996 2208 2214
- (1996) Proceedings of the Fifth IEEE International Conference on Fuzzy Systems , vol.3 , pp. 2208-2214
- Berenji, H.R.¹

4
- 0010820738
- HarperCollins Publishers 77-85 Fulham Palace Road, London, England
- Collins Cobuild English Dictionary 2000 HarperCollins Publishers 77-85 Fulham Palace Road, London, England
- (2000) Collins Cobuild English Dictionary

5
- 0003259931
- Improving elevator performance using reinforcement learning
- D.S. Touretzky, M.C. Mozer, M.E. Hasselmo (Eds.) MIT Press, Cambridge, MA
- R.H. Crites, A.G. Barto, Improving elevator performance using reinforcement learning, in: D.S. Touretzky, M.C. Mozer, M.E. Hasselmo (Eds.), Advances in Neural Information Processing Systems, vol. 8, MIT Press, Cambridge, MA, 1996
- (1996) Advances in Neural Information Processing Systems , vol.8
- Crites, R.H.¹ Barto, A.G.²

6
- 4444312102
- Integrating guidance into relational reinforcement learning
- K. Driessens, and S. Dzeroski Integrating guidance into relational reinforcement learning Machine Learning 57 2004 271 304
- (2004) Machine Learning , vol.57 , pp. 271-304
- Driessens, K.¹ Dzeroski, S.²

7
- 0009835609
- PhD Thesis, University of Edinburgh, Edinburgh
- S. Gadanho, Reinforcement learning in autonomous robots: an empirical investigation of the role of emotions, PhD Thesis, University of Edinburgh, Edinburgh, 1999
- (1999) Reinforcement Learning in Autonomous Robots: An Empirical Investigation of the Role of Emotions
- Gadanho, S.¹

8
- 29344435556
- Department of Computer Science and Engineering, University of Texas at Arlington TX, USA
- S.K. Goel Subgoal Discovery for Hierarchical Reinforcement learning Using Learned Policies 2003 Department of Computer Science and Engineering, University of Texas at Arlington TX, USA Master of Science in Computer Science and Engineering
- (2003) Subgoal Discovery for Hierarchical Reinforcement Learning Using Learned Policies
- Goel, S.K.¹

9
- 0008898273
- PhD Thesis, University of Cambridge
- M. Humphrys, Action selection methods using reinforcement learning, PhD Thesis, University of Cambridge, 1997
- (1997) Action Selection Methods Using Reinforcement Learning
- Humphrys, M.¹

10
- 0000908087
- Hierarchical reinforcement learning: Preliminary results
- L.P. Kaelbling Hierarchical reinforcement learning: preliminary results Proceedings of the Tenth International Conference on Machine Learning 1993
- (1993) Proceedings of the Tenth International Conference on Machine Learning
- Kaelbling, L.P.¹

11
- 0029679044
- Reinforcement learning: A survey
- L.P. Kaelbling, M.L. Littman, and A.W. Moore Reinforcement learning: a survey Journal of Artificial Intelligence Research 4 1996 237 285 (Pubitemid 126646155)
- (1996) Journal of Artificial Intelligence Research , vol.4 , pp. 237-285
- Kaelbling, L.P.¹ Littman, M.L.² Moore, A.W.³

12
- 51649126455
- American Association for Artificial Intelligence
- R. Maclin, J. Shavlik, L. Torrey, T. Walker, and E. Wild Giving Advice About Preferred Actions to Reinforcement Learners via Knowledge-Based Kernel Regression 2005 American Association for Artificial Intelligence
- (2005) Giving Advice about Preferred Actions to Reinforcement Learners Via Knowledge-Based Kernel Regression
- MacLin, R.¹ Shavlik, J.² Torrey, L.³ Walker, T.⁴ Wild, E.⁵

13
- 0010498504
- The NSF Workshop on Reinforcement Learning: Summary and Observations
- S. Mahadevan, and L.P. Kaelbling The NSF Workshop on Reinforcement Learning: Summary and Observations AI Magazine 1996
- (1996) AI Magazine
- Mahadevan, S.¹ Kaelbling, L.P.²

14
- 79956160967
- University of Massachusetts, Amherst, Technical Report Number 98-70
- A. Mc Govern, R.S. Sutton, Macro-actions in reinforcement learning: an empirical analysis, University of Massachusetts, Amherst, Technical Report Number 98-70, 1998
- (1998) Macro-actions in Reinforcement Learning: An Empirical Analysis
- Mc Govern, A.¹ Sutton, R.S.²

15
- 84963642854
- Relational state abstractions for reinforcement learning
- Banff, Canada
- E.F. Morales Relational state abstractions for reinforcement learning Proceedings of the ICML'04 workshop on Relational Reinforcement Learning Banff, Canada 2004
- (2004) Proceedings of the ICML'04 Workshop on Relational Reinforcement Learning
- Morales, E.F.¹

16
- 0000955979
- Incremental multi-step Q-learning
- J. Peng, and R.J. Williams Incremental multi-step Q-learning Machine Learning 1996 22
- (1996) Machine Learning , pp. 22
- Peng, J.¹ Williams, R.J.²

17
- 84988783053
- Convergence of reinforcement learning algorithms and acceleration of learning
- A. Potapov, and M.K. Ali Convergence of reinforcement learning algorithms and acceleration of learning Physical Review E 67 2003 026706
- (2003) Physical Review e , vol.67 , pp. 026706
- Potapov, A.¹ Ali, M.K.²

18
- 34547252483
- Opposition-based differential evolution algorithms
- 1688554, 2006 IEEE Congress on Evolutionary Computation, CEC 2006
- S. Rahnamayan, H.R. Tizhoosh, and M.M.A. Salama Opposition-based differential evolution algorithms IEEE Congress on Evolutionary Computation (CEC-2006) Vancouver 2006 2010 2017 (Pubitemid 47130743)
- (2006) 2006 IEEE Congress on Evolutionary Computation, CEC 2006 , pp. 2010-2017
- Rahnamayan, S.¹ Tizhoosh, H.R.² Salama, M.M.A.³

19
- 34547240097
- Opposition-based differential evolution for optimization of noisy problems
- 1688534, 2006 IEEE Congress on Evolutionary Computation, CEC 2006
- S. Rahnamayan, H.R. Tizhoosh, and M. Salama Opposition-based differential evolution for optimization of noisy problems IEEE Congress on Evolutionary Computation (CEC-2006) Vancouver 2006 1865 1872 (Pubitemid 47130723)
- (2006) 2006 IEEE Congress on Evolutionary Computation, CEC 2006 , pp. 1865-1872
- Rahnamayan, S.¹ Tizhoosh, H.R.² Salama, M.M.A.³

20
- 34249011503
- A novel population initialization method for accelerating evolutionary algorithms
- DOI 10.1016/j.camwa.2006.07.013, PII S0898122107001344
- S. Rahnamayan, H.R. Tizhoosh, and M.M.A. Salama A novel population initialization method for accelerating evolutionary algorithms Elsevier Journal on Computers and Mathematics with Applications 53 10 2007 1605 1614 (Pubitemid 46783389)
- (2007) Computers and Mathematics with Applications , vol.53 , Issue.10 , pp. 1605-1614
- Rahnamayan, S.¹ Tizhoosh, H.R.² Salama, M.M.A.³

21
- 0031607078
- Embedding a Priori Knowledge in Reinforcement Learning
- C.H. Ribeiro Embedding a priori knowledge in reinforcement learning Journal of Intelligent and Robotic Systems 21 1998 51 71 (Pubitemid 128513842)
- (1998) Journal of Intelligent and Robotic Systems: Theory and Applications , vol.21 , Issue.1 , pp. 51-71
- Ribeiro, C.H.C.¹

22
- 0036570250
- Reinforcement learning agent
- C. Ribeiro Reinforcement learning agent Artificial Intelligence Review 17 2002 223 250
- (2002) Artificial Intelligence Review , vol.17 , pp. 223-250
- Ribeiro, C.¹

23
- 0003584577
- Pearson Education Inc. New Jersey
- S.J. Russell, and P. Norvig Artificial Intelligence: A Modern Approach 2003 Pearson Education Inc. New Jersey
- (2003) Artificial Intelligence: A Modern Approach
- Russell, S.J.¹ Norvig, P.²

24
- 51649087431
- Using background knowledge to speed reinforcement learning in physical agents
- Montreal, Quebec, Canada
- D. Shapiro, P. Langley, and R. Shachter Using background knowledge to speed reinforcement learning in physical agents AGENTS'01 Montreal, Quebec, Canada 2001
- (2001) AGENTS'01
- Shapiro, D.¹ Langley, P.² Shachter, R.³

25
- 34548717391
- Opposition-based Q(λ) algorithm
- M. Shokri, H.R. Tizhoosh, and M. Kamel Opposition-based Q(λ) algorithm International Joint Conference on Neural Networks, IJCNN 2006 646 653
- (2006) International Joint Conference on Neural Networks, IJCNN , pp. 646-653
- Shokri, M.¹ Tizhoosh, H.R.² Kamel, M.³

26
- 34548732070
- Opposition-based Q(λ) with non-Markovian update
- Hawaii, USA
- M. Shokri, H.R. Tizhoosh, and M.S. Kamel Opposition-based Q(λ) with non-Markovian update IEEE International Symposium on Approximate Dynamic Programming and Reinforcement Learning Hawaii, USA 2007 288 295
- (2007) IEEE International Symposium on Approximate Dynamic Programming and Reinforcement Learning , pp. 288-295
- Shokri, M.¹ Tizhoosh, H.R.² Kamel, M.S.³

27
- 56349096864
- Tradeoff between exploration and exploitation of OQ(λ) with non-Markovian update in dynamic environments
- M. Shokri, H.R. Tizhoosh, and M.S. Kamel Tradeoff between exploration and exploitation of OQ(λ) with non-Markovian update in dynamic environments IEEE International Joint Conference on Neural Networks 2008 2916 2922
- (2008) IEEE International Joint Conference on Neural Networks , pp. 2916-2922
- Shokri, M.¹ Tizhoosh, H.R.² Kamel, M.S.³

28
- 79956133635
- The concept of opposition and its use in Q-learning and Q(λ) techniques
- H.R. Tizhoosh, M. Ventresca (Eds.) Springer Physika-Verlag
- M. Shokri, H.R. Tizhoosh, M.S. Kamel, The concept of opposition and its use in Q-learning and Q(λ) techniques, in: H.R. Tizhoosh, M. Ventresca (Eds.), Oppositional Concepts in Computational Intelligence, Springer Physika-Verlag, 2008
- (2008) Oppositional Concepts in Computational Intelligence
- Shokri, M.¹ Tizhoosh, H.R.² Kamel, M.S.³

29
- 14344261491
- Using relative novelty to identify useful temporal abstractions in reinforcement learning
- O. Simsek, and A.G. Barto Using relative novelty to identify useful temporal abstractions in reinforcement learning Proceedings of the Twenty-First International Conference on Machine Learning (ICML) 2004
- (2004) Proceedings of the Twenty-First International Conference on Machine Learning (ICML)
- Simsek, O.¹ Barto, A.G.²

30
- 0004102479
- MIT Press Cambridge, MA
- R.S. Sutton, and A.G. Barto Reinforcement Learning: An Introduction 1998 MIT Press Cambridge, MA
- (1998) Reinforcement Learning: An Introduction
- Sutton, R.S.¹ Barto, A.G.²

31
- 84947807317
- Open theoretical questions in reinforcement learning
- P. Fischer, H.U. Simon (Eds.) Springer-Verlag
- R.S. Sutton, Open theoretical questions in reinforcement learning, in: P. Fischer, H.U. Simon (Eds.), Proceedings of the Fourth European Conference on Computational Learning Theory (Proceedings EuroCOLT'99), Springer-Verlag, 1999, pp. 11-17
- (1999) Proceedings of the Fourth European Conference on Computational Learning Theory (Proceedings EuroCOLT'99) , pp. 11-17
- Sutton, R.S.¹

32
- 56349107774
- The outline of a reinforcement-learning agents for e-learning applications
- S. Pierre (Ed.)
- H.R. Tizhoosh, M. Shokri, M. Kamel, The outline of a reinforcement- learning agents for e-learning applications, in: S. Pierre (Ed.), E-Learning Networked Environments and Architectures: A Knowledge Processing Perspective, Springer Book Series, 2005
- (2005) E-Learning Networked Environments and Architectures: A Knowledge Processing Perspective, Springer Book Series
- Tizhoosh, H.R.¹ Shokri, M.² Kamel, M.³

33
- 34547340811
- Reinforcement learning based on actions and opposite actions
- Cairo, Egypt
- H.R. Tizhoosh Reinforcement learning based on actions and opposite actions ICGST International Conference on Artificial Intelligence and Machine Learning (AIML-05) Cairo, Egypt 2005
- (2005) ICGST International Conference on Artificial Intelligence and Machine Learning (AIML-05)
- Tizhoosh, H.R.¹

34
- 33847221479
- Opposition-based learning: A new scheme for machine intelligence
- Vienna, Austria
- H.R. Tizhoosh Opposition-based learning: a new scheme for machine intelligence International Conference on Computational Intelligence for Modeling Control and Automation - CIMCA'2005, vol. I Vienna, Austria 2005 695 701
- (2005) International Conference on Computational Intelligence for Modeling Control and Automation - CIMCA'2005, Vol. i , pp. 695-701
- Tizhoosh, H.R.¹

35
- 34249018171
- Opposition-based reinforcement learning
- H.R. Tizhoosh Opposition-based reinforcement learning Journal of Advanced Computational Intelligence and Intelligent Informatics 10 4 2006 578 585
- (2006) Journal of Advanced Computational Intelligence and Intelligent Informatics , vol.10 , Issue.4 , pp. 578-585
- Tizhoosh, H.R.¹

36
- 70449353259
- Opposition-based computing
- H.R. Tizhoosh, M. Ventresca (Eds.) Springer Physika-Verlag
- H.R. Tizhoosh, M. Ventresca, S. Rahnamayan, Opposition-based computing, in: H.R. Tizhoosh, M. Ventresca (Eds.), Oppositional Concepts in Computational Intelligence, Springer Physika-Verlag, 2008
- (2008) Oppositional Concepts in Computational Intelligence
- Tizhoosh, H.R.¹ Ventresca, M.² Rahnamayan, S.³

37
- 34548755909
- Improving the convergence of backpropagation by opposite transfer functions
- Vancouver
- M. Ventresca, and H.R. Tizhoosh Improving the convergence of backpropagation by opposite transfer functions International Joint Conference on Neural Networks (IJCNN) Vancouver 2006 9527 9534
- (2006) International Joint Conference on Neural Networks (IJCNN) , pp. 9527-9534
- Ventresca, M.¹ Tizhoosh, H.R.²

38
- 0004049893
- Cambridge University Cambridge
- C.J.C.H. Watkins Learning from Delayed Rewards 1989 Cambridge University Cambridge
- (1989) Learning from Delayed Rewards
- Watkins, C.J.C.H.¹

39
- 34249833101
- Technical note, Q-learning
- C.J.H. Watkins, and P. Dayan Technical note, Q-learning Machine Learning 8 1992 279 292
- (1992) Machine Learning , vol.8 , pp. 279-292
- Watkins, C.J.H.¹ Dayan, P.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.