SCOPUS 정보 검색 플랫폼

Advanced Information and Knowledge Processing

Volumn , Issue , 2007, Pages 251-275

Reinforcement Agents for E-Learning Applications

(3) Tizhoosh, Hamid R a Shokri, Maryam a Kamel, Mohamed a

Author keywords

Learning Object; Markov Decision Process; Multiagent System; Partially Observable Markov Decision Process; Reinforcement Agent

Indexed keywords

EID: 85076593059 PISSN: 16103947 EISSN: 21978441 Source Type: Book Series
DOI: 10.1007/978-1-84628-758-9_9 Document Type: Chapter

Times cited : (3)

References (39)

1
- 85078114915
- The Netherlands: IEEE SMC
- Ayesh, A. (2004) Emotionally Motivated Reinforcement Learning Based Controller. The Hague, The Netherlands: IEEE SMC.
- (2004) Emotionally Motivated Reinforcement Learning Based Controller. the Hague
- Ayesh, A.¹

2
- 0028731609
- Fuzzy Q-learning: A new approach for fuzzy dynamic programming problems
- Orlando, FL
- Berenji, H.R. (1994) Fuzzy Q-learning: a new approach for fuzzy dynamic programming problems. Third IEEE International Conference on Fuzzy Systems, Orlando, FL.
- (1994) Third IEEE International Conference on Fuzzy Systems
- Berenji, H.R.¹

3
- 84899032145
- All learning is local: Multi-agent learning in global reward games, Advances in Neural Information Processing Systems 16
- Chang, Y.H., Ho, T., Kaelbling, L.P. (2004) All learning is local: Multi-agent learning in global reward games, Advances in Neural Information Processing Systems 16, Vancouver, (NIPS-03).
- (2004) Vancouver
- Chang, Y.H.¹ Ho, T.² Kaelbling, L.P.³

4
- 85078188985
- AAMAS03, Melbourne, Australia
- Chalkiadakis, G., Boutilier, C. (2003) Coordination in Multiagent Reinforcement Learning: A Bayesian Approach, AAMAS03, Melbourne, Australia, 1418.
- (2003) Coordination in Multiagent Reinforcement Learning: A Bayesian Approach
- Chalkiadakis, G.¹ Boutilier, C.²

5
- 85078179656
- Claus, C., Boutilier, C. (1998) The Dynamics of Reinforcement Learning in Cooperative Multiagent Systems, Department of Computer Science, University of British Columbia, Canada (American Association for Artificial Intelligence).
- (1998) The Dynamics of Reinforcement Learning in Cooperative Multiagent Systems, Department of Computer Science, University of British Columbia, Canada (American Association for Artificial Intelligence)
- Claus, C.¹ Boutilier, C.²

6
- 85040789943
- Department of Computer Science, University of British Columbia, Vancouver, Canada Computer Science Division, University of California Berkeley
- Dearden, R., Friedman, N., Russell, S. (1998) Bayesian Q-learning, Department of Computer Science, University of British Columbia, Vancouver, Canada Computer Science Division, University of California Berkeley.
- (1998) Bayesian Q-Learning
- Dearden, R.¹ Friedman, N.² Russell, S.³

7
- 0009835609
- Edinburgh: PhD Thesis, University of Edinburgh
- Gadanho, S. (1999) Reinforcement Learning in Autonomous Robots: An Empirical Investigation of the Role of Emotions. Edinburgh: PhD Thesis, University of Edinburgh.
- (1999) Reinforcement Learning in Autonomous Robots: An Empirical Investigation of the Role of Emotions
- Gadanho, S.¹

8
- 0035247059
- An Introduction to hidden Markov models and Bayesian networks
- Ghahramani, Z. (2001) An Introduction to hidden Markov models and Bayesian networks. International Journal of Pattern Recognition and Artificial Intelligence, 15(1):9– 42.
- (2001) International Journal of Pattern Recognition and Artificial Intelligence , vol.15 , Issue.1 , pp. 9-42
- Ghahramani, Z.¹

9
- 0028730301
- Fuzzy Q-learning and dynamical fuzzy Q-learning
- pp
- Glorennec, P.Y. (1994) Fuzzy Q-learning and dynamical fuzzy Q-learning. Proceedings of the Third IEEE International Conference on Fuzzy Systems, IEEE Press, Piscataway, NJ, pp. 474–479.
- (1994) Proceedings of the Third IEEE International Conference on Fuzzy Systems, IEEE Press, Piscataway, NJ , pp. 474-479
- Glorennec, P.Y.¹

10
- 0030711314
- Fuzzy Q-Learning
- pp
- Glorennec, P.Y., Jouffe, L. (1997) Fuzzy Q-Learning. Proceedings of Sixth International Conference on Fuzzy Systems, Barcelona, Spain, pp. 659–662.
- (1997) Proceedings of Sixth International Conference on Fuzzy Systems, Barcelona, Spain , pp. 659-662
- Glorennec, P.Y.¹ Jouffe, L.²

11
- 85078215532
- Mixed-Initiative Interaction, IEEE Intelligence Systems, September/October
- Hearst, M.A. (1999) Trends & Controversies, Mixed-Initiative Interaction, IEEE Intelligence Systems, September/October.
- (1999) Trends & Controversies
- Hearst, M.A.¹

12
- 0032634178
- May
- Horvitz, E. (May, 1999) Principles of Mixed-Initiative User Interfaces. Proceedings of CHI’99, ACM SIGCHI Conference on Human Factors in Computing Systems, Pittsburgh, PA.
- (1999) Principles of Mixed-Initiative User Interfaces. Proceedings of CHI’99, ACM SIGCHI Conference on Human Factors in Computing Systems, Pittsburgh, PA
- Horvitz, E.¹

13
- 85153938292
- Reinforcement learning algorithm for partially observable markov decision problems
- Jaakkola, T., Singh, S.P., Jordan, M.I. (1994) Reinforcement learning algorithm for partially observable markov decision problems, In Advances in Neural Information Processing Systems (NIPS), 7.
- (1994) Advances in Neural Information Processing Systems (NIPS) , pp. 7
- Jaakkola, T.¹ Singh, S.P.² Jordan, M.I.³

14
- 0032140718
- Fuzzy inference system learning by reinforcement methods
- Jouffe, L. (1999) Fuzzy inference system learning by reinforcement methods, IEEE Transactions on Systems, Man and Cybernetics, 28:338–355.
- (1999) IEEE Transactions on Systems, Man and Cybernetics , vol.28 , pp. 338-355
- Jouffe, L.¹

15
- 0032073263
- Planning and acting in partially observable stochastic domains
- Kaelbling, L.P., Littman, M.L., Cassandra, A.R. (1998) Planning and acting in partially observable stochastic domains. Artificial Intelligence, 101:99–134.
- (1998) Artificial Intelligence , vol.101 , pp. 99-134
- Kaelbling, L.P.¹ Littman, M.L.² Cassandra, A.R.³

16
- 0029679044
- Reinforcement learning: A survey
- Kaelbling, L.P., Littman, M.L., Moore, A.W. (1996) Reinforcement learning: a survey. Journal of Artificial Intelligence Research, 4:237–285.
- (1996) Journal of Artificial Intelligence Research , vol.4 , pp. 237-285
- Kaelbling, L.P.¹ Littman, M.L.² Moore, A.W.³

17
- 17444405333
- Hidden Markov models with states depending on observations source, Pattern Recognition Letters Archive, New York
- Li, Y. (2005) Hidden Markov models with states depending on observations source, Pattern Recognition Letters Archive, New York, NY: Elsevier Science Inc. 26(7): 977– 984.
- (2005) NY: Elsevier Science Inc. , vol.26 , Issue.7 , pp. 977-984
- Li, Y.¹

18
- 85138579181
- Learning Policies for Partially Observable Environments: Scaling Up
- Littman, M.L., Cassandra, A.R., Kaelbling, L.P. (1995) Learning Policies for Partially Observable Environments: Scaling Up, Proceedings of the Twelfth International Conference on Machine Learning.
- (1995) Proceedings of the Twelfth International Conference on Machine Learning
- Littman, M.L.¹ Cassandra, A.R.² Kaelbling, L.P.³

19
- 0141819580
- PEGASUS: A policy search method for large MDPs and POMDPs
- UAI), Proceedinjgs of the Sixteenth Conference
- Ng, A.Y., Jordan, M.I. (2000) PEGASUS: A policy search method for large MDPs and POMDPs, Uncertainty in artificial intelligence (UAI), Proceedinjgs of the Sixteenth Conference.
- (2000) Uncertainty in Artificial Intelligence
- Ng, A.Y.¹ Jordan, M.I.²

20
- 85078084757
- POMDPs for Dummies, Subtitled: POMDPs and Their Algorithms, Sans Formula!
- Online Tutorial, Brown University, Department of Computer Science, POMDPs for Dummies, Subtitled: POMDPs and Their Algorithms, Sans Formula!, http://www.cs.brown.edu/research/ai/pomdp/tutorial/index.html.
- Online Tutorial, Brown University, Department of Computer Science

21
- 0003391330
- San Mateo, CA: Morgan Kaufmann Publishers
- Pearl, J. (1988) Probabilistic Reasoning in Intelligent Systems: Networks of Plausible Inference. San Mateo, CA: Morgan Kaufmann Publishers.
- (1988) Probabilistic Reasoning in Intelligent Systems: Networks of Plausible Inference
- Pearl, J.¹

22
- 51649118863
- New York: Springer-Verlag
- Pham, T.D. (2002). Perception-Based Hidden Markov Models: A Theoretical Framework for Data Mining and Knowledge Discovery. Soft Computing, 6: 400–405. New York: Springer-Verlag.
- (2002) Perception-Based Hidden Markov Models: A Theoretical Framework for Data Mining and Knowledge Discovery. Soft Computing, 6 , pp. 400-405
- Pham, T.D.¹

23
- 0036570250
- Reinforcement learning agent
- Ribeiro, C. (2002) Reinforcement learning agent. Artificial Intelligence Review 17:223–250.
- (2002) Artificial Intelligence Review , vol.17 , pp. 223-250
- Ribeiro, C.¹

24
- 84880707672
- Spoken dialogue management using probabilistic reasoning
- Hong Kong
- Roy, N., Pineau, J., Thrun, S. (2000) Spoken dialogue management using probabilistic reasoning, In Proceedings of the 38th Annual Meeting of the Association for Computational Linguistics (ACL-2000), Hong Kong.
- (2000) Proceedings of the 38Th Annual Meeting of the Association for Computational Linguistics (ACL-2000)
- Roy, N.¹ Pineau, J.² Thrun, S.³

25
- 0003584577
- NJ: Pearson Education Inc
- Russell, S.J., Norvig, P. (2003) Artificial Intelligence: A Modern Approach. NJ: Pearson Education Inc.
- (2003) Artificial Intelligence: a Modern Approach
- Russell, S.J.¹ Norvig, P.²

26
- 85078090479
- Sarawagi, S., Cohen, W.W. (2004) Semi-Markov Conditional Random Fields for Information Extraction, NIPS 2004 (Advances in Neural Information Processing Systems 17 [Neural Information Processing Systems, NIPS 2004, December 13–18, 2004, Vancouver, British Columbia, Canada]).
- (2004) Semi-Markov Conditional Random Fields for Information Extraction, NIPS 2004 (Advances in Neural Information Processing Systems 17 [Neural Information Processing Systems, NIPS 2004, December 13–18, 2004, Vancouver, British Columbia, Canada])
- Sarawagi, S.¹ Cohen, W.W.²

27
- 85078145845
- Advanced Topics in Artificial Intelligence, University of Waterloo
- Shokri, M. (2004) Adjustable Autonomy in Reinforced Image Thresholding, Report, Cs 886: Advanced Topics in Artificial Intelligence, University of Waterloo.
- (2004) Adjustable Autonomy in Reinforced Image Thresholding, Report, Cs , pp. 886
- Shokri, M.¹

28
- 0141762732
- Using Reinforcement Learning for Image Thresholding
- Shokri, M., Tizhoosh, H.R. (2003) Using Reinforcement Learning for Image Thresholding, Canadian Conference on Electrical and Computer Engineering, 1:1231–1234.
- (2003) Canadian Conference on Electrical and Computer Engineering , vol.1 , pp. 1231-1234
- Shokri, M.¹ Tizhoosh, H.R.²

29
- 4544335895
- Canadian Conference on Computer and Robot Vision
- Shokri, M., Tizhoosh, H.R. (2004) Q(λ)-Based Image Thresholding, Canadian Conference on Computer and Robot Vision.
- (2004) Q(λ)-Based Image Thresholding
- Shokri, M.¹ Tizhoosh, H.R.²

30
- 0003631802
- Massachusetts Institute of Technology, Artificial Intelligence Laboratory and Center for Biological and Computational Learning, Department of Brain and Cognitive Science
- Smyth, P., Heckerman, D., Jordan, M. (1996) Probabilistic Independence Networks for Hidden Markov Models, Massachusetts Institute of Technology, Artificial Intelligence Laboratory and Center for Biological and Computational Learning, Department of Brain and Cognitive Science.
- (1996) Probabilistic Independence Networks for Hidden Markov Models
- Smyth, P.¹ Heckerman, D.² Jordan, M.³

31
- 0004102479
- Cambridge, MA: MIT Press
- Sutton R.S., Barto, A.G. (1998) Reinforcement Learning: An Introduction, Cambridge, MA: MIT Press.
- (1998) Reinforcement Learning: An Introduction
- Sutton, R.S.¹ Barto, A.G.²

32
- 85078141188
- Thacker, N.A., Lacey, A.J. (1998) Tutorial: The Kalman Filter, Imaging Science and Biomedical Engineering Division, Medical School, University of Manchester, Stopford Building, Oxford Road, Manchester, M13 9PT.
- (1998) Tutorial: The Kalman Filter, Imaging Science and Biomedical Engineering Division, Medical School, University of Manchester, Stopford Building, Oxford Road, Manchester, M13 9PT
- Thacker, N.A.¹ Lacey, A.J.²

33
- 5444243723
- A Framework for the initialization of student models in Web-based intelligent tutoring systems
- Tsiriga, V., Virvou, M. (2004) A Framework for the initialization of student models in Web-based intelligent tutoring systems. User Modeling and User-Adapted Interaction, 14:289–316.
- (2004) User Modeling and User-Adapted Interaction , vol.14 , pp. 289-316
- Tsiriga, V.¹ Virvou, M.²

34
- 14344279109
- An Application of Reinforcement Learning to Dialogue Strategy Selection in a Spoken Dialogue System for Email
- Walker, M.A. (2000) An Application of Reinforcement Learning to Dialogue Strategy Selection in a Spoken Dialogue System for Email, Journal of Artificial Intelligence Research (JAIR), 12:387–416.
- (2000) Journal of Artificial Intelligence Research (JAIR) , vol.12 , pp. 387-416
- Walker, M.A.¹

35
- 0004049893
- Cambridge: Cambridge University
- Watkins, C.J.H. (1989) Learning from Delayed Rewards. Cambridge: Cambridge University.
- (1989) Learning from Delayed Rewards
- Watkins, C.J.H.¹

36
- 34249833101
- Technical note, Q-learning
- Watkins, C.J.H., Dayan, P. (1992) Technical note, Q-learning. Machine Learning, 8:279–292.
- (1992) Machine Learning , vol.8 , pp. 279-292
- Watkins, C.J.H.¹ Dayan, P.²

37
- 0003504917
- Hierarchical Optimization of Policy-Coupled Semi-Markov Decision Processes
- Bled, Slovenia, June 27–30. (nominated for best paper award at ICML-99)
- Wang, G., Mahadevan, S. (1999) Hierarchical Optimization of Policy-Coupled Semi-Markov Decision Processes, Proceeding of the 16th International Conference on Machine Learning (ICML ’99), Bled, Slovenia, June 27–30. (nominated for best paper award at ICML-99).
- (1999) Proceeding of the 16Th International Conference on Machine Learning (ICML ’99)
- Wang, G.¹ Mahadevan, S.²

38
- 0036644611
- Maximum entropy-based optimal threshold selection using deterministic reinforcement learning with controlled randomization
- Yin, P.Y. (2002) Maximum entropy-based optimal threshold selection using deterministic reinforcement learning with controlled randomization. Signal Processing 82:993– 1006.
- (2002) Signal Processing , vol.82 , pp. 993-1006
- Yin, P.Y.¹

39
- 85078082004
- Zhang, W., Dietterich, T.G. (1995) Value Function Approximations and Job-Shop Scheduling, Submitted to the Workshop on Value Function Approximation in Reinforcement Learning at ICML-95.
- (1995) Value Function Approximations and Job-Shop Scheduling, Submitted to the Workshop on Value Function Approximation in Reinforcement Learning at ICML-95
- Zhang, W.¹ Dietterich, T.G.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.