SCOPUS 정보 검색 플랫폼

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

Volumn 1856, Issue , 2000, Pages 292-303

VQQL. Applying vector quantization to reinforcement learning

(2) Fernáandez, Fernando a Borrajo, Daniel a

a UNIVERSIDAD CARLOS III DE MADRID (Spain)

Author keywords

[No Author keywords available]

Indexed keywords

CLUSTERING ALGORITHMS; DIGITAL TO ANALOG CONVERSION; LEARNING ALGORITHMS; MACHINE LEARNING; QUANTIZATION (SIGNAL); REINFORCEMENT LEARNING; SPORTS; VECTORS;

CLUSTERING MECHANISM; DYNAMIC DOMAINS; GENERALIZED LLOYD ALGORITHM; NEW MECHANISMS; OPTIMAL POLICIES; REINFORCEMENT LEARNING TECHNIQUES; ROBOSOCCER SIMULATOR; VECTOR QUANTIZERS;

VECTOR QUANTIZATION;

EID: 84944872843 PISSN: 03029743 EISSN: 16113349 Source Type: Book Series
DOI: 10.1007/3-540-45327-x_24 Document Type: Conference Paper

Times cited : (11)

References (18)

1
- 33244460774
- Path-tracking control of non-holonomic car-like robot with reinforcement learning
- Manuela Veloso, editor, Stockholm, Sweden, July-August, IJCAI Press
- Jacky Baltes and Yuming Lin. Path-tracking control of non-holonomic car-like robot with reinforcement learning. In Manuela Veloso, editor, Working notes of the IJCAI’99 Third International Workshop on Robocup, pages 17-21, Stockholm, Sweden, July-August 1999. IJCAI Press.
- (1999) Working notes of the IJCAI’99 Third International Workshop on Robocup , pp. 17-21
- Baltes, J.¹ Lin, Y.²

2
- 85166207010
- Exploiting structure in policy construction
- Montreal, Quebec, Canada, August, Morgan Kaufmann
- Craig Boutilier, Richard Dearden, and Moises Goldszmidt. Exploiting structure in policy construction. In Proceedings of the Fourteenth International Joint Conference on Artificial Intelligence (IJCAI-95), pages 1104-1111, Montreal, Quebec, Canada, August 1995. Morgan Kaufmann.
- (1995) Proceedings of the Fourteenth International Joint Conference on Artificial Intelligence (IJCAI-95) , pp. 1104-1111
- Boutilier, C.¹ Dearden, R.² Goldszmidt, M.³

3
- 0002192119
- Input generalization in delayed reinforcement learning: An algorithm and performance comparisons
- David Chapman and Leslie P. Kaelbling. Input generalization in delayed reinforcement learning: An algorithm and performance comparisons. Proceedings of the International Joint Conference on Artificial Intelligence, 1991.
- (1991) Proceedings of the International Joint Conference on Artificial Intelligence
- Chapman, D.¹ Kaelbling Leslie, P.²

4
- 0008364145
- Reinforcement learning using funtional approximation for generalization and their application to cart centering and fractal compression
- Thomas Dean, editor, Stockholm, Sweden, August
- C. Claussen, S. Gutta, and H. Wechsler. Reinforcement learning using funtional approximation for generalization and their application to cart centering and fractal compression. In Thomas Dean, editor, Proceedings of Sixteenth International Joint Coference on Artificial Intelligence, volume 2, pages 1362-1367, Stockholm, Sweden, August 1999.
- (1999) Proceedings of Sixteenth International Joint Coference on Artificial Intelligence , vol.2 , pp. 1362-1367
- Claussen, C.¹ Gutta, S.² Wechsler, H.³

5
- 0031370386
- Model minimization in markov decision processes
- AAAI Press
- Thomas Dean and Robert Givan. Model minimization in markov decision processes. In Proceedings of the American Association of Artificial Intelligence (AAAI-97). AAAI Press, 1997.
- (1997) Proceedings of the American Association of Artificial Intelligence (AAAI-97)
- Dean, T.¹ Givan, R.²

6
- 84943298044
- Message-based bucket brigade: An algorithm for the appointment of credit problem
- European Workshop on Machine Learning Yves Kodratoff, editor, Springer-Verlag
- Marco Dorigo. Message-based bucket brigade: An algorithm for the appointment of credit problem. In Yves Kodratoff, editor, Machine Learning. European Workshop on Machine Learning, LNAI 482, pages 235-244. Springer-Verlag, 1991.
- (1991) Machine Learning , vol.482 , pp. 235-244
- Dorigo, M.¹

7
- 0003959189
- Kluwer Academic Publishers
- Allen Gersho and Robert M. Gray. Vector Quantization and Signal Compression. Kluwer Academic Publishers, 1992.
- (1992) Vector Quantization and Signal Compression
- Gersho, A.¹ Gray Robert, M.²

8
- 0025489075
- The self-organizing map
- T. Kohonen. The self-organizing map. In Proceedings of IEEE, volume 2, pages 1464-1480, 1990.
- (1990) Proceedings of IEEE , vol.2 , pp. 1464-1480
- Kohonen, T.¹

9
- 0002224896
- Scaling-up reinforcement learning for robot control
- Amherst, MA, June, Morgan Kaufman
- Long-Ji Lin. Scaling-up reinforcement learning for robot control. In Proceedings of the Tenth International Conference on Machine Learning, pages 182-189, Amherst, MA, June 1993. Morgan Kaufman.
- (1993) Proceedings of the Tenth International Conference on Machine Learning , pp. 182-189
- Lin, L.-J.¹

10
- 0018918171
- An algorithm for vector quantizer design
- Com-28, N 1
- Yoseph Linde, Andre Buzo, and Robet M. Gray. An algorithm for vector quantizer design. In IEEE Transactions on Communications, Vol1. Com-28, N 1, pages 8495, 1980.
- (1980) IEEE Transactions on Communications , vol.1 , pp. 8495
- Linde, Y.¹ Buzo, A.² Gray Robet, M.³

11
- 0020102027
- Least squares quantization in pcm
- number 28 in IT, March
- S. P. Lloyd. Least squares quantization in pcm. In IEEE Transactions on Information Theory, number 28 in IT, pages 127-135, March 1982.
- (1982) In IEEE Transactions on Information Theory , pp. 127-135
- LloydS, P.¹

12
- 0026880130
- Automatic programming of behavior-based robots using reinforcement learning
- S. Mahavedan and J. Connell. Automatic programming of behavior-based robots using reinforcement learning. Artificial Intelligence, 55: 311-365, 1992.
- (1992) Artificial Intelligence , vol.55 , pp. 311-365
- Mahavedan, S.¹ Connell, J.²

13
- 78649956702
- Explanation based learning: A comparison of symbolic and neural network approaches
- University of Massachusetts, Amherts, MA, USA, Morgan Kaufmann
- Tom M. Mitchell and Sebastian B. Thrun. Explanation based learning: A comparison of symbolic and neural network approaches. In Proceedings of the Tenth International Conference on Machine Learning, pages 197-204, University of Massachusetts, Amherts, MA, USA, 1993. Morgan Kaufmann.
- (1993) Proceedings of the Tenth International Conference on Machine Learning , pp. 197-204
- Mitchell Tom, M.¹ Thrun Sebastian, B.²

14
- 0002267046
- Variable resolution dynamic programming: Efficiently learning action maps in multivariate real-valued spaces
- Andrew W. Moore. Variable resolution dynamic programming: Efficiently learning action maps in multivariate real-valued spaces. Proceedings in Eighth International Machine Learning Workshop, 1991.
- (1991) Proceedings in Eighth International Machine Learning Workshop
- Moore Andrew, W.¹

15
- 0006488247
- The party-game algorithm for variable resolution reinforcement learning in multidimensional state-spaces
- J.D. Cowan, G. Tesauro, and J. Alspector, editors, San Mateo, CA, Morgan Kaufmann
- Andrew W. Moore. The party-game algorithm for variable resolution reinforcement learning in multidimensional state-spaces. In J.D. Cowan, G. Tesauro, and J. Alspector, editors, Advances in Neural Information Processing Systems, pages 711-718, San Mateo, CA, 1994. Morgan Kaufmann.
- (1994) Advances in Neural Information Processing Systems , pp. 711-718
- Moore Andrew, W.¹

16
- 0004350056
- version 4.02 edition, January
- Itsuki Noda. Soccer Server Manual, version 4.02 edition, January 1999.
- (1999) Soccer Server Manual
- Noda, I.¹

17
- 0003328519
- Team-partitioned, opaque-transition reinforcement learning
- M. Asada and H. Kitano, editors, Berlin, Springer Verlag
- Peter Stone and Manuela Veloso. Team-partitioned, opaque-transition reinforcement learning. In M. Asada and H. Kitano, editors, RoboCup-98: Robot Soccer World Cup II, Berlin, 1999. Springer Verlag.
- (1999) RoboCup-98: Robot Soccer World Cup II
- Stone, P.¹ Veloso, M.²

18
- 34249833101
- Technical note: Q-learning
- May
- C. J. C. H. Watkins and P. Dayan. Technical note: Q-learning. Machine Learning, 8(3/4): 279-292, May 1992.
- (1992) Machine Learning , vol.8 , Issue.3-4 , pp. 279-292
- Watkins, C.J.C.H.¹ Dayan, P.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.