SCOPUS 정보 검색 플랫폼

Volumn , Issue , 2007, Pages 1065-1070

An experts algorithm for transfer learning

Author keywords

[No Author keywords available]

Indexed keywords

CURRENT SITUATION; MARKOV DECISION PROCESSES; PRIOR EXPERIENCE; PRIORI KNOWLEDGE; TRANSFER LEARNING; TWO DOMAINS;

ARTIFICIAL INTELLIGENCE; MARKOV PROCESSES;

ALGORITHMS;

EID: 84880892531 PISSN: 10450823 EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (43)

References (13)

1
- 0037709910
- The non-stochastic multi-armed bandit problem
- Peter Auer, Nicolò Cesa-Bianchi, Yoav Freund, and Robert E. Schapire. The non-stochastic multi-armed bandit problem. SIAM Journal on Computing, 32(1):48-77, 2002.
- (2002) SIAM Journal on Computing , vol.32 , Issue.1 , pp. 48-77
- Auer, P.¹ Cesa-Bianchi, N.² Freund, Y.³ Schapire, R.E.⁴

2
- 0031140246
- How to use expert advice
- Nicolò Cesa-Bianchi, Yoav Freund, David Haussler, David P. Helmbold, and Robert E. Schapire. How to use expert advice. Journal of the Association for Computing Machinery, 44(3):427-485, 1997.
- (1997) Journal of the Association for Computing Machinery , vol.44 , Issue.3 , pp. 427-485
- Cesa-Bianchi, N.¹ Freund, Y.² Haussler, D.³ Helmbold, D.P.⁴ Schapire, R.E.⁵

3
- 33646519163
- Exploration-exploitation tradeoffs for experts algorithms in reactive environments
- Daniela Pucci de Farias and Nimrod Megiddo. Exploration-exploitation tradeoffs for experts algorithms in reactive environments. In Advances in Neural Information Processing Systems 17, pages 409-416, 2004.
- (2004) Advances in Neural Information Processing Systems , vol.17 , pp. 409-416
- Pucci De Farias, D.¹ Megiddo, N.²

4
- 0032137328
- Tracking the best expert
- Mark Herbster and Manfred Warmuth. Tracking the best expert. Machine Learning, 32(2):151-78, 1998.
- (1998) Machine Learning , vol.32 , Issue.2 , pp. 151-178
- Herbster, M.¹ Warmuth, M.²

5
- 0029617280
- Covergence results for the EM approach to mixtures of experts architectures
- Michael I. Jordan and Lei Xu. Covergence results for the EM approach to mixtures of experts architectures. Neural Networks, 8:1409-1431, 1995.
- (1995) Neural Networks , vol.8 , pp. 1409-1431
- Jordan, M.I.¹ Xu, L.²

6
- 0036832954
- Near-optimal reinforcement learning in polynomial time
- Michael Kearns and Satinder Singh. Near-optimal reinforcement learning in polynomial time. Machine Learning, 49:209-232, 2002.
- (2002) Machine Learning , vol.49 , pp. 209-232
- Kearns, M.¹ Singh, S.²

7
- 0002899547
- Asymptotically efficient allocation rules
- T. L. Lai and Herbert Robbins. Asymptotically efficient allocation rules. Advances in Applied Mathematics, 6:4-22, 1985.
- (1985) Advances in Applied Mathematics , vol.6 , pp. 4-22
- Lai, T.L.¹ Robbins, H.²

8
- 84880771557
- SMDP homomorphisms: An algebraic approach to abstraction in semi markov decision processes
- Balaraman Ravindran and Andrew Barto. SMDP homomorphisms: An algebraic approach to abstraction in semi markov decision processes. In Proceedings of the Eighteenth International Joint Converence on Artificial Intelligence (IJCAI 03), pages 1011-1016, 2003.
- (2003) Proceedings of the Eighteenth International Joint Converence on Artificial Intelligence (IJCAI 03) , pp. 1011-1016
- Ravindran, B.¹ Barto, A.²

9
- 84966203785
- Some aspects of the sequential design of experiments
- Herbert Robbins. Some aspects of the sequential design of experiments. Bulletins of the American Mathematical Society, 58:527-535, 1952.
- (1952) Bulletins of the American Mathematical Society , vol.58 , pp. 527-535
- Robbins, H.¹

11
- 0032050241
- Model-based average reward reinforcement learning
- Prasad Tadepalli and DoKyeong Ok. Model-based average reward reinforcement learning. Artificial Intelligence, 100:177-224, 1998.
- (1998) Artificial Intelligence , vol.100 , pp. 177-224
- Tadepalli, P.¹ Ok, D.²

12
- 27544473171
- Behavior transfer for value-function-based reinforcement learning
- Matthew E. Taylor and Peter Stone. Behavior transfer for value-function-based reinforcement learning. In The Fourth International Joint Conference on Autonomous Agents and Multiagent Systems, pages 53-59, 2005.
- (2005) The Fourth International Joint Conference on Autonomous Agents and Multiagent Systems , pp. 53-59
- Taylor, M.E.¹ Stone, P.²

13
- 33646406807
- Multi-armed bandit algorithms and empirical evaluation
- Joannès Vermorel and Mehryar Mohri. Multi-armed bandit algorithms and empirical evaluation. In Proceedings of the 16th European Conference on Machine Learning, pages 437-448, 2005.
- (2005) Proceedings of the 16th European Conference on Machine Learning , pp. 437-448
- Vermorel, J.¹ Mohri, M.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.