SCOPUS 정보 검색 플랫폼

30th International Conference on Machine Learning, ICML 2013

Volumn , Issue PART 3, 2013, Pages 2503-2511

Online learning under delayed feedback

(3) Joulani, Pooria a György, András a Szepesvari, Csaba a

a UNIVERSITY OF ALBERTA (Canada)

Author keywords

[No Author keywords available]

Indexed keywords

ALGORITHMS; COMPUTER AIDED INSTRUCTION; LEARNING SYSTEMS;

DELAYED FEEDBACK; EFFECT OF DELAYS; LOWER COMPLEXITY; META-ALGORITHMS; ONLINE LEARNING ALGORITHMS; STOCHASTIC PROBLEMS; SYSTEMATIC STUDY; WEB BASED LEARNING;

E-LEARNING;

EID: 84897506818 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (127)

References (13)

1
- 85162387277
- Distributed delayed stochastic optimization
- Shawe-Taylor, J., Zemel, R.S., Bartlett, P., Pereira, F., and Weinberger, K.Q. (eds.)
- Agarwal, Alekh and Duchi, John. Distributed delayed stochastic optimization. In Shawe-Taylor, J., Zemel, R.S., Bartlett, P., Pereira, F., and Weinberger, K.Q. (eds.), Advances in Neural Information Processing Systems 24 (NIPS), pp. 873-881, 2011.
- (2011) Advances in Neural Information Processing Systems 24 (NIPS) , pp. 873-881
- Agarwal, A.¹ Duchi, J.²

2
- 0036568025
- Finite-time analysis of the multiarmed bandit problem
- May
- Auer, Peter, Cesa-Bianchi, Nicolò, and Fischer, Paul. Finite-time analysis of the multiarmed bandit problem. Machine Learning, 47(2-3):235-256, May 2002.
- (2002) Machine Learning , vol.47 , Issue.2-3 , pp. 235-256
- Auer, P.¹ Cesa-Bianchi, N.² Fischer, P.³

3
- 84926078662
- Cambridge University Press, New York, NY, USA, ISBN 0521841089
- Cesa-Bianchi, Nicolò and Lugosi, Gábor. Prediction, Learning, and Games. Cambridge University Press, New York, NY, USA, 2006. ISBN 0521841089.
- (2006) Prediction, Learning, and Games
- Cesa-Bianchi, N.¹ Lugosi, G.²

4
- 84867115523
- Parallelizing exploration-exploitation trade-offs with gaussian process bandit optimization
- Omnipress
- Desautels, Thomas, Krause, Andreas, and Burdick, Joel. Parallelizing exploration-exploitation trade-offs with gaussian process bandit optimization. In Proceedings of the 29th International Conference on Machine Learning (ICML), Edinburgh, Scotland, UK, 2012. Omnipress.
- Proceedings of the 29th International Conference on Machine Learning (ICML), Edinburgh, Scotland, UK, 2012
- Desautels, T.¹ Krause, A.² Burdick, J.³

5
- 80053154335
- Efficient optimal learning for contextual bandits
- Corvallis, Oregon, AUAI Press
- Dudik, Miroslav, Hsu, Daniel, Kale, Satyen, Karampatziakis, Nikos, Langford, John, Reyzin, Lev, and Zhang, Tong. Efficient optimal learning for contextual bandits. In Proceedings of the 27th Conference on Uncertainty in Artificial Intelligence (UAI), pp. 169-178, Corvallis, Oregon, 2011. AUAI Press.
- (2011) Proceedings of the 27th Conference on Uncertainty in Artificial Intelligence (UAI) , pp. 169-178
- Dudik, M.¹ Hsu, D.² Kale, S.³ Karampatziakis, N.⁴ Langford, J.⁵ Reyzin, L.⁶ Zhang, T.⁷

6
- 84898437076
- The KL-UCB algorithm for bounded stochastic bandits and beyond
- Budapest, Hungary, July
- Garivier, Aurélien and Cappe, Olivier. The KL-UCB algorithm for bounded stochastic bandits and beyond. In Proceedings of the 24th Annual Conference on Learning Theory (COLT), volume 19, pp. 359-376, Budapest, Hungary, July 2011.
- (2011) Proceedings of the 24th Annual Conference on Learning Theory (COLT) , vol.19 , pp. 359-376
- Garivier, A.¹ Cappe, O.²

7
- 84897506818
- Online learning under delayed feedback
- Extended version of a paper submitted to URL
- Joulani, Pooria, György, András, and Szepesvári, Csaba. Online learning under delayed feedback. Extended version of a paper submitted to ICML-2013, 2013. URL http://webdocs.cs.ualberta.ca/~pooria/ publications/DelayedFeedback-ICML2013-Extended.pdf.
- (2013) ICML-2013
- Joulani, P.¹ György, A.² Szepesvári, C.³

8
- 80052488062
- Slow learners are fast
- Bengio, Y., Schuurmans, D., Lafferty, J., Williams, C. K. I., and Culotta, A. (eds.)
- Langford, John, Smola, Alexander, and Zinkevich, Martin. Slow learners are fast. In Bengio, Y., Schuurmans, D., Lafferty, J., Williams, C. K. I., and Culotta, A. (eds.), Advances in Neural Information Processing Systems 22, pp. 2331-2339. 2009.
- (2009) Advances in Neural Information Processing Systems , vol.22 , pp. 2331-2339
- Langford, J.¹ Smola, A.² Zinkevich, M.³

9
- 77954641643
- A contextual-bandit approach to personalized news article recommendation
- New York, NY, USA, ACM
- Li, Lihong, Chu, Wei, Langford, John, and Schapire, Robert E. A contextual-bandit approach to personalized news article recommendation. In Proceedings of the 19th International Conference on World Wide Web (WWW), pp. 661-670, New York, NY, USA, 2010. ACM.
- (2010) Proceedings of the 19th International Conference on World Wide Web (WWW) , pp. 661-670
- Li, L.¹ Chu, W.² Langford, J.³ Schapire, R.E.⁴

10
- 33646498288
- On-line learning with delayed label feedback
- Jain, Sanjay, Simon, HansUlrich, and Tomita, Etsuji (eds.), Algorithmic Learning Theory, Springer Berlin Heidelberg
- Mesterharm, Chris J. On-line learning with delayed label feedback. In Jain, Sanjay, Simon, HansUlrich, and Tomita, Etsuji (eds.), Algorithmic Learning Theory, volume 3734 of Lecture Notes in Computer Science, pp. 399-413. Springer Berlin Heidelberg, 2005.
- (2005) Lecture Notes in Computer Science , vol.3734 , pp. 399-413
- Mesterharm, C.J.¹

11
- 56749126921
- PhD thesis, Department of Computer Science, Rutgers University, New Brunswick, NJ
- Mesterharm, Chris J. Improving on-line learning. PhD thesis, Department of Computer Science, Rutgers University, New Brunswick, NJ, 2007.
- (2007) Improving On-line Learning
- Mesterharm, C.J.¹

12
- 85162052729
- Online markov decision processes under bandit feedback
- Lafferty, J., Williams, C. K. I., Shawe-Taylor, J., Zemel, R.S., and Culotta, A. (eds.)
- Neu, Gergely, György, András, Szepesvári, Csaba, and Antos, Andras. Online markov decision processes under bandit feedback. In Lafferty, J., Williams, C. K. I., Shawe-Taylor, J., Zemel, R.S., and Culotta, A. (eds.), Advances in Neural Information Processing Systems 23 (NIPS), pp. 1804-1812, 2010.
- (2010) Advances in Neural Information Processing Systems 23 (NIPS) , pp. 1804-1812
- Neu, G.¹ György, A.² Szepesvári, C.³ Antos, A.⁴

13
- 0036648904
- On delayed prediction of individual sequences
- September
- Weinberger, Marcelo J. and Ordentlich, Erik. On delayed prediction of individual sequences. IEEE Transactions on Information Theory, 48 (7): 1959-1976, September 2002.
- (2002) IEEE Transactions on Information Theory , vol.48 , Issue.7 , pp. 1959-1976
- Weinberger, M.J.¹ Ordentlich, E.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.