메뉴 건너뛰기




Volumn , Issue PART 3, 2013, Pages 2503-2511

Online learning under delayed feedback

Author keywords

[No Author keywords available]

Indexed keywords

ALGORITHMS; COMPUTER AIDED INSTRUCTION; LEARNING SYSTEMS;

EID: 84897506818     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (127)

References (13)
  • 1
    • 85162387277 scopus 로고    scopus 로고
    • Distributed delayed stochastic optimization
    • Shawe-Taylor, J., Zemel, R.S., Bartlett, P., Pereira, F., and Weinberger, K.Q. (eds.)
    • Agarwal, Alekh and Duchi, John. Distributed delayed stochastic optimization. In Shawe-Taylor, J., Zemel, R.S., Bartlett, P., Pereira, F., and Weinberger, K.Q. (eds.), Advances in Neural Information Processing Systems 24 (NIPS), pp. 873-881, 2011.
    • (2011) Advances in Neural Information Processing Systems 24 (NIPS) , pp. 873-881
    • Agarwal, A.1    Duchi, J.2
  • 2
    • 0036568025 scopus 로고    scopus 로고
    • Finite-time analysis of the multiarmed bandit problem
    • May
    • Auer, Peter, Cesa-Bianchi, Nicolò, and Fischer, Paul. Finite-time analysis of the multiarmed bandit problem. Machine Learning, 47(2-3):235-256, May 2002.
    • (2002) Machine Learning , vol.47 , Issue.2-3 , pp. 235-256
    • Auer, P.1    Cesa-Bianchi, N.2    Fischer, P.3
  • 6
    • 84898437076 scopus 로고    scopus 로고
    • The KL-UCB algorithm for bounded stochastic bandits and beyond
    • Budapest, Hungary, July
    • Garivier, Aurélien and Cappe, Olivier. The KL-UCB algorithm for bounded stochastic bandits and beyond. In Proceedings of the 24th Annual Conference on Learning Theory (COLT), volume 19, pp. 359-376, Budapest, Hungary, July 2011.
    • (2011) Proceedings of the 24th Annual Conference on Learning Theory (COLT) , vol.19 , pp. 359-376
    • Garivier, A.1    Cappe, O.2
  • 7
    • 84897506818 scopus 로고    scopus 로고
    • Online learning under delayed feedback
    • Extended version of a paper submitted to URL
    • Joulani, Pooria, György, András, and Szepesvári, Csaba. Online learning under delayed feedback. Extended version of a paper submitted to ICML-2013, 2013. URL http://webdocs.cs.ualberta.ca/~pooria/ publications/DelayedFeedback-ICML2013-Extended.pdf.
    • (2013) ICML-2013
    • Joulani, P.1    György, A.2    Szepesvári, C.3
  • 8
    • 80052488062 scopus 로고    scopus 로고
    • Slow learners are fast
    • Bengio, Y., Schuurmans, D., Lafferty, J., Williams, C. K. I., and Culotta, A. (eds.)
    • Langford, John, Smola, Alexander, and Zinkevich, Martin. Slow learners are fast. In Bengio, Y., Schuurmans, D., Lafferty, J., Williams, C. K. I., and Culotta, A. (eds.), Advances in Neural Information Processing Systems 22, pp. 2331-2339. 2009.
    • (2009) Advances in Neural Information Processing Systems , vol.22 , pp. 2331-2339
    • Langford, J.1    Smola, A.2    Zinkevich, M.3
  • 10
    • 33646498288 scopus 로고    scopus 로고
    • On-line learning with delayed label feedback
    • Jain, Sanjay, Simon, HansUlrich, and Tomita, Etsuji (eds.), Algorithmic Learning Theory, Springer Berlin Heidelberg
    • Mesterharm, Chris J. On-line learning with delayed label feedback. In Jain, Sanjay, Simon, HansUlrich, and Tomita, Etsuji (eds.), Algorithmic Learning Theory, volume 3734 of Lecture Notes in Computer Science, pp. 399-413. Springer Berlin Heidelberg, 2005.
    • (2005) Lecture Notes in Computer Science , vol.3734 , pp. 399-413
    • Mesterharm, C.J.1
  • 11
    • 56749126921 scopus 로고    scopus 로고
    • PhD thesis, Department of Computer Science, Rutgers University, New Brunswick, NJ
    • Mesterharm, Chris J. Improving on-line learning. PhD thesis, Department of Computer Science, Rutgers University, New Brunswick, NJ, 2007.
    • (2007) Improving On-line Learning
    • Mesterharm, C.J.1
  • 12
    • 85162052729 scopus 로고    scopus 로고
    • Online markov decision processes under bandit feedback
    • Lafferty, J., Williams, C. K. I., Shawe-Taylor, J., Zemel, R.S., and Culotta, A. (eds.)
    • Neu, Gergely, György, András, Szepesvári, Csaba, and Antos, Andras. Online markov decision processes under bandit feedback. In Lafferty, J., Williams, C. K. I., Shawe-Taylor, J., Zemel, R.S., and Culotta, A. (eds.), Advances in Neural Information Processing Systems 23 (NIPS), pp. 1804-1812, 2010.
    • (2010) Advances in Neural Information Processing Systems 23 (NIPS) , pp. 1804-1812
    • Neu, G.1    György, A.2    Szepesvári, C.3    Antos, A.4
  • 13


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.