-
1
-
-
84867882483
-
Regret bounds for the adaptive control of linear quadratic systems
-
Kakade S, von Luxburg U, eds
-
Abbasi-Yadkori Y, Szepesvári C(2011) Regret bounds for the adaptive control of linear quadratic systems. Kakade S, von Luxburg U, eds. 24th Annual Conf. Learn. Theory 4COLT5, Vol. 19, 1-26.
-
(2011)
24th Annual Conf. Learn. Theory 4COLT5
, vol.19
, pp. 1-26
-
-
Abbasi-Yadkori, Y.1
Szepesvári, C.2
-
2
-
-
85162561761
-
Improved algorithms for linear stochastic bandits
-
Shawe-Taylor J, Zemel RS, Bartlett P, Pereira FCN, Weinberger KQ, eds
-
Abbasi-Yadkori Y, Pal D, Szepesvári C(2011) Improved algorithms for linear stochastic bandits. Shawe-Taylor J, Zemel RS, Bartlett P, Pereira FCN, Weinberger KQ, eds. Advances in Neural Information Processing Systems 24, 2312-2320.
-
(2011)
Advances in Neural Information Processing Systems
, vol.24
, pp. 2312-2320
-
-
Abbasi-Yadkori, Y.1
Pal, D.2
Szepesvári, C.3
-
3
-
-
70350228783
-
Dynamic pricing for nonperishable products with demand learning
-
Araman VF, Caldentey R (2009) Dynamic pricing for nonperishable products with demand learning. Oper. Res. 57(5): 1169-1188.
-
(2009)
Oper. Res.
, vol.57
, Issue.5
, pp. 1169-1188
-
-
Araman, V.F.1
Caldentey, R.2
-
4
-
-
84871023469
-
The multiplicative weights update method: A meta-algorithm and applications
-
Arora S, Hazan E, Kale S (2012) The multiplicative weights update method: A meta-algorithm and applications. Theory Comput. 8: 121-164.
-
(2012)
Theory Comput.
, Issue.8
, pp. 121-164
-
-
Arora, S.1
Hazan, E.2
Kale, S.3
-
5
-
-
0036568025
-
Finite-time analysis of the multi-armed bandit problem
-
Auer P, Cesa-Bianchi N, Fischer P (2002) Finite-time analysis of the multi-armed bandit problem. Machine Learn. 47(2): 235-256.
-
(2002)
Machine Learn.
, vol.47
, Issue.2
, pp. 235-256
-
-
Auer, P.1
Cesa-Bianchi, N.2
Fischer, P.3
-
6
-
-
38049040954
-
Improved rates for the stochastic continuum-armed bandit problem
-
Bshouty N, Gentile C, eds., Lecture Notes in Computer Science, Springer, Berlin
-
Auer P, Ortner R, Szepesvári C(2007) Improved rates for the stochastic continuum-armed bandit problem. Bshouty N, Gentile C, eds. Learning Theory, Lecture Notes in Computer Science, Vol. 4539 (Springer, Berlin), 454-468.
-
(2007)
Learning Theory
, vol.4539
, pp. 454-468
-
-
Auer, P.1
Ortner, R.2
Szepesvári, C.3
-
7
-
-
25844499294
-
A partially observed markov decision process for dynamic pricing
-
Aviv Y, Pazgal A (2005) A partially observed Markov decision process for dynamic pricing. Management Sci. 51(9): 1400-1416.
-
(2005)
Management Sci.
, vol.51
, Issue.9
, pp. 1400-1416
-
-
Aviv, Y.1
Pazgal, A.2
-
8
-
-
0001028358
-
An inverse matrix adjustment arising in discriminant analysis
-
Bartlett MS (1951) An inverse matrix adjustment arising in discriminant analysis. Ann. Math. Statist. 22(1): 107-111.
-
(1951)
Ann. Math. Statist.
, vol.22
, Issue.1
, pp. 107-111
-
-
Bartlett, M.S.1
-
9
-
-
80053161827
-
Regal: A regularization based algorithm for reinforcement learning in weakly communicating mdps
-
(AUAI Press, Arlington, VA
-
Bartlett PL, Tewari A (2009) REGAL: A regularization based algorithm for reinforcement learning in weakly communicating MDPs. Proc. Twenty-Fifth Conf. Uncertainty in Artificial Intelligence UAI '09 (AUAI Press, Arlington, VA), 35-42.
-
(2009)
Proc. Twenty-Fifth Conf. Uncertainty in Artificial Intelligence UAI '09
, pp. 35-42
-
-
Bartlett, P.L.1
Tewari, A.2
-
10
-
-
84968512234
-
Convergence rates in the law of large numbers
-
Baum LE, Katz M (1965) Convergence rates in the law of large numbers. Trans. Amer. Math. Soc. 120(1): 108-123.
-
(1965)
Trans. Amer. Math. Soc.
, vol.120
, Issue.1
, pp. 108-123
-
-
Baum, L.E.1
Katz, M.2
-
12
-
-
70350251174
-
Dynamic pricing without knowing the demand function: Risk bounds and near-optimal algorithms
-
Besbes O, Zeevi A (2009) Dynamic pricing without knowing the demand function: Risk bounds and near-optimal algorithms. Oper. Res. 57(6): 1407-1420.
-
(2009)
Oper. Res.
, vol.57
, Issue.6
, pp. 1407-1420
-
-
Besbes, O.1
Zeevi, A.2
-
13
-
-
79952957936
-
On the minimax complexity of pricing in a changing environment
-
Besbes O, Zeevi A (2011) On the minimax complexity of pricing in a changing environment. Oper. Res. 59(1): 66-79.
-
(2011)
Oper. Res.
, vol.59
, Issue.1
, pp. 66-79
-
-
Besbes, O.1
Zeevi, A.2
-
15
-
-
84866361207
-
Dynamic pricing under a general parametric choice model
-
Broder J, Rusmevichientong P (2012) Dynamic pricing under a general parametric choice model. Oper. Res. 60(4): 965-980.
-
(2012)
Oper. Res.
, vol.60
, Issue.4
, pp. 965-980
-
-
Broder, J.1
Rusmevichientong, P.2
-
16
-
-
84874045238
-
Regret analysis of stochastic and non-stochastic multi-armed bandit problems
-
Bubeck S, Cesa-Bianchi N (2012) Regret analysis of stochastic and non-stochastic multi-armed bandit problems. Foundations Trends Machine Learn. 58(1): 1-122.
-
(2012)
Foundations Trends Machine Learn.
, vol.58
, Issue.1
, pp. 1-122
-
-
Bubeck, S.1
Cesa-Bianchi, N.2
-
18
-
-
34547314820
-
Learning and pricing in an internet environment with binomial demand
-
Carvalho AX, Puterman ML (2005) Learning and pricing in an internet environment with binomial demand. J. Revenue Pricing Management 3(4): 320-336.
-
(2005)
J. Revenue Pricing Management
, vol.3
, Issue.4
, pp. 320-336
-
-
Carvalho, A.X.1
Puterman, M.L.2
-
19
-
-
0033570858
-
Strong consistency of maximum quasi-likelihood estimate in generalized linear models via a last time
-
Chang YI (1999) Strong consistency of maximum quasi-likelihood estimate in generalized linear models via a last time. Statist. Probab. Lett. 45(3): 237-246.
-
(1999)
Statist. Probab. Lett.
, vol.45
, Issue.3
, pp. 237-246
-
-
Chang, Y.I.1
-
21
-
-
34047241098
-
Bayesian strategies for dynamic pricing in e-commerce
-
Cope E (2007) Bayesian strategies for dynamic pricing in e-commerce. Naval Res. Logist. 54(3): 265-281.
-
(2007)
Naval Res. Logist.
, vol.54
, Issue.3
, pp. 265-281
-
-
Cope, E.1
-
22
-
-
67649577204
-
Regret and convergence bounds for a class of continuum-armed bandit problems
-
Cope EW (2009) Regret and convergence bounds for a class of continuum-armed bandit problems. IEEE Trans. Automatic Control 54(6): 1243-1253.
-
(2009)
IEEE Trans. Automatic Control
, vol.54
, Issue.6
, pp. 1243-1253
-
-
Cope, E.W.1
-
23
-
-
84898072179
-
Stochastic linear optimization under bandit feedback
-
Omnipress, Madison, WI
-
Dani V, Hayes TP, Kakade SM (2008) Stochastic linear optimization under bandit feedback. 21st Annual Conf. Learn. Theory (COLT) (Omnipress, Madison, WI), 355-366.
-
(2008)
21st Annual Conf. Learn. Theory (COLT
, pp. 355-366
-
-
Dani, V.1
Hayes, T.P.2
Kakade, S.M.3
-
24
-
-
0001393823
-
Convergence rates for probabilities of moderate deviations
-
Davis JA (1968) Convergence rates for probabilities of moderate deviations. Ann. Math. Statist. 39(6): 2016-2028.
-
(1968)
Ann. Math. Statist.
, vol.39
, Issue.6
, pp. 2016-2028
-
-
Davis, J.A.1
-
27
-
-
84897030013
-
Simultaneously learning and optimizing using controlled variance pricing
-
ePub ahead of print December 10
-
den Boer AV, Zwart B (2013) Simultaneously learning and optimizing using controlled variance pricing. Management Sci., ePub ahead of print December 10, http://dx.doi.org/10.1287/mnsc.2013.1788.
-
(2013)
Management Sci
-
-
Den Boer, A.V.1
Zwart, B.2
-
28
-
-
33747165365
-
Multidimensional real analysis: Differentiation
-
(Cambridge University Press, Cambridge, UK
-
Duistermaat JJ, Kolk JAC (2004) Multidimensional Real Analysis: Differentiation, Cambridge Studies in Advanced Mathematics (86) (Cambridge University Press, Cambridge, UK).
-
(2004)
Cambridge Studies in Advanced Mathematics
, vol.86
-
-
Duistermaat, J.J.1
Kolk, J.A.C.2
-
29
-
-
0001241005
-
On a theorem of hsu and robbins
-
Erdǒs P (1949) On a theorem of Hsu and Robbins. Ann. Math. Statist. 20(2): 286-291.
-
(1949)
Ann. Math. Statist.
, vol.20
, Issue.2
, pp. 286-291
-
-
Erdǒs, P.1
-
30
-
-
0001407291
-
Remark on my paper "on a theorem of hsu and robbins
-
Erdǒs P (1950) Remark on my paper "On a theorem of Hsu and Robbins." Ann. Math. Statist. 21(1): 138.
-
(1950)
Ann. Math. Statist.
, vol.21
, Issue.1
, pp. 138
-
-
Erdos, P.1
-
32
-
-
77249163740
-
Dynamic pricing with a prior on market response
-
Farias VF, van Roy B (2010) Dynamic pricing with a prior on market response. Oper. Res. 58(1): 16-29.
-
(2010)
Oper. Res.
, vol.58
, Issue.1
, pp. 16-29
-
-
Farias, V.F.1
Van Roy, B.2
-
33
-
-
85162071043
-
Parametric bandits: The generalized linear case
-
Lafferty J, Williams CKI, Shawe-Taylor J, Zemel RS, Culotta A, eds. (Curran Associates, Red Hook, NY
-
Filippi S, Cappe O, Garivier A, Szepesvari C(2010) Parametric bandits: The generalized linear case. Lafferty J, Williams CKI, Shawe-Taylor J, Zemel RS, Culotta A, eds. Advances in Neural Information Processing Systems 23 (Curran Associates, Red Hook, NY), 586-594.
-
(2010)
Advances in Neural Information Processing Systems
, vol.23
, pp. 586-594
-
-
Filippi, S.1
Cappe, O.2
Garivier, A.3
Szepesvari, C.4
-
35
-
-
0000390402
-
Quasi-likelihood and optimal estimation
-
Godambe VP, Heyde CC (1987) Quasi-likelihood and optimal estimation. Internat. Statist. Rev. 55(3): 231-244.
-
(1987)
Internat. Statist. Rev.
, vol.55
, Issue.3
, pp. 231-244
-
-
Godambe, V.P.1
Heyde, C.C.2
-
36
-
-
0024682770
-
Updating the inverse of a matrix
-
Hager WW (1989) Updating the inverse of a matrix. SIAM Rev. 31(2): 221-239.
-
(1989)
SIAM Rev.
, vol.31
, Issue.2
, pp. 221-239
-
-
Hager, W.W.1
-
38
-
-
84861366496
-
Bayesian dynamic pricing policies: Learning and earning under a binary prior distribution
-
Harrison JM, Keskin NB, Zeevi A (2012) Bayesian dynamic pricing policies: Learning and earning under a binary prior distribution. Management Sci. 58(3): 570-586.
-
(2012)
Management Sci.
, vol.58
, Issue.3
, pp. 570-586
-
-
Harrison, J.M.1
Keskin, N.B.2
Zeevi, A.3
-
41
-
-
0003129058
-
Complete convergence and the law of large numbers
-
Hsu PL, Robbins H (1947) Complete convergence and the law of large numbers. Proc. Natl. Acad. Sci. USA 33(2): 25-31.
-
(1947)
Proc. Natl. Acad. Sci. USA
, vol.33
, Issue.2
, pp. 25-31
-
-
Hsu, P.L.1
Robbins, H.2
-
42
-
-
77951952841
-
Near-optimal regret bounds for reinforcement learning
-
Jaksch T, Ortner R, Auer P (2010) Near-optimal regret bounds for reinforcement learning. J. Machine Learn. Res. 11: 1563-1600.
-
(2010)
J. Machine Learn. Res.
, vol.11
, pp. 1563-1600
-
-
Jaksch, T.1
Ortner, R.2
Auer, P.3
-
43
-
-
0001761982
-
The probability in the tail of a distribution
-
Katz ML (1963) The probability in the tail of a distribution. Ann. Math. Statist. 34(1): 312-318.
-
(1963)
Ann. Math. Statist.
, vol.34
, Issue.1
, pp. 312-318
-
-
Katz, M.L.1
-
45
-
-
84898981061
-
Nearly tight bounds for the continuum-armed bandit problem
-
Saul LK, Weiss Y, Bottou L, eds. MIT Press, Cambridge, MA
-
Kleinberg R (2005) Nearly tight bounds for the continuum-armed bandit problem. Saul LK, Weiss Y, Bottou L, eds. Advances in Neural Information Processing Systems 17 (MIT Press, Cambridge, MA), 697-704.
-
(2005)
Advances in Neural Information Processing Systems
, vol.17
, pp. 697-704
-
-
Kleinberg, R.1
-
46
-
-
0345412655
-
The value of knowing a demand curve: Bounds on regret for online posted-price auctions
-
(IEEE Computer Society, Washington, DC
-
Kleinberg R, Leighton T (2003) The value of knowing a demand curve: Bounds on regret for online posted-price auctions. Proc. 44th IEEE Sympos. Foundations of Comput. Sci. (IEEE Computer Society, Washington, DC), 594-605.
-
(2003)
Proc. 44th IEEE Sympos. Foundations of Comput. Sci
, pp. 594-605
-
-
Kleinberg, R.1
Leighton, T.2
-
47
-
-
0000400695
-
Limit theorems for delayed sums
-
Lai TL (1974) Limit theorems for delayed sums. Ann. Probab. 2(3): 432-440.
-
(1974)
Ann. Probab.
, vol.2
, Issue.3
, pp. 432-440
-
-
Lai, T.L.1
-
48
-
-
0000258837
-
Least squares estimates in stochastic regression models with applications to identification and control of dynamic systems
-
Lai TL, Wei CZ (1982) Least squares estimates in stochastic regression models with applications to identification and control of dynamic systems. Ann. Statist. 10(1): 154-166.
-
(1982)
Ann. Statist.
, vol.10
, Issue.1
, pp. 154-166
-
-
Lai, T.L.1
Wei, C.Z.2
-
49
-
-
84906707736
-
-
Massachusetts Institute of Technology Cambridge, MA. Retrieved April 4, 2011
-
Le Guen T (2008) Data-driven pricing. Master's thesis, Massachusetts Institute of Technology, Cambridge, MA. Retrieved April 4, 2011, http://hdl.handle.net/1721.1/45627.
-
(2008)
Data-driven pricing. Master's thesis
-
-
Le Guen, T.1
-
50
-
-
34247526098
-
Relative entropy, exponential utility, and robust dynamic pricing
-
Lim AEB, Shanthikumar JG (2007) Relative entropy, exponential utility, and robust dynamic pricing. Oper. Res. 55(2): 198-214.
-
(2007)
Oper. Res.
, vol.55
, Issue.2
, pp. 198-214
-
-
Lim, A.E.B.1
Shanthikumar, J.G.2
-
51
-
-
75149179504
-
-
Working paper, University of California, Berkeley
-
Lim AEB, Shanthikumar JG, Watewai T (2008) Robust multi-product pricing. Working paper, University of California, Berkeley, http://dx.doi.org/10.2139/ ssrn.1078012.
-
(2008)
Robust Multi-Product Pricing
-
-
Lim, A.E.B.1
Shanthikumar, J.G.2
Watewai, T.3
-
52
-
-
34047243343
-
-
Working paper, University of Stanford, Stanford
-
Lobo MS, Boyd S (2003) Pricing and learning with uncertain demand. Working paper, University of Stanford, Stanford, http://www.stanford.edu/~boyd/ papers/pdf/pric-learn-unc-dem.pdf.
-
(2003)
Pricing and Learning With Uncertain Demand
-
-
Lobo, M.S.1
Boyd, S.2
-
53
-
-
0003998877
-
-
4th ed. (Springer Verlag, New York
-
Loève M (1977) Probability Theory I, 4th ed. (Springer Verlag, New York).
-
(1977)
Probability Theory I
-
-
Loève, M.1
-
54
-
-
0001898235
-
Quasi-likelihood functions
-
McCullagh P (1983) Quasi-likelihood functions. Ann. Statist. 11(1): 59-67.
-
(1983)
Ann. Statist.
, vol.11
, Issue.1
, pp. 59-67
-
-
McCullagh, P.1
-
56
-
-
0028369721
-
On the convergence of least squares estimates in white noise
-
Nassiri-Toussi K, Ren W (1994) On the convergence of least squares estimates in white noise. IEEE Trans. Automatic Control 39(2): 364-368.
-
(1994)
IEEE Trans. Automatic Control
, vol.39
, Issue.2
, pp. 364-368
-
-
Nassiri-Toussi, K.1
Ren, W.2
-
57
-
-
0041055966
-
Fisher's method of scoring. Internat. Statist
-
Osborne MR (1992) Fisher's method of scoring. Internat. Statist. Rev./Revue Internationale de Statistique 60(1): 99-117.
-
(1992)
Rev./Revue Internationale de Statistique
, vol.60
, Issue.1
, pp. 99-117
-
-
Osborne, M.R.1
-
58
-
-
37849021822
-
Optimal experimental design and some related control problems
-
Pronzato L (2008) Optimal experimental design and some related control problems. Automatica 44(2): 303-325.
-
(2008)
Automatica
, vol.44
, Issue.2
, pp. 303-325
-
-
Pronzato, L.1
-
59
-
-
70349622251
-
Asymptotic properties of nonlinear estimates in stochastic models with finite design space
-
Pronzato L (2009) Asymptotic properties of nonlinear estimates in stochastic models with finite design space. Statist. Probab. Lett. 79(21): 2307-2313.
-
(2009)
Statist. Probab. Lett.
, vol.79
, Issue.21
, pp. 2307-2313
-
-
Pronzato, L.1
-
60
-
-
0001058483
-
A two-armed bandit theory of market pricing
-
Rothschild M (1974) A two-armed bandit theory of market pricing. J. Econom. Theory 9(2): 185-202.
-
(1974)
J. Econom. Theory
, vol.9
, Issue.2
, pp. 185-202
-
-
Rothschild, M.1
-
62
-
-
7444258597
-
Precise asymptotics for a series of t.l
-
Spataru A (2004) Precise asymptotics for a series of T.L. Lai. Proc. Amer. Math. Soc. 132(11): 3387-3395.
-
(2004)
Lai. Proc. Amer. Math. Soc.
, vol.132
, Issue.11
, pp. 3387-3395
-
-
Spataru, A.1
-
64
-
-
34548083417
-
Baum-katz-nagaev type results for martingales
-
Stoica G (2007) Baum-Katz-Nagaev type results for martingales. J. Math. Anal. Appl. 336(2): 1489-1492.
-
(2007)
J. Math. Anal. Appl.
, vol.336
, Issue.2
, pp. 1489-1492
-
-
Stoica, G.1
-
66
-
-
84863381440
-
Algorithms for infinitely many-armed bandits
-
Koller D, Schuurmans D, Bengio Y, Bottou L, eds. (Curran Associates, Red Hook, NY
-
Wang Y, Audibert JY, Munos R (2009) Algorithms for infinitely many-armed bandits. Koller D, Schuurmans D, Bengio Y, Bottou L, eds. Advances in Neural Information Processing Systems 21 (Curran Associates, Red Hook, NY), 1729-1736.
-
(2009)
Advances in Neural Information Processing Systems
, vol.21
, pp. 1729-1736
-
-
Wang, Y.1
Audibert, J.Y.2
Munos, R.3
-
67
-
-
0016335739
-
Quasi-likelihood functions, generalized, linear models, and the gauss-newton method
-
Wedderburn RWM (1974) Quasi-likelihood functions, generalized, linear models, and the Gauss-Newton method. Biometrika 61(3): 439-447.
-
(1974)
Biometrika
, vol.61
, Issue.3
, pp. 439-447
-
-
Wedderburn, R.W.M.1
-
68
-
-
39449117970
-
Dynamic pricing in e-services under demand uncertainty
-
Xia CH, Dube P (2007) Dynamic pricing in e-services under demand uncertainty. Production Oper. Management 16(6): 701-712.
-
(2007)
Production Oper. Management
, vol.16
, Issue.6
, pp. 701-712
-
-
Xia, C.H.1
Dube, P.2
-
69
-
-
49649123810
-
Rate of strong consistency of maximum quasi-likelihood estimator in multivariate generalized linear models
-
Yin C, Zhang H, Zhao L (2008) Rate of strong consistency of maximum quasi-likelihood estimator in multivariate generalized linear models. Comm. Statist.-Theory Methods 37(19): 3115-3123.
-
(2008)
Comm. Statist.-Theory Methods
, vol.37
, Issue.19
, pp. 3115-3123
-
-
Yin, C.1
Zhang, H.2
Zhao, L.3
-
70
-
-
80053457608
-
Unimodal bandits
-
Getoor L, Scheffer T, eds., Bellevue, Washington, (Omnipress, Madison, WI
-
Yu JY, Mannor S (2011) Unimodal bandits. Getoor L, Scheffer T, eds. Proc. 28th Internat. Conf. Machine Learn., Bellevue, Washington, (Omnipress, Madison, WI), 41-48.
-
(2011)
Proc. 28th Internat. Conf. Machine Learn.
, pp. 41-48
-
-
Yu, J.Y.1
Mannor, S.2
|