-
1
-
-
33646357710
-
Empirical comparison of various reinforcement learning strategies for sequential targeted marketing
-
Abe, N., Pednault, E., Wang, H., Zadrozny, B., Fan, W., and Apte, C. (2002). Empirical comparison of various reinforcement learning strategies for sequential targeted marketing. In International Conference on Data Mining, pages 3-10.
-
(2002)
International Conference on Data Mining
, pp. 3-10
-
-
Abe, N.1
Pednault, E.2
Wang, H.3
Zadrozny, B.4
Fan, W.5
Apte, C.6
-
2
-
-
12244261880
-
Cross channel optimized marketing by reinforcement learning
-
Abe, N., Verma, N., Schroko, R., and Apte, C. (2004). Cross channel optimized marketing by reinforcement learning. In International Conference on Knowledge Discovery and Data Mining (KDD), pages 767-772.
-
(2004)
International Conference on Knowledge Discovery and Data Mining (KDD)
, pp. 767-772
-
-
Abe, N.1
Verma, N.2
Schroko, R.3
Apte, C.4
-
3
-
-
49949111037
-
Parallel dynamic programming
-
Kronsjö, L. and Shumsheruddin, D., editors, John Wiley & Sons, Inc.
-
Archibald, T. (1992). Parallel dynamic programming. In Kronsjö, L. and Shumsheruddin, D., editors, Advances in parallel algorithms, pages 343-367. John Wiley & Sons, Inc.
-
(1992)
Advances in Parallel Algorithms
, pp. 343-367
-
-
Archibald, T.1
-
5
-
-
60249089838
-
Assigning discounts in a marketing campaign by using reinforcement learning and neural networks
-
doi: 10.1016/j.eswa.2008.10.064
-
Gomez-Perez, G., Martin-Guerrero, J. D., Soria-Olivas, E., Balaguer-Ballester, E., Palomares, A., and Casariego, N. (2008). Assigning discounts in a marketing campaign by using reinforcement learning and neural networks. Expert Systems with Applications, (doi: 10.1016/j.eswa.2008.10.064).
-
(2008)
Expert Systems with Applications
-
-
Gomez-Perez, G.1
Martin-Guerrero, J.D.2
Soria-Olivas, E.3
Balaguer-Ballester, E.4
Palomares, A.5
Casariego, N.6
-
6
-
-
77956543367
-
Web-scale Bayesian click-through rate prediction for sponsored search advertising in microsoft's bing search engine
-
Graepel, T., Candela, J. Q., Borchert, T., and Herbrich, R. (2010). Web-scale Bayesian click-through rate prediction for sponsored search advertising in microsoft's bing search engine. In 27th International Conference on Machine Learning, pages 13-20.
-
(2010)
27th International Conference on Machine Learning
, pp. 13-20
-
-
Graepel, T.1
Candela, J.Q.2
Borchert, T.3
Herbrich, R.4
-
7
-
-
49949106524
-
Parallel reinforcement learning with linear function approximation
-
Grounds, M. and Kudenko, D. (2007). Parallel reinforcement learning with linear function approximation. In Adaptive Agents and Multi-Agent Systems, pages 60-74.
-
(2007)
Adaptive Agents and Multi-Agent Systems
, pp. 60-74
-
-
Grounds, M.1
Kudenko, D.2
-
8
-
-
0032073263
-
Planning and acting in partially observable stochastic domains
-
PII S000437029800023X
-
Kaelbling, L., Littman, M., and Cassandra, A. (1995). Planning and acting in partially observable stochastic domains. Artificial Intelligence, 101:99-134. (Pubitemid 128387390)
-
(1998)
Artificial Intelligence
, vol.101
, Issue.1-2
, pp. 99-134
-
-
Kaelbling, L.P.1
Littman, M.L.2
Cassandra, A.R.3
-
9
-
-
84897487186
-
A contextual-bandit approach to personalized news article recommendation
-
abs/1003.0146
-
Li, L., Chu, W., Langford, J., and Schapire, R. E. (2010). A contextual-bandit approach to personalized news article recommendation. CoRR, abs/1003.0146.
-
(2010)
CoRR
-
-
Li, L.1
Chu, W.2
Langford, J.3
Schapire, R.E.4
-
10
-
-
85149834820
-
Markov games as a framework for multi-agent reinforcement learning
-
Littman, M. L. (1994). Markov games as a framework for multi-agent reinforcement learning. In 11th International Conference on Machine Learning, pages 157-163.
-
(1994)
11th International Conference on Machine Learning
, pp. 157-163
-
-
Littman, M.L.1
-
11
-
-
0242540456
-
Sequential cost-sensitive decision making with reinforcement learning
-
Pednault, E., Abe, N., Zadrozny, B., Wang, H., Fan, W., and Apte, C. (2002). Sequential cost-sensitive decision making with reinforcement learning. In International Conference on Knowledge Discovery and Data Mining (KDD).
-
(2002)
International Conference on Knowledge Discovery and Data Mining (KDD)
-
-
Pednault, E.1
Abe, N.2
Zadrozny, B.3
Wang, H.4
Fan, W.5
Apte, C.6
-
13
-
-
33847202724
-
Learning to predict by the method of temporal differences
-
Sutton, R. (1988). Learning to predict by the method of temporal differences. Machine Learning, 3(9):9-44.
-
(1988)
Machine Learning
, vol.3
, Issue.9
, pp. 9-44
-
-
Sutton, R.1
-
14
-
-
0026971570
-
Adapting bias by gradient descent: An incremental version of delta-bar-delta
-
Sutton, R. (1992). Adapting bias by gradient descent: An incremental version of delta-bar-delta. In 10th National Conference on Artificial Intelligence, pages 171-176.
-
(1992)
10th National Conference on Artificial Intelligence
, pp. 171-176
-
-
Sutton, R.1
-
16
-
-
0033170372
-
Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning
-
DOI 10.1016/S0004-3702(99)00052-1
-
Sutton, R., Precup, D., and Singh, S. (1999). Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning. Artificial Intelligence, 112(1-2):181-211. (Pubitemid 32079890)
-
(1999)
Artificial Intelligence
, vol.112
, Issue.1
, pp. 181-211
-
-
Sutton, R.S.1
Precup, D.2
Singh, S.3
|