SCOPUS 정보 검색 플랫폼

Volumn 55, Issue 5 II, 2007, Pages 2170-2181

Q-learning algorithms for constrained Markov decision processes with randomized monotone policies: Application to MIMO transmission control

(2) Djonin, Dejan V a Krishnamurthy, Vikram b

a Dyaptive Systems (Canada)

b UNIVERSITY OF BRITISH COLUMBIA (Canada)

Author keywords

Constrained Markov decision process (CMDP); Delay constraints; Monotone policies; Q learning; Randomizedpolicies; Reinforcementlearning; Supermodularity; Transmission scheduling; V BLAST

Indexed keywords

CONVERGENCE OF NUMERICAL METHODS; LEARNING ALGORITHMS; MARKOV PROCESSES; OPTIMIZATION; QUALITY OF SERVICE; SCHEDULING ALGORITHMS; STOCHASTIC CONTROL SYSTEMS; TIME VARYING SYSTEMS;

CONSTRAINED MARKOV DECISION PROCESS; DELAY CONSTRAINTS; MONOTONE POLICIES; TRANSMISSION SYSTEMS;

DATA COMMUNICATION SYSTEMS;

EID: 34247898874 PISSN: 1053587X EISSN: None Source Type: Journal
DOI: 10.1109/TSP.2007.893228 Document Type: Article

Times cited : (82)

References (26)

1
- 85066695690
- V-BLASt power and rate control under delay constraints in Markovian fading channels - Optimality of randomized monotonic policies
- accepted for publication
- D. V. Djonin and V. Krishnamurthy, "V-BLASt power and rate control under delay constraints in Markovian fading channels - Optimality of randomized monotonic policies," IEEE Trans. Signal Process., 2007, accepted for publication.
- (2007) IEEE Trans. Signal Process
- Djonin, D.V.¹ Krishnamurthy, V.²

2
- 0003233588
- Transmission policies for time varying channels with average delay constraints
- Sep
- B. E. Collins and R. Cruz, "Transmission policies for time varying channels with average delay constraints," in Proc. Allerton Conf. Commun., Contr. Comput., Sep. 1999, pp. 709-717.
- (1999) Proc. Allerton Conf. Commun., Contr. Comput , pp. 709-717
- Collins, B.E.¹ Cruz, R.²

3
- 85008016250
- Optimal and suboptimal packet scheduling over time-varying fading channels
- Feb
- A. K. Karmokar, D. V. Djonin, and V. K. Bhargava, "Optimal and suboptimal packet scheduling over time-varying fading channels," IEEE Trans. Wireless Commun., vol. 5, no. 2, pp. 446-457, Feb. 2006.
- (2006) IEEE Trans. Wireless Commun , vol.5 , Issue.2 , pp. 446-457
- Karmokar, A.K.¹ Djonin, D.V.² Bhargava, V.K.³

4
- 34247862895
- Delay limited optimal and suboptimal power and bit loading algorithms for OFDM systems over correlated fading
- St. Louis
- M. J. Hossain, D. V. Djonin, and V. K. Bhargava, "Delay limited optimal and suboptimal power and bit loading algorithms for OFDM systems over correlated fading," in Proc. GLOBECOM 2005, St. Louis, 2005, pp. 3448-3453.
- (2005) Proc. GLOBECOM 2005 , pp. 3448-3453
- Hossain, M.J.¹ Djonin, D.V.² Bhargava, V.K.³

5
- 0003487482
- Belmont, MA: Athena Scientific
- D.P. Bertsekas and J. Tsitsildis, Neuro-Dynamic Programming. Belmont, MA: Athena Scientific, 1996.
- (1996) Neuro-Dynamic Programming
- Bertsekas, D.P.¹ Tsitsildis, J.²

6
- 0032022195
- On limits of wireless communication in a fading environment when using multiple antennas
- Mar
- G. J. Foschini and M. J. Gans, "On limits of wireless communication in a fading environment when using multiple antennas," Wireless Pers. Commun., vol. 6, pp. 311-335, Mar. 1998.
- (1998) Wireless Pers. Commun , vol.6 , pp. 311-335
- Foschini, G.J.¹ Gans, M.J.²

7
- 0030234863
- Layered space-time architecture for wireless communication in a fading environment when using multielement antennas
- Oct
- G. J. Foschini, "Layered space-time architecture for wireless communication in a fading environment when using multielement antennas," Bell. Labs. Tech. J., pp. 41-59, Oct. 1996.
- (1996) Bell. Labs. Tech. J , pp. 41-59
- Foschini, G.J.¹

8
- 0035189565
- Low complexity algorithm for rate quantization in extended V-BLAST
- S. Chung, H. C. Howard, and A. Lozano, "Low complexity algorithm for rate quantization in extended V-BLAST," in Proc. IEEE VTC' 2001, 2001, pp. 910-914.
- (2001) Proc. IEEE VTC' 2001 , pp. 910-914
- Chung, S.¹ Howard, H.C.² Lozano, A.³

9
- 0345308612
- Low complexity per-antenna rate and power control approach for closed-loop V-BLAST
- Nov
- H. Zhang, L. Dai, S. Zhou, and Y. Yao, "Low complexity per-antenna rate and power control approach for closed-loop V-BLAST," IEEE Trans. Commun., vol. 51, no. 11, pp. 1783-1787, Nov. 2003.
- (2003) IEEE Trans. Commun , vol.51 , Issue.11 , pp. 1783-1787
- Zhang, H.¹ Dai, L.² Zhou, S.³ Yao, Y.⁴

10
- 4544239777
- Spreading code optimization and adaptation in CDMA via discrete stochastic approximation
- Sep
- V. Krishnamurthy, X. Wang, and G. Yin, "Spreading code optimization and adaptation in CDMA via discrete stochastic approximation," IEEE Trans. Inf. Theory, vol. 50, no. 9, pp. 1927-1949, Sep. 2004.
- (2004) IEEE Trans. Inf. Theory , vol.50 , Issue.9 , pp. 1927-1949
- Krishnamurthy, V.¹ Wang, X.² Yin, G.³

11
- 0020970738
- Neuron-like elements that can solve difficult learning control problems
- A. Barto, R. Sutton, and C. Anderson, "Neuron-like elements that can solve difficult learning control problems," IEEE Trans. Syst, Man, Cybern., vol. SMC-13, pp. 834-846, 1983.
- (1983) IEEE Trans. Syst, Man, Cybern , vol.SMC-13 , pp. 834-846
- Barto, A.¹ Sutton, R.² Anderson, C.³

12
- 0035249254
- Simulation-based optimization of Markov reward processes
- Feb
- P. Marbach and J. N. Tsitsiklis, "Simulation-based optimization of Markov reward processes," IEEE Trans. Autom. Contr., vol. 42, no. 2, pp. 191-209, Feb. 2001.
- (2001) IEEE Trans. Autom. Contr , vol.42 , Issue.2 , pp. 191-209
- Marbach, P.¹ Tsitsiklis, J.N.²

13
- 85066649783
- London, U.K, Chapman and Hall
- E. Altman, Constrained MDPes: Stochastic Modeling. London, U.K.: Chapman and Hall, 1999.
- (1999) Constrained MDPes: Stochastic Modeling
- Altman, E.¹

14
- 0003998452
- New York: Wiley
- M. L. Putterman, Markov Decision Procsses: Discrete Stochastic Dynammic Programming. New York: Wiley, 1994.
- (1994) Markov Decision Procsses: Discrete Stochastic Dynammic Programming
- Putterman, M.L.¹

15
- 0003565783
- Belmont, MA: Athena Scientific
- D. P. Bertsekas, Dynamic Programming and Optimal Control. Belmont, MA: Athena Scientific, 1996, vol. 2.
- (1996) Dynamic Programming and Optimal Control , vol.2
- Bertsekas, D.P.¹

16
- 0035439783
- Degrees of freedom in adaptive modulation: A unified view
- Sep
- S. T. Chung and A. J. Goldsmith, "Degrees of freedom in adaptive modulation: A unified view," IEEE Trans. Commun., vol. 49, no. 9, pp. 1561-1571, Sep. 2001.
- (2001) IEEE Trans. Commun , vol.49 , Issue.9 , pp. 1561-1571
- Chung, S.T.¹ Goldsmith, A.J.²

17
- 84890749086
- Princeton, NJ: Princeton Univ. Press
- D. M. Topkis, Supermodularity and Complementarity. Princeton, NJ: Princeton Univ. Press, 1998.
- (1998) Supermodularity and Complementarity
- Topkis, D.M.¹

18
- 31144459451
- Berlin, Germany: Springer-Verlag
- E. Altman, B. Gaujal, and A. Hordijk. Discrete-Event Control of Stochastic Networks: Multimodularity and Regularity. Berlin, Germany: Springer-Verlag, 2003.
- (2003) Discrete-Event Control of Stochastic Networks: Multimodularity and Regularity
- Altman, E.¹ Gaujal, B.² Hordijk, A.³

19
- 0003713964
- 2nd ed. Belmont, MA: Athena Scientific
- D. P. Bertsekas, Nonlinear Programming, 2nd ed. Belmont, MA: Athena Scientific, 1999.
- (1999) Nonlinear Programming
- Bertsekas, D.P.¹

20
- 0004055894
- Cambridge, U.K, Cambridge Univ. Press
- S. Boyd and L. Vandenberghe, Convex Optimization. Cambridge, U.K.: Cambridge Univ. Press, 2003.
- (2003) Convex Optimization
- Boyd, S.¹ Vandenberghe, L.²

21
- 0004098975
- New York: Wiley
- D. G. Luenberger, Optimization by Vector Space Methods. New York: Wiley, 1969.
- (1969) Optimization by Vector Space Methods
- Luenberger, D.G.¹

22
- 0003452601
- New York: Springer-Verlag
- H. Kushner and D. Clark, Stochastic Approximation Methods for Constrained and Unconstrained Systems. New York: Springer-Verlag, 1978.
- (1978) Stochastic Approximation Methods for Constrained and Unconstrained Systems
- Kushner, H.¹ Clark, D.²

23
- 9944258743
- 1st ed. New York: Springer-Verlag
- H. Kushner and G. Yin, Stochastic Approximation Algorithms and Applications, 1st ed. New York: Springer-Verlag, 1997.
- (1997) Stochastic Approximation Algorithms and Applications
- Kushner, H.¹ Yin, G.²

24
- 0022151359
- Optimal policies for controlled Markov chains with a constraint
- F. J. Beutler and K. W. Ross, "Optimal policies for controlled Markov chains with a constraint," J. Math. Anal. Appl., vol. 112, pp. 236-252, 1985.
- (1985) J. Math. Anal. Appl , vol.112 , pp. 236-252
- Beutler, F.J.¹ Ross, K.W.²

25
- 1542348670
- Constrained stochastic approximation algorithms for adaptive control of constrained markov decision processes
- F. Vazquez Abad and V. Krishnamurthy, "Constrained stochastic approximation algorithms for adaptive control of constrained markov decision processes," in Proc. 42nd IEEE Conf. Decision Contr., 2003, pp. 2823-2828.
- (2003) Proc. 42nd IEEE Conf. Decision Contr , pp. 2823-2828
- Vazquez Abad, F.¹ Krishnamurthy, V.²

26
- 1542350203
- Implementation of gradient estimation to a constrained Markov decision problem
- V. Krishnamurthy, F. Vazquez Abad, and K. Martin, "Implementation of gradient estimation to a constrained Markov decision problem," in Proc. 42nd IEEE Conf. Decision Contr., 2003, pp. 4841-4846
- (2003) Proc. 42nd IEEE Conf. Decision Contr , pp. 4841-4846
- Krishnamurthy, V.¹ Vazquez Abad, F.² Martin, K.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.