메뉴 건너뛰기




Volumn 14, Issue 1, 2004, Pages 1-30

Combining importance sampling and temporal difference control variates to simulate Markov chains

Author keywords

Importance sampling; Markov chains; Rare events; Stochastic approximation; Temporal difference methods; Variance reduction

Indexed keywords

IMPORTANCE SAMPLING; RARE EVENTS; STOCHASTIC APPROXIMATION; TEMPORAL DIFFERENCE METHODS; VARIANCE REDUCTION;

EID: 3042683131     PISSN: 10493301     EISSN: None     Source Type: Journal    
DOI: 10.1145/974734.974735     Document Type: Article
Times cited : (22)

References (29)
  • 1
    • 0027628986 scopus 로고
    • Variance reduction through smoothing and control variates for Markov chain simulations
    • ANDRADOTTIR, S., HEYMAN, D., AND OTT, T. J. 1993. Variance reduction through smoothing and control variates for Markov chain simulations. ACM Trans. Mod. Comput. Sim. 3, 3, 167-189.
    • (1993) ACM Trans. Mod. Comput. Sim. , vol.3 , Issue.3 , pp. 167-189
    • Andradottir, S.1    Heyman, D.2    Ott, T.J.3
  • 2
    • 0000025104 scopus 로고
    • On the choice of alternative measures in importance sampling with Markov chains
    • ANDRADOTTIR, S., HEYMAN, D., AND OTT, T. J. 1995. On the choice of alternative measures in importance sampling with Markov chains. Operat. Res. 43, 3, 509-519.
    • (1995) Operat. Res. , vol.43 , Issue.3 , pp. 509-519
    • Andradottir, S.1    Heyman, D.2    Ott, T.J.3
  • 5
    • 0033876515 scopus 로고    scopus 로고
    • The o.d.e. method for convergence of stochastic approximation and reinforcement learning
    • BORKAR, V. AND MEYN, S. 2000. The o.d.e. method for convergence of stochastic approximation and reinforcement learning. SIAM J. Contr. Opt. 38, 2, 347-469.
    • (2000) SIAM J. Contr. Opt. , vol.38 , Issue.2 , pp. 347-469
    • Borkar, V.1    Meyn, S.2
  • 7
    • 0016436892 scopus 로고
    • Simulating stable stochastic systems, III: Regenerative processes and discrete-event simulations
    • CRANE, M. AND IGLEHART, D. 1975. Simulating stable stochastic systems, iii: Regenerative processes and discrete-event simulations. Operat. Res. 23, 33-45.
    • (1975) Operat. Res. , vol.23 , pp. 33-45
    • Crane, M.1    Iglehart, D.2
  • 9
    • 0026406217 scopus 로고
    • Optimally efficient estimation of the statistics of rare events in queuing networks
    • FRATER, M., LENNON, T., AND ANDERSON, B. 1991. Optimally efficient estimation of the statistics of rare events in queuing networks. IEEE Trans. Automat. Contr. 36, 12, 1395-1404.
    • (1991) IEEE Trans. Automat. Contr. , vol.36 , Issue.12 , pp. 1395-1404
    • Frater, M.1    Lennon, T.2    Anderson, B.3
  • 10
    • 84976805635 scopus 로고
    • Analysis of an importance sampling estimator for tandem queues
    • GLASSERMAN, P. AND KOU, S. 1995. Analysis of an importance sampling estimator for tandem queues. ACM Trans. Mod. Comput. Sim. 5, 1, 22-42.
    • (1995) ACM Trans. Mod. Comput. Sim. , vol.5 , Issue.1 , pp. 22-42
    • Glasserman, P.1    Kou, S.2
  • 11
    • 0031481011 scopus 로고    scopus 로고
    • Counterexamples in importance sampling for large deviations probabilities
    • GLASSERMAN, P. AND WANG, Y. 1997. Counterexamples in importance sampling for large deviations probabilities. Ann. Appl. Prob. 7, 731-746.
    • (1997) Ann. Appl. Prob. , vol.7 , pp. 731-746
    • Glasserman, P.1    Wang, Y.2
  • 12
    • 0001240715 scopus 로고
    • Importance sampling for stochastic simulations
    • GLYNN, P. AND IGLEHART, D. 1989. Importance sampling for stochastic simulations. Manage. Sci. 35, 11, 1367-1392.
    • (1989) Manage. Sci. , vol.35 , Issue.11 , pp. 1367-1392
    • Glynn, P.1    Iglehart, D.2
  • 14
    • 0019060753 scopus 로고
    • Variance reduction techniques for the simulation of Markov processes, I: Multiple estimates
    • HEIDELBERGER, P. 1980a. Variance reduction techniques for the simulation of Markov processes, I: Multiple estimates. IBM J. Res. Develop. 24, 570-581.
    • (1980) IBM J. Res. Develop. , vol.24 , pp. 570-581
    • Heidelberger, P.1
  • 15
    • 3042601611 scopus 로고
    • Variance reduction techniques for the simulation of Markov processes, II: Matrix iterative methods
    • HEIDELBERGER, P. 1980b. Variance reduction techniques for the simulation of Markov processes, ii: Matrix iterative methods. Acta Informatica 13, 21-37.
    • (1980) Acta Informatica , vol.13 , pp. 21-37
    • Heidelberger, P.1
  • 16
    • 84976842451 scopus 로고
    • Fast simulation of rare events in queuing and reliability models
    • HEIDELBERGER, P. 1995. Fast simulation of rare events in queuing and reliability models. ACM Trans. Model. Comput. Sim. 5, 1, 43-85.
    • (1995) ACM Trans. Model. Comput. Sim. , vol.5 , Issue.1 , pp. 43-85
    • Heidelberger, P.1
  • 18
    • 0035521425 scopus 로고    scopus 로고
    • Importance sampling and the cyclic approach
    • JUNEJA, S. 2001. Importance sampling and the cyclic approach. Operat. Res. 49, 6, 900-912.
    • (2001) Operat. Res. , vol.49 , Issue.6 , pp. 900-912
    • Juneja, S.1
  • 19
    • 3042544432 scopus 로고    scopus 로고
    • Efficient rare event simulation using importance sampling: An introduction
    • J. C. Misra, Ed. Narosa Publishing House, New Delhi, India
    • JUNEJA, S. 2003. Efficient rare event simulation using importance sampling: An introduction. In Computational Mathematics, Modelling and Algorithms, J. C. Misra, Ed. Narosa Publishing House, New Delhi, India, 357-396.
    • (2003) Computational Mathematics, Modelling and Algorithms , pp. 357-396
    • Juneja, S.1
  • 20
    • 0035322354 scopus 로고    scopus 로고
    • Efficient simulation of Markov chains with small transition probabilities
    • JUNEJA, S. AND SHAHABUDDIN, P. 2001. Efficient simulation of Markov chains with small transition probabilities. Manage. Sci. 47, 4, 547-562.
    • (2001) Manage. Sci. , vol.47 , Issue.4 , pp. 547-562
    • Juneja, S.1    Shahabuddin, P.2
  • 22
    • 0024479530 scopus 로고
    • A quick simulation method for excessive backlogs in networks of queues
    • PAREKH, S. AND WALRAND, J. 1989. A quick simulation method for excessive backlogs in networks of queues. IEEE Trans. Automat. Contr. 34, 1, 54-66.
    • (1989) IEEE Trans. Automat. Contr. , vol.34 , Issue.1 , pp. 54-66
    • Parekh, S.1    Walrand, J.2
  • 24
    • 0031139210 scopus 로고    scopus 로고
    • Optimization of computer simulation models with rare events
    • RUBINSTEIN, R. 1997. Optimization of computer simulation models with rare events. European J. Operat. Res. 99, 89-112.
    • (1997) European J. Operat. Res. , vol.99 , pp. 89-112
    • Rubinstein, R.1
  • 26
    • 0032114627 scopus 로고    scopus 로고
    • Analytical mean squared error curves for temporal difference learning
    • SINGH, S. AND DAYAN, P. 1998. Analytical mean squared error curves for temporal difference learning. Mach. Learn. 32, 5-40.
    • (1998) Mach. Learn. , vol.32 , pp. 5-40
    • Singh, S.1    Dayan, P.2
  • 27
    • 0029753630 scopus 로고    scopus 로고
    • Reinforcement learning with replacing eligibility traces
    • SINGH, S. AND SUTTON, R. 1996. Reinforcement learning with replacing eligibility traces. Mach. Learn. 22, 123-158.
    • (1996) Mach. Learn. , vol.22 , pp. 123-158
    • Singh, S.1    Sutton, R.2
  • 28
    • 33847202724 scopus 로고
    • Learning to predict by the methods of temporal differences
    • SUTTON, R. 1988. Learning to predict by the methods of temporal differences. Mach. Learn. 3, 9-44.
    • (1988) Mach. Learn. , vol.3 , pp. 9-44
    • Sutton, R.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.