Continuous time Markov decision programming with average reward criterion and unbounded reward rate

Shaohui Zheng

doi:10.1007/BF02080199

This paper deals with the continuous time Markov decision programming (briefly CTMDP) with unbounded reward rate. The economic criterion is the long-run average reward. To the models with countable state space and compact metric action sets, we present a set of sufficient conditions to ensure the existence of the stationary optimal policies.

Loading next page...

References (11)

Z. Dong (1979)
CONTINUOUS TIME MARKOVIAN DECISION PROGRAMMING WITH AVERAGE RETURN CRITERION——COUNTABLE STATE AND ACTION SETS
A. Federgruen, A. Hordijk, H. C. Tijms (1978)
Dynamic Programming and Its Application
Zeqing Dong (1982)
Introduction to Markov Decision Programming, Inst. of Appl. Math.
Shaohui Zheng (1989)
A Class of Continuous Time Markov Decision Programming with Average Return Criterion--The Existence of Optimal Policies
Acta Mathematicae Applicatae Sinica, 12
A. Federgruen, A. Hordijk, H. C. Tijms (1979)
Denumerable State Semi-Markov Decision Processes with Unbound Costs, Average Cost Criterion
Stoch. Proc. Appl., 9
A. Federgruen, A. Hordijk, H. Tijms (1977)
RECURRENCE CONDITIONS IN DENUMERABLE STATE MARKOV DECISION PROCESSES
B. Doshi (1976)
Continuous time control of Markov processes on an arbitrary state space: Average return criterion
Stochastic Processes and their Applications, 4
K. L. Chung (1967)
Markov Chains with Stationary Transition Probability
A. Federgruen, A. Hordijk, H. Tijms (1979)
Denumerable state semi-markov decision processes with unbounded costs, average cost criterion : (preprint)
Shaohui Zheng (1989)
The Existence of the Stationary Optimal Policy, For A Class of Continuous Time Average Markov Decision Programming
Acta Mathematicae Applicatae Sinica, 12
F. M. David (1983)
Markov Chain

Publisher: Springer Journals
Subject: Mathematics; Applications of Mathematics; Math Applications in Computer Science; Theoretical, Mathematical and Computational Physics
ISSN: 0168-9673
eISSN: 1618-3932
DOI: 10.1007/BF02080199
Publisher site: See Article on Publisher Site

Get 20M+ Full-Text Papers For Less Than $1.50/day. Start a 14-Day Trial for You or Your Team.

Learn More →

Continuous time Markov decision programming with average reward criterion and unbounded reward rate

Continuous time Markov decision programming with average reward criterion and unbounded reward rate

Get 20M+ Full-Text Papers For Less Than $1.50/day. Start a 14-Day Trial for You or Your Team.

Learn More →

Continuous time Markov decision programming with average reward criterion and unbounded reward rate

Continuous time Markov decision programming with average reward criterion and unbounded reward rate

References (11)

Abstract

Journal

Recommended Articles

There are no references for this article.

Our policy towards the use of cookies