Policy Iteration for Average Cost Markov Control Processes on Borel Spaces

Onésimo Hernández-Lerma; Jean Lasserre

doi:10.1023/A:1005781013253

Loading next page...

References (31)

E. Gordienko, O. Hernández-Lerma (1995)
Average cost Markov control processes with weighted norms: existence of canonical policies
Applicationes Mathematicae, 23
O. Hernández-Lerma, J. Lasserre (1999)
Discrete-time Markov control processes
P. Glynn (1989)
A Lyapunov Bound for Solutions of Poisson's Equation
A. Hordijk, M. Puterman (1987)
On the Convergence of Policy Iteration in Finite State Undiscounted Markov Decision Processes: The Unichain Case
Math. Oper. Res., 12
O. Hernández-Lerma, J. Lasserre (1994)
Linear Programming and Average Optimality of Markov Control Processes on Borel Spaces---Unbounded Costs
Siam Journal on Control and Optimization, 32
M. Duflo (1990)
Méthodes récursives aléatoires
O. Hernández-Lerma, J. B. Lasserre (1996)
Discrete-Time Markov Control Processes: Basic Optimality Criteria
R. Dekker (1987)
Counter examples for compact action Markov decision chains with average reward criteria
Stochastic Models, 3
A. Arapostathis, V. Borkar, E. Fernández-Gaucherand, M. Ghosh, S. Marcus (1993)
Discrete-time controlled Markov processes with average cost criterion: a survey
Siam Journal on Control and Optimization, 31
I. Schochetman, Robert Smith (1991)
Convergence of selections with applications in optimization
Journal of Mathematical Analysis and Applications, 155
L. Sennott (1989)
Average Cost Optimal Stationary Policies in Infinite State Markov Decision Processes with Unbounded Costs
Oper. Res., 37
O. Vega-Amaya (1996)
Overtaking optimality for a class of production-inventory systems
M. Schäl (1993)
Average Optimality in Dynamic Programming with General State Space
Math. Oper. Res., 18
A. Hordijk, M. L. Puterman (1987)
On the convergence of policy iteration in undiscounted finite state Markov decision processes: The unichain case
Math. Oper. Res., 12
O. Hernández-Lerma, J. Lasserre (1990)
Average cost optimal policies for Markov control processes with Borel state space and unbounded costs
Systems & Control Letters, 15
Raúl Montes-de-Oca, O. Hernández-Lerma (1996)
Value iteration in average cost Markov control processes on Borel spaces
Acta Applicandae Mathematica, 42
M. Puterman (1994)
Markov Decision Processes: Discrete Stochastic Dynamic Programming
G. Klimov (1985)
Existence of a final distribution for an irreducible Feller process with invariant measure
Mathematical notes of the Academy of Sciences of the USSR, 37
O. Hernández-Lerma (1993)
Existence of average optimal policies in Markov control processes with strictly unbounded costs
Kybernetika, 29
Patrick Billingsley (1970)
Convergence of Probability Measures
The Mathematical Gazette, 54
P. Schweitzer (1985)
On undiscounted markovian decision processes with compact action spaces
Rairo-operations Research, 19
I. Schochetman (1990)
Pointwise versions of the maximum theorem with applications in optimization
Applied Mathematics Letters, 3
E. Denardo, B. Fox (1968)
Multichain Markov Renewal Programs
Siam Journal on Applied Mathematics, 16
E. Gordienko, O. Hernández-Lerma (1994)
Average cost Markov control processes with weighted norms: value iteration
Applicationes Mathematicae, 23
M. Schäl (1975)
Conditions for optimality and for the limit of n-stage optimal policies to be optimal
Zeit. Wahrs. verw. Geb., 32
R. Dekker (1987)
Counter examples for compact action Markov decision chains with average reward criteria
Comm. Statist. Stochastic Models, 3
O. Hernández-Lerma (1965)
Controlled Markov Processes
V. Beneš (1968)
Finite regular invariant measures for Feller processes
Journal of Applied Probability, 5
S. Meyn, R. Tweedie (1993)
Markov Chains and Stochastic Stability
K. Yosida (1978)
Functional Analysis
M. L. Puterman (1994)
Markov Decision Processes

Publisher: Springer Journals
Copyright: Copyright © 1997 by Kluwer Academic Publishers
Subject: Mathematics; Mathematics, general; Computer Science, general; Theoretical, Mathematical and Computational Physics; Complex Systems; Classical Mechanics
ISSN: 0167-8019
eISSN: 1572-9036
DOI: 10.1023/A:1005781013253
Publisher site: See Article on Publisher Site

Abstract

This paper studies the policy iteration algorithm (PIA) for average cost Markov control processes on Borel spaces. Two classes of MCPs are considered. One of them allows some restricted-growth unbounded cost functions and compact control constraint sets; the other one requires strictly unbounded costs and the control constraint sets may be non-compact. For each of these classes, the PIA yields, under suitable assumptions, the optimal (minimum) cost, an optimal stationary control policy, and a solution to the average cost optimality equation.

Journal

Acta Applicandae Mathematicae – Springer Journals

Published: Oct 15, 2004

Get 20M+ Full-Text Papers For Less Than $1.50/day. Start a 14-Day Trial for You or Your Team.

Learn More →

Policy Iteration for Average Cost Markov Control Processes on Borel Spaces

Policy Iteration for Average Cost Markov Control Processes on Borel Spaces

Get 20M+ Full-Text Papers For Less Than $1.50/day. Start a 14-Day Trial for You or Your Team.

Learn More →

Policy Iteration for Average Cost Markov Control Processes on Borel Spaces

Policy Iteration for Average Cost Markov Control Processes on Borel Spaces

References (31)

Abstract

Journal

Recommended Articles

There are no references for this article.

Our policy towards the use of cookies