Markov Decision Processes: Discrete Stochastic Dynamic Programming.

Abstract

From the Publisher: The past decade has seen considerable theoretical and applied research on Markov decision processes, as well as the growing use of these models in ecology, economics, communications engineering, and other fields where outcomes are uncertain and sequential decision-making processes are needed. A timely response to this increased activity, Martin L. Puterman's new work provides a uniquely up-to-date, unified, and rigorous treatment of the theoretical, computational, and applied research on Markov decision process models. It discusses all major research directions in the field, highlights many significant applications of Markov decision processes models, and explores numerous important topics that have previously been neglected or given cursory coverage in the literature. Markov Decision Processes focuses primarily on infinite horizon discrete time models and models with discrete time spaces while also examining models with arbitrary state spaces, finite horizon models, and continuous-time discrete state models. The book is organized around optimality criteria, using a common framework centered on the optimality (Bellman) equation for presenting results. The results are presented in a theorem-proof format and elaborated on through both discussion and examples, including results that are not available in any other book. A two-state Markov decision process model, presented in Chapter 3, is analyzed repeatedly throughout the book and demonstrates many results and algorithms. Markov Decision Processes covers recent research advances in such areas as countable state space models with average reward criterion, constrained models, and models with risk sensitive optimality criteria. It also explores several topics that have received little or no attention in other books, including modified policy iteration, multichain models with average reward criterion, and sensitive optimality. In addition, a Bibliographic Remarks section in each chapter comments on relevant historic

Keywords

Dynamic programmingMarkov decision processComputer scienceMarkov chainMathematical optimizationMathematicsMarkov processMachine learningStatistics

Related Publications

Markov: A methodology for the solution of infinite time horizon markov decision processes

Byron K. Williams

Abstract Algorithms are described for determining optimal policies for finite state, finite action, infinite discrete time horizon Markov decision processes. Both value‐improvem...

1988 Applied Stochastic Models and Data An... 11 citations

Estimation of dynamic structural models, problems and prospects: discrete decision processes

J. R. Rust

This is the first in a two-part survey of recent developments in the rapidly growing literature on methods for solving and estimating dynamic structural models. Part I focusses ...

2008 Cambridge University Press eBooks 136 citations

A Probabilistic Production and Inventory Problem

F. d'Epenoux

R. Howard (R. Howard.1960. Dynamic Programming and Markov Processes. John Wiley and Sons, Inc., New York.) and A. Manne (A. Manne. 1960. Linear programming and sequential decisi...

1963 Management Science 200 citations

Contraction Mappings in the Theory Underlying Dynamic Programming

Eric V. Denardo

Next article Contraction Mappings in the Theory Underlying Dynamic ProgrammingEric V. DenardoEric V. Denardohttps://doi.org/10.1137/1009030PDFBibTexSections ToolsAdd to favorite...

1967 SIAM Review 464 citations

Actor-Critic Reinforcement Learning with Energy-Based Policies

Nicolas Heess , David Silver , Yee Whye Teh

We consider reinforcement learning in Markov decision processes with high dimensional state and action spaces. We parametrize policies using energy-based models (particularly re...

2012 48 citations

Publication Info

Year: 1995
Type: article
Volume: 90
Issue: 429
Pages: 392-392
Citations: 8422
Access: Closed

External Links

View on DOI.org

Social Impact

Altmetric

Markov Decision Processes: Discrete Stochastic Dynamic Programming.

PlumX Metrics

Social media, news, blog, policy document mentions

Citation Metrics

8422

OpenAlex

Cite This

APA Style

                            
                                    Kasra Hazeghi, 
                                
                                    Martin L. Puterman
                                
                            (1995). 
                            Markov Decision Processes: Discrete Stochastic Dynamic Programming.. 
                            Journal of the American Statistical Association
                            , 90
                            (429)
                            , 392-392.
                            https://doi.org/10.2307/2291177

Identifiers

DOI: 10.2307/2291177