Keywords
Affiliated Institutions
Related Publications
A Probabilistic Production and Inventory Problem
R. Howard (R. Howard.1960. Dynamic Programming and Markov Processes. John Wiley and Sons, Inc., New York.) and A. Manne (A. Manne. 1960. Linear programming and sequential decisi...
Decentralized learning in finite Markov chains
The principal contribution of this paper is a new result on the decentralized control of finite Markov chains with unknown transition probabilities and rewords. One decentralize...
Markov Decision Processes: Discrete Stochastic Dynamic Programming.
From the Publisher: The past decade has seen considerable theoretical and applied research on Markov decision processes, as well as the growing use of these models in ecology, ...
Actor-Critic Reinforcement Learning with Energy-Based Policies
We consider reinforcement learning in Markov decision processes with high dimensional state and action spaces. We parametrize policies using energy-based models (particularly re...
Generalization in Reinforcement Learning: Safely Approximating the Value Function
A straightforward approach to the curse of dimensionality inreinforcement learning and dynamic programming is to replace the lookup table with a generalizing function approximat...
Publication Info
- Year
- 1992
- Type
- article
- Volume
- 8
- Issue
- 3-4
- Pages
- 279-292
- Citations
- 8791
- Access
- Closed
External Links
Social Impact
Social media, news, blog, policy document mentions
Citation Metrics
Cite This
Identifiers
- DOI
- 10.1007/bf00992698