Residual Algorithms: Reinforcement Learning with Function Approximation

Keywords

ResidualAlgorithmReinforcement learningConvergence (economics)Gradient descentComputer scienceGradient methodPerceptronFunction approximationFunction (biology)Mathematical optimizationMathematicsArtificial intelligenceArtificial neural network

Affiliated Institutions

United States Air Force Academy US

Related Publications

Greedy function approximation: A gradient boosting machine.

Jerome H. Friedman

Function estimation/approximation is viewed from the perspective\nof numerical optimization in function space, rather than parameter space. A\nconnection is made between stagewi...

2001 The Annals of Statistics 26394 citations

Training Recurrent Networks by Evolino

Jürgen Schmidhuber , Daan Wierstra , Matteo Gagliolo +1 more

In recent years, gradient-based LSTM recurrent neural networks (RNNs) solved many previously RNN-unlearnable tasks. Sometimes, however, gradient information is of little use for...

2007 Neural Computation 251 citations

Function Optimization using Connectionist Reinforcement Learning Algorithms

Ronald J. Williams , Jing Peng

Any non-associative reinforcement learning algorithm can be viewed as a method for performing function optimization through (possibly noise-corrupted) sampling of function value...

1991 Connection Science 296 citations

Optimization for training neural nets

Etienne Barnard

Various techniques of optimizing criterion functions to train neural-net classifiers are investigated. These techniques include three standard deterministic techniques (variable...

1992 IEEE Transactions on Neural Networks 210 citations

ADADELTA: An Adaptive Learning Rate Method

Matthew D. Zeiler

We present a novel per-dimension learning rate method for gradient descent called ADADELTA. The method dynamically adapts over time using only first order information and has mi...

2012 arXiv (Cornell University) 5515 citations

Publication Info

Year: 1995
Type: book-chapter
Pages: 30-37
Citations: 948
Access: Closed

External Links

View on DOI.org

Social Impact

Altmetric

Residual Algorithms: Reinforcement Learning with Function Approximation

PlumX Metrics

Social media, news, blog, policy document mentions

Citation Metrics

948

OpenAlex

Cite This

APA Style

                            
                                    Leemon C. Baird
                                
                            (1995). 
                            Residual Algorithms: Reinforcement Learning with Function Approximation. 
                            Elsevier eBooks
                            
                            , 30-37.
                            https://doi.org/10.1016/b978-1-55860-377-6.50013-x

Identifiers

DOI: 10.1016/b978-1-55860-377-6.50013-x