First- and Second-Order Methods for Learning: Between Steepest Descent and Newton's Method

Roberto Battiti

doi:10.1162/neco.1992.4.2.141

Abstract

On-line first-order backpropagation is sufficiently fast and effective for many large-scale classification problems but for very high precision mappings, batch processing may be the method of choice. This paper reviews first- and second-order optimization methods for learning in feedforward neural networks. The viewpoint is that of optimization: many methods can be cast in the language of optimization techniques, allowing the transfer to neural nets of detailed results about computational complexity and safety procedures to ensure convergence and to avoid numerical problems. The review is not intended to deliver detailed prescriptions for the most appropriate methods in specific applications, but to illustrate the main characteristics of the different methods and their mutual relations.

Keywords

Computer scienceBackpropagationArtificial neural networkConvergence (economics)Gradient descentArtificial intelligenceFeedforward neural networkMathematical optimizationFeed forwardAlgorithmMathematics

Affiliated Institutions

University of Trento IT

Related Publications

An analysis of noise in recurrent neural networks: convergence and generalization

Kam-Chuen Jim , C. Lee Giles , B.G. Horne

Concerns the effect of noise on the performance of feedforward neural nets. We introduce and analyze various methods of injecting synaptic noise into dynamically driven recurren...

1996 IEEE Transactions on Neural Networks 138 citations

Training Recurrent Networks by Evolino

Jürgen Schmidhuber , Daan Wierstra , Matteo Gagliolo +1 more

In recent years, gradient-based LSTM recurrent neural networks (RNNs) solved many previously RNN-unlearnable tasks. Sometimes, however, gradient information is of little use for...

2007 Neural Computation 251 citations

Optimization for training neural nets

Etienne Barnard

Various techniques of optimizing criterion functions to train neural-net classifiers are investigated. These techniques include three standard deterministic techniques (variable...

1992 IEEE Transactions on Neural Networks 210 citations

A direct adaptive method for faster backpropagation learning: the RPROP algorithm

Martin Riedmiller , Heinrich Braun

A learning algorithm for multilayer feedforward networks, RPROP (resilient propagation), is proposed. To overcome the inherent disadvantages of pure gradient-descent, RPROP perf...

2002 IEEE International Conference on Neur... 3913 citations

Generalization of Back propagation to Recurrent and Higher Order Neural Networks

Fernando J. Pineda

A general method for deriving backpropagation algorithms for networks with recurrent and higher order networks is introduced. The propagation of activation in these networks is ...

1987 Neural Information Processing Systems 123 citations

Publication Info

Year: 1992
Type: article
Volume: 4
Issue: 2
Pages: 141-166
Citations: 1187
Access: Closed

External Links

View on DOI.org

Social Impact

Altmetric

First- and Second-Order Methods for Learning: Between Steepest Descent and Newton's Method

PlumX Metrics

Social media, news, blog, policy document mentions

Citation Metrics

1187

OpenAlex

Cite This

APA Style

                            
                                    Roberto Battiti
                                
                            (1992). 
                            First- and Second-Order Methods for Learning: Between Steepest Descent and Newton's Method. 
                            Neural Computation
                            , 4
                            (2)
                            , 141-166.
                            https://doi.org/10.1162/neco.1992.4.2.141

Identifiers

DOI: 10.1162/neco.1992.4.2.141