Optimization for training neural nets

Abstract

Various techniques of optimizing criterion functions to train neural-net classifiers are investigated. These techniques include three standard deterministic techniques (variable metric, conjugate gradient, and steepest descent), and a new stochastic technique. It is found that the stochastic technique is preferable on problems with large training sets and that the convergence rates of the variable metric and conjugate gradient techniques are similar.

Keywords

Computer scienceArtificial neural networkArtificial intelligenceTraining (meteorology)Machine learning

Affiliated Institutions

University of Pretoria ZA

Related Publications

HOGWILD!: A Lock-Free Approach to Parallelizing Stochastic Gradient Descent

Feng Niu , Benjamin Recht , Christopher Ré +1 more

Stochastic Gradient Descent (SGD) is a popular algorithm that can achieve state-of-the-art performance on a variety of machine learning tasks. Several researchers have recently ...

2011 arXiv (Cornell University) 1224 citations

Highway Networks

Rupesh K. Srivastava , Klaus Greff , Jürgen Schmidhuber

There is plenty of theoretical and empirical evidence that depth of neural networks is a crucial ingredient for their success. However, network training becomes more difficult w...

2015 arXiv (Cornell University) 301 citations

Training Very Deep Networks

Rupesh K. Srivastava , Klaus Greff , Jürgen Schmidhuber

Theoretical and empirical evidence indicates that the depth of neural networks is crucial for their success. However, training becomes more difficult as depth increases, and tra...

2015 arXiv (Cornell University) 1100 citations

Distance-IoU Loss: Faster and Better Learning for Bounding Box Regression

Zhaohui Zheng , Ping Wang , Wei Liu +3 more

Bounding box regression is the crucial step in object detection. In existing methods, while ℓn-norm loss is widely adopted for bounding box regression, it is not tailored to the...

2020 Proceedings of the AAAI Conference on... 3685 citations

Object Detection with Discriminatively Trained Part-Based Models

Pedro F. Felzenszwalb , Ross Girshick , David McAllester +1 more

We describe an object detection system based on mixtures of multiscale deformable part models. Our system is able to represent highly variable object classes and achieves state-...

2009 IEEE Transactions on Pattern Analysis... 9911 citations

Publication Info

Year: 1992
Type: article
Volume: 3
Issue: 2
Pages: 232-240
Citations: 210
Access: Closed

External Links

Download PDF (Free) View on DOI.org PubMed Semantic Scholar

Social Impact

Altmetric

Optimization for training neural nets

PlumX Metrics

Social media, news, blog, policy document mentions

Citation Metrics

210

OpenAlex

Influential

166

CrossRef

Cite This

APA Style

                            
                                    Etienne Barnard
                                
                            (1992). 
                            Optimization for training neural nets. 
                            IEEE Transactions on Neural Networks
                            , 3
                            (2)
                            , 232-240.
                            https://doi.org/10.1109/72.125864

Identifiers

DOI: 10.1109/72.125864
PMID: 18276424

Data Quality

Data completeness: 77%