Input generalization in delayed reinforcement learning: an algorithm and performance comparisons

Abstract

Delayed reinforcement learning is an attractive framework for the unsupervised learning of action policies for autonomous agents. Some existing delayed reinforcement learning techniques have shown promise in simple domains. However, a number of hurdles must be passed before they are applicable to realistic problems. This paper describes one such difficulty, the input generalization problem (whereby the system must generalize to produce similar actions in similar situations) and an implemented solution, the G algorithm. This algorithm is based on recursive splitting of the state space based on statistical measures of differences in reinforcements received. Connectionist backpropagation has previously been used for input generalization in reinforcement learning. We compare the two techniques analytically and empirically. The G algorithm&apos;s sound statistical basis makes it easy to predict when it should and should not work, whereas the behavior of backpropagation is unpredictable. We found that a previous successful use of backpropagation can be explained by the linearity of the application domain. We found that in another domain, G reliably found the optimal policy, whereas none of a set of runs of backpropagation with many combinations of parameters did. 1

Keywords

BackpropagationReinforcement learningGeneralizationComputer scienceArtificial intelligenceSet (abstract data type)Machine learningStability (learning theory)Artificial neural networkConnectionismAlgorithmMathematics

Related Publications

Learning and Problem Solving with Multilayer Connectionist Systems

Charles W. Anderson

The difficulties of learning in multilayered networks of computational units has limited the use of connectionist systems in complex domains. This dissertation elucidates the is...

1986 119 citations

A Comparison between Recursive Neural Networks and Graph Neural Networks

Vincenzo Di Massa , Gabriele Monfardini , Lorenzo Sarti +3 more

Recursive neural networks (RNNs) and graph neural networks (GNNs) are two connectionist models that can directly process graphs. RNNs and GNNs exploit a similar processing frame...

2006 The 2006 IEEE International Joint Con... 39 citations

Searching for Diverse, Cooperative Populations with Genetic Algorithms

Robert E. Smith , Stephanie Forrest , Alan S. Perelson

In typical applications, genetic algorithms (GAs) process populations of potential problem solutions to evolve a single population member that specifies an ‘optimized’ solution....

1993 Evolutionary Computation 256 citations

Competitive Anti-Hebbian Learning of Invariants

Nicol N. Schraudolph , Terrence J. Sejnowski

Although the detection of invariant structure in a given set of input patterns is vital to many recognition tasks, connectionist learning rules tend to focus on directions of hi...

1991 26 citations

Learning to Generalize: Meta-Learning for Domain Generalization

Da Li , Yongxin Yang , Yi-Zhe Song +1 more

Domain shift refers to the well known problem that a model trained in one source domain performs poorly when appliedto a target domain with different statistics. Domain Generali...

2018 Proceedings of the AAAI Conference on... 1146 citations

Publication Info

Year: 1991
Type: article
Pages: 726-731
Citations: 249
Access: Closed

External Links

Citation Metrics

249

OpenAlex

Cite This

APA Style

                            
                                    David Chapman, 
                                
                                    Leslie Pack Kaelbling
                                
                            (1991). 
                            Input generalization in delayed reinforcement learning: an algorithm and performance comparisons. 
                            
                            , 726-731.