Sigmoid-weighted linear units for neural network function approximation in reinforcement learning

Abstract

In recent years, neural networks have enjoyed a renaissance as function approximators in reinforcement learning. Two decades after Tesauro's TD-Gammon achieved near top-level human performance in backgammon, the deep reinforcement learning algorithm DQN achieved human-level performance in many Atari 2600 games. The purpose of this study is twofold. First, we propose two activation functions for neural network function approximation in reinforcement learning: the sigmoid-weighted linear unit (SiLU) and its derivative function (dSiLU). The activation of the SiLU is computed by the sigmoid function multiplied by its input. Second, we suggest that the more traditional approach of using on-policy learning with eligibility traces, instead of experience replay, and softmax action selection can be competitive with DQN, without the need for a separate target network. We validate our proposed approach by, first, achieving new state-of-the-art results in both stochastic SZ-Tetris and Tetris with a small 10 × 10 board, using TD(λ) learning and shallow dSiLU network agents, and, then, by outperforming DQN in the Atari 2600 domain by using a deep Sarsa(λ) agent with SiLU and dSiLU hidden units.

Keywords

Sigmoid functionArtificial neural networkReinforcement learningReinforcementComputer scienceArtificial intelligenceFunction approximationFunction (biology)MathematicsPsychology

MeSH Terms

Deep LearningNeural NetworksComputer

Affiliated Institutions

Okinawa Institute of Science and Technology Graduate University JP

Related Publications

Deep Reinforcement Learning with Double Q-Learning

Hado van Hasselt , Arthur Guez , David Silver

The popular Q-learning algorithm is known to overestimate action values under certain conditions. It was not previously known whether, in practice, such overestimations are comm...

2016 Proceedings of the AAAI Conference on... 3514 citations

Massively Parallel Methods for Deep Reinforcement Learning

Arun Sukumaran Nair , P. Srinivasan , Sam Blackwell +11 more

We present the first massively distributed architecture for deep reinforcement learning. This architecture uses four main components: parallel actors that generate new behaviour...

2015 arXiv (Cornell University) 405 citations

Rainbow: Combining Improvements in Deep Reinforcement Learning

Matteo Hessel , Joseph Modayil , Hado van Hasselt +7 more

The deep reinforcement learning community has made several independent improvements to the DQN algorithm. However, it is unclear which of these extensions are complementary and ...

2018 Proceedings of the AAAI Conference on... 1630 citations

TD-Gammon, a Self-Teaching Backgammon Program, Achieves Master-Level Play

Gerald Tesauro

TD-Gammon is a neural network that is able to teach itself to play backgammon solely by playing against itself and learning from the results, based on the TD(λ) reinforcement le...

1994 Neural Computation 783 citations

Playing Atari with Deep Reinforcement Learning

Alex Graves , Ioannis Antonoglou , Daan Wierstra +4 more

We present the first deep learning model to successfully learn control policies directly from high-dimensional sensory input using reinforcement learning. The model is a convolu...

2013 arXiv (Cornell University) 5109 citations

Publication Info

Year: 2018
Type: article
Volume: 107
Pages: 3-11
Citations: 1643
Access: Closed

External Links

Download PDF (Free) View on DOI.org arXiv PubMed Semantic Scholar

Social Impact

Altmetric

Sigmoid-weighted linear units for neural network function approximation in reinforcement learning

PlumX Metrics

Social media, news, blog, policy document mentions

Citation Metrics

1643

OpenAlex

Influential

1440

CrossRef

Cite This

APA Style

                            
                                    Stefan Elfwing, 
                                
                                    Eiji Uchibe, 
                                
                                    Kenji Doya
                                
                            (2018). 
                            Sigmoid-weighted linear units for neural network function approximation in reinforcement learning. 
                            Neural Networks
                            , 107
                            
                            , 3-11.
                            https://doi.org/10.1016/j.neunet.2017.12.012

Identifiers

DOI: 10.1016/j.neunet.2017.12.012
PMID: 29395652
arXiv: 1702.03118

Data Quality

Data completeness: 93%