Abstract

This paper questions the need for reinforcement learning or control theory when optimising behaviour. We show that it is fairly simple to teach an agent complicated and adaptive behaviours using a free-energy formulation of perception. In this formulation, agents adjust their internal states and sampling of the environment to minimize their free-energy. Such agents learn causal structure in the environment and sample it in an adaptive and self-supervised fashion. This results in behavioural policies that reproduce those optimised by reinforcement learning and dynamic programming. Critically, we do not need to invoke the notion of reward, value or utility. We illustrate these points by solving a benchmark problem in dynamic programming; namely the mountain-car problem, using active perception or inference under the free-energy principle. The ensuing proof-of-concept may be important because the free-energy formulation furnishes a unified account of both action and perception and may speak to a reappraisal of the role of dopamine in the brain.

Keywords

Reinforcement learningInferenceFree energy principleComputer scienceArtificial intelligencePerceptionAction (physics)Machine learningBenchmark (surveying)Dynamic programmingReinforcementEnergy (signal processing)PsychologyMathematicsSocial psychologyNeuroscienceAlgorithm

Affiliated Institutions

Related Publications

A theory of cortical responses

This article concerns the nature of evoked brain responses and the principles underlying their generation. We start with the premise that the sensory brain has evolved to repres...

2005 Philosophical Transactions of the Roy... 4533 citations

Publication Info

Year
2009
Type
article
Volume
4
Issue
7
Pages
e6421-e6421
Citations
424
Access
Closed

External Links

Social Impact

Social media, news, blog, policy document mentions

Citation Metrics

424
OpenAlex

Cite This

Karl Friston, Jean Daunizeau, Stefan J. Kiebel (2009). Reinforcement Learning or Active Inference?. PLoS ONE , 4 (7) , e6421-e6421. https://doi.org/10.1371/journal.pone.0006421

Identifiers

DOI
10.1371/journal.pone.0006421