Neuromodulatory Control Networks (NCNs): A Biologically Inspired Architecture for Dynamic LLM Processing

Jimmy Ba; Jamie Kiros; Geoffrey E. Hinton

doi:10.5281/zenodo.17851047

Abstract

Large Language Models (LLMs) based on the Transformer architecture have achieved remarkable success, yet their core processing mechanisms remain largely static after training. While powerful, this static nature limits their ability to dynamically adapt their processing strategy based on nuanced contextual cues, task demands, or desired operational modes (e.g., shifting between exploration and exploitation). We propose Neuromodulatory Control Networks (NCNs), a novel architectural modification inspired by the neuromodulatory systems in the vertebrate brain (e.g., those utilizing dopamine, acetylcholine, norepinephrine). NCNs are small, parallel networks that receive contextual input, summarizing the global state, task information, or external control signals, and compute dynamic "modulatory signals". These signals are distributed as layer-specific control vectors to the main LLM to influence its computational properties during a forward pass, analogous to how neuromodulators alter neuronal gain, plasticity, and network states across different cortical depths. Instead of merely routing information, NCNs aim to change how information is processed throughout the base model by modulating key components like attention mechanisms (e.g., via precision scaling), layer gains, and activation functions. Crucially, the architecture allows the model to implicitly learn to self-regulate these parameters via backpropagation, effectively becoming its own "tuning expert." We further introduce formal stability mechanisms, including homeostatic regularization, to prevent control manifold collapse. This paper introduces the NCN architecture, details its components and implicit learning mechanism, discusses its conceptual advantages and potential failure modes (such as contextual stereotyping), and provides an open-source PyTorch implementation to facilitate community exploration and future empirical validation.

Keywords

Normalization (sociology)Computer scienceSociology

Related Publications

Future directions in packet radio architectures and protocols

N. Shacham , Jil Westcott

The technology of packet switching over multihop, multiple-access channels has evolved to the point at which its protocols can now support internetwork operation of medium-size ...

1987 Proceedings of the IEEE 111 citations

AN INTEGRATIVE THEORY OF LOCUS COERULEUS-NOREPINEPHRINE FUNCTION: Adaptive Gain and Optimal Performance

Gary Aston‐Jones , Jonathan D. Cohen

Historically, the locus coeruleus-norepinephrine (LC-NE) system has been implicated in arousal, but recent findings suggest that this system plays a more complex and specific ro...

2005 Annual Review of Neuroscience 4289 citations

Hierarchical Bayesian inference in the visual cortex

Tai Sing Lee , David B. Mumford

Traditional views of visual processing suggest that early visual neurons in areas V1 and V2 are static spatiotemporal filters that extract local features from a visual scene. Th...

2003 Journal of the Optical Society of Ame... 1514 citations

Neural Network for Graphs: A Contextual Constructive Approach

Alessio Micheli

This paper presents a new approach for learning in structured domains (SDs) using a constructive neural network for graphs (NN4G). The new model allows the extension of the inpu...

2009 IEEE Transactions on Neural Networks 607 citations

Making Working Memory Work: A Computational Model of Learning in the Prefrontal Cortex and Basal Ganglia

Randall C. O׳Reilly , Michael J. Frank

The prefrontal cortex has long been thought to subserve both working memory (the holding of information online for processing) and executive functions (deciding how to manipulat...

2005 Neural Computation 1079 citations

Publication Info

Year: 2025
Type: preprint
Citations: 1325
Access: Closed

External Links

View on DOI.org

Social Impact

Altmetric

Neuromodulatory Control Networks (NCNs): A Biologically Inspired Architecture for Dynamic LLM Processing

PlumX Metrics

Social media, news, blog, policy document mentions

Citation Metrics

1325

OpenAlex

Cite This

APA Style

                            
                                    Jimmy Ba, 
                                
                                    Jamie Kiros, 
                                
                                    Geoffrey E. Hinton
                                
                            (2025). 
                            Neuromodulatory Control Networks (NCNs): A Biologically Inspired Architecture for Dynamic LLM Processing. 
                            Zenodo (CERN European Organization for Nuclear Research)
                            
                            .
                            https://doi.org/10.5281/zenodo.17851047

Identifiers

DOI: 10.5281/zenodo.17851047