10.1162/153244304773936054 | RDL Research Database

Abstract

We describe a general methodology for the design of large-scale recursive neural network architectures (DAG-RNNs) which comprises three fundamental steps: (1) representation of a given domain using suitable directed acyclic graphs (DAGs) to connect visible and hidden node variables; (2) parameterization of the relationship between each variable and its parent variables by feedforward neural networks; and (3) application of weight-sharing within appropriate subsets of DAG connections to capture stationarity and control model complexity. Here we use these principles to derive several specific classes of DAG-RNN architectures based on lattices, trees, and other structured graphs. These architectures can process a wide range of data structures with variable sizes and dimensions. While the overall resulting models remain probabilistic, the internal deterministic dynamics allows efficient propagation of information, as well as training by gradient descent, in order to tackle large-scale problems. These methods are used here to derive state-of-the-art predictors for protein structural features such as secondary structure (1D) and both fine- and coarse-grained contact maps (2D). Extensions, relationships to graphical models, and implications for the design of neural architectures are briefly discussed. The protein prediction servers are available over the Web at: www.igb.uci.edu/tools.htm .

Keywords

Computer science

Related Publications

A self-organizing map for adaptive processing of structured data

Markus Hagenbuchner , Alessandro Sperduti , Ah Chung Tsoi

Recent developments in the area of neural networks produced models capable of dealing with structured data. Here, we propose the first fully unsupervised model, namely an extens...

2003 IEEE Transactions on Neural Networks 180 citations

Scene Segmentation with DAG-Recurrent Neural Networks

Bing Shuai , Zhen Zuo , Bing Wang +1 more

In this paper, we address the challenging task of scene segmentation. In order to capture the rich contextual dependencies over image regions, we propose Directed Acyclic Graph-...

2017 IEEE Transactions on Pattern Analysis... 143 citations

A Comparison between Recursive Neural Networks and Graph Neural Networks

Vincenzo Di Massa , Gabriele Monfardini , Lorenzo Sarti +3 more

Recursive neural networks (RNNs) and graph neural networks (GNNs) are two connectionist models that can directly process graphs. RNNs and GNNs exploit a similar processing frame...

2006 The 2006 IEEE International Joint Con... 39 citations

10.1162/15324430152748236

This paper introduces a general Bayesian framework for obtaining sparse solutions to regression and classification tasks utilising models linear in the parameters. Although this...

2000 Applied Physics Letters 1868 citations

Training Recurrent Networks by Evolino

Jürgen Schmidhuber , Daan Wierstra , Matteo Gagliolo +1 more

In recent years, gradient-based LSTM recurrent neural networks (RNNs) solved many previously RNN-unlearnable tasks. Sometimes, however, gradient information is of little use for...

2007 Neural Computation 251 citations

Publication Info

Year: 2000
Type: article
Volume: 1
Citations: 62
Access: Closed

External Links

View on DOI.org

Social Impact

Altmetric

10.1162/153244304773936054

PlumX Metrics

Social media, news, blog, policy document mentions

Citation Metrics

OpenAlex

Cite This

APA Style

                            
                            (2000). 
                            10.1162/153244304773936054. 
                            Applied Physics Letters
                            , 1
                            
                            .
                            https://doi.org/10.1162/153244304773936054
                        

Identifiers

DOI: 10.1162/153244304773936054