Abstract

Hidden Markov models (HMMs) have been successfully applied to the tasks of transmembrane protein topology prediction and signal peptide prediction. In this paper we expand upon this work by making use of the more powerful class of dynamic Bayesian networks (DBNs). Our model, Philius, is inspired by a previously published HMM, Phobius, and combines a signal peptide submodel with a transmembrane submodel. We introduce a two-stage DBN decoder that combines the power of posterior decoding with the grammar constraints of Viterbi-style decoding. Philius also provides protein type, segment, and topology confidence metrics to aid in the interpretation of the predictions. We report a relative improvement of 13% over Phobius in full-topology prediction accuracy on transmembrane proteins, and a sensitivity and specificity of 0.96 in detecting signal peptides. We also show that our confidence metrics correlate well with the observed precision. In addition, we have made predictions on all 6.3 million proteins in the Yeast Resource Center (YRC) database. This large-scale study provides an overall picture of the relative numbers of proteins that include a signal-peptide and/or one or more transmembrane segments as well as a valuable resource for the scientific community. All DBNs are implemented using the Graphical Models Toolkit. Source code for the models described here is available at http://noble.gs.washington.edu/proj/philius. A Philius Web server is available at http://www.yeastrc.org/philius, and the predictions on the YRC database are available at http://www.yeastrc.org/pdr.

Keywords

Computer scienceHidden Markov modelDecoding methodsBayesian probabilityArtificial intelligenceViterbi algorithmTopology (electrical circuits)Pattern recognition (psychology)AlgorithmMathematics

MeSH Terms

Artificial IntelligenceBayes TheoremComputational BiologyFungal ProteinsMarkov ChainsMembrane ProteinsModelsMolecularNeural NetworksComputerProtein ConformationProtein Sorting SignalsReproducibility of ResultsYeasts

Affiliated Institutions

Related Publications

Publication Info

Year
2008
Type
article
Volume
4
Issue
11
Pages
e1000213-e1000213
Citations
262
Access
Closed

Social Impact

Social media, news, blog, policy document mentions

Citation Metrics

262
OpenAlex
26
Influential
226
CrossRef

Cite This

Sheila M. Reynolds, Lukas Käll, Michael Riffle et al. (2008). Transmembrane Topology and Signal Peptide Prediction Using Dynamic Bayesian Networks. PLoS Computational Biology , 4 (11) , e1000213-e1000213. https://doi.org/10.1371/journal.pcbi.1000213

Identifiers

DOI
10.1371/journal.pcbi.1000213
PMID
18989393
PMCID
PMC2570248

Data Quality

Data completeness: 86%