Abstract

In our attempts to understand cellular function at the molecular level, we must be able to synthesize information from disparate types of genomic data. We consider the problem of inferring gene functional classifications from a heterogeneous data set consisting of DNA microarray expression measurements and phylogenetic profiles from whole-genome sequence comparisons. We demonstrate the application of the support vector machine (SVM) learning algorithm to this functional inference task. Our results suggest the importance of exploiting prior information about the heterogeneity of the data. In particular, we propose an SVM kernel function that is explicitly heterogeneous. We also show how to use knowledge about heterogeneity to aid in feature selection.

Keywords

Computer scienceSupport vector machineKernel (algebra)InferenceFeature selectionArtificial intelligenceSet (abstract data type)Function (biology)Machine learningKernel methodGenomeData miningComputational biologyGeneBiologyMathematicsGenetics

Affiliated Institutions

Related Publications

Publication Info

Year
2001
Type
article
Pages
249-255
Citations
273
Access
Closed

External Links

Social Impact

Social media, news, blog, policy document mentions

Citation Metrics

273
OpenAlex

Cite This

Paul Pavlidis, Jason Weston, Jinsong Cai et al. (2001). Gene functional classification from heterogeneous data. , 249-255. https://doi.org/10.1145/369133.369228

Identifiers

DOI
10.1145/369133.369228