Abstract

We consider the problem of learning probabilistic models for complex relational structures between various types of objects. A model can help us “understand ” a dataset of relational facts in at least two ways, by finding interpretable structure in the data, and by supporting predictions, or inferences about whether particular unobserved relations are likely to be true. Often there is a tradeoff between these two aims: cluster-based models yield more easily interpretable representations, while factorization-based approaches have given better predictive performance on large data sets. We introduce the Bayesian Clustered Tensor Factorization (BCTF) model, which embeds a factorized representation of relations in a nonparametric Bayesian clustering framework. Inference is fully Bayesian but scales well to large data sets. The model simultaneously discovers interpretable clusters and yields predictive performance that matches or beats previous probabilistic models for relational data. 1

Keywords

Computer scienceStatistical relational learningCluster analysisInferenceBayesian probabilityArtificial intelligenceMachine learningRelational databaseProbabilistic logicData miningBayesian inferenceRepresentation (politics)FactorizationAlgorithm

Affiliated Institutions

Related Publications

Publication Info

Year
2009
Type
article
Volume
22
Pages
1821-1828
Citations
241
Access
Closed

External Links

Citation Metrics

241
OpenAlex

Cite This

Ilya Sutskever, Joshua B. Tenenbaum, Ruslan Salakhutdinov (2009). Modelling Relational Data using Bayesian Clustered Tensor Factorization. DSpace@MIT (Massachusetts Institute of Technology) , 22 , 1821-1828.