Subject and citation indexing. Part II: The optimal, cluster-based retrieval performance of composite representations

1991 Journal of the American Society for Information Science 20 citations

Abstract

Measures of cluster-based retrieval effectiveness are computed for five composite representations in the cystic fibrosis (CF) Document Collection. The composite representations are constructed from combinations of two subject representations, based on Medical Subject Headings and subheadings, and two citation representations, consisting of the complete list of cited references and a comprehensive list of citations for each document. Experimental retrieval results are presented as a function of the exhaustivity and similarity of the composite representations and reveal consistent patterns from which optimal performance levels can be identified. The optimal performance values provide an assessment of the absolute capacity of each composite representation to associate documents relevant to the same query and discriminate between documents relevant to different queries in single-link hierarchies. The optimal performance values for all composite representations are completely comparable and are superior to the optimal performance of constituent representations. Optimal performance consistently occurs at low levels of exhaustivity. Exhaustive composite representations that include subject descriptions produce the lowest levels of performance; retrieval results derived from random structures are comparable to the observed results. The effectiveness of the exhaustive representation composed of references and citations is materially superior to the effectiveness of exhaustive composite representations that include subject descriptions. © 1991 John Wiley & Sons, Inc.

Keywords

Subject (documents)Information retrievalSearch engine indexingCitationCluster (spacecraft)Computer scienceComposite numberWorld Wide WebAlgorithm

Affiliated Institutions

Related Publications

Publication Info

Year
1991
Type
article
Volume
42
Issue
9
Pages
676-684
Citations
20
Access
Closed

Social Impact

Social media, news, blog, policy document mentions

Citation Metrics

20
OpenAlex
0
Influential
9
CrossRef

Cite This

W. M. Shaw (1991). Subject and citation indexing. Part II: The optimal, cluster-based retrieval performance of composite representations. Journal of the American Society for Information Science , 42 (9) , 676-684. https://doi.org/10.1002/(sici)1097-4571(199110)42:9<676::aid-asi6>3.0.co;2-2

Identifiers

DOI
10.1002/(sici)1097-4571(199110)42:9<676::aid-asi6>3.0.co;2-2

Data Quality

Data completeness: 81%