Abstract

Abstract An on‐line document retrieval system is described which combines a data base management system with automatic processing of natural language queries and abstracts. Data consists of an abstract, from which index terms are automatically extracted, along with bibliographic and descriptive information. The data base management system is used to store bibliographic and descriptive information, providing direct access to documents with specified bibliographic or descriptor items. Methods originally developed in the SMART project are used for abstract analysis: stemming algorithm, cosine function for query‐document comparisons, ranked output, and clustered document collection. Searches are entered and performed on‐line, with output consisting of document abstracts ranked in decreasing order of similarity with the query. Additional facilities include off‐line searches, SDI, and display of data base statistics. Future plans and improvements are also discussed.

Keywords

Computer scienceInformation retrievalCosine similarityDocument retrievalIndex (typography)Base (topology)Knowledge baseSimilarity (geometry)Function (biology)Vector space modelDatabaseData miningWorld Wide WebArtificial intelligenceCluster analysisMathematics

Affiliated Institutions

Related Publications

Publication Info

Year
1979
Type
article
Volume
30
Issue
1
Pages
9-14
Citations
30
Access
Closed

Social Impact

Social media, news, blog, policy document mentions

Citation Metrics

30
OpenAlex
0
Influential
20
CrossRef

Cite This

Robert T. Dattola (1979). FIRST: Flexible Information Retrieval System for Text. Journal of the American Society for Information Science , 30 (1) , 9-14. https://doi.org/10.1002/asi.4630300103

Identifiers

DOI
10.1002/asi.4630300103

Data Quality

Data completeness: 77%