Abstract
Abstract An on‐line document retrieval system is described which combines a data base management system with automatic processing of natural language queries and abstracts. Data consists of an abstract, from which index terms are automatically extracted, along with bibliographic and descriptive information. The data base management system is used to store bibliographic and descriptive information, providing direct access to documents with specified bibliographic or descriptor items. Methods originally developed in the SMART project are used for abstract analysis: stemming algorithm, cosine function for query‐document comparisons, ranked output, and clustered document collection. Searches are entered and performed on‐line, with output consisting of document abstracts ranked in decreasing order of similarity with the query. Additional facilities include off‐line searches, SDI, and display of data base statistics. Future plans and improvements are also discussed.
Keywords
Affiliated Institutions
Related Publications
BLAST+: architecture and applications
Abstract Background Sequence similarity searching is a very important bioinformatics task. While Basic Local Alignment Search Tool (BLAST) outperforms exact methods through its ...
The Grid file: A data structure designed to support proximity queries on spatial objects
Abstract : This document describes a technique for storing large sets of spatial objects so that proximity queries are handled efficiently as part of the accessing mechanism. Th...
Document Language Models, Query Models, and Risk Minimization for Information Retrieval
We present a framework for information retrieval that combines document models and query models using a probabilistic ranking function based on Bayesian decision theory. The fra...
Secure statistical databases with random sample queries
A new inference control, called random sample queries, is proposed for safeguarding confidential data in on-line statistical databases. The random sample queries control deals d...
HMMER web server: interactive sequence similarity searching
HMMER is a software suite for protein sequence similarity searches using probabilistic methods. Previously, HMMER has mainly been available only as a computationally intensive U...
Publication Info
- Year
- 1979
- Type
- article
- Volume
- 30
- Issue
- 1
- Pages
- 9-14
- Citations
- 30
- Access
- Closed
External Links
Social Impact
Social media, news, blog, policy document mentions
Citation Metrics
Cite This
Identifiers
- DOI
- 10.1002/asi.4630300103