Abstract
One of the essential features of the "Meaning <=> Text" model (MTM) developed by I. A. Mel'chuk et. al. is the special lexicon or ECD ('explanatory and combinatory' dictionary). This component can be thought of as a collection independent thesaurus, and can be applied to improve the effectiveness of an information retrieval system.After outlining the MTM and related work, this paper briefly describes the SMART type of information retrieval system. Applicability of the above-mentioned lexicon to such a system is discussed. In particular, the list of lexical relations included in the ECD is expanded and organized to be more effective for retrieval, partially along the lines suggested by Evens and Smith.Finally, an experimental analysis of the utility of lexical relations in an information retrieval system is discussed. It is shown that lexical relations generally enhance system performance. When all lexical relations are considered in the comparison, the resulting performance is shown, by statistical methods, to make a significant improvement (up to 16.5% at a single recall level); when all lexical relations except for antonyms are considered, the improvement is even greater (up to 20.2% at a single recall level).
Keywords
Affiliated Institutions
Related Publications
Word‐word associations in document retrieval systems
Abstract The SMART automatic document retrieval system is used to study association procedures for automatic content analysis. The effect of word frequency and other parameters ...
Access to the Internal Lexicon
In order to understand how a word is read for meaning, we need to know how a reader proceeds from the printed representation of a word to the word's entry in the reader's intern...
Learning Subjective Adjectives from Corpora
Subjectivity tagging is distinguishing sentences used to present opinions and evaluations from sentences used to objectively present factual information. There are numerous appl...
PolySearch2: a significantly improved text-mining system for discovering associations between human diseases, genes, drugs, metabolites, toxins and more
PolySearch2 (http://polysearch.ca) is an online text-mining system for identifying relationships between biomedical entities such as human diseases, genes, SNPs, proteins, drugs...
Sentiment analyzer: extracting sentiments about a given topic using natural language processing techniques
We present sentiment analyzer (SA) that extracts sentiment (or opinion) about a subject from online text documents. Instead of classifying the sentiment of an entire document ab...
Publication Info
- Year
- 1980
- Type
- article
- Volume
- 15
- Issue
- 3
- Pages
- 5-36
- Citations
- 62
- Access
- Closed
External Links
Social Impact
Social media, news, blog, policy document mentions
Citation Metrics
Cite This
Identifiers
- DOI
- 10.1145/1095403.1095404