Abstract
Surveying a suite of algorithms that offer a solution to managing large document archives.
Keywords
Affiliated Institutions
Related Publications
Probabilistic Latent Semantic Indexing
Probabilistic Latent Semantic Indexing is a novel approach to automated document indexing which is based on a statistical latent class model for factor analysis of count data. F...
quanteda: An R package for the quantitative analysis of textual data
quanteda is an R package providing a comprehensive workflow and toolkit for natural language processing tasks such as corpus management, tokenization, analysis, and visualizatio...
Factored neural language models
We present a new type of neural probabilistic language model that learns a mapping from both words and explicit word features into a continuous space that is then used for word ...
Foundations of statistical natural language processing
Statistical approaches to processing natural language text have become dominant in recent years. This foundational text is the first comprehensive introduction to statistical na...
Sentiment analyzer: extracting sentiments about a given topic using natural language processing techniques
We present sentiment analyzer (SA) that extracts sentiment (or opinion) about a subject from online text documents. Instead of classifying the sentiment of an entire document ab...
Publication Info
- Year
- 2012
- Type
- article
- Volume
- 55
- Issue
- 4
- Pages
- 77-84
- Citations
- 5358
- Access
- Closed
External Links
Social Impact
Social media, news, blog, policy document mentions
Citation Metrics
Cite This
Identifiers
- DOI
- 10.1145/2133806.2133826