Improving Word Representations via Global Context and Multiple Word Prototypes

Eric Huang; Richard Socher; Christopher D. Manning; Andrew Y. Ng

Abstract

Unsupervised word representations are very useful in NLP tasks both as inputs to learning algorithms and as extra word features in NLP systems. However, most of these models are built with only local context and one representation per word. This is problematic because words are often polysemous and global context can also provide useful information for learning word meanings. We present a new neural network architecture which 1) learns word embeddings that better capture the semantics of words by incorporating both local and global document context, and 2) accounts for homonymy and polysemy by learning multiple embeddings per word. We introduce a new dataset with human judgments on pairs of words in sentential context, and evaluate our model on it, showing that our model outperforms competitive baselines and other neural language models. 1 1

Keywords

PolysemyComputer scienceWord (group theory)Natural language processingArtificial intelligenceContext (archaeology)Semantics (computer science)Context modelRepresentation (politics)SemEvalArtificial neural networkLinguistics

Affiliated Institutions

Stanford University US

Related Publications

Deep Contextualized Word Representations

Matthew E. Peters , Mark E Neumann , Mohit Iyyer +4 more

We introduce a new type of deep contextualized word representation that models both (1) complex characteristics of word use (e.g., syntax and semantics), and (2) how these uses ...

2018 Proceedings of the 2018 Conference of... 1786 citations

Supervised Learning of Universal Sentence Representations from Natural\n Language Inference Data

Alexis Conneau , Douwe Kiela , Holger Schwenk +2 more

Many modern NLP systems rely on word embeddings, previously trained in an\nunsupervised manner on large corpora, as base features. Efforts to obtain\nembeddings for larger chunk...

2017 arXiv (Cornell University) 2038 citations

Enriching Word Vectors with Subword Information

Piotr Bojanowski , Édouard Grave , Armand Joulin +1 more

Continuous word representations, trained on large unlabeled corpora are useful for many natural language processing tasks. Popular models that learn such representations ignore ...

2017 Transactions of the Association for C... 9444 citations

Word Space

Hinrich Schütze

Representations for semantic information about words are necessary for many applications of neural networks in natural language processing. This paper describes an efficient, co...

1992 Neural Information Processing Systems 212 citations

Polyglot: Distributed Word Representations for Multilingual NLP

Rami Al‐Rfou , Bryan Perozzi , Steven Skiena

Distributed word representations (word embeddings) have recently contributed to competitive performance in language modeling and several NLP tasks. In this work, we train word e...

2013 arXiv (Cornell University) 307 citations

Publication Info

Year: 2012
Type: article
Pages: 873-882
Citations: 1099
Access: Closed

External Links

Citation Metrics

1099

OpenAlex

Cite This

APA Style

                            
                                    Eric Huang, 
                                
                                    Richard Socher, 
                                
                                    Christopher D. Manning
                                
                                et al.
                            
                            (2012). 
                            Improving Word Representations via Global Context and Multiple Word Prototypes. 
                            
                            , 873-882.