Abstract

Distributed word representations have recently been proven to be an invaluable resource for NLP. These representations are normally learned using neural networks and capture syntactic and semantic information about words. Informa-tion about word morphology and shape is nor-mally ignored when learning word representa-tions. However, for tasks like part-of-speech tag-ging, intra-word information is extremely use-ful, specially when dealing with morphologically rich languages. In this paper, we propose a deep neural network that learns character-level repre-sentation of words and associate them with usual word representations to perform POS tagging. Using the proposed approach, while avoiding the use of any handcrafted feature, we produce state-of-the-art POS taggers for two languages: En-glish, with 97.32 % accuracy on the Penn Tree-bank WSJ corpus; and Portuguese, with 97.47% accuracy on the Mac-Morpho corpus, where the latter represents an error reduction of 12.2 % on the best previous known result. 1.

Keywords

TreebankComputer scienceNatural language processingArtificial intelligenceWord (group theory)Character (mathematics)Feature (linguistics)Part-of-speech taggingPart of speechArtificial neural networkRepresentation (politics)Speech recognitionParsingLinguistics

Affiliated Institutions

Related Publications

Publication Info

Year
2014
Type
article
Pages
1818-1826
Citations
555
Access
Closed

External Links

Citation Metrics

555
OpenAlex

Cite This

Cícero dos Santos, Bianca Zadrozny (2014). Learning Character-level Representations for Part-of-Speech Tagging. , 1818-1826.