Backpropagation training for multilayer conditional random field based phone recognition

Abstract

Conditional random fields (CRFs) have recently found increased popularity in automatic speech recognition (ASR) applications. CRFs have previously been shown to be effective combiners of posterior estimates from multilayer perceptrons (MLPs) in phone and word recognition tasks. In this paper, we describe a novel hybrid Multilayer-CRF structure (ML-CRF), where a MLP-like hidden layer serves as input to the CRF; moreover, we propose a technique for directly training the ML-CRF to optimize a conditional log-likelihood based criterion, based on error backpropagation. The proposed technique thus allows for the implicit learning of suitable feature functions for the CRF. We present results for initial phone recognition experiments on the TIMIT database that indicate that our proposed method is a promising approach for training CRFs.

Keywords

Conditional random fieldCRFSComputer scienceBackpropagationPerceptronTIMITArtificial intelligenceSpeech recognitionPattern recognition (psychology)Multilayer perceptronFeature (linguistics)Feature extractionArtificial neural networkMachine learningHidden Markov model

Affiliated Institutions

The Ohio State University US

Related Publications

Boosting attribute and phone estimation accuracies with deep neural networks for detection-based speech recognition

Dong Yu , Sabato Marco Siniscalchi , Li Deng +1 more

Generation of high-precision sub-phonetic attribute (also known as phonological features) and phone lattices is a key frontend component for detection-based bottom-up speech rec...

2012 64 citations

Sparse Multilayer Perceptron for Phoneme Recognition

G. S. V. S. Sivaram , Hynek Heřmanský

This paper introduces the sparse multilayer perceptron (SMLP) which jointly learns a sparse feature representation and nonlinear classifier boundaries to optimally discriminate ...

2011 IEEE Transactions on Audio Speech and... 65 citations

Speech Recognition Using Augmented Conditional Random Fields

Yasser Hifny , Steve Renals

Acoustic modeling based on hidden Markov models (HMMs) is employed by state-of-the-art stochastic speech recognition systems. Although HMMs are a natural choice to warp the time...

2009 IEEE Transactions on Audio Speech and... 82 citations

Improved phone recognition using Bayesian triphone models

Ming Jiang , F.J. Smith

A crucial issue in triphone based continuous speech recognition is the large number of models to be estimated against the limited availability of training data. This problem can...

2002 45 citations

Lattice-based optimization of sequence classification criteria for neural-network acoustic modeling

Brian Kingsbury

Acoustic models used in hidden Markov model/neural-network (HMM/NN) speech recognition systems are usually trained with a frame-based cross-entropy error criterion. In contrast,...

2009 238 citations

Publication Info

Year: 2010
Type: article
Pages: 5534-5537
Citations: 31
Access: Closed

External Links

View on DOI.org

Social Impact

Altmetric

Backpropagation training for multilayer conditional random field based phone recognition

PlumX Metrics

Social media, news, blog, policy document mentions

Citation Metrics

OpenAlex

Cite This

APA Style

                            
                                    Rohit Prabhavalkar, 
                                
                                    Eric Fosler‐Lussier
                                
                            (2010). 
                            Backpropagation training for multilayer conditional random field based phone recognition. 
                            
                            , 5534-5537.
                            https://doi.org/10.1109/icassp.2010.5495222

Identifiers

DOI: 10.1109/icassp.2010.5495222