Links between Markov models and multilayer perceptrons

Abstract

The statistical use of a particular classic form of a connectionist system, the multilayer perceptron (MLP), is described in the context of the recognition of continuous speech. A discriminant hidden Markov model (HMM) is defined, and it is shown how a particular MLP with contextual and extra feedback input units can be considered as a general form of such a Markov model. A link between these discriminant HMMs, trained along the Viterbi algorithm, and any other approach based on least mean square minimization of an error function (LMSE) is established. It is shown theoretically and experimentally that the outputs of the MLP (when trained along the LMSE or the entropy criterion) approximate the probability distribution over output classes conditioned on the input, i.e. the maximum a posteriori probabilities. Results of a series of speech recognition experiments are reported. The possibility of embedding MLP into HMM is described. Relations with other recurrent networks are also explained.< <ETX xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">></ETX>

Keywords

Hidden Markov modelViterbi algorithmPattern recognition (psychology)Computer scienceArtificial intelligenceMaximum-entropy Markov modelMultilayer perceptronDiscriminantConnectionismPerceptronMarkov chainArtificial neural networkMarkov modelMarkov processSpeech recognitionEntropy (arrow of time)Context (archaeology)Maximum a posteriori estimationMachine learningMathematicsVariable-order Markov modelMaximum likelihoodStatistics

Affiliated Institutions

Philips (Finland) FI

Related Publications

Sparse Multilayer Perceptron for Phoneme Recognition

G. S. V. S. Sivaram , Hynek Heřmanský

This paper introduces the sparse multilayer perceptron (SMLP) which jointly learns a sparse feature representation and nonlinear classifier boundaries to optimally discriminate ...

2011 IEEE Transactions on Audio Speech and... 65 citations

Global optimization of a neural network-hidden Markov model hybrid

Yoshua Bengio , Renato De Mori , Giovanni Flammia +1 more

An original method for integrating artificial neural networks (ANN) with hidden Markov models (HMM) is proposed. ANNs are suitable for performing phonetic classification, wherea...

2002 18 citations

Lattice-based optimization of sequence classification criteria for neural-network acoustic modeling

Brian Kingsbury

Acoustic models used in hidden Markov model/neural-network (HMM/NN) speech recognition systems are usually trained with a frame-based cross-entropy error criterion. In contrast,...

2009 238 citations

Backpropagation training for multilayer conditional random field based phone recognition

Rohit Prabhavalkar , Eric Fosler‐Lussier

Conditional random fields (CRFs) have recently found increased popularity in automatic speech recognition (ASR) applications. CRFs have previously been shown to be effective com...

2010 31 citations

Comparing multilayer perceptron to Deep Belief Network Tandem features for robust ASR

Oriol Vinyals , Suman Ravuri

In this paper, we extend the work done on integrating multilayer perceptron (MLP) networks with HMM systems via the Tandem approach. In particular, we explore whether the use of...

2011 55 citations

Publication Info

Year: 1990
Type: article
Volume: 12
Issue: 12
Pages: 1167-1178
Citations: 340
Access: Closed

External Links

View on DOI.org

Social Impact

Altmetric

Links between Markov models and multilayer perceptrons

PlumX Metrics

Social media, news, blog, policy document mentions

Citation Metrics

340

OpenAlex

Cite This

APA Style

                            
                                    H. Bourlard, 
                                
                                    C. Wellekens
                                
                            (1990). 
                            Links between Markov models and multilayer perceptrons. 
                            IEEE Transactions on Pattern Analysis and Machine Intelligence
                            , 12
                            (12)
                            , 1167-1178.
                            https://doi.org/10.1109/34.62605

Identifiers

DOI: 10.1109/34.62605