Minimum prediction residual principle applied to speech recognition

Abstract

A computer system is described in which isolated words, spoken by a designated talker, are recognized through calculation of a minimum prediction residual. A reference pattern for each word to be recognized is stored as a time pattern of linear prediction coefficients (LPC). The total log prediction residual of an input signal is minimized by optimally registering the reference LPC onto the input autocorrelation coefficients using the dynamic programming algorithm (DP). The input signal is recognized as the reference word which produces the minimum prediction residual. A sequential decision procedure is used to reduce the amount of computation in DP. A frequency normalization with respect to the long-time spectral distribution is used to reduce effects of variations in the frequency response of telephone connections. The system has been implemented on a DDP-516 computer for the 200-word recognition experiment. The recognition rate for a designated male talker is 97.3 percent for telephone input, and the recognition time is about 22 times real time.

Keywords

Normalization (sociology)ResidualComputer scienceLinear predictionAutocorrelationSpeech recognitionComputationWord (group theory)SIGNAL (programming language)Dynamic programmingPattern recognition (psychology)AlgorithmArtificial intelligenceMathematicsStatistics

Related Publications

Dynamic programming algorithm optimization for spoken word recognition

Hiroaki Sakoe , Seibi Chiba

This paper reports on an optimum dynamic progxamming (DP) based time-normalization algorithm for spoken word recognition. First, a general principle of time-normalization is giv...

1978 IEEE Transactions on Acoustics Speech... 6280 citations

Comparison of optimal quantizations of speech reflection coefficients

Alfred Gray , Robert M. Gray , J. Markel

Four quantization schemes for the reflection coefficients obtained from linear prediction speech analysis are theoretically compared. The asymptotic performance of each scheme i...

1977 IEEE Transactions on Acoustics Speech... 37 citations

Product code vector quantizers for waveform and voice coding

Michael J. Sabin , Robert M. Gray

Memory and computation requirements imply fundamental limitations on the quality that can be achieved in vector quantization systems used for speech waveform coding and linear p...

1984 IEEE Transactions on Acoustics Speech... 180 citations

Lattice-based optimization of sequence classification criteria for neural-network acoustic modeling

Brian Kingsbury

Acoustic models used in hidden Markov model/neural-network (HMM/NN) speech recognition systems are usually trained with a frame-based cross-entropy error criterion. In contrast,...

2009 238 citations

An Algorithm for Vector Quantizer Design

Y. Linde , A. Buzo , Robert M. Gray

An efficient and intuitive algorithm is presented for the design of vector quantizers based either on a known probabilistic model or on a long training sequence of data. The bas...

1980 IEEE Transactions on Communications 7180 citations

Publication Info

Year: 1975
Type: article
Volume: 23
Issue: 1
Pages: 67-72
Citations: 1588
Access: Closed

External Links

View on DOI.org

Social Impact

Altmetric

Minimum prediction residual principle applied to speech recognition

PlumX Metrics

Social media, news, blog, policy document mentions

Citation Metrics

1588

OpenAlex

Cite This

APA Style

                            
                                    Fumitada Itakura
                                
                            (1975). 
                            Minimum prediction residual principle applied to speech recognition. 
                            IEEE Transactions on Acoustics Speech and Signal Processing
                            , 23
                            (1)
                            , 67-72.
                            https://doi.org/10.1109/tassp.1975.1162641

Identifiers

DOI: 10.1109/tassp.1975.1162641