Abstract
Many problems in voice recognition and audio processing involve feature extraction from raw waveforms. The goal of feature extraction is to reduce the dimensionality of the audio signal while preserving the informative signatures that, for example, distinguish different phonemes in speech or identify particular instruments in music. If the acoustic variability of a data set is described by a small number of continuous features, then we can imagine the data as lying on a low dimensional manifold in the high dimensional space of all possible waveforms. Locally linear embedding (LLE) is an unsupervised learning algorithm for feature extraction in this setting. In this paper, we present results from the exploratory analysis and visualization of speech and music by LLE. 1.
Keywords
Affiliated Institutions
Related Publications
Nonlinear Dimensionality Reduction by Locally Linear Embedding
Many areas of science depend on exploratory data analysis and visualization. The need to analyze large amounts of multivariate data raises the fundamental problem of dimensional...
Spectral Distortion Measures for Speech Compression.
In recent years several measures of distortion between speech waveforms have been proposed as substitutes for the traditional but subjectively inadequate mean-squared error. All...
Super-resolution through neighbor embedding
In this paper, we propose a novel method for solving single-image super-resolution problems. Given a low-resolution image as input, we recover its highresolution counterpart usi...
Querying Large Collections of Music for Similarity
We present a system capable of performing similarity queries against a large archive of digital music. Users are able to search for songs which "sound similar" to a given query ...
Learning Eigenfunctions Links Spectral Embedding and Kernel PCA
In this letter, we show a direct relation between spectral embedding methods and kernel principal components analysis and how both are special cases of a more general learning p...
Publication Info
- Year
- 2004
- Type
- article
- Volume
- 3
- Pages
- iii-984
- Citations
- 46
- Access
- Closed
External Links
Social Impact
Social media, news, blog, policy document mentions
Citation Metrics
Cite This
Identifiers
- DOI
- 10.1109/icassp.2004.1326712