Abstract

The authors describe an experiment in the construction of perfect metrics for minimum-distance classification of character images. A perfect metric is one that, with high probability, is zero for correct classifications and non-zero for incorrect classifications. They promise excellent reject behavior in addition to good rank ordering. The approach is to infer from the training data faithful but concise representations of the empirical class-conditional distributions. In doing this, the authors have abandoned many visual simplifying assumptions about the distributions, e.g., that they are simply-connected, unimodal, convex, or parametric (e.g., Gaussian). The method requires unusually large and representative training sets, which we provide through pseudorandom generation of training samples using a realistic model of printing and imaging distortions. The authors illustrate the method on a challenging recognition problem: 3755 character classes of machine-print Chinese, in four typefaces, over a range of text sizes. In a test on over three million images, the perfect-metric classifier achieved better than 99% top-choice accuracy. In addition, it is shown that it is superior to a conventional parametric classifier.< <ETX xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">&gt;</ETX>

Keywords

Computer scienceArtificial intelligenceClassifier (UML)Parametric statisticsGaussianMetric (unit)TypefacePattern recognition (psychology)MathematicsMachine learningStatistics

Affiliated Institutions

Related Publications

Self-organising multilayer topographic mappings

Minimization of distortion measures requires multilayer mappings to be topographic. The author shows this only for tree-like multilayer networks. He also shows how to modify the...

1988 IEEE International Conference on Neur... 14 citations

Publication Info

Year
2002
Type
article
Pages
593-597
Citations
17
Access
Closed

External Links

Social Impact

Altmetric
PlumX Metrics

Social media, news, blog, policy document mentions

Citation Metrics

17
OpenAlex

Cite This

Tuyen Ho Thi Thanh, Henry S. Baird (2002). Perfect metrics. , 593-597. https://doi.org/10.1109/icdar.1993.395665

Identifiers

DOI
10.1109/icdar.1993.395665