Previous |  Up |  Next

Article

Title: The Bhattacharyya metric as an absolute similarity measure for frequency coded data (English)
Author: Aherne, Frank J.
Author: Thacker, Neil A.
Author: Rockett, Peter I.
Language: English
Journal: Kybernetika
ISSN: 0023-5954
Volume: 34
Issue: 4
Year: 1998
Pages: [363]-368
Summary lang: English
.
Category: math
.
Summary: This paper highlights advantageous properties of the Bhattacharyya metric over the chi-squared statistic for comparing frequency distributed data. The original interpretation of the Bhattacharyya metric as a geometric similarity measure is reviewed and it is pointed out that this derivation is independent of the use of the Bhattacharyya measure as an upper bound on the probability of misclassification in a two-class problem. The affinity between the Bhattacharyya and Matusita measures is described and we suggest use of the Bhattacharyya measure for comparing histogram data. We explain how the chi- squared statistic compensates for the implicit assumption of a Euclidean distance measure being the shortest path between two points in high dimensional space. By using the square-root transformation the Bhattacharyya metric requires no such standardization and by its multiplicative nature has no singularity problems (unlike those caused by the denominator of the chi- squared statistic) with zero count-data. (English)
Keyword: chi-square statistic
MSC: 62B10
MSC: 62G99
MSC: 62H99
MSC: 68T99
idZBL: Zbl 1274.62058
idMR: MR1658937
.
Date available: 2009-09-24T19:17:09Z
Last updated: 2015-03-28
Stable URL: http://hdl.handle.net/10338.dmlcz/135216
.
Reference: [1] Aherne F. J., Thacker N. A., Rockett P. I.: Optimal pairwise geometric histograms.In: Proc. 8th British Machine Vision Conf., Colchester 1997, pp. 480–490
Reference: [2] Bhattacharyya A.: On a measure of divergence between two statistical populations defined by their probability distributions.Bull. Calcutta Math. Soc. 35 (1943), 99–110 Zbl 0063.00364, MR 0010358
Reference: [3] Christensen R.: Linear Models for Multivariate Time Series and Spatial Data.Springer–Verlag, New York 1991 Zbl 0717.62079, MR 1081535
Reference: [4] Fukanaga K.: Introduction to Statistical Pattern Recognition.Second edition. Academic Press, New York 1990 MR 1075415
Reference: [5] Matusita K.: Decision rules based on distance for problems of fit, two samples and estimation.Ann. Math. Statist. 26 (1955), 631–641 MR 0073899, 10.1214/aoms/1177728422
Reference: [6] Thacker N. A., Abraham, I., Courtney P. G.: Supervised learning extensions to the CLAM network.Neural Network J. 10 (1997), 2, 315–326 10.1016/S0893-6080(96)00074-3
.

Files

Files Size Format View
Kybernetika_34-1998-4_2.pdf 927.5Kb application/pdf View/Open
Back to standard record
Partner of
EuDML logo