Title:
|
The Bhattacharyya metric as an absolute similarity measure for frequency coded data (English) |
Author:
|
Aherne, Frank J. |
Author:
|
Thacker, Neil A. |
Author:
|
Rockett, Peter I. |
Language:
|
English |
Journal:
|
Kybernetika |
ISSN:
|
0023-5954 |
Volume:
|
34 |
Issue:
|
4 |
Year:
|
1998 |
Pages:
|
[363]-368 |
Summary lang:
|
English |
. |
Category:
|
math |
. |
Summary:
|
This paper highlights advantageous properties of the Bhattacharyya metric over the chi-squared statistic for comparing frequency distributed data. The original interpretation of the Bhattacharyya metric as a geometric similarity measure is reviewed and it is pointed out that this derivation is independent of the use of the Bhattacharyya measure as an upper bound on the probability of misclassification in a two-class problem. The affinity between the Bhattacharyya and Matusita measures is described and we suggest use of the Bhattacharyya measure for comparing histogram data. We explain how the chi- squared statistic compensates for the implicit assumption of a Euclidean distance measure being the shortest path between two points in high dimensional space. By using the square-root transformation the Bhattacharyya metric requires no such standardization and by its multiplicative nature has no singularity problems (unlike those caused by the denominator of the chi- squared statistic) with zero count-data. (English) |
Keyword:
|
chi-square statistic |
MSC:
|
62B10 |
MSC:
|
62G99 |
MSC:
|
62H99 |
MSC:
|
68T99 |
idZBL:
|
Zbl 1274.62058 |
idMR:
|
MR1658937 |
. |
Date available:
|
2009-09-24T19:17:09Z |
Last updated:
|
2015-03-28 |
Stable URL:
|
http://hdl.handle.net/10338.dmlcz/135216 |
. |
Reference:
|
[1] Aherne F. J., Thacker N. A., Rockett P. I.: Optimal pairwise geometric histograms.In: Proc. 8th British Machine Vision Conf., Colchester 1997, pp. 480–490 |
Reference:
|
[2] Bhattacharyya A.: On a measure of divergence between two statistical populations defined by their probability distributions.Bull. Calcutta Math. Soc. 35 (1943), 99–110 Zbl 0063.00364, MR 0010358 |
Reference:
|
[3] Christensen R.: Linear Models for Multivariate Time Series and Spatial Data.Springer–Verlag, New York 1991 Zbl 0717.62079, MR 1081535 |
Reference:
|
[4] Fukanaga K.: Introduction to Statistical Pattern Recognition.Second edition. Academic Press, New York 1990 MR 1075415 |
Reference:
|
[5] Matusita K.: Decision rules based on distance for problems of fit, two samples and estimation.Ann. Math. Statist. 26 (1955), 631–641 MR 0073899, 10.1214/aoms/1177728422 |
Reference:
|
[6] Thacker N. A., Abraham, I., Courtney P. G.: Supervised learning extensions to the CLAM network.Neural Network J. 10 (1997), 2, 315–326 10.1016/S0893-6080(96)00074-3 |
. |