Bayes error rate

In statistical classification, Bayes error rate is the lowest possible error rate for any classifier of a random outcome (into, for example, one of two categories) and is analogous to the irreducible error.[1][2]

A number of approaches to the estimation of the Bayes error rate exist. One method seeks to obtain analytical bounds which are inherently dependent on distribution parameters, and hence difficult to estimate. Another approach focuses on class densities, while yet another method combines and compares various classifiers.[2]

The Bayes error rate finds important use in the study of patterns and machine learning techniques.[3]

Error determination

In terms of machine learning and pattern classification, the labels of a set of random observations can be divided into 2 or more classes. Each observation is called an instance and the class it belongs to is the label. The Bayes error rate of the data distribution is the probability an instance is misclassified by a classifier that knows the true class probabilities given the predictors. For a multiclass classifier, the Bayes error rate may be calculated as follows:

p=1-\textstyle \sum _{C_{i}\neq C_{\text{max,x}}}\int \limits _{x\in H_{i}}P(C_{i}|x)p(x)\,dx

where x is an instance, C_i is a class into which an instance is classified, H_i is the area/region that a classifier function h classifies as C_i.

The Bayes error is non-zero if the classification labels are not deterministic, i.e., there is a non-zero probability of a given instance belonging to more than one class.

Proof of Minimality

Proof that the Bayes error rate is indeed the minimum possible and that the Bayes classifier is therefore optimal, may be found together on the Wikipedia page Bayes classifier.

gollark: I know all information, actually.

gollark: In most real-world cases.

gollark: Arguments based on definitions are wrong, see.

gollark: There's not really a very agreed-upon "definition" for most political things.

gollark: On many issues.

References

Fukunaga, Keinosuke (1990) Introduction to Statistical Pattern Recognition by ISBN 0122698517 pages 3 and 97
K. Tumer, K. (1996) "Estimating the Bayes error rate through classifier combining" in Proceedings of the 13th International Conference on Pattern Recognition, Volume 2, 695–699
Hastie, Trevor (2009). The Elements of Statistical Learning (2nd ed.). https://web.stanford.edu/~hastie/ElemStatLearn/: Springer. p. 21. ISBN 978-0387848570.

This article is issued from Wikipedia. The text is licensed under Creative Commons - Attribution - Sharealike. Additional terms may apply for the media files.

[stat-1] Fukunaga, Keinosuke (1990) Introduction to Statistical Pattern Recognition by ISBN 0122698517 pages 3 and 97

[Tumer-2] K. Tumer, K. (1996) "Estimating the Bayes error rate through classifier combining" in Proceedings of the 13th International Conference on Pattern Recognition, Volume 2, 695–699

[3] Hastie, Trevor (2009). The Elements of Statistical Learning (2nd ed.). https://web.stanford.edu/~hastie/ElemStatLearn/: Springer. p. 21. ISBN 978-0387848570.

Bayes error rate

Error determination

Proof of Minimality

See also

References