Repeated median regression

In robust statistics, repeated median regression, also known as the repeated median estimator, is a robust linear regression algorithm.

The estimator has a breakdown point of 50%.[1] Although it is equivariant under scaling, or under linear transformations of either its explanatory variable or its response variable, it is not under affine transformations that combine both variables.[1] It can be calculated in time by brute force, in time using more sophisticated techniques,[2] or in randomized expected time.[3] It may also be calculated using an on-line algorithm with update time.[4]

Method

The repeated median method estimates the slope of the regression line for a set of points as

where is defined as .[5]

The estimated Y-axis intercept is defined as

where is defined as .[5]

gollark: Well, yes, they're not amazing magic things which can immediately do anything if you plug them in, but they... can do bad things.
gollark: That is true, although my laptop doesn't support Thunderbolt so all I have to worry about is... electronic hardware attacks and rubberducky-type things?
gollark: > Like usb 3.1 supports direct memory access<@689232518125191253> Does it? I thought that was limited to Thunderbolt, which most devices do not actually support. There are meant to be mechanisms for limiting the access of Thunderbolt devices but they seem to be broken all the time and terrible anywya.
gollark: Sure you can, just X-ray-scan them before use.
gollark: I'm considering making potatOS use advanced rot13 encryption for logs and such.

See also

References

  1. Peter J. Rousseeuw, Nathan S. Netanyahu, and David M. Mount, "New Statistical and Computational Results on the Repeated Median Regression Estimator", in New Directions in Statistical Data Analysis and Robustness, edited by Stephan Morgenthaler, Elvezio Ronchetti, and Werner A. Stahel, Birkhauser Verlag, Basel, 1993, pp. 177-194.
  2. Stein, Andrew; Werman, Michael (1992). "Finding the repeated median regression line". Proceedings of the Third Annual ACM-SIAM Symposium on Discrete Algorithms (SODA '92). Philadelphia, PA, USA: Society for Industrial and Applied Mathematics. pp. 409–413. ISBN 0-89791-466-X.
  3. Matoušek, J.; Mount, D. M.; Netanyahu, N. S. (1998), "Efficient randomized algorithms for the repeated median line estimator", Algorithmica, 20 (2): 136–150, doi:10.1007/PL00009190, MR 1484533
  4. Bernholt, Thorsten; Fried, Roland (2003). "Computing the update of the repeated median regression line in linear time". Information Processing Letters. 88 (3): 111–117. doi:10.1016/s0020-0190(03)00350-8. hdl:2003/5224.
  5. Siegel, Andrew (September 1980). "Technical Report No. 172, Series 2 By Department of Statistics Princeton University: Robust Regression Using Repeated Medians" (PDF). Retrieved 20 February 2018.


This article is issued from Wikipedia. The text is licensed under Creative Commons - Attribution - Sharealike. Additional terms may apply for the media files.