John D. Storey

John D. Storey is the William R. Harman '63 and Mary-Love Harman Professor in Genomics at Princeton University. His research is focused on statistical inference of high-dimensional data, particularly genomic data. Storey was the founding director of the Princeton University Center for Statistics and Machine Learning.[1]

John D. Storey
NationalityUnited States
Alma materStanford University Ph.D. (2002)
Known forQ-value
AwardsCOPSS Presidents' Award (2015)
Mortimer Spiegelman Award (2015)
Scientific career
FieldsStatistics
Statistical genetics
Genomics
InstitutionsPrinceton University
Doctoral advisorRobert Tibshirani
Doctoral studentsJeffrey T. Leek
Websitestoreylab.org

Research

Storey's early research focused on the false discovery rate. At the time the false discovery rate had only been studied in the context of sequential p-value methods and it was not yet in widespread use. However, Storey showed that false discovery rates can be approached through point estimation[2] opening up this very active branch of statistics to false discovery rates. He simultaneously proved a result showing that the positive false discovery rate (pFDR) is exactly equal to a Bayesian posterior probability, thereby providing the first direct connection between false discovery rates and Bayesian theory.[3] In these works, he also invented the q-value, which is a false discovery rate analogue of the p-value. Storey then introduced false discovery rates and q-values as widely applicable measures of statistical significance in genomics, shifting the focus from false positive control to false discovery rate control.[4] With Jeff Leek, Storey discovered that "expression heterogeneity", or unmodeled sources of systematic variation in gene expression data, are very prevalent and need to be modeled and corrected when analyzing genome-wide gene expression data.[5] Leek and Storey introduced "surrogate variable analysis", which is a high-dimensional regression model that includes both known and unknown covariates. He has developed a number of methods for estimating this model. Recently, Storey has shifted his focus to population genomics, where he has introduced genome-wide models of allele frequencies, Hardy–Weinberg equilibrium, and F-statistics that hold under arbitrary population structures.

Honors and awards

gollark: That is not it.
gollark: Please open that and show me the contents.
gollark: Oh?
gollark: They're not "super-hard", just VERY ANNOYING.
gollark: ```> print("RF/t:", math.random(0, 100000))RF/t: 84019> print("Heat:", math.random(0, 100000))Heat: 39438```Yes, a "tab".

References

This article is issued from Wikipedia. The text is licensed under Creative Commons - Attribution - Sharealike. Additional terms may apply for the media files.