Oligotyping (sequencing)

Oligotyping is the process of correcting DNA sequence measured during the process of DNA sequencing based on frequency data of related sequences across related samples.

History

DNA sequences were originally read from sequencing gels by eye. With the advent of computerized base callers, humans no longer 'called' the bases and instead 'corrected' the called bases. The bases were called by the software using the relative intensity of each putative basepair signal and the local spacing of the signals.

With the advent of high throughput sequencing, the volume of sequence to be corrected exceeded human capacity for sequence correction.

Use

Multiple applications require single-base pair accuracy across populations of closely related sequences. An example is amplicon sequencing to assess the relative contribution of DNA from diverse organisms to a sample.

The requirement for single basepair accuracy led to the development of methods which drew on frequency data distributed across several samples to identify variant sequences which shared the same frequency profile and were thus likely errors from the same original sequence.[1][1][2] The ability to use higher-order statistics to correct sequences is an important element in decreasing the burden of error in DNA sequence datasets.

gollark: I have copies *installed* on stuff, but no actual source code.
gollark: I've got a personal git server which mostly just holds random whatever.
gollark: This really should have been among my collection of random mirrored repositories. Troubling.
gollark: It isn't entirely useless, but it is stupidly overhyped.
gollark: Anyway, you *could* just tell them to not post server invites instead of... spamming the entire server?

See also

References

  1. Eren, A. Murat; Maignien, Loïs; Sul, Woo Jun; Murphy, Leslie G.; Grim, Sharon L.; Morrison, Hilary G.; Sogin, Mitchell L. (1 December 2013). "Oligotyping: differentiating between closely related microbial taxa using 16S rRNA gene data". Methods in Ecology and Evolution. 4 (12): 1111–1119. doi:10.1111/2041-210X.12114. ISSN 2041-210X. PMC 3864673. PMID 24358444.
  2. Preheim, Sarah P.; Perrotta, Allison R.; Martin-Platero, Antonio M.; Gupta, Anika; Alm, Eric J. (1 November 2013). "Distribution-Based Clustering: Using Ecology To Refine the Operational Taxonomic Unit". Applied and Environmental Microbiology. 79 (21): 6593–6603. doi:10.1128/AEM.00342-13. ISSN 0099-2240. PMC 3811501. PMID 23974136.
This article is issued from Wikipedia. The text is licensed under Creative Commons - Attribution - Sharealike. Additional terms may apply for the media files.