Epitranscriptome

Within the field of molecular biology, the epitranscriptome includes all the biochemical modifications of the RNA (the transcriptome) within a cell. In analogy to epigenetics that describes "functionally relevant changes to the genome that do not involve a change in the nucleotide sequence", epitranscriptomics involves all functionally relevant changes to the transcriptome that do not involve a change in the ribonucleotide sequence. Thus, the epitranscriptome can be defined as the ensemble of such functionally relevant changes.

There are several types of RNA modifications that impact gene expression. These modifications happen to many types of cellular RNA including, but not limited to, ribosomal RNA (rRNA), transfer RNA (tRNA), messenger RNA (mRNA), and small nuclear RNA (snRNA). The most common and well-understood mRNA modification at present is N⁶-Methyladenosine (m⁶A), which has been observed to occur an average of three times in every mRNA molecule.

Currently, work is focused on determining the types of and location of RNA modifications,[1] determining if these modification have function, and if so, what is their mechanism of action. Similar to the epigenome, the epitranscriptome has "writers" and "erasers" that mark RNA and "readers" that translate those marks into function. One function that has been elucidated involves the enzyme adenosine deaminase (ADAR), which acts on RNA. ADAR affects a series of cellular processes, including alternative splicing, microRNAs, the innate immune system, and leads to protein recoding especially for important receptors in the central nervous system.[2]

Chemical Modifications of RNA

N⁶-Methyladenosine (m⁶A)

m⁶A describes the methylation of the nitrogen at position 6 in the adenosine base within mRNA. Discovered in 1974,[3] m⁶A is the most abundant eukaryotic mRNA modification;[4] most mRNAs contain approximately three m⁶A residues.[5] However, some mRNA transcripts do not contain any m⁶A at all, while others may have 10 or more.[6] The term "epitranscriptome" was coined following transcriptome-wide mappings of m⁶A sites,[7][8] but does not necessarily exclude other post-transcriptional mRNA modifications. How, and in response to what stimulus, the cell endogeneously regulates the level of m⁶A methylation remains unclear at present. However, it is known that the levels of this epitranscriptional mark are dynamically altered during embryonic development. Moreover, environmental stimuli such as stress can also alter the levels of m⁶A.[9]

The m⁶A RNA methylomes of different eukaryotic organisms have two common characteristics. First of all, the mark is usually found in the R[G > A]m⁶AC[U>A>C>] or RRm⁶ACH sequence. Secondly, this mark is enriched in specific regions of the transcriptome; it is mostly found close to stop codons, in 3’-UTRs and in long internal exons.[5] Nevertheless, m⁶A levels vary between different RNAs within a cell and between different cell types of the same organism. The mechanisms controlling the addition of m⁶A to some types of RNA have been described, but others remain unknown.[9]

"Writers," "erasers," and "readers"

The terms, "writer", "eraser" and "reader" have been associated with RNA modification.[10] An "Eraser" is a category of enzymes that demethylate m⁶A. Proteins that recognize and bind to m⁶A are known as "readers". However, some of the mRNA modifications prevent the binding of some RNA binding proteins; these are called "anti-readers".[11] The "writers" and "erasers" of the m⁶A mark are mostly located in nuclear speckles (subnuclear structures enriched in pre-mRNA splicing factors), where mRNA is processed and stored.[11]

The m⁶A mark is added by a m⁶A methyltransferase complex post-transcriptionally. This "writer" complex is composed of METTL3, METTL14, Wilms tumor 1-associated protein (WTAP), KIAA1429 and RBM15.[12] METTL3 is the catalytic subunit, whereas METTL14 is involved in the stability of the complex and RNA recruitment. WTAP is also needed in aiding the recruitment of mRNA, whereas RBM15 and its paralog RBM15B are only involved in the recruitment of lncRNAs. The role RBM15 and RBM15B may have in recruiting other types of RNA to the methyltransferase complex remains unknown.[13] The specific recognition sites of the writers are not known, but the minimal sequence required is 5’-Rm⁶AC-3’.[6] METTL3 has been proposed to also be a "reader" of the m⁶A mark. This function is localized in the cytoplasm, where it promotes the recruitment of eIF3.[11] Discovery of the METTL3 complex proved that m⁶A is a reversible mark, and this fact was crucial for the development of the field of epitranscriptomics.[14]

Members of the YTH domain protein family act as "readers" of m⁶A. The study of these proteins has been key in understanding the functions and effects of mRNA methylations.[12] It has been shown that three members of the human YTH domain family of proteins have higher binding affinities to methylated mRNA.[15] The YTH protein YTHDF2 affects mRNA by directing methylated mRNA from the translational pool to mRNA decay sites. As a result, methylated mRNA has a shorter half-life than unmethylated mRNA.

So far, two "erasers" of the m⁶A mark have been identified. ALKBH5 is a demethylase found in mammals that removes the methyl group of m⁶A.[12] The second one is the fat mass and obesity associated protein (FTO), a demethylase that converts m⁶A back to adenosine. FTO preferentially demethylates the m⁶A found closer to the mRNA cap.[12] This oxidative process has three steps and two intermediates: N⁶-hydroxymethyladenosine (hm⁶A) and N⁶-formyladenosine (f⁶A). FTO is most commonly found in nuclear speckles; however, in some species low levels of FTO can also be found in the cytoplasm.[5] Dysfunctional FTO correlates with alterations in body weight and disease, while Alkbh5 knockout mice have impaired fertility.[15] These two facts reflect how important the proper regulation of the m⁶A modification is for normal body function. Moreover, mutations in FTO can lead to developmental failures, brain atrophy and physiological disorders in adulthood.[9]

Role in the life-cycle of mRNA

mRNA methylation is important throughout the entire life-cycle of the mRNA, starting with the alternative polyadenylation (APA) of some transcripts. m⁶A sites are often located in the last exon, mostly in the 3’-UTR. The presence of m⁶A in the 3’-UTR promotes the use of the proximal APA site, resulting in a shorter 3’-UTR. Splicing of the pre-mRNA transcripts is also greatly affected by m⁶A. Furthermore, nuclear export of mature mRNAs depends on m⁶A; when the m⁶A "writers" are inhibited, there is a delay in the export of the mature mRNAs. However, normal nuclear export does not solely depend on m⁶A, other mRNA marks such as 5'-methylcytosine (m⁵C) are also involved.[11]

The m⁶A mark has a notable effect on translational dynamics.[16] There are various ways in which m⁶A is involved in translational efficiency. For instance, this modification modulates multiple steps in the process of tRNA incorporation. On the one hand, it slows down GTP hydrolysis by EF-Tu by 12-fold and the peptidyl transfer reaction by two-fold. It also causes a 1.5-fold increase in the amount of GTP hydrolyzed per peptidyl transfer, which indicates that a lot of proofreading is required. Moreover, because it is just a modified adenosine base, m⁶A base-pairs with uridine during decoding. However, the adenosine's methylation hinders tRNA accommodation and translation elongation. When a m⁶A-modified codon interacts with its cognate tRNA (the tRNA with the anticodon that is complementary to a particular codon), it acts more like a near-cognate codon interaction instead of the cognate codon interaction. This can be seen in the delay in the tRNA accommodation, which is dependent upon both the position of the m⁶A in the mRNA codons and on how accurate the translation is. Overall, this m⁶A modification leads to a kinetic loss of a factor of 18.[16] To summarize, translation-elongation dynamics are slower for codons with m⁶A and different locations of these modified nucleotides in the mRNA codons affect decoding dynamics in different ways.

However, this mark can also increase translational efficiency. The m⁶A "reader" YTHDF1 induces the association of the modified mRNA with the ribosome. Furthermore, it also recruits the translation initiation factor eIF3 to the mRNA independently of METTL3. Additionally, eIF3 also acts as a "reader" of a m⁶A located in the 5’-UTR of the mRNA, which results in recruitment of the 40S translational preinitiation complex.[14] This interaction is involved in cap-independent translation, which happens during the cellular response to heat shock stress.[11]

m⁶A methylation also modulates mRNA stability.[17] The "reader" YTHDF2 binds to m⁶A-containing mRNAs and decreases their stability by recruiting them to P-bodies, in a process called methylation-dependent mRNA decay.[11] This process is needed to rapidly degrade pluripotency transcription factor transcripts, to enable the commitment of a pluripotent stem cell to a specific cell lineage.[18] Reduced levels of m⁶A in mice embryos lead to embryonic lethality during the early stages of development.[9]

Role of N⁶-Methyladenosine (m⁶A) in alternative splicing

Exons of the pre-mRNA are shown in blue and introns (non-coding sequence) are in red. a) Alternative splicing involves the removal of introns from the pre-mRNA transcript. b) Adenine is methylated, forming the m⁶A modification. c) The modified m⁶A is located in a uridine rich RNA stem loop, reducing the stability of the loop and increasing the accessibility to the single strands. The HNRNPC protein (involved in pre-mRNA processing) can now bind to the more accessible uridine rich region on the loop (the HNRPNC binding site), leading to the excision of intron.

Stem loop structures can sometimes be found in introns. m⁶A residues located in these stem-loops weaken base-pairing interactions within the stem, thus altering the structure of the mRNA. This phenomenon is known as m⁶A-Switch.[12] The m⁶A mark has an important role in alternative splicing, since it increases the accessibility of hnRNPC to its binding site. The heterogeneous nuclear ribonucleoprotein C (hnRNPC) is a RNA-binding protein that complexes with both heterogeneous nuclear RNA (hnRNA) and pre-mRNA to participate in pre-mRNA processing. hnRNPC binds to a uridine-rich region in introns that can usually form stem-loops. The destabilization of the stem-loop exposes the hnRNPC binding site, which increases the accessibility of the protein to the region.[19] Because hnRNPC must be bound to pre-mRNA in order to fulfill its function, increased accessibility means higher activity of hnRNPC. Therefore, m⁶A residues located in stem-loops of introns enhance the activity of hnRNPC, which results in increased alternative splicing. Evidence supporting this claim identified that decreased m⁶A levels in the transcriptome lead to significantly reduced hnRNPC binding.[12]

m⁶A also has additional roles in alternative splicing by acting as the binding site for YTHDC1 (YTHDC1 binds to m⁶A residues located in alternative exons). YTHDC1 has a double role in alternative splicing. First of all, it recruits the serine and arginine-rich splicing factor 3 (SRSF3), which promotes exon inclusion. In addition, YTHDC1 blocks binding of SRSF10, a protein involved in exon-skipping.[12]

Due to the role of m⁶A in alternative splicing, pre-mRNAs have higher levels of m⁶A than mature mRNAs. Moreover, m⁶A is more abundant in mRNAs that undergo alternative splicing compared to genes that code a single isoform. This is because alternatively spliced mRNAs are enriched in METTL3 binding sites. Splicing is affected in Mettl3 knock-out mice, resulting in increased frequency of exon skipping and intron retention.[11] However, m⁶A is not a general unspecific splicing factor, it only participates in the alternative splicing of certain mRNAs and lncRNAs.[9]

Other roles of m⁶A

m⁶A is not only found on mRNAs, various non-coding RNAs also contain this mark. For instance, XIST, the lncRNA that initiates X-inactivation, is enriched in m⁶A. These m⁶A are recognized and bound by the YTH domain protein YTHDC1. XIST mediated silencing of the X chromosome is negatively affected when XIST is not modified with m⁶A.[11]

RNA molecules containing m⁶A are involved in UV-induced DNA damage repair mechanisms. When DNA is damaged, poly(A)+ transcripts containing numerous m⁶A residues accumulate in the region. This facilitates the accessibility of DNA-repairing proteins, such as DNA polymerase K, so that they can fulfil their function.[11]

Disease

Alterations in the pathways leading to the addition of the removal of the m⁶A mark result in impaired gene expression and cellular function, which can lead to disease.

Normal m⁶A levels are altered in a number of cancers. Reduced m⁶A levels due to down regulation of METTL3 and/or METTL14 lead to the activation of a number of oncogenes, such as the gene encoding ADAM metallopeptidase domain 19 (ADAM19). Moreover, loss of m⁶A also results in the down regulation of tumor suppressors like cyclin-dependent kinase inhibitor 2A (CDKN2A) and breast cancer 2 (BRCA2). On the other hand, increased m⁶A levels inhibit tumor progression in certain types of cancer.[14] In addition, single nucleotide polymorphisms (SNPs) on the gene encoding FTO have been associated with increased risk of breast and pancreatic cancer. Altered m⁶A levels also contribute to hypoxia-induced enrichment of breast cancer stem cells phenotype.[20] All things considered, "writers" and "erasers" of the m⁶A mark may be good potential drug targets in cancer therapy.

Metabolic disorders are also affected by the m⁶A mark due to the role of FTO. Overexpression of FTO results in increased body and fat mass, whereas loss of FTO leads to a reduction in lean body mass. However, the mechanisms by which changes in FTO expression affect body and fat mass are not understood.[14]

N1-methyladenosine (m¹A)

N1-methyladenosine is a modified nucleoside in which a methyl group is added to N1 of the adenosine base. This modification introduces a positive charge on the nitrogen atom to which the methyl group is added, because the modified nitrogen donates its lone pair to the carbon atom of the methyl group in order to form a bond. N1-methyladenosine modification is thought to regulate tRNA and rRNA stability, as well as potentially alter protein-RNA interactions or RNA secondary structures. This modification results in the melting of double-stranded RNA, due to alterations in the RNA structure. The N1-methyladenosine modification is less common than the m⁶A modification, with modified transcripts usually only containing a single m¹A modification, whereas they may contain several m⁶A residues.[21]

Studies of these modifications have been slow to advance due to a lack of sound methodology to locate and identify them. A few methods, such as MeRIP-seq and m¹A-ID-seq, have been developed, but the particular adenosine that is modified still cannot be identified. A computational tool based on the data generated from these methods called RAMPed has been developed to try to identify these particular modifications.[22]

5-methylcytosine (m⁵C)

5-methylcytosine, commonly abbreviated as "m⁵C", is a chemical modification first identified in tRNA. Since its initial identification, 5-methylcytosine has been found in a variety of different cellular structures ranging from a variety of RNAs and even DNA. Two different kinds of RNA m⁵C "writers" have been identified: NOP2/SUN RNA methyltransferase (NSUN) and DNA methyltransferase-2. It is important to note that DNMT-2 is a protein that falls under the DNMT family, which contains three other DNMTs (1, 3a, and 3b) known to demonstrate methylation activity in relation to the genome. Uniquely, DNMT-2 is the only DNMT that has been confirmed to methylate both DNA and RNA, although its overall DNA methylation function is significantly less than that of its counterparts.[23] While these writers have been identified, as of now, there are no known m⁵C "erasers"; in a broader sense, this means that reamination, or the conversion of 5-methylcytosine back into cytosine, has not been observed in RNA.[24] 5-methylcytosine modifications are typically found approximately 100 nucleotides downstream of translation initiation sites. This may provide some insight into the purpose of these modifications; for instance, this may indicate that these modifications are important for controlling the fate of the RNA, such as whether it will be translated or not in the case of mRNA. However, the exact purpose of the methylation at specific cytosines in RNA is currently unknown. One possibility may be that m⁵C may be associated with RNA transport, since the Aly/REF export factor is a known m⁵C binding protein.[24] On the other hand, m⁵C modifications could possibly be associated with the regulation of genes involved in energy and lipid metabolism, through modulation of the overall RNA translational fate.[15]

Adenosine-to-Inosine

Adenosine-to-Inosine (A-to-I) modifications were described well before the conception of epitranscriptomics. These modifications are very common in tissues and cells of the nervous system, and malfunctions in this deamination can result in a variety of different human diseases. A-to-I deamination has been shown to cause changes in the overall RNA structure or cause changes to the protein-coding mRNAs, although changes in codons and the amino acid they code for are not commonly seen.[25] A-to-I RNA editing is described in more detail on the RNA editing page.

Queuosine

The chemical structure of queuosine

Queuine (Q) is a modified nucleotide at position 34 in tRNA (queuosine is the name of the nucleoside, while queuine is the name of the nucleotide). Nucleotide modifications in tRNA are not uncommon, as tRNA is one of the most heavily modified types of RNA, and nearly 80 types of modified nucleotides have been identified. Queuosine is a very heavily modified version of guanosine (G).[26] Modifications in tRNA have the well-known ability to control and modulate gene expression. The regulation of gene expression typically comes from some structural changes to the stem-loop structure of the tRNA. The editing that tRNA undergoes may have developed as a response to rare codons, and tRNA counteracts frameshifts by utilizing the modified bases. Other similar modifications to nucleotides impact the ability of tRNA to initiate translation, thus impeding gene expression.[27]

This modification is particularly widespread and found amongst a variety of organisms, indicating that perhaps convergent evolution took place in the development of this nucleoside. Eukaryotic cells cannot synthesize queuosine, so they must rely on prokaryotes of the microbiome to produce and increase the availability of it within the body. Depleted levels of Q34 (queuine at position 34) are associated with the development of tumors.[28]

2′-O-methylation

2'-O-methylation refers to the methylation of the 2' hydroxyl group of the ribose within an RNA nucleotide.[29] 2'-O-methylation is found in the five-prime cap of mRNAs in higher eukaryotes.[30] It is involved in differentiating between self and non-self mRNA.[15][31] Without the 2′-O-methylation mark the immune system triggers higher levels of type 1 interferon activity.[30][15] While this modification is not currently known to be a response to any particular phenomenon, not everything is fully understood about the mechanisms of this modification due to the difficulty of studying small RNA molecules. However, the effect on RNA stability this modification has could be regulated to modulate transcript levels.

Pseudouridylation

Pseudouridine (Ψ, 5-ribosyluracil) is the most abundant RNA modification; in fact, at one time it was considered the "fifth nucleotide". This isomer of uridine is found in various types of RNA, such as snRNA, tRNA, small nucleolar RNA (snoRNA) and many others.[24] Pseudouridine increases the stability of the modified RNA by making the sugar-phosphate backbone more rigid and by facilitating base stacking interactions (pseudouridine contains an extra hydrogen bond donor). When it comes to Watson-Crick base pair interactions, the pseudouridine-adenosine base pair is more stable than the uridine-adenosine base pair; therefore, pseudouridine increases stability.[25] Apart from increasing RNA stability, this modification is also involved in regulation of translation. All eukaryotic stop codons contain one uridine (UAA, UGA and UGA); conversion of this uridine to pseudouridine results in suppression of translational termination and generation of unexpected sense codons.[24][25] The artificial process of pseudouridylation has an effect on the function of mRNA: it changes the genetic code by making non-canonical base pairing possible in the ribosome decoding center.[32]

Pseudouridylation reactions are catalyzed by enzymes that contain the pseudouridine synthase domain; 13 such enzymes have been identified in humans, which are called pseudouridine syntheses (PUS). These enzymes can be either RNA-dependent or RNA-independent depending on whether a small RNA is needed to guide the enzyme to its target or not. Additionally, different PUS enzymes work in different cell compartments. For instance, PUS4 (also known as TruB pseudouridine synthase family member 1, TRUB1) and PUS7, which are responsible for most of the mRNA pseudouridylation, are located in the nucleus or the cytoplasm. On the other hand, several PUS enzymes, such as PUS1 and TRUB2 are located in the mitochondria, modifying a number of mitochondrial mRNAs (mt-mRNAs).[24] In tRNA, PUS1 and PUS7 modify the second uridine in the UGUAR consensus sequence, as long as this sequence is located in a very structured region of the tRNA.[33]

To date, no pseudouridine erasers or readers have been identified. It is thought that pseudouridylation is most probably an irreversible process.[25]

Pseudouridine is most commonly found in tRNAs, with almost all tRNA molecules having at least one pseudouridine. Therefore, because the addition of pseudouridine happens during the normal processing of tRNA, it is not considered an epitranscriptomic mark. However, pseudouridine acts as an epigenetic mark in mRNAs and ncRNAs of the brain, since pseudouridylation in these two RNAs responds dynamically to stress and differentiation in the cell,[20] giving reason to believe that pseudouridylation may act as an important regulatory mechanism for RNA function.[34] Pseudouridylation in mRNA can be conserved, tissue-specific or inducible, which reflects plasticity and regulatory function.[25] Furthermore, expression of TRBU1, which is mostly expressed in the brain, goes up due to fear conditioning. In addition, expression of the ncRNAs needed to guide RNA-dependent PUS enzymes also goes up in response to fear.[20]

Pseudouridine detection and sequencing methods

There are three major techniques for the site-specific mapping of pseudouridine in RNA, called Pseudo-seq, Ψ-seq and PSI-seq. All these methods are based on the unique reaction between pseudouridine and N-cyclohexyl-N'-(2-morpholinoethyl)carbodiimide metho-p-toluenesulfonate (CMCT). The RNA to be analyzed is fragmented and incubated with CMCT. Even if CMCT can form covalent bonds with U, G and Ψ residues, only Ψ-CMC is resistant to alkaline hydrolysis (U-CMC and G-CMC get hydrolyzed). Next, reverse transcription is done to obtain a cDNA library, with the cDNAs terminating one nucleotide downstream the pseudouridine residue. Next generation sequencing of the cDNA library will indicate where the modified pseudouridine residue is located in the RNA. In order to do this, two cDNA libraries are prepared, one in which the RNA has undergone CMC treatment and the other one without CMC treatment. Differences in the length of the reads between the two libraries will indicate where the Ψ residues are.[24][33] Another method is called CeU-Seq, which uses a biotinylated derivative of CMCT. This enables the purification and enrichment of biotinylated transcripts (transcripts modified with pseudouridine) with streptavidin columns, therefore reducing the library size and increasing sensitivity.[25]

Other pseudouridine detection methods include site-specific cleavage and radioactive-labeling followed by ligation-assisted extraction and thin-layer chromatography (SCARLET) and mass spectrometry.

Modifications specific to different types of RNA

Ribosomal RNA (rRNA)

Ribosomal RNA, or rRNA, forms the nucleic acid component of ribosomes. rRNA modifications take place in and around the peptidyl transferasecenter, the active site of the ribosome. Some modifications include pseudouridines, 2′-O-methylations on backbone sugars, and methylated bases. It is not well known what the biological effects of these modifications are on the rRNA molecule, but one hypothesis is that they help stabilize the structure and enhance the function of the ribosome, especially during ribosome formation. Moreover, these modifications may alter the chemical properties of the rRNA such that the correct tertiary structure is favored. 2'-O-methylation prevents backbone hydrolysis; other noted modifications also seem to help with stabilizing rRNA secondary structures and preventing damage to rRNA strands. 2'-O-methylation also helps to increase base stacking forces, stabilizing the secondary and tertiary structure of rRNA even further. Collectively, these modifications in rRNA are indispensable to ribosomal function.[15]

Transfer RNA (tRNA)

A 3D model of the complex cruciform structure of tRNA

Transfer RNAs, which are RNAs that participate in translation, contain the greatest number of modifications of any type of RNA, with up to one-fourth of the nucleosides in these molecules containing some sort of modification in eukaryotes.[15] There are several known reasons for the wide variety of modifications found in tRNA. First of all, such modifications allow for easier differentiation between different tRNA molecules, such as separating the initiator tRNA^Met from elongator tRNA^Met.. Moreover, they increase overall tRNA stability. Some studies have shown that the modifications of tRNA can be dynamic and adaptive to the changes of the environment. Examples include methylation of cytosine groups by tRNA methyltransferase (Trm4) in response to the depletion of nutrients in the body. The tRNA's cruciform structure is incredibly important to its overall function and such a complicated structure is maintained by post-transcriptional modifications. A primary example of this is the methylation of guanosine at junctions within the tRNA structure. These methylguanosine impact the overall tertiary structure by disrupting any potential canonical hydrogen bonding (hydrogen bonds that are conventional Watson-Crick base pairs), thus creating a loop at the core of the tRNA. Other modifications are integral for creating and maintaining the extreme bends in the structure.[35]

Messenger RNA (mRNA)

Messenger RNA is the bridge between the genetic code and the resulting proteins, as it is what carries the necessary information that gets translated into proteins. Modifications to the actual, physical genetic code are likely to be deleterious; therefore, minor modifications, such as methylation, done to mRNA are preferable (nevertheless, modifications are still seen throughout the genome). The four major types of modifications done to mRNA are N7-methylguanine (at the 5′ cap), N⁶-methyladenosine, 5-methylcytosine, and 2′-O-methylation. The modification seen at the 5' cap perfectly demonstrates how modifications to mRNA can impact its function, as the 5' cap is necessary to initiate translation. Therefore, modifications, such as N7-methylguanine during RNA processing, to the 5' cap may effect the ability of the ribosome to initiate translation. It is important to note that not all modifications happening to the mRNA are epigenetic, some, like the N7-methylguanosine cap, are RNA editing.

mRNA molecules demonstrate something known as "modification stoichiometry". Modification stoichiometry is when only a portion of transcripts have a specific modification at a particular modification site. Typically, under normal cell conditions, the modification stoichiometry is very low, there are a very few number of transcripts that have specific modifications. However, as cell conditions change, the fraction of modified transcripts can change as well.[36] As with other types of RNA, modifications impact the overall structure of the mRNA. Altering its structure may cause the mRNA to take different paths. For example, a normal transcript might be fated to be translated; however, the introduction of a modified base can disrupt its structure and send it down a different path, and that particular transcript may now be targeted for degradation.[36]

Short non-coding RNA (sncRNA) modifications

Modifications can also happen in short non-coding RNAs, including small nuclear RNA (snRNA) and microRNA (miRNA).[37][15] However, these modifications are less common than those in mRNA, tRNA, and rRNA.[37]

Short nuclear RNA (snRNA)

Some trans-spliced snRNAs have been observed to have a N²,N²,7-trimethylguanosine cap.[37] This particular modification to the guanosine cap is rare in snRNAs. Trans-splicing is a phenomenon in which exons from two different primary RNA transcripts are ligated together.[38] These rare variants have been seen during development in C.elegans and are associated with polysomes.[37] How this modification is regulated in certain cell types and the exact function of this modification remains largely unknown, although it has been speculated that this modification may help define a special subset of trimethylguanosine-regulated RNAs.[37]

MicroRNA (miRNA)

Some miRNAs in plants have been seen to contain 2'-O-methylation, a modification to the ribose sugar that is added by the methyltransferase HEN1. This modification is thought to protect the miRNA against polyuridylation, which would result in its subsequent degradation.[15]

In addition, pri-miRNAs have been shown to contain m⁶A. This reversible modification may affect their cellular localization and function during miRNA processing.[15]

Long non-coding RNA (lncRNA)

The family of long non-coding RNAs includes a variety of different kinds of RNA, including, but not limited to, circular RNA (circRNA), nuclear lncRNA, long intergenic non-coding RNA, and enhancer RNA. The development of next-generation sequencing has made the study of lncRNA more accessible (because lncRNA is not very common in the cell relative to other types of RNA).

Editing and modifications to lncRNA have demonstrated to result in changes in RNA expression and rate of mutation.[39] 5-methylcytosine (m⁵C), N⁶-methyladenosine (m⁶A), and pseudouridine are the three most common and most studied modifications occurring in lncRNA.[24] Modifications to the nucleotide structure are likely to impact the structure of lncRNAs and modulate their overall function. The study of the reversibility of these modifications is an active area of research. These modifications impact a variety of different qualities including the lncRNA's function and the initiation of translation. Modifications to lncRNAs have been demonstrated to impact where they localize within the cell and while complicated structures, such as the crucifix of tRNA, are not typically found in lncRNA, modifications may alter their structure and impact the overall function and pathway the lncRNA takes.[15]

Viral epitranscriptomics

Viral epitranscriptomics is the field that studies RNA modifications in viral transcripts that do not affect the sequence of the transcript but that are functionally relevant. So far, the studies have been focused on viral transcripts of mammalian viruses. Mammalian viral transcripts must function in a mammalian cell, so they must acquire the same epigenetic marks as the host cell. For this, viruses make use of the numerous mRNA modifying enzymes found in the host cells.[33]

m6A in viral transcripts

The most widely described RNA modification in mammalian viruses is m⁶A, which was first identified in Influenza virus mRNAs, in 1976.[11] The epitranscriptomic analysis of viral transcripts has revealed that m⁶A levels in viral and cellular transcripts are similar. Nevertheless, in some viruses such as adenovirus-2, m⁶A levels are higher in viral mRNAs.[33] As with cellular RNAs, m⁶A is predominantly added in the nucleus by METTL3, with the assistance of several cofactors such as METTL14, WTAP, KIAA1429 and RBM15/RMB15B. A recent study demonstrates the presence of m⁶A in the small T antigen of Merkel cell polyomavirus (MCPyV) in Merkel cell carcinoma, a fatal skin cancer [40].

Studies of the viral m⁶A mark have mostly been conducted with HIV.[13] Despite the high mutagenic rate of this virus, m⁶A sites have been evolutionarily conserved. This is due to the fact that m⁶A is involved in regulating multiple stages in the HIV life-cycle. In addition to the normal functions m⁶A has in pre-mRNA splicing, nuclear export, mRNA stability and translation; this mark also inhibits the recognition of viral transcripts by Toll-like receptors and RIG-1 receptors. As a result, m⁶A positively influences viral replication.[6] On the other hand, HIV also regulates the addition of the m⁶A mark in a number of cellular mRNAs. For instance, 56 cellular transcripts that only contain m⁶A during HIV infection have been identified. The effect this mark has on cellular transcripts during the course of the viral infection remains unknown.[33]

Even if m⁶A-marked viral transcripts are involved in regulating gene expression of a number of different viruses, the mechanisms by which this happens have not been identified. To date, three possible models have been proposed.[13]

Although METTL3 and METTL14 are mostly localized in the nucleus, they can also be found in the cytoplasm, where they methylate the genomes and transcripts of cytoplasmic RNA viruses. As opposed to nuclear viruses, loss of m⁶A on hepatitis C virus (HCV, a cytoplasmic RNA virus) increases the production of infectious HCV virions, which indicates that in this particular virus the m⁶A mark has a negative effect on virus production. Nevertheless, in other cytoplasmic RNA viruses such as dengue virus and yellow fever virus, m⁶A sites have been selected for during evolution, suggesting that the m⁶A mark is beneficial for these viruses.[6]

Since m⁶A enhances viral replication, m⁶A can be used as a target for antiviral therapy. The major challenge is to target this mark in viral transcripts without causing major effects to the host cells, as normally occurring cellular m⁶A marks will also be depleted. The S-adenosylhomocysteine (SAC) hydrolase inhibitor 3-dezaadenosine (DAA) can be used as an antiviral drug, because it inhibits the addition of m⁶A.[6] However, it is yet to be determined whether this drug has any off-target effects.[13]

N⁶,2-O-dimethyladenosine (m⁶A_m)

Other viral transcript modifications

m⁶A is not the only RNA modification that can be found in viral RNAs. For instance, N⁶,2-O-dimethyladenosine (m⁶A_m) can be found in influenza and herpes simplex virus type 1, even though the effect this mark has on the life cycle of these viruses remains unknown. Another modification commonly found in coronaviruses, flaviviruses and poxviruses (all of them are cytoplasmic viruses) is the 2'-O-methylation of ribose moieties. The addition of this mark is catalyzed by a viral methyltransferase. 2'-O-methylation binds to and inhibits Toll-like receptor 7 (TLR-7), which is involved in activating the production of inflammatory cytokines. Moreover, this modification enables viral RNAs to evade the antiviral actions of the IFIT proteins, a family of interferon-induced proteins that limit viral replication.[13]

MODOMICS

MODOMICS is a comprehensive database that contains information about RNA modifications. MODOMICS provides the following information: the chemical structure of the modified RNAs, the RNA modifying pathways, the location of the modifications in the RNA sequences, the enzymes responsible for the modifications and liquid chromatography/mass spectrometry(LC/MS) data of the modified RNAs. As of November 2017, the database contained 163 different RNA modifications, as well as 340 different enzymes and cofactors involved in the modifications. This database classifies RNA modifying pathways according to their starting point. The LC/MS data has been very useful in determining the specific mass of the modified RNAs, which facilitates the identification of the modification.[41]

gollark: Not even RDTSC or whatever?

gollark: But what about the secret hardware apiobeeoids embedded in RDRAND?

gollark: Oh, you're assemblicating?

gollark: `grep rdrand /proc/cpuinfo`

gollark: Oh, it has entropy from other sources even without that.

References

Ross R, Cao X, Yu N, Limbach PA (2016). "Sequence mapping of transfer RNA chemical modifications by liquid chromatography tandem mass spectrometry". Methods. 107 (107): 73–78. doi:10.1016/j.ymeth.2016.03.016. PMC 5014671. PMID 27033178.
Tajaddod M, Jantsch MF, Licht K (March 2016). "The dynamic epitranscriptome: A to I editing modulates genetic information". Chromosoma. 125 (1): 51–63. doi:10.1007/s00412-015-0526-9. PMC 4761006. PMID 26148686.
Desrosiers R, Friderici K, Rottman F (October 1974). "Identification of methylated nucleosides in messenger RNA from Novikoff hepatoma cells". Proceedings of the National Academy of Sciences of the United States of America. 71 (10): 3971–5. Bibcode:1974PNAS...71.3971D. doi:10.1073/pnas.71.10.3971. PMC 434308. PMID 4372599.
Bokar JA (2005). "The biosynthesis and functional roles of methylated nucleosides in eukaryotic mRNA". Fine-Tuning of RNA Functions by Modification and Editing. Topics in Current Genetics. 12. Springer, Berlin, Heidelberg. pp. 141–177. doi:10.1007/b106365. ISBN 9783540244950.
Yue Y, Liu J, He C (July 2015). "RNA N6-methyladenosine methylation in post-transcriptional gene expression regulation". Genes & Development. 29 (13): 1343–55. doi:10.1101/gad.262766.115. PMC 4511210. PMID 26159994.
Kennedy EM, Courtney DG, Tsai K, Cullen BR (May 2017). "Viral Epitranscriptomics". Journal of Virology. 91 (9): e02263–16. doi:10.1128/jvi.02263-16. PMC 5391447. PMID 28250115.
Meyer KD, Saletore Y, Zumbo P, Elemento O, Mason CE, Jaffrey SR (June 2012). "Comprehensive analysis of mRNA methylation reveals enrichment in 3' UTRs and near stop codons". Cell. 149 (7): 1635–46. doi:10.1016/j.cell.2012.05.003. PMC 3383396. PMID 22608085.
Dominissini D, Moshitch-Moshkovitz S, Schwartz S, Salmon-Divon M, Ungar L, Osenberg S, Cesarkas K, Jacob-Hirsch J, Amariglio N, Kupiec M, Sorek R, Rechavi G (April 2012). "Topology of the human and mouse m6A RNA methylomes revealed by m6A-seq". Nature. 485 (7397): 201–6. Bibcode:2012Natur.485..201D. doi:10.1038/nature11112. PMID 22575960.
Noack F, Calegari F (2018). "Epitranscriptomics: A New Regulatory Mechanism of Brain Development and Function". Frontiers in Neuroscience. 12: 85. doi:10.3389/fnins.2018.00085. PMC 5826231. PMID 29515357.
Licht K, Jantsch MF (April 2016). "Rapid and dynamic transcriptome regulation by RNA editing and RNA modifications". The Journal of Cell Biology. 213 (1): 15–22. doi:10.1083/jcb.201511041. PMC 4828693. PMID 27044895.
Peer E, Rechavi G, Dominissini D (December 2017). "Epitranscriptomics: regulation of mRNA metabolism through modifications". Current Opinion in Chemical Biology. 41: 93–98. doi:10.1016/j.cbpa.2017.10.008. PMID 29125941.
Roignant JY, Soller M (June 2017). "6A in mRNA: An Ancient Mechanism for Fine-Tuning Gene Expression" (PDF). Trends in Genetics. 33 (6): 380–390. doi:10.1016/j.tig.2017.04.003. PMID 28499622.
Gonzales-van Horn SR, Sarnow P (June 2017). "Making the Mark: The Role of Adenosine Modifications in the Life Cycle of RNA Viruses". Cell Host & Microbe. 21 (6): 661–669. doi:10.1016/j.chom.2017.05.008. PMC 5555051. PMID 28618265.
Batista PJ (June 2017). "6-methyladenosine and Its Implications in Human Disease". Genomics, Proteomics & Bioinformatics. 15 (3): 154–163. doi:10.1016/j.gpb.2017.03.002. PMC 5487527. PMID 28533023.
Wang X, He C (October 2014). "Dynamic RNA modifications in posttranscriptional regulation". Molecular Cell. 56 (1): 5–12. doi:10.1016/j.molcel.2014.09.001. PMC 7129666. PMID 25280100.
Choi J, Ieong KW, Demirci H, Chen J, Petrov A, Prabhakar A, O'Leary SE, Dominissini D, Rechavi G, Soltis SM, Ehrenberg M, Puglisi JD (February 2016). "N(6)-methyladenosine in mRNA disrupts tRNA selection and translation-elongation dynamics". Nature Structural & Molecular Biology. 23 (2): 110–5. doi:10.1038/nsmb.3148. PMC 4826618. PMID 26751643.
Wang X, Lu Z, Gomez A, Hon GC, Yue Y, Han D, Fu Y, Parisien M, Dai Q, Jia G, Ren B, Pan T, He C (January 2014). "N6-methyladenosine-dependent regulation of messenger RNA stability". Nature. 505 (7481): 117–20. Bibcode:2014Natur.505..117W. doi:10.1038/nature12730. PMC 3877715. PMID 24284625.
Zhao BS, He C (February 2015). "Fate by RNA methylation: m6A steers stem cell pluripotency". Genome Biology. 16: 43. doi:10.1186/s13059-015-0609-1. PMC 4336730. PMID 25723450.
Liu N, Zhou KI, Parisien M, Dai Q, Diatchenko L, Pan T (June 2017). "N6-methyladenosine alters RNA structure to regulate binding of a low-complexity protein". Nucleic Acids Research. 45 (10): 6051–6063. doi:10.1093/nar/gkx141. PMC 5449601. PMID 28334903.
Leighton LJ, Ke K, Zajaczkowski EL, Edmunds J, Spitale RC, Bredy TW (March 2018). "Experience-dependent neural plasticity, learning, and memory in the era of epitranscriptomics". Genes, Brain, and Behavior. 17 (3): e12426. doi:10.1111/gbb.12426. PMC 5858957. PMID 28926184.
Dominissini D, Nachtergaele S, Moshitch-Moshkovitz S, Peer E, Kol N, Ben-Haim MS, Dai Q, Di Segni A, Salmon-Divon M, Clark WC, Zheng G, Pan T, Solomon O, Eyal E, Hershkovitz V, Han D, Doré LC, Amariglio N, Rechavi G, He C (February 2016). "The dynamic N(1)-methyladenosine methylome in eukaryotic messenger RNA". Nature. 530 (7591): 441–6. Bibcode:2016Natur.530..441D. doi:10.1038/nature16998. PMC 4842015. PMID 26863196.
Chen W, Lin H (December 2016). "Recent Advances in Identification of RNA Modifications". Non-Coding RNA. 3 (1): 1. doi:10.3390/ncrna3010001. PMC 5831996. PMID 29657273.
Schaefer M, Lyko F (February 2010). "Solving the Dnmt2 enigma". Chromosoma. 119 (1): 35–40. doi:10.1007/s00412-009-0240-6. PMID 19730874.
Jacob R, Zander S, Gutschner T (November 2017). "The Dark Side of the Epitranscriptome: Chemical Modifications in Long Non-Coding RNAs". International Journal of Molecular Sciences. 18 (11): 2387. doi:10.3390/ijms18112387. PMC 5713356. PMID 29125541.
Shafik A, Schumann U, Evers M, Sibbritt T, Preiss T (January 2016). "The emerging epitranscriptomics of long noncoding RNAs". Biochimica et Biophysica Acta (BBA) - Gene Regulatory Mechanisms. 1859 (1): 59–70. doi:10.1016/j.bbagrm.2015.10.019. PMID 26541084.
Morris RC, Elliott MS (2001-09-01). "Queuosine modification of tRNA: a case for convergent evolution". Molecular Genetics and Metabolism. 74 (1–2): 147–59. doi:10.1006/mgme.2001.3216. PMID 11592812.
Persson BC (June 1993). "Modification of tRNA as a regulatory device". Molecular Microbiology. 8 (6): 1011–6. doi:10.1111/j.1365-2958.1993.tb01645.x. PMID 7689685.
Gustilo EM, Vendeix FA, Agris PF (April 2008). "tRNA's modifications bring order to gene expression". Current Opinion in Microbiology. 11 (2): 134–40. doi:10.1016/j.mib.2008.02.003. PMC 2408636. PMID 18378185.
Kiss T (July 2001). "Small nucleolar RNA-guided post-transcriptional modification of cellular RNAs". The EMBO Journal. 20 (14): 3617–22. doi:10.1093/emboj/20.14.3617. PMC 125535. PMID 11447102.
Züst R, Cervantes-Barragan L, Habjan M, Maier R, Neuman BW, Ziebuhr J, Szretter KJ, Baker SC, Barchet W, Diamond MS, Siddell SG, Ludewig B, Thiel V (February 2011). "Ribose 2'-O-methylation provides a molecular signature for the distinction of self and non-self mRNA dependent on the RNA sensor Mda5". Nature Immunology. 12 (2): 137–43. doi:10.1038/ni.1979. PMC 3182538. PMID 21217758.
Daffis S, Szretter KJ, Schriewer J, Li J, Youn S, Errett J, Lin TY, Schneller S, Zust R, Dong H, Thiel V, Sen GC, Fensterl V, Klimstra WB, Pierson TC, Buller RM, Gale M, Shi PY, Diamond MS (November 2010). "2'-O methylation of the viral mRNA cap evades host restriction by IFIT family members". Nature. 468 (7322): 452–6. Bibcode:2010Natur.468..452D. doi:10.1038/nature09489. PMC 3058805. PMID 21085181.
Meier UT (January 2011). "Pseudouridylation goes regulatory". The EMBO Journal. 30 (1): 3–4. doi:10.1038/emboj.2010.323. PMC 3020123. PMID 21206510.
Pereira-Montecinos C, Valiente-Echeverría F, Soto-Rifo R (April 2017). "Epitranscriptomic regulation of viral replication". Biochimica et Biophysica Acta (BBA) - Gene Regulatory Mechanisms. 1860 (4): 460–471. doi:10.1016/j.bbagrm.2017.02.002. PMID 28219769.
Zaringhalam M, Papavasiliou FN (September 2016). "Pseudouridylation meets next-generation sequencing". Methods. 107: 63–72. doi:10.1016/j.ymeth.2016.03.001. PMID 26968262.
Väre VY, Eruysal ER, Narendran A, Sarachan KL, Agris PF (March 2017). "Chemical and Conformational Diversity of Modified Nucleosides Affects tRNA Structure and Function". Biomolecules. 7 (1): 29. doi:10.3390/biom7010029. PMC 5372741. PMID 28300792.
Lewis CJ, Pan T, Kalsotra A (March 2017). "RNA modifications and structures cooperate to guide RNA-protein interactions". Nature Reviews. Molecular Cell Biology. 18 (3): 202–210. doi:10.1038/nrm.2016.163. PMC 5542016. PMID 28144031.
Li S, Mason CE (2014). "The pivotal regulatory landscape of RNA modifications". Annual Review of Genomics and Human Genetics. 15 (1): 127–50. doi:10.1146/annurev-genom-090413-025405. PMID 24898039.
Bruzik JP, Van Doren K, Hirsh D, Steitz JA (October 1988). "Trans splicing involves a novel form of small nuclear ribonucleoprotein particles". Nature. 335 (6190): 559–62. Bibcode:1988Natur.335..559B. doi:10.1038/335559a0. PMID 2971142.
Wilusz JE, Sunwoo H, Spector DL (July 2009). "Long noncoding RNAs: functional surprises from the RNA world". Genes & Development. 23 (13): 1494–504. doi:10.1101/gad.1800909. PMC 3152381. PMID 19571179.
Orouji, Elias; Wiebke K. Peitsch; Azadeh Orouji; Roland Houben; Jochen Utikal (Jan 2020). "Oncogenic Role of an Epigenetic Reader of m⁶A RNA Modification: YTHDF1 in Merkel Cell Carcinoma". Cancers. 12 (1): 202. doi:10.3390/cancers12010202. PMC 7016651. PMID 31947544.
Boccaletto P, Machnicka MA, Purta E, Piatkowski P, Baginski B, Wirecki TK, de Crécy-Lagard V, Ross R, Limbach PA, Kotter A, Helm M, Bujnicki JM (January 2018). "MODOMICS: a database of RNA modification pathways. 2017 update". Nucleic Acids Research. 46 (D1): D303–D307. doi:10.1093/nar/gkx1030. PMC 5753262. PMID 29106616.

Epitranscriptome

Chemical Modifications of RNA

N⁶-Methyladenosine (m⁶A)

"Writers," "erasers," and "readers"

Role in the life-cycle of mRNA

Role of N⁶-Methyladenosine (m⁶A) in alternative splicing

Other roles of m⁶A

Disease

N1-methyladenosine (m¹A)

5-methylcytosine (m⁵C)

Adenosine-to-Inosine

Queuosine

2′-O-methylation

Pseudouridylation

Pseudouridine detection and sequencing methods

Modifications specific to different types of RNA

Ribosomal RNA (rRNA)

Transfer RNA (tRNA)

Messenger RNA (mRNA)

Short non-coding RNA (sncRNA) modifications

Short nuclear RNA (snRNA)

MicroRNA (miRNA)

Long non-coding RNA (lncRNA)

Viral epitranscriptomics

m6A in viral transcripts

Other viral transcript modifications

MODOMICS

See also

References

Further reading

Epitranscriptome

Chemical Modifications of RNA

N6-Methyladenosine (m6A)

"Writers," "erasers," and "readers"

Role in the life-cycle of mRNA

Role of N6-Methyladenosine (m6A) in alternative splicing

Other roles of m6A

Disease

N1-methyladenosine (m1A)

5-methylcytosine (m5C)

Adenosine-to-Inosine

Queuosine

2′-O-methylation

Pseudouridylation

Pseudouridine detection and sequencing methods

Modifications specific to different types of RNA

Ribosomal RNA (rRNA)

Transfer RNA (tRNA)

Messenger RNA (mRNA)

Short non-coding RNA (sncRNA) modifications

Short nuclear RNA (snRNA)

MicroRNA (miRNA)

Long non-coding RNA (lncRNA)

Viral epitranscriptomics

m6A in viral transcripts

Other viral transcript modifications

MODOMICS

See also

References

Further reading

N⁶-Methyladenosine (m⁶A)

Role of N⁶-Methyladenosine (m⁶A) in alternative splicing

Other roles of m⁶A

N1-methyladenosine (m¹A)

5-methylcytosine (m⁵C)