TALE-likes
Transcription Activator-Like Effector-Likes (TALE-likes) are a group of bacterial DNA binding proteins named for the first and still best-studied group, the TALEs of Xanthomonas bacteria. TALEs are important factors in the plant diseases caused by Xanthomonas bacteria, but are known primarily for their role in biotechnology as programmable DNA binding proteins, particularly in the context of TALE nucleases. TALE-likes have additionally been found in many strains of the Ralstonia solanacearum bacterial species complex, in Paraburkholderia rhizoxinica strain HKI 454, and in two unknown marine bacteria. Whether or not all these proteins from a single phylogenetic grouping is as yet unclear.
TAL effector repeat | |||||||||
---|---|---|---|---|---|---|---|---|---|
Pfam sequence logo for its TALE-like entry constructed from TALE, RipTAL, and BATs. Repeat starts with the F/L; RVD is the final N/H. | |||||||||
Identifiers | |||||||||
Symbol | TAL_effector | ||||||||
Pfam | PF03377 | ||||||||
InterPro | IPR005042 | ||||||||
|
The unifying feature of the TALE-likes are their tandem arrays of DNA binding repeats. These repeats are, with few exceptions, 33-35 amino acids in length, and composed of two alpha-helices on either side of a flexible loop containing the DNA base binding residues and with neighbouring repeats joined by flexible linker loops.[1] Evidence for this common structure comes in part from solved crystal structures of TALEs[2] and a Burkholderia TALE-like (BAT),[3] but also from the conservation of the code that all TALE-likes use to recognise DNA-sequences. In fact, TALE, RipTAL, and BAT repeats can be mixed and matched to generate functional DNA-binding proteins with varying affinity.[4]
TALEs
TALEs are the first identified, best-studied and largest group within the TALE-likes. TALEs are found throughout the bacterial genus Xanthomonas,[5] comprising mostly plant pathogens. Those TALEs which have been studied have all been shown to be secreted as part of the Type III secretion system into host plant cells. Once inside the host cell they translocate to the nucleus, bind specific DNA sequences within host promoters and turn on downstream genes. Every part of this process is thought to be conserved across all TALEs. The single meaningful difference between individual TALEs, based on current understanding, is the specific DNA sequence that each TALE binds. TALEs from even closely related strains differ in the composition of repeats that make up their DNA binding domain.[6] Repeat composition determines DNA binding preference. In particular position 13 of each repeat confers the DNA base preference of each repeat. During early research it was noted that almost all the differences between repeats of a single TALE repeat array are found in positions 12 and 13 and this finding led to the hypothesis that these residues determine base preference.[7] In fact repeat positions 12 and 13, referred to jointly as the Repeat Variable Diresidue (RVD) are commonly said to confer base specificity despite clear evidence that position 13 is the base determining residue.[8] In addition to the repeat domain TALEs also possess a number of conserved features in the domains flanking the repeats. These include domains for type-III-secretion, nuclear localization and transcriptional activation. This allows TALEs to carry out their biological role as effector proteins secreted into host plant cells to activate expression of specific host genes.
Diversity and evolution
Whilst the RVD positions are commonly the only variable positions within a single TALE repeat array, there are more differences when comparing repeat arrays of different TALEs. The diversity of TALEs across the Xanthomonas genus is considerable, but a particularly striking finding is that the evolutionary history one arrives at by comparing repeat compositions differs from that found when comparing non-repeat sequences.[6] Repeat arrays of TALEs are thought to evolve rapidly, with a number of recombinatorial processes suggested to shape repeat array evolution.[5] Recombination of TALE repeat arrays has been demonstrated in a forced-selection experiment.[9] This evolutionary dynamism is thought to be made possible by the very high sequence identity of TALE repeats, which is a unique feature of TALEs as opposed to other TALE-likes.
T-zero
Another unique feature of TALEs is a set of four repeat structures at the N-terminal flank of the core repeat array. These structures, termed non-canonical or degenerate repeats have been shown to be vital for DNA binding,[10] though all but one do not contact DNA bases and thus make no contribution to sequence preference. The one exception is repeat -1, which encodes a fixed T-zero preference to all TALEs. This means that the target sequences of TALEs are always preceded by a thymine base. This is thought to be common to all TALEs, with the possible exception of TalC from Xanthomonas oryzae pv. oryzae strain AXO1947 (G1FM79).[11]
RipTALs
TAL effector protein Brg11 | |||||||
---|---|---|---|---|---|---|---|
Identifiers | |||||||
Organism | |||||||
Symbol | brg11 | ||||||
UniProt | Q8XYE3 | ||||||
|
Discovery and molecular properties
It was noted in the 2002 publication of the genome of reference strain Ralstonia solanacearum GMI1000 that its genome encodes a protein similar to Xanthomonas TALEs.[12] Based on similar domain structure and repeat sequences it was presumed that this gene and homologs in other Ralstonia strains would encode proteins with the same molecular properties as TALEs, including sequence-specific DNA binding. In 2013 this was confirmed by two studies.[13][14] These genes and the proteins they encode are referred to as RipTALs (Ralstonia injected protein TALE-like) in line with the standard nomenclature of Ralstonia effectors.[15] Whilst the DNA binding code of the core repeats is conserved with TALEs, RipTALs do not share the T-zero preference, instead they have a strict G-zero requirement.[13] In addition repeats within a single RipTAL repeat array have multiple sequence differences beyond the RVD positions, unlike the near-identical repeats of TALEs.
RipTALs have been found in all four phylotypes of R. solanacearum, making it an ancestral feature of this clade. Despite differences in the flanking domains, the sequences their RVDs target are highly similar.[16]
Biological role
Several lines of evidence support the idea that RipTALs function as effector proteins, promoting bacterial growth or disease by manipulating the expression of plant genes. They are secreted into plant cells by the Type III secretion system, which is the main delivery system for effector proteins.[17] They localize to the cell nucleus and are able to function as sequence-specific transcription factors in plant cells.[13] In addition a strain lacking its RipTAL was shown to grow slower inside eggplant leaf tissue than the wild type.[18] Furthermore, a study based on DNA polymorphisms in ripTAL repeat domain sequences and host plants found a statistically significant connection between host plant and repeat domain variants.[19] This is expected if the RipTALs of different strains are adapted to target genes in specific host plants. Despite this, no target genes have been identified for any RipTAL, as of June 2019.
BATs
Burkholderia TALE-like protein 1 | |||||||
---|---|---|---|---|---|---|---|
Identifiers | |||||||
Organism | |||||||
Symbol | bat1 | ||||||
UniProt | E5AV36 | ||||||
|
Discovery
The publication of the genome of bacterial strain Paraburkholderia rhizoxinica HKI 454, in 2011 [20] led to the discovery of a set of TALE-like genes that differed considerably in nature from the TALEs and RipTALS. The proteins encoded by these genes were studied for their DNA binding properties by two groups independently and named the Bats (Burkholderia TALE-likes; E5AV36) or BurrH.[21][22] This research showed that the repeat units of the Burkholderia TALE-likes bind DNA with the same code as TALEs, governed by position 13 of each repeat. There are, however, a number of differences.
Biological role
Burkholderia TALE-likes are composed almost entirely of repeats, lacking the large non-repetitive domains found flanking the repeats in TALEs and RpTALs. Those domains are key to the functions of TALEs and RipTALs allowing them to infiltrate the plant nucleus and turn on gene expression. It is therefore currently unclear what the biological roles of Burkholderia TALE-likes are. What is clear is that they are not effector proteins secreted into plant cells to act as transcription factors, the biological role of TALEs and RipTALs. It is not unexpected that they may differ in biological roles from TALEs and RipTALs since the life style of the bacterium they derive from is very unlike that of TALE and RipTAL bearing bacteria. B. rhizoxinica is an endosymbiont, living inside a fungus, unlike Rhizopus microsporus, a plant pathogen. The same fungus is also an opportunistic human pathogen in immuno-compromised patients, but whereas B. rhizoxinica is necessary for pathogenicity on plant hosts it is irrelevant to human infection.[23] It is unclear whether the Burkholderia TALE-likes are ever secreted either into the fungus, let alone into host plants.
Uses in Biotechnology
As noted in the publications on Burkholderia TALE-likes there may be some advantages to using these proteins as a scaffold for programmable DNA-binding proteins to function as transcription factors or designer-nucleases, compared to TALEs.[21][22] It has been fused with a FokI nuclease analogous to TALEN.[3] Advantages include a shorter repeat size, more compact domain structure (no large non-repeat domains), greater repeat sequence diversity enabling the use of PCR on the genes encoding them and making them less vulnerable to recombinatorial repeat loss. In addition, Burkholderia TALE-likes have no T-zero requirement relaxing the constraints on DNA target selection. However, few uses of Burkholderia TALE-likes as programmable DNA binding proteins have been published, outside of the original characterization publications.
MOrTLs
Discovery
In 2007 the results of a metagenomic sweep of the world's oceans by the J. Craig Venter Institute were made publicly available.[24] The paper in 2014 on Burkholderia TALE-likes [22] was also the first to report that two entries from that database resembled TALE-likes, based on sequence similarity. These were further characterized and assessed for their DNA-binding potential in 2015.[25] The repeat units encoded by these sequences were found to mediate DNA binding with base preference matching the TALE code, and judged likely to form structures nearly identical to Bat1 repeats based on molecular dynamics simulations. The proteins encoded by these DNA sequences were therefore designated Marine Organism TALE-likes (MOrTLs) 1 and 2 (GenBank: ECG96325, EBN91409).[25] Similar sequences found in metagenomes include EBN19408 and ECR81667.[26]
Evolutionary relationship to other TALE-likes
Whilst repeats of MOrTL1 and 2 both conform structurally and functionally to the TALE-like norm, they differ considerably at the sequence level both from all other TALE-likes and from one another. It is not known whether they are truly homologous to the other TALE-likes, and thus constitute together with the TALEs, RipTALs and Bats a true protein-family. Alternatively, they may have evolved independently. It is particularly difficult to judge the relationship to the other TALE-likes because almost nothing is known of the organisms that MOrTL1 and MOrTL2 come from. It is known only that they were found in two separate sea-water samples from the Gulf of Mexico and are likely to be bacteria based on size-exclusion before DNA sequencing.[25]
Legal status
A patent for BATs and marine TALE-likes in protein engineering was filed in July 2012. As of May 2019, it is currently pending in all jurisdictions.[27]
References
- Deng D, Yan C, Wu J, Pan X, Yan N (April 2014). "Revisiting the TALE repeat". Protein & Cell. 5 (4): 297–306. doi:10.1007/s13238-014-0035-2. PMC 3978159. PMID 24622844.
- Deng D, Yan C, Pan X, Mahfouz M, Wang J, Zhu JK, Shi Y, Yan N (February 2012). "Structural basis for sequence-specific recognition of DNA by TAL effectors". Science. 335 (6069): 720–3. Bibcode:2012Sci...335..720D. doi:10.1126/science.1215670. PMC 3586824. PMID 22223738.
- Stella S, Molina R, López-Méndez B, Juillerat A, Bertonati C, Daboussi F, Campos-Olivas R, Duchateau P, Montoya G (July 2014). "BuD, a helix-loop-helix DNA-binding domain for genome modification". Acta Crystallographica. Section D, Biological Crystallography. 70 (Pt 7): 2042–52. doi:10.1107/S1399004714011183. PMC 4089491. PMID 25004980.
- de Lange O, Schandry N, Wunderlich M, Berendzen KW, Lahaye T (January 2017). "Exploiting the sequence diversity of TALE-like repeats to vary the strength of dTALE-promoter interactions". Synthetic Biology. 2 (1). doi:10.1093/synbio/ysx004.
- Ferreira RM, de Oliveira AC, Moreira LM, Belasque J, Gourbeyre E, Siguier P, Ferro MI, Ferro JA, Chandler M, Varani AM (February 2015). "A TALE of transposition: Tn3-like transposons play a major role in the spread of pathogenicity determinants of Xanthomonas citri and other xanthomonads". mBio. 6 (1): e02505-14. doi:10.1128/mBio.02505-14. PMC 4337579. PMID 25691597.
- Pérez-Quintero AL, Lamy L, Gordon JL, Escalon A, Cunnac S, Szurek B, Gagnevin L (3 August 2015). "QueTAL: a suite of tools to classify and compare TAL effectors functionally and phylogenetically". Frontiers in Plant Science. 6: 545. doi:10.3389/fpls.2015.00545. PMC 4522561. PMID 26284082.
- Boch J, Schornack S (2010). "Unraveling a 20-Year Enigma" (PDF). IS-MPMI Reporter (1): 3–4.
- de Lange O, Binder A, Lahaye T (June 2014). "From dead leaf, to new life: TAL effectors as tools for synthetic biology". The Plant Journal. 78 (5): 753–71. doi:10.1111/tpj.12431. PMID 24602153.
- Yang B, Sugio A, White FF (February 2005). "Avoidance of host recognition by alterations in the repetitive and C-terminal regions of AvrXa7, a type III effector of Xanthomonas oryzae pv. oryzae". Molecular Plant-Microbe Interactions. 18 (2): 142–9. doi:10.1094/MPMI-18-0142. PMID 15720083.
- Gao H, Wu X, Chai J, Han Z (December 2012). "Crystal structure of a TALE protein reveals an extended N-terminal DNA binding region". Cell Research. 22 (12): 1716–20. doi:10.1038/cr.2012.156. PMC 3515758. PMID 23147789.
- Yu Y, Streubel J, Balzergue S, Champion A, Boch J, Koebnik R, Feng J, Verdier V, Szurek B (September 2011). "Colonization of rice leaf blades by an African strain of Xanthomonas oryzae pv. oryzae depends on a new TAL effector that induces the rice nodulin-3 Os11N3 gene". Molecular Plant-Microbe Interactions. 24 (9): 1102–13. doi:10.1094/MPMI-11-10-0254. PMID 21679014.
- Salanoubat M, Genin S, Artiguenave F, Gouzy J, Mangenot S, Arlat M, Billault A, Brottier P, Camus JC, Cattolico L, Chandler M, Choisne N, Claudel-Renard C, Cunnac S, Demange N, Gaspin C, Lavie M, Moisan A, Robert C, Saurin W, Schiex T, Siguier P, Thébault P, Whalen M, Wincker P, Levy M, Weissenbach J, Boucher CA (January 2002). "Genome sequence of the plant pathogen Ralstonia solanacearum". Nature. 415 (6871): 497–502. doi:10.1038/415497a. PMID 11823852.
- de Lange O, Schreiber T, Schandry N, Radeck J, Braun KH, Koszinowski J, Heuer H, Strauß A, Lahaye T (August 2013). "Breaking the DNA-binding code of Ralstonia solanacearum TAL effectors provides new possibilities to generate plant resistance genes against bacterial wilt disease". The New Phytologist. 199 (3): 773–86. doi:10.1111/nph.12324. PMID 23692030.
- Li L, Atef A, Piatek A, Ali Z, Piatek M, Aouida M, Sharakuu A, Mahjoub A, Wang G, Khan S, Fedoroff NV, Zhu JK, Mahfouz MM (July 2013). "Characterization and DNA-binding specificities of Ralstonia TAL-like effectors". Molecular Plant. 6 (4): 1318–30. doi:10.1093/mp/sst006. PMC 3716395. PMID 23300258.
- Peeters N, Carrère S, Anisimova M, Plener L, Cazalé AC, Genin S (December 2013). "Repertoire, unified nomenclature and evolution of the Type III effector gene set in the Ralstonia solanacearum species complex". BMC Genomics. 14 (1): 859. doi:10.1186/1471-2164-14-859. PMC 3878972. PMID 24314259.
- Schandry N, de Lange O, Prior P, Lahaye T (17 August 2016). "TALE-Like Effectors Are an Ancestral Feature of the Ralstonia solanacearum Species Complex and Converge in DNA Targeting Specificity". Frontiers in Plant Science. 7: 1225. doi:10.3389/fpls.2016.01225. PMC 4987410. PMID 27582755.
- Mukaihara T, Tamura N, Iwabuchi M (March 2010). "Genome-wide identification of a large repertoire of Ralstonia solanacearum type III effector proteins by a new functional screen". Molecular Plant-Microbe Interactions. 23 (3): 251–62. doi:10.1094/mpmi-23-3-0251. PMID 20121447.
- Macho AP, Guidot A, Barberis P, Beuzón CR, Genin S (September 2010). "A competitive index assay identifies several Ralstonia solanacearum type III effector mutant strains with reduced fitness in host plants". Molecular Plant-Microbe Interactions. 23 (9): 1197–205. doi:10.1094/MPMI-23-9-1197. PMID 20687809.
- Heuer H, Yin YN, Xue QY, Smalla K, Guo JH (July 2007). "Repeat domain diversity of avrBs3-like genes in Ralstonia solanacearum strains and association with host preferences in the field". Applied and Environmental Microbiology. 73 (13): 4379–84. doi:10.1128/AEM.00367-07. PMC 1932761. PMID 17468277.
- Lackner G, Moebius N, Partida-Martinez LP, Boland S, Hertweck C (May 2011). "Evolution of an endofungal lifestyle: Deductions from the Burkholderia rhizoxinica genome". BMC Genomics. 12 (1): 210. doi:10.1186/1471-2164-12-210. PMC 3102044. PMID 21539752.
- de Lange O, Wolf C, Dietze J, Elsaesser J, Morbitzer R, Lahaye T (June 2014). "Programmable DNA-binding proteins from Burkholderia provide a fresh perspective on the TALE-like repeat domain". Nucleic Acids Research. 42 (11): 7436–49. doi:10.1093/nar/gku329. PMC 4066763. PMID 24792163.
- Juillerat A, Bertonati C, Dubois G, Guyot V, Thomas S, Valton J, Beurdeley M, Silva GH, Daboussi F, Duchateau P (January 2014). "BurrH: a new modular DNA binding protein for genome engineering". Scientific Reports. 4: 3831. Bibcode:2014NatSR...4E3831J. doi:10.1038/srep03831. PMC 5379180. PMID 24452192.
- Partida-Martinez LP, Bandemer S, Rüchel R, Dannaoui E, Hertweck C (May 2008). "Lack of evidence of endosymbiotic toxin-producing bacteria in clinical Rhizopus isolates". Mycoses. 51 (3): 266–9. doi:10.1111/j.1439-0507.2007.01477.x. PMID 18399908.
- Yooseph S, Sutton G, Rusch DB, Halpern AL, Williamson SJ, Remington K, Eisen JA, Heidelberg KB, Manning G, Li W, Jaroszewski L, Cieplak P, Miller CS, Li H, Mashiyama ST, Joachimiak MP, van Belle C, Chandonia JM, Soergel DA, Zhai Y, Natarajan K, Lee S, Raphael BJ, Bafna V, Friedman R, Brenner SE, Godzik A, Eisenberg D, Dixon JE, Taylor SS, Strausberg RL, Frazier M, Venter JC (March 2007). "The Sorcerer II Global Ocean Sampling expedition: expanding the universe of protein families". PLoS Biology. 5 (3): e16. doi:10.1371/journal.pbio.0050016. PMC 1821046. PMID 17355171.
- de Lange O, Wolf C, Thiel P, Krüger J, Kleusch C, Kohlbacher O, Lahaye T (November 2015). "DNA-binding proteins from marine bacteria expand the known sequence diversity of TALE-like repeats". Nucleic Acids Research. 43 (20): 10065–80. doi:10.1093/nar/gkv1053. PMC 4787788. PMID 26481363.
- "Pfam alignment: PF03377 metagenome (auto-generated match)". Retrieved 28 May 2019.
- Bertonati C, Duchateau P, Juillerat A, Silva G, Valton J (24 July 2013). "WO2014018601A2 New modular base-specific nucleic acid binding domains from burkholderia rhizoxinica proteins". Google Patents.