Tudor domain proteins function as molecular adaptors, binding methylated arginine or lysine residues on their substrates to promote physical interactions and the assembly of macromolecular complexes. Here, we discuss the emerging roles of Tudor domain proteins during development, most notably in the Piwi-interacting RNA pathway, but also in other aspects of RNA metabolism, the DNA damage response and chromatin modification.
Introduction
The tudor gene was originally discovered in Drosophila melanogaster in 1985 in a screen for maternal factors that regulate embryonic development or fertility (for a timeline, see Box 1) (Boswell and Mahowald, 1985). The Tudor protein contains 11 repeats of a domain, later termed the Tudor domain, that was subsequently found to be present in many other proteins that function in RNA metabolism (Ponting, 1997; Callebaut and Mornon, 1997). Over the past two decades, Tudor domain proteins have emerged as a class of proteins that regulate not only multiple aspects of RNA metabolism – including RNA splicing and small RNA pathways – but also a number of other processes, such as histone modification and the DNA damage response (reviewed by Chen et al., 2011; Siomi et al., 2010; Lasko, 2010; Thomson and Lasko, 2005). Through these activities, Tudor domain proteins affect a wide variety of processes during development, including cell division, differentiation, genome stability and gametogenesis. We present the current knowledge gained from studies in Drosophila melanogaster, mice and human cell lines regarding how the Tudor domain functions and the roles of different Tudor domain proteins in biological pathways.
Box 1. Timeline of the major discoveries on Tudor domains
1985. Discovery of tudor in Drosophila. The gene was named after an extinct English royal family, the Tudors, for its grandchildless phenotype in which mutant females produce embryos with defects in primordial germ cell specification (Boswell and Mahowald, 1985).
1997. Identification of conserved Tudor domain (Ponting, 1997; Callebaut and Mornon, 1997).
2001. SMN binds symmetric dimethylated arginines in spliceosomal SM proteins (Friesen et al., 2001; Brahms et al., 2001). Tudor domain structure solved (Selenko et al., 2001).
2003. Tudor domain ‘royal’ family defined to also include Tudor, MBT, PWWP, chromo and plant Agenet domains (Maurer-Stroh et al., 2003).
2005. Tudor domain proposed to recognize methyl-arginines (Cote and Richard, 2005).
2006. Tudor domain proposed to recognize methyl-lysine (Kim et al., 2006).
2006. Structure of Tudor domain bound to methyl-lysine solved (Huang et al., 2006; Botuyan et al., 2006).
2009. Germline Tudor domain proteins shown to recognize methylated Piwi proteins (Kirino et al., 2009; Reuter et al., 2009; Wang et al., 2009; Vagin et al., 2009; Chen et al., 2009; Nishida et al., 2009).
2010. Structure of Tudor domain bound to methyl-arginine solved (Liu et al., 2010a; Liu et al., 2010b).
Tudor domain proteins function as adaptor proteins
The Tudor domain is a conserved protein structural motif of ∼60 amino acids that is characterized by a strongly bent anti-parallel β-sheet composed of five β-strands with a barrel-like fold (Sprangers et al., 2003). The Tudor domain has been reported to recognize and bind methylated lysines and arginines of target substrates (Fig. 1A), and this is thought to be the key in their ability to facilitate the assembly of protein complexes at discrete cellular compartments (Friberg et al., 2009; Liu et al., 2010a; Liu et al., 2010b; Tripsianes et al., 2011). A subset of Tudor domain proteins bind methylated lysines, whereas the rest have methyl-arginine-binding capacity. However, the factors that determine this specificity are still poorly understood (Kim et al., 2006; Chen et al., 2011). Here, we focus primarily on the methyl-arginine-binding Tudor domain proteins and their roles in RNA metabolism. The roles of lysine methylation in regulating histone and non-histone proteins have been recently reviewed (Zhang et al., 2012), so we will discuss only some functions for methyl-lysine-binding Tudor domains. A list of Tudor domain proteins discussed here is given in Table 1.
Tudor domain proteins involved in RNA metabolism have been studied extensively, especially for their functions in germline and gonadal development. These Tudor domain proteins harbor an ‘extended Tudor domain’ of ∼180 amino acids, containing the 60 amino acid core Tudor domain, and interact with methylated arginines on target proteins. Structural analysis has shown that the Tudor domain binds to symmetrically dimethylated arginines with a higher affinity than its binding to asymmetrically dimethylated arginines (Liu et al., 2010a; Liu et al., 2010b). Arginine residue methylation is catalyzed by enzymes known as protein arginine methyltransferases (PRMTs) (Fig. 1B). Arginine can be methylated by Type I and II PRMTs to form mono-methylated arginine. This mono-methylated form can then be further methylated by either a Type I or Type II PRMT to form an asymmetric or symmetric di-methyl-arginine, respectively (Gary and Clarke, 1998; Bedford and Richard, 2005; Bedford and Clarke, 2009; Boulanger et al., 2004). Thus, PRMTs establish the methyl-arginine marks that are recognized and bound by Tudor domain proteins. Recently, JMJ20 and JMJ22 (Jumonji domain-containing proteins) have been shown to function as histone arginine demethylases in Arabidopsis (Cho et al., 2012); however, it remains unclear whether such an eraser of arginine methylation exists in animals. Lysine methylation and demethylation are catalyzed by lysine methyltransferases (KMTs) and demethylases (LSDs), respectively (Fig. 1C) (Zhang and Reinberg, 2001; Zhang et al., 2012).
A common function of Tudor domain proteins is to act as an ‘adaptor’ that links methylated arginine or lysine marks to ‘effectors’ with specific catalytic activities. A minority of Tudor domain proteins containing only Tudor domains function as adaptors that recruit downstream effectors (Table 1). The rest of the family contain additional catalytic, enzymatic or functional domains (e.g. nuclease or RNA helicase domains) and may themselves also function as effectors. This theme will be seen recurrently in the following sections, as we discuss the functions of different Tudor domain proteins in various biological processes.
Tudor domain proteins in RNA metabolism
Tudor domain proteins have been found to be involved in many processes that regulate the metabolism of RNA molecules, such as RNA processing, stability and translation. To target RNAs, Tudor domain proteins typically contain RNA-binding motifs themselves or bind to di-methylated arginines of proteins that are directly bound to RNAs. Below, we discuss prominent examples that display the diversity of Tudor domain functions in RNA metabolism.
Splicing
In higher eukaryotes, splicing to remove introns from pre-mRNAs is a highly coordinated and regulated event catalyzed by small nuclear ribonucleoproteins (snRNPs) (Han et al., 2011). snRNPs are assembled in the cytoplasm, before being re-imported into the nucleus where they function. The survival motoneuron (SMN) protein, a Tudor domain protein, mediates the assembly of snRNPs, by binding snRNAs and recruiting asymmetrically di-methylated arginines of SM proteins that form the protein constituent of the snRNPs (Table 1) (Cheng et al., 2007; Fischer et al., 1997; Pellizzoni et al., 1998). Together with Germins 2-8, SMN promotes the subsequent binding of the snRNAs to the SM proteins to form mature snRNPs that function as effectors (Fig. 2). Not only SMN, but also SPF30 (SMNDC1, survivor motoneuron domain containing 1), a paralog of SMN, has been shown to function in promoting spliceosome assembly via an interaction with methylated arginines of SM proteins (Rappsilber et al., 2001; Cote and Richard, 2005). However, it is currently unclear whether SMN and SPF30 perform specific or overlapping functions. The importance of SMN is demonstrated by the fact that mutations in the gene encoding it have been linked to spinal muscular atrophy disease, which is characterized by the degeneration of motoneurons leading to muscle weakness and even death in severe cases (Gubitz et al., 2004).
Small RNA pathways
Many Tudor domain proteins are involved in RNA interference (RNAi) and related pathways, in which small RNAs regulate gene expression, both post-transcriptionally and transcriptionally. Generally, there are three different classes of such small RNAs – microRNAs (miRNAs), small interfering RNAs (siRNAs) and Piwi-interacting RNAs (piRNAs) (Ghildiyal and Zamore, 2009; Kim et al., 2009; Siomi and Siomi, 2009). First, we will discuss the roles of Tudor domain proteins in the miRNA and siRNA pathways, before focusing on the piRNA pathway, where Tudor domain proteins play particularly important roles.
miRNA and siRNA pathways
The miRNA pathway regulates cellular functions ranging from cell division, differentiation and embryogenesis to regeneration (Hatfield et al., 2005; Li and Carthew, 2005; Bushati et al., 2008; Pek et al., 2009; Yin et al., 2008). In this pathway, endogenous hairpin transcripts are processed into mature 22-nucleotide miRNAs. These short single-stranded RNAs target mRNAs via base pairing between the seed sequence of the miRNA and the mRNA – most often but not always at the 3′ UTR. This results in inhibition of gene expression, by either translational repression or mRNA degradation (Fig. 3) (Liu et al., 2005; Guo et al., 2010). Another form of small RNA regulation occurs via the siRNA pathway. Double-stranded RNAs, either exogenous or from endogenous double-stranded RNA encoding loci, are processed into exo- or endo-siRNAs of a specific length (between 20 and 25 nucleotides, depending on species), which silence their targets by post-transcriptional and transcriptional means (Fig. 3). The siRNA/endo-siRNA pathways are important for combating exogenous viral infections and endogenous retrotransposon activities, robust embryonic development and mitotic chromosome segregation (Chung et al., 2008; Czech et al., 2008; Ghildiyal et al., 2008; Kawamura et al., 2008; van Rij et al., 2006; Wang et al., 2006; Galiana-Arnoux et al., 2006; Lucchetta et al., 2009; Pek and Kai, 2011a).
The effector of both the miRNA and siRNA pathways is the RNA-induced silencing complex (RISC), which contains the protein Argonaute as a key catalytic component. In Drosophila, mammals and Caenorhabditis elegans, RISC also contains the Tudor-SN protein, which contains a Tudor domain and four nuclease domains (Table 1) (Caudy et al., 2003). Structural studies on Drosophila Tudor-SN have shown that its Tudor domain recognizes and binds to ligands containing a symmetrically di-methylated arginine (Friberg et al., 2009). However, the specific proteins to which Tudor-SN binds via arginine methylation in vivo are not known. The nuclease domains are required for RNA binding and cleavage during post-transcriptional silencing of target RNAs (Scadden, 2005), thereby functioning as enzymatic effectors. During stress, Tudor-SN localizes to stress granules (SGs), cytoplasmic structures that are involved in RNA metabolism (Box 2), and promotes their assembly (Gao et al., 2010). However, it remains unclear how Tudor-SN functions in SG assembly, or what its substrates are.
Box 2. Processing bodies and stress granules
Processing bodies (P-bodies) and stress granules (SGs) are microscopically visible structures in the cells containing mRNAs that are stalled during translation. P-bodies are always present in cells, whereas SGs appear under stressful conditions. P-bodies are considered to be sites for degradation and miRNA-based silencing of specific mRNAs. However, it is unclear whether the formation of P-bodies is a cause or a consequence of RNA silencing. P-bodies contain cap-binding proteins, decapping enzymes, RNases and proteins involved in miRNA-mediated silencing and general mRNA repression (Thomas et al., 2011).
By contrast, SGs mainly function to achieve global translational silencing, but they also activate synthesis of specific proteins to cope with various forms of stress, such as exposure to chemicals, heat shock and DNA damage. Therefore, SGs formed under different circumstances may differ in their composition; included proteins can range from signal transduction proteins and RNA helicases to apoptotic proteins. SG formation is often triggered by the phosphorylation of EIF2α, the degradation of tRNAs or polysome disassembly, which lead to the aggregation of non-functional translation initiation complexes (Anderson and Kedersha, 2009).
Another Tudor domain protein, Tudor domain containing (TDRD) 3, also localizes to SGs upon stress (Table 1). In human cell lines, TDRD3 binds to methyl-arginines of fragile X mental retardation protein (FMRP), a translational repressor, via its Tudor domain, and promotes SG formation (Goulet et al., 2008; Linder et al., 2008). As FMRP has been linked to mediating translation repression via the miRNA pathway (Caudy et al., 2002; Jin et al., 2004; Edbauer et al., 2010), TDRD3 may function in this context as an adaptor to recruit the effector FMRP.
piRNA pathway
In addition to the miRNA and siRNA pathways that function ubiquitously, a gonad-specific form of silencing, the piRNA pathway, functions primarily to protect the genome from deleterious selfish genetic elements such as transposons (Khurana and Theukauf, 2010; Saito and Siomi, 2010; Pek et al., 2012; Senti and Brennecke, 2010). In Drosophila, piRNAs are transcribed from piRNA cluster loci, which are special regions of the genome that harbor imperfect and fragmented copies of transposons that are incapable of mobilization (Malone and Hannon, 2009). Most clusters are transcribed bi-directionally, but several are transcribed from only one direction. Cluster transcripts in the antisense direction relative to the transposon mRNAs are processed to generate piRNAs against the active transposons. In addition to cluster transcripts, piRNAs are also produced from cellular mRNAs and from transposon mRNAs. piRNAs biogenesis occurs by two mechanisms: primary processing, which is present in both germline and somatic cells (Fig. 4A); and secondary processing, the ping-pong amplification loop that occurs only in germline cells (Fig. 4B). piRNA processing is independent of Dicer – the key endonuclease that processes miRNA precursors (Vagin et al., 2006) and mature piRNAs range from 24 to 29 nucleotides in length (Khurana and Theukauf, 2010; Saito and Siomi, 2010; Pek et al., 2012; Senti and Brennecke, 2010).
The core of the piRNA pathway consists of Piwi clade proteins from the Argonaute family. Piwi proteins are conserved from invertebrates to mammals; for example, the mouse genome encodes MIWI, MILI and MIWI2 (also called PIWIL1, PIWIL2 and PIWIL4, respectively), and Drosophila encodes Piwi, Aubergine (Aub) and Argonaute3 (Ago3). Piwi proteins can be arginine methylated, suggesting that they may be targets for Tudor domain binding (Chen et al., 2009; Kirino et al., 2009; Liu et al., 2010a; Liu et al., 2010b; Nishida et al., 2009; Vagin et al., 2009). During piRNA biogenesis, the 5′ end of piRNAs is generated by the slicer activity of Piwi family proteins (Gunawardane et al., 2007), but their 3′ end is generated by unknown mechanisms (Kawaoka et al., 2011).
Primary processing involves processing of the precursor transcripts in a cytoplasmic structure called the Yb body. The mechanism of primary processing is better studied in the Drosophila soma compared with the germline, but still little is known about the molecular functions of the processing machineries. In the soma, the Piwi-bound piRNA precursors are exported to the Yb body where processing occurs (Fig. 4A). Mature Piwi-bound piRNAs are then imported back to the nucleus to regulate transcription by promoting heterochromatin formation (Olivieri et al., 2010; Saito et al., 2010; Qi et al., 2011). A Tudor domain protein containing a DEAD-box helicase motif, Yb, is required for the primary processing of piRNAs and transposon repression (Table 1) (Szakmary et al., 2009; Olivieri et al., 2010; Saito et al., 2010; Qi et al., 2011). Yb localizes to the Yb body via its Tudor domain and functions as an adaptor by binding and recruiting Armitage (Armi), another primary processing protein, to the Yb body. Without Yb, Piwi is not loaded with piRNAs, remains cytoplasmic in Yb bodies and cannot enter the nucleus. Although the molecular function of the helicase domain is currently unknown, Yb itself may also function as an effector to regulate piRNA processing. Another Tudor domain protein, Vreteno (Vret), also localizes to the Yb body and functions in the primary processing pathway (Table 1) (Zamparini et al., 2011; Handler et al., 2011). Vret contains RRM (RNA recognition motif) and MYND (myeloid, Nervy and DEAF-1) domains, and physically interacts with proteins required for primary processing, such as Piwi, Armi, Yb and Zucchini; therefore, it may act as both an adaptor and an effector in the repression of transposons in somatic follicle cells. However, it is currently unclear whether the binding of Yb and Vret to their partners is dependent on arginine methylation, and the exact molecular roles of the components in the primary processing pathway remain unclear.
Unlike in somatic cells, piRNAs in Drosophila germline cells are generated by both primary and secondary processing pathways. The secondary pathway leads to amplification of mature piRNAs via the ping-pong cycle. The Piwi proteins Aub and Ago3 are central to this cycle, in which Aub-bound piRNAs in antisense orientation trigger the cleavage of the sense target (base paired to the antisense piRNA) to produce piRNAs in sense orientation. These sense piRNAs are loaded onto Ago3 and trigger cleavage of the antisense precursors to yield more antisense piRNAs (Fig. 4B). This secondary processing pathway is conserved in other animals, such as mouse and zebrafish (Siomi et al., 2011). Many of the proteins involved in this amplification cycle contain Tudor domains (Table 1) and exhibit a characteristic localization to the perinuclear nuage (an electron-dense structure) in the cytoplasm of germline cells in Drosophila, zebrafish and mouse (Fig. 4C,D) (Voronina et al., 2011; Pek et al., 2012; van der Heijden et al., 2010). Drosophila Aub and Ago3, and mouse MIWI, MILI and MIWI2 (which are required for the secondary amplification of piRNAs) also localize to the nuage. For this reason, the nuage has been proposed as the site where secondary amplification occurs (Arkov and Ramos, 2010; Pek et al., 2012; Siomi et al., 2011). Interestingly, in both mouse and Drosophila, the interaction of specific Tudor domain proteins, such as mouse TDRD1, TDRD2 and TDRD9 and fly Tudor and Partner of Piwis (Papi), with the Piwi proteins, is regulated by PRMT5 in mammals (or by Capsuleen in flies), which also localizes to the nuage, suggesting that these interactions are dependent on the arginine methylation of their partners (Kirino et al., 2009; Vagin et al., 2009; Anne et al., 2007; Anne and Mechler, 2005; Liu et al., 2011; Nishida et al., 2009).
Some Tudor domain proteins that localize to the nuage may function via their RNA-binding activities. Two Drosophila Tudor domain proteins, Tejas (Tej) and Spindle-E (Spn-E), have been proposed to function together in the assembly of the nuage, the production of germline piRNAs and the repression of transposons (Vagin et al., 2006; Lim and Kai, 2007; Patil and Kai, 2010). Similarly, mouse TDRD5 (a Tej homolog) is also required for nuage formation and retrotransposon Line-1 repression; however, its role in piRNA production is unknown (Yabuta et al., 2011). Tej contains a Tejas/Lotus domain that has been implicated in double-stranded RNA binding, whereas Spn-E contains a DEXD-box helicase domain (Table 1) (Patil and Kai, 2010; Anantharaman et al., 2010; Callebaut and Mornon, 2010; Gillespie and Berg, 1995). In Drosophila, Tej binds to Spn-E and Aub, and its interaction with Aub appears to be independent of arginine methylation (Partil and Kai, 2010). Tej and Spn-E are required for the recruitment of many piRNA pathway proteins, such as Aub, Krimper (Krimp), Ago3 and Maelstrom (Mael) to the nuage (Lim and Kai, 2007; Patil and Kai, 2010; Findley et al., 2003; Pek et al., 2009). Therefore, these proteins may function together as both adaptor and effector proteins in a subcomplex that links transposons and precursor piRNAs to piRNA-loaded Aub/Ago3 for effective ping-pong cycling via RNA-binding domains.
Recently, a unique Tudor domain protein Kumo/Qin has been reported to be involved in transposons silencing by secondary piRNA processing. Drosophila Kumo/Qin contains five Tudor domains and an E3 ligase domain (Table 1), and loss of kumo/qin results in an increase in the sense piRNAs from a few clusters and transposon families (Anand and Kai, 2011; Zhang et al., 2011). The two studies used different alleles and reported different degrees of piRNA loss; in neither case was the depletion as severe as that observed in other nuage components. Kumo/Qin localizes to the nuage and interacts with other piRNA pathway components such as Aub, Ago3, Vasa and Spn-E, and kumo/qin is required for the proper assembly of all the examined nuage components (Anand and Kai, 2011). The analysis of Aub- and Ago3-bound piRNAs in the kumo/qin mutant ovaries by Zhang et al. (Zhang et al., 2011) demonstrated the loss of Aub:Ago3 ping-pong and the increase in Aub:Aub ping-pong, suggesting that kumo/qin may function to maintain the heterotypic ping-pong between Aug and Ago3. The role of the E3 ligase domain of Kumo/Qin remains unknown, but it may regulate protein stability via ubiquitin-mediated protein degradation. The mouse homologue of Kumo/Qin, TDRD4/RNF17 (ring finger protein 17), which also localizes to the nuage, is important for spermatogenesis; however, its requirement for piRNA production is not known (Pan et al., 2005).
In addition to functioning directly in the processing of piRNAs, some Tudor domain proteins that localize to the nuage have been linked to the P-body in Drosophila (Box 2). Two of these proteins are Papi and Krimp (Table 1) (Lim and Kai, 2007; Liu et al., 2011). Papi contains a Tudor domain and two K homology (KH) domains, that may bind RNA (Table 1) (Liu et al., 2011). Papi functions as an adaptor by interacting with symmetric di-methyl-arginines of Ago3 and recruiting them to the nuage for transposon silencing. This function is probably conserved in the mouse Papi homolog TDRD2 (Chen et al., 2009; Vagin et al., 2009). Furthermore, in Drosophila, Papi physically associates with two P-body proteins, Tral and Me31B, suggesting that transposon silencing may be linked to P-body-mediated mRNA degradation. Like Papi, Krimp has also been linked to the Me31B, Dcp1/2 and Pcm P-body components, and it also is required for the generation of germline piRNAs via the ping-pong cycle (Malone et al., 2009; Lim et al., 2009). However, the binding partners, molecular functions and mammalian homolog of Krimp are currently unknown. Interestingly, another conserved Tudor domain protein, Ovarian Tumor (Otu), localizes to the P-body (Keyes and Spradling, 1997), but it has not yet been implicated in the piRNA pathway. The relationship between nuage proteins and P-body components is conserved: the Spn-E homolog TDRD9 forms a complex with MIWI2 and colocalizes with MIWI2 and MAEL in piP-bodies (cytoplasmic structures that also contain P body components), and it is required for Line-1 repression and piRNA production in the mouse male germline (Shoji et al., 2009; Aravin et al., 2009).
Some Tudor domain proteins that repress transposons do not appear to be required for piRNA production. In the mouse, TDRD7 localizes to the nuage and is essential for Line-1 retrotransposon repression but is not required for piRNA production (Table 1) (Lachke et al., 2011; Tanaka et al., 2011; Hosokawa et al., 2007). Similarly, TDRD1 localizes to the nuage and represses the expression of retrotransposons in mice and zebrafish (Table 1) (Chuma et al., 2006; Wang et al., 2009; Reuter et al., 2009; Huang et al., 2011). The Tudor domain of TDRD1 physically interacts with MILI/ZILI via symmetric di-methylarginines, and it is required for the formation of nuage but not for the production of piRNAs (Wang et al., 2009; Reuter et al., 2009; Huang et al., 2011). These data imply that besides piRNA amplification, the nuage may have another distinct role downstream of piRNA biogenesis in transposon silencing.
The protein in which the Tudor domain was initially discovered, Tudor, is a component of the nuage and pole plasm in Drosophila (Table 1). The pole plasm contains electron-dense organelles called polar granules that are composed of many proteins [including Tudor, Vasa, Staufen, Pgc (polar granule component) and Aub] and RNAs (such as oskar, cyclin B and hsp83), which are required for germline formation in the next generation (Thomson and Lasko, 2005). In the absence of Tudor, pole plasm assembly is disrupted, with consequent defects in the germline. More recent studies have suggested a function for Tudor in the nuage for the production of piRNAs in Drosophila (Nishida et al., 2009). Tudor binds to Aub and Ago3 in a symmetric di-methylated arginine-dependent fashion, probably functioning as an adaptor to regulate ping-pong amplification. However, although Tudor regulates piRNA production, it is not required for transposon silencing (Lim and Kai, 2007; Handler et al., 2011).
In summary, many Tudor domain proteins localize to specific structures (such as the nuage and Yb body) in the gonads, and they are required for the assembly of these structures. Especially for piRNA production, it is evident that Tudor domain proteins function as scaffolds for multiple adaptors via binding to methylated arginines on Piwi family proteins (Chen et al., 2009; Chen et al., 2011; Kirino et al., 2009; Liu et al., 2010a; Liu et al., 2010b; Liu et al., 2011; Nishida et al., 2009; Siomi et al., 2010; Vagin et al., 2009). The additional catalytic domains on some Tudor domain proteins appear to be important for processing piRNAs to exert their function as effectors.
Other functions for Tudor domain proteins
We have discussed various Tudor domain proteins that recognize methylated arginines in order to mediate their functions in RNA metabolism. In the next section, we discuss how other Tudor domain proteins function to regulate the DNA damage response and chromatin organization by recognizing methylated histone lysines and arginines.
Tudor domain proteins in the DNA damage response
The DNA damage response (DDR) is an evolutionarily conserved pathway that protects the genome from insults from external stimuli (van Attikum and Gasser, 2009; Polo and Jackson, 2011). This pathway involves a cascade of events that begins with sensing of the DNA damage through adaptor proteins and ends with effectors triggering a set of responses, such as cell cycle arrest, DNA repair, transcriptional changes and/or apoptosis. The human p53-binding protein 1 (53BP1 or TRP53BP1) is a Tudor domain protein that is linked to the DDR. 53BP1 acts downstream of DNA damage sensors such as MRN [MRE11 (meiotic recombination 11)-RAD50-NBS1 (Nijmegen breakage syndrome 1)], PIKKs (phosphoinositide 3-kinase-like kinases), ATM (ataxia telangiectasia mutated) and ATR (ATM and RAD3-related) (FitzGerald et al., 2009). The Tudor domain of 53BP1 mediates its binding to histone H4 dimethylated at Lys20, which are sensors that accumulate at the sites of DNA damage (Botuyan et al., 2006). Ubiquitylation of 53BP1 by the ubiquitin ligases ring finger (RNF) 8 and RNF168 facilitates recruitment of AAA-ATPase valosin-containing protein (VCP, also known as p97) to site of DNA damage, which in turn removes the L3MBTL1 protein from H3K4 Lys20 methylated sites and promotes binding of 53BP1 at these marks (Acs et al., 2011; Doil et al., 2009; Kolas et al., 2007). The Tudor domain of 53BP1 also functions as the adaptor in the DDR by promoting effector recruitment to the site of DNA damage. The loaded effectors, such as check point kinase 1 (CHEK1), check point kinase 2 (CHEK2) and ATM, in turn activate the DDR, thereby linking the sensors to the final output (van Attikum and Gasser, 2009; FitzGerald et al., 2009).
53BP1 also interacts with multiple methylated lysines of p53 during the DDR (Huang et al., 2007; Kachirskaia et al., 2008; Roy et al., 2010). Methylation of p53 by Set9 promotes its binding to 53BP1 and increases its stability and downstream transcriptional activation of genes such as p21 to induce G1 cell cycle arrest and allow DNA repair during the DDR (Chuikov et al., 2004; Kachirskaia et al., 2008). Taken together, 53BP1 plays multiple roles in the DDR by regulating multiple proteins.
Tudor domain proteins that interact with modified histones
Tudor domain proteins also function to regulate DNA packaging during development. Modifications of histone tails and DNA have been reported to be major mechanisms that regulate the transcriptional activity of chromatin (Box 3, Fig. 5). These modifications function as ‘marks’ and are characteristic of transcriptional activation or repression via the so-called histone code (Strahl and Allis, 2000; Berger, 2002; Peterson and Laniel, 2004). Among those, Tudor domain proteins recognize particular methylated arginines and lysines and recruit downstream effectors that, in turn, can promote chromatin activation or silencing in a context-dependent manner. As the roles of Tudor domain proteins that bind to methylated histones have been thoroughly summarized (Adams-Cioaba and Min, 2009), we will focus on a few representative examples.
Box 3. Chromatin modification
Both the DNA and histone protein components of chromatin are subject to chemical modifications: DNA undergoes methylation, whereas histones are subjected to a wide variety of post-translational modifications (PTMs), such as methylation, acetylation, phosphorylation and ubiquitylation. Together, these constitute a code for genome organization and the regulation of gene expression. For example, a specific set of histone PTMs and DNA methylation can lead to tight packaging of DNA, making it inaccessible for transcription; this structure is termed heterochromatin. By contrast, other types of histone PTMs may lead to the opening of DNA to facilitate transcription. For example, methylation of H3K9 and H3K27 are representative of repressive marks, while methylation of H3K4 promotes transcription activation. However, the biological outcomes of PTMs often depend heavily on the chromatin context and the cellular context of the modification. DNA methylation and histone PTMs are dynamic, and are regulated by both intrinsic signals and environmental factors. In addition, some of these modifications are genetically transmitted to the next generation to regulate gene expression in the progeny – a process termed epigenetic inheritance.
As in other examples discussed above, some of the Tudor domain proteins that bind methylated histones possess catalytic domains thereby also functioning as effectors. An example of transcriptional activation by a Tudor domain protein involves human TDRD3 (Table 1), discussed above as a SG component. In addition to this cytoplasmic function, TDRD3 in the nucleus specifically recognizes and binds to methylated arginines of histones to promote transcription of regions such as estrogen-responsive element (Yang et al., 2010). In addition, the Tudor domain of TDRD3 can also bind to a CARM1 (co-activator-associated arginine methyltransferase 1) methylated site in the C-terminal domain of RNA polymerase II (RNAPII), suggesting that TDRD3 may also regulate the activity of RNAPII (Sims et al., 2011).
Unlike TDRD3, many other Tudor domain proteins function in transcription repression. These proteins recruit methyltransferases or function as methyltransferases themselves, and promote lysine methylation of the histone tails. One such factor, the Polycomb-like (PCL) protein contains a Tudor domain and two plant homeodomain (PHD) domains (Table 1). PCL functions as an adaptor to recruit core components of the Polycomb repressive complex 2 (PRC2), a histone H3K27 methyltransferase, to chromatin (Casanova et al., 2011). H3K27 methylation by PRC2 controls many developmental processes, including Drosophila patterning, mouse embryonic stem cell self-renewal and differentiation, and X-chromosome inactivation (Nekrasov et al., 2007; Savla et al., 2008; Walker et al., 2010; Casanova et al., 2011). Histone methylation is also mediated by SETDB1 (Eggless in Drosophila), which contains two Tudor domains, a methyl-DNA-binding domain and a SET domain that methylates histone lysines (Clough et al., 2007). In Drosophila, SetdB1 (Egg) is important for oogenesis and the silencing of chromosome 4 (Clough et al., 2007; Tzeng et al., 2007; Seum et al., 2007). However, the roles of its Tudor domains and their potential binding partners are currently unknown.
Several Tudor domain transcriptional repressors promote the removal of methyl groups from histone lysine tails. One example is the human and mouse Jumonji domain-containing (JHDM3/JMJD2) family of proteins, which harbor two Tudor domains along with a Jumonji N (JmjN) domain, a Jumonji C (JmjC) domain and two PHD domains (Table 1). The Tudor domains of these proteins have been found to be essential for their interaction with methylated H3K4, H3K9 and H4K20, whereas the JmjN and JmjC domains have a demethylase activity that demethylates tri-methylated lysines at H3K9 and H3K36 (Huang et al., 2006; Whetstine et al., 2006; Klose et al., 2006; Cloos et al., 2006; Fodor et al., 2006; Kim et al., 2006). This family of proteins functions as effectors by binding to the co-repressor complex NCoR and has also been shown to repress E2F target genes by interacting with RB (retinoblastoma protein) (Gray et al., 2005; Zhang et al., 2005).
Besides regulating histone methylation, Tudor domain proteins can also regulate histone acetylation. PHF20 (plant homeo finger protein 20) contains a Tudor domain that binds dimethylated histone lysines. Although the precise mechanism is unclear, it is proposed to mediate crosstalk between histone lysine methylation and acetylation by recruiting the histone acetyltransferase MOF (male absent on the first) (Badeanx et al., 2012).
Finally, Tudor domain proteins can also promote transcriptional repression by mediating DNA methylation. For example, UHRF1 (ubiquitin-like, containing PHD and RING finger domains 1; also known as NP95 and ICBP90) functions to maintain DNA methylation during S phase in mammalian cells (Bostick et al., 2007). UHRF1 binds to H3K9me3 via its tandem Tudor domains and recruits DNMT1 [DNA (cytosine-5)-methyltransferase 1] to the chromatin to promote DNA methylation and heterochromatin assembly (Bostick et al., 2007; Rottach et al., 2010; Nady et al., 2011). In this way, UHRF1 functions to link histone lysine methylation to DNA methylation during the assembly of heterochromatin.
Conclusions
Tudor domain proteins function in diverse cellular processes and have wide-ranging effects in developmental processes and cellular events. Recent studies have shown that a large class of Tudor domain proteins functions in the gonads to regulate gametogenesis and genome stability via the piRNA pathway. Despite the recent discovery that Tudor domains recognize methylated arginines/lysines on their substrates, there is still much to learn about how methyl-arginine/lysine binding relates to the known cellular functions of these proteins. Moreover, it is clear that many Tudor domain proteins cooperate in the same macromolecular complexes and processes, and the redundancies and interactions between them are still not well understood. Analysis of the spatial associations of individual Tudor domain proteins with other members of the family, with themselves (as oligomers) and with their binding partners should shed light on the molecular mechanisms underlying the functioning of Tudor domain protein complexes.
Acknowledgements
We thank members of the Ito and Kai Laboratories for discussion.
Funding
Research in the Kai laboratory was supported by the Temasek Life Sciences Laboratory and the Singapore Millennium Foundation.
References
Competing interests statement
The authors declare no competing financial interests.