ABSTRACT
G2/M progression requires coordinated expression of many gene products, but little is known about the transcriptional regulators involved. We recently identified human Cdc5, a positive regulator of G2/M in mammalian cells. We also demonstrated the presence of a latent activation domain in its carboxyl terminus, suggesting that human Cdc5 regulates G2/M through transcriptional activation. Despite the presence of a DNA binding domain, studies by others have failed to identify a preferential binding site for Cdc5 family members. In addition, Cdc5 recently has been associated with the splicesome in several organisms, suggesting that it may not act through DNA binding. We now report the identification of a 12 bp sequence to which human Cdc5 binds specifically and with high affinity through its amino terminus. We show that this DNA-protein interaction is capable of activating transcription. We also used a selection system in yeast to identify human genomic fragments that interact with human Cdc5. Several of these contained sequences similar to the binding site. We demonstrate that these bind human Cdc5 with similar specificity and affinity. These experiments provide the first evidence that Cdc5 family members can act as site-specific DNA binding proteins, and that human Cdc5 may interact with specific, low abundance sequences in the human genome. This raises the possibility that Cdc5 proteins may participate in more than one process necessary for regulated cell division.
INTRODUCTION
The eukaryotic cell division cycle includes a series of checkpoints, which ensure that some events are completed before others begin (Murray and Hunt, 1993). One set of controls determines whether the cell replicates its genetic material in preparation for division (G1/S), while another checks that DNA replication is complete and that the genome has not been damaged (G2/M) (Nasmyth, 1996). Cyclins and cyclin-dependent kinases mediate these events, and are themselves regulated through multiple mechanisms (Morgan, 1995).
The biochemical events controlling G2/M transit in mammalian cells are remarkably similar to those in lower eukaryotes. Mammalian Cdc2 kinase accumulates in S phase (Shimizu et al., 1995) and is regulated by Wee1 kinase (Parker and Piwnica-Worms, 1992) and Cdc25 phosphatase (Lee et al., 1992; Millar et al., 1991). Recent studies also have begun to elucidate the pathways by which G2 arrest occurs in response to DNA damage in mammalian cells (Nurse, 1997). Chk1 kinase, which is activated by phosphorylation in response to DNA damage, in turn phosphorylates Cdc25C (Sanchez et al., 1997). This potentiates the binding of 14-3-3 to Cdc25C, which prevents Cdc25C from dephosphorylating and activating Cdc2 (Peng et al., 1997). In addition, it has been shown that expression of 14-3-3 is regulated by p53, a DNA binding protein already implicated in p21WAF1/CIP1-mediated G1 arrest (Hermeking et al., 1997).
While G2/M progression clearly requires the coordinated expression of many gene products, less is known about how these are regulated at the level of transcription. Characterization of transcription factors regulating G2 progression and mitotic entry would significantly advance our understanding of the mechanisms controlling this portion of the cell cycle.
S. pombe cdc5p was first described as a putative DNA binding protein implicated in G2/M transit (Nasmyth and Nurse, 1981; Ohi et al., 1994). We subsequently identified a cDNA encoding a protein with limited homology to S. pombe cdc5p (Bernstein and Coughlin, 1997). Its widespread expression in human tissues and homology with expressed sequences in other eukaryotes suggested an evolutionarily conserved general function (Bernstein and Coughlin, 1997). Nuclear import upon serum stimulation of mammalian cells suggested a possible role in cell proliferation (Bernstein and Coughlin, 1997).
More recently, we described the function of human Cdc5 (hCdc5) in mammalian cells (Bernstein and Coughlin, 1998). Overexpression of hCdc5 shortened G2 and reduced cell size. A dominant negative mutant of hCdc5 consisting of the putative DNA binding domain slowed G2 progression and delayed entry into mitosis. These results demonstrated that hCdc5 regulates G2 progression and mitotic entry, and suggested that it functions through DNA binding (Bernstein and Coughlin, 1998).
While hCdc5 appears to positively regulate G2/M in mammalian cells, its function through DNA binding and transcriptional regulation remains to be confirmed. Effector genes regulated by hCdc5 or other members of the Cdc5 family in S. cerevisiae (Ohi et al., 1998), A. thaliana (Hirayama and Shinozaki, 1996), C. elegans (Bernstein and Coughlin, 1997), D. melanogaster (Katzen et al., 1998; Ohi et al., 1998), and M. musculus (Bernstein and Coughlin, 1997) have not been identified. hCdc5 and its relatives contain tandem helix-turn-helix DNA binding motifs at their amino termini, similar to that seen in Myb-related proteins (Bernstein and Coughlin, 1997). In contrast with c-Myb, however, Cdc5-related proteins contain only two repeats of the helix-turn-helix motif, whereas Myb family members possess three (Bernstein and Coughlin, 1997; Hirayama and Shinozaki, 1996; Katzen et al., 1998; Ohi et al., 1998; Ohi et al., 1994). Moreover, within this domain hCdc5 and its relatives bear a Val→Leu substitution at a position critical for DNA binding specificity (Carr et al., 1996; Ogata et al., 1996). Cdc5 family members, therefore, likely differ from Myb in their DNA binding properties.
Recently, a 7-base-pair (bp) nucleotide sequence identified by random oligonucleotide binding site selection was shown to interact with the DNA binding domain of A. thaliana Cdc5 in vitro; however, binding with this sequence was reduced with non-specific, competitor DNA (Hirayama and Shinozaki, 1996). Similar experiments to identify a consensus binding site for the highly conserved DNA binding domains of D. melanogaster Cdc5 and the Cdc5 homologue in S. cerevisiae, Cef1p, failed to identify any preferential site, nor did they interact with the 7 bp sequence identified for A. thaliana (Ohi et al., 1998). In addition, others have shown that Cdc5 does not activate the transcription of candidate genes known to be upregulated during G2/M, for example cdc2 and String in D. melanogaster (Katzen et al., 1998), Clb1 and Swi5 in S. cerevisiae (Ohi et al., 1998), and cdc13+ and cdc25+ in S. pombe (Ohi et al., 1998). These results raised the question of whether Cdc5 family members serve functions other than as site-specific DNA binding proteins (Ohi et al., 1998), even though they possess a Myb-like DNA binding domain and the human homologue experimentally has been shown to contain a latent transcriptional activation domain (Bernstein and Coughlin, 1998). In addition, recent studies in fission and budding yeast have implicated a role for cdc5p and Cef1p, respectively, in pre-mRNA splicing (McDonald et al., 1999; Tsai et al., 1999), and hCdc5 has been identified as a component of the mammalian splicesome (Burns et al., 1999; Neubauer et al., 1998).
We now report that human Cdc5 may serve as a site-specific, double-stranded DNA binding protein capable of forming high affinity complexes with specific sequences through its DNA binding domain. We employed cyclic amplification and selection from a pool of random oligonucleotides to identify preferential binding sites for the hCdc5 DNA binding domain. A high-affinity, double-stranded, consensus sequence was identified, which contains a helix-turn-helix binding motif. We also showed that this DNA-protein interaction is capable of activating transcription. We then used a selection system in yeast to identify human genomic sequences that bind hCdc5 through its helix-turn-helix domain. These experiments demonstrate that a member of the Cdc5 family interacts with specific sequences through direct DNA-protein interaction. They also suggest that mammalian Cdc5 family members may participate in more than one process necessary for regulated cell division.
MATERIALS AND METHODS
Materials and strains
Yeast expression plasmids pBM2389 and pBM2463, and S. cerevisiae strain YM4271 (MATaura3-52 his3-_200 ade2-101 lys2-801 trp1-901 gal4-Δ512 gal80-Δ538 ade5::hisG), were generously provided by Mark Johnston (Washington University School of Medicine).
Expression and purification of recombinant hCdc5
In order to express the DNA binding domain of hCdc5, unique EcoRV and SalI sites were inserted into pSK67 (Bernstein and Coughlin, 1997) using site-directed mutagenesis (Kunkel et al., 1987). The 1500 bp cDNA fragment encoding 500 amino acids from the amino terminus of hCdc5 was then subcloned into EcoRV/SalI sites in pET29c(+) (Novagen). The resulting plasmid was transformed into E. coli strain BL21 (DE3), and individual colonies were inoculated into 500 ml liquid LB medium containing 50 μg/ml kanamycin. Cultures were grown at 37°C to OD600 0.2-0.6. Cultures were induced with isopropyl β-D-thiogalactopyranoside (IPTG) at final concentration of 1 mM for 2 hours. Bacterial pellets were collected by centrifugation and lysed in 50 mM NaH2PO4, pH 8.0, 300 mM NaCl, 10 mM imidazole, 1 mg/ml lysozyme at 4°C for 1 hour followed by sonication. The lysate supernatant then was added to 0.5 ml 50% Ni-NTA resin (Qiagen) and mixed gently at room temperature for 60 minutes. The resin was washed with 50 mM NaH2PO4, pH 8.0, 300 mM NaCl, 20 mM imidazole, and the His6-tagged protein eluted with 50 mM NaH2PO2, pH 8.0, 300 mM NaCl, 70 mM imidazole. The protein then was dialyzed in 10 mM Tris, pH 7.9, 60 mM KCl, 4 mM MgCl2 at 4°C overnight. Dialysed protein was analyzed by SDS-polyacrylamide gel electrophoresis and immunoblot using monoclonal anti-His antibody (Invitrogen) according to previously described methods (Bernstein and Coughlin, 1997).
Oligonucleotide selection
Oligonucleotide selection was performed with a DNA fragment that contained a 15 bp random sequence flanked on either side by 20 bp of nonrandom sequence (5′-CGCTCGAGGGATCCGAATTC(N15)T-CTAGAAAGCTTGTCGACGC-3′). The single-stranded oligonucleotide was made double-stranded with Taq DNA polymerase and a primer complementary to the 3′ flanking sequence (5′-GCGTCGA-CAAGCTTTCTAGA-3′). After extraction with phenol/chloroform/isoamyl alcohol and precipitation with glycogen/ethanol, 60 picomoles of the double-stranded oligonucleotide pool were incubated with 0.5 μg purified His6-hCdc5 in binding buffer (10 mM Hepes, pH 7.8, 75 mM KCl, 2.5 mM MgCl2, 3% Ficoll, 40 μg/ml poly(dI-dC), in the presence of 1 μg/ml phenylmethylsulfonyl fluoride, 0.4 ng/ml aprotinin, 1 ng/ml leupeptin, 0.4 ng/ml soybean trypsin inhibitor, 0.4 ng/ml pepstatin A, 10 ng/ml benzamidine) at room temperature for 30 minutes. The binding solution then was incubated with 50% Ni-NTA agarose at room temperature for 10 minutes. After washing the resin extensively with binding buffer, the DNA-protein complexes were eluted with binding buffer containing 250 mM imidazole.
Bound oligonucleotides were amplified with Taq polymerase over 15-20 cycles by annealing to flanking primers (5′-GCGTCGAC-AAGCTTTCTAGA-3′ and 5′-CGCTCGAGGGATCCGAATTC-3′) at 55°C for 20 seconds and extending at 72°C for 10 seconds. Subsequent rounds of binding were accomplished as above with 0.1 μg recombinant protein and 1/10th of the unpurified PCR product. After 3-8 cycles of binding, selection and amplification, PCR products were cloned into a TA vector (Invitrogen) and sequenced.
Electrophoretic mobility shift assay
DNA-protein binding was accomplished by incubating 0.8 pmoles recombinant protein with 0.01 pmole double-stranded oligonucleotide, labeled with γ-32P to a specific activity of 106 cpm/pmole, in binding buffer at room temperature for 30 minutes. The same molar ratio of protein/DNA was used in all samples. After separation of bound and free DNA by 5% non-denaturing acrylamide gel electrophoresis, DNA binding was quantitated with a Storm 860 Phosphor/Fluor Imager and ImageQuant software (Molecular Dynamics) and calculated as a percentage of DNA bound divided by total DNA (bound/bound+free). Each DNA binding experiment was repeated at least three times, and representative assays are shown.
Equilibrium dissociation constants (Kd) were determined by incubating a constant amount of labeled DNA (0.01 pmole at 106 cpm/pmole) with increasing amounts of protein (0.2-2.0 pmoles) at room temperature for 30 minutes. After binding activity was quantitated as described above, the Kd was calculated with the equation Kd=[D][P]/[DP], where [D] is the concentration of free DNA, [P] is the concentration of free protein, and [DP] is the concentration of the DNA-protein complex. The DNA concentration was limiting relative to the protein concentration to allow the approximation [DP]≈[DPtotal].
Transcriptional reporter assay
p5xCdc5 was constructed by cloning five, tandem copies of the 12 bp, hCdc5 binding sequence in place of the GAL4 binding element in pFR-Luc by PCR using primers 5′-CAAGCTTGCATGCCTGC-AGGTGATTTAACATAAGATTTAACATAAGATTTAACATAAGAT-TAACATAAGATTTAACATAAACTCTAGAG-3′ and 5′-CGTGTA-CATCGACTGAAATCCC-3′. p3xGAL4-3xCdc5 was constructed by cloning three copies of the GAL4 binding site in tandem with three copies of the hCdc5 binding site in place of the GAL4 binding site of pFR-Luc using primers 5′-CAAGCTTGCATGCCTGCAGGTCGGA-GTACTGTCCTCCGCCGGAGTACTGTCCTCCGCCGGAGTACT-GTCCTCCGCGATTTAACATAAGATTTAACATAAGATTTAACA-TAAACTCTAGAG-3′ and 5′-CGTGTACATCGACTGAAATCCC3′?????
pFR-Luc (Stratagene) contains the entire coding region of Photinus pyralis luciferase downstream of five tandem repeats of the GAL4 binding element and a basic promoter (TATATA).
HeLa cells were transiently transfected using LipofectAMINE Plus (Gibco) according to the manufacturer’s instructions, and luciferase activity was assayed in whole cell lysates using the Dual-Luciferase Reporter Assay System (Promega) as previously described (Bernstein and Coughlin, 1998). pM3-VP16 (Clontech), encoding a GAL4-VP16 fusion, was used as a positive control. A plasmid encoding the Δ675 hCdc5 mutant has previously been described (Bernstein and Coughlin, 1998). pRL-TK (Promega), encoding Renilla reniformis luciferase downstream of a Herpes TK promoter, was included in each transfection for normalization of transfection efficiency.
Binding site selection in yeast
The method for selection of human genomic binding sites was adapted from a similar system used to define the binding specificity of NGFI-B (Wilson et al., 1991). The library or UAS (upstream activating sequence)-reporter plasmid was constructed by inserting Sau3AI-digested, human genomic DNA into the unique BamHI site of pBM2389 (Liu et al., 1993), upstream of an inactive GAL1 promoter. The activator plasmid was constructed by cloning sequence encoding the first 120 amino acids of hCdc5 (Bernstein and Coughlin, 1997) into unique NotI/XhoI sites of pBM2463 (Liu et al., 1993), placing it in-frame between the LexA DNA binding domain and GAL4 activating domain.
Yeast were manipulated according to standard protocols (Guthrie and Fink, 1991). UAS and activator plasmids were cotransformed into S. cerevisiae strain YM4271 (Liu et al., 1993), to allow for red-white selection with ADE5, and counterselection with 5-FOA. Non-sectoring, His+ colonies were patched to medium containing 0.1% 5-FOA, and library plasmids that allowed activator-dependent growth in His− medium, as confirmed by growth failure in 5-FOA, were selected for sequencing.
RESULTS AND DISCUSSION
The DNA binding domain of hCdc5 binds specifically and with high affinity to a 12-bp, double-stranded DNA sequence
To determine whether hCdc5 interacts with specific consensus sequences through its consensus DNA binding domain, we performed selection and amplification from a pool of random oligonucleotides (Blackwell, 1995). We first expressed and purified the amino terminus of hCdc5 (amino acids 1-500) in E. coli, then used this to select a preferential binding site from a degenerate pool of 15 bp oligonucleotides flanked by conserved sequences to facilitate amplification and cloning. After 8 cycles, a consensus 12 bp sequence, GATTTAACATAA, was identified (Fig. 1).
We then used an electrophoretic mobility-shift assay to evaluate the DNA-protein interaction (Fig. 2). Competition with unlabeled, consensus oligonucleotide demonstrated that the DNA binding domain of hCdc5 binds the double-stranded target sequence specifically (Fig. 2A). Binding did not occur with single-stranded sense or antisense oligomer (data not shown). Titration with increasing amounts of purified, recombinant protein enabled us to calculate an equilibrium dissociation constant of approx. 10−8 M (Fig. 2B). This value is comparable to other helix-turn-helix DNA binding domains, such as that published for c-Myb (Krieg et al., 1995). These experiments demonstrated for the first time that hCdc5 can bind with high affinity to a specific, double-stranded DNA sequence. It also confirms that the binding specificity for Cdc5 in higher eukaryotes is markedly distinct from even the extended consensus sequences reported for c-Myb, YAACKGHH (Weston, 1992) and YGRCVGTTR (Howe and Watson, 1991). It is not clear why others were unable to identify preferential binding sites for Cef1p, the putative Cdc5 homologue in S. cerevisiae (Ohi et al., 1998). While Cef1p shares sequence homology at its amino terminus with Cdc5 family members, it was unable to complement the temperature-sensitive growth defect of S. pombe cdc5-120 at its restrictive temperature (Ohi et al., 1998). In addition, human and fission yeast Cdc5 proteins were unable to rescue the lethality of S. cerevisiae cef1Δ cells (Ohi et al., 1998). These findings suggest that Cdc5 family members perhaps have multiple or even divergent functions in lower eukaryotes.
The hCdc5 target sequence contains a helix-turn-helix binding motif
To determine which nucleotides within the consensus sequence were necessary for protein binding, we next performed scanning mutagenesis of the sequence (Fig. 2C). This revealed that the core ‘ANCA’ motif, commonly found in the binding site for other Myb-related, helix-turn-helix DNA binding proteins (McIntosh et al., 1998; Oda et al., 1998; Ording et al., 1996; Sala et al., 1999; Suzuki et al., 1998) was essential for binding. It also demonstrated that the symmetrical, flanking sequences, TTA/TAA, increased binding affinity (Fig. 2C).
To further investigate whether DNA binding was specific for the helix-turn-helix motif, as suggested by the ‘ANCA’ core sequence, we expressed a truncation of the hCdc5 amino terminus (amino acids 1-120), containing only the helix-turn-helix domain (Bernstein and Coughlin, 1997), as well as a Cdc5 mutant in which the helix-turn-helix motif was disrupted by W→G substitutions at positions 31, 53 and 82 in the human peptide sequence (Bernstein and Coughlin, 1997). These substitutions replace the core tryptophan residues essential to the helix-turn-helix structure, and correspond to the W33G W52G W84G cef1 mutant previously reported as unable to rescue growth of a cef1-Δ1::HIS3 mutant (Ohi et al., 1998). While the helix-turn-helix domain alone bound the target sequence with the same specificity and affinity as the longer, amino-terminal peptide (amino acids 1-500), the W31G W53G W82G mutant did not bind the consensus sequence by mobility shift assay (data not shown). Together with the data demonstrating that the ‘ANCA’ core sequence was essential for hCdc5 binding (Fig. 2C), these results suggest that binding is specific for the helix-turn-helix motif.
Our observation that a T→G substitution at position 4 or A→C substitution at position 12 abolished the DNA-protein interaction suggested that the symmetry of the flanking sequences was important. To determine whether the specific sequences or the symmetry provided by the palindrome was critical for binding, we tested additional mutants that created new palindromes flanking the core sequence, namely GATATAACATAT and GATGTAACATAC. Neither of these mutants bound Cdc5 in a mobility-shift assay (data not shown), suggesting that the specific, palindromic sequence was critical for binding.
The presence of a core, helix-turn-helix binding motif in the target sequence is consistent with a DNA binding function for hCdc5. The symmetry observed in the sequence suggests that hCdc5 may adopt a higher order structure in binding to its target. We have demonstrated that specific domains of hCdc5 can homodimerize in vitro (X.-H. Lei and H. S. Bernstein, unpublished observations); however, whether dimerization is important for hCdc5 function in vivo remains to be demonstrated.
hCdc5 binding to its target sequence can activate transcription
To test whether interaction between hCdc5 and its binding site could activate transcription, we transfected HeLa cells (known to express hCdc5; Bernstein and Coughlin, 1997) with a plasmid (p5xCdc5) containing the hCdc5 binding site upstream of a luciferase reporter (Fig. 3). Luciferase activity was increased 28-fold in HeLa cells transfected with the plasmid containing the hCdc5 binding site, while activity was not detected in cells transfected with the same plasmid in which the binding site for GAL4 was substituted for the hCdc5 binding site (pFR-Luc). To determine whether binding by endogenous hCdc5 was responsible for the observed reporter activity, we coexpressed a dominant-negative hCdc5 mutant (Bernstein and Coughlin, 1998) containing only the amino-terminal DNA binding domain (Δ675). We previously demonstrated that this mutant was transcriptionally inactive, and delayed G2/M in transfected cells (Bernstein and Coughlin, 1998). In HeLa cells, the Δ675 mutant abrogated reporter activity, presumably by competing with endogenous hCdc5 for binding to the consensus site, and blocking activation of the reporter. These data demonstrated that specific interaction of endogenous hCdc5 with its selected binding sequence is capable of activating transcription.
hCdc5 interacts preferentially with specific sequences in the human genome
To determine whether hCdc5 binds specific sites in the human genome, we employed a selection system in yeast that previously was used to define the binding specificity of the nerve growth factor induced-B gene product (Wilson et al., 1991). We constructed two plasmids to express in S. cerevisiae (Fig. 4A). The library or UAS (upstream activating sequence)-reporter plasmid consisted of the HIS3 gene placed downstream of an inactive GAL1 promoter missing the GAL1 UAS (Lorch and Kornberg, 1985). We then inserted size-selected, human genomic DNA (1-4 kb) upstream of the inactive GAL1 promoter as a source of potential DNA binding sites. The activator plasmid encoded an epitope-tagged fusion of the amino terminus of hCdc5 with the activating domain of GAL4 (Brent and Ptashne, 1985) under control of a strong, constitutive promoter (i.e., ADH1). Yeast cells expressing the hybrid transcriptional activator and carrying a library plasmid containing binding sites for hCdc5 expressed HIS3 and were able to grow in histidine-deficient medium (Fig. 4A,B).
To avoid the anticipated expression of HIS3 as a result of UAS recognition by an endogenous yeast transcriptional activator, we took advantage of the ADE5 and URA3 markers on the activator plasmid. Yeast colonies bearing null mutations in the ade2 and ade5 genes turn red in the presence of ADE5 (Liu et al., 1993). Colonies that were able to grow in histidine-deficient medium in the absence of the activator plasmid formed red colonies with white sectors, indicating loss of the activator plasmid. We discarded such ‘sectoring’ colonies as false positives (Fig. 4B). In addition, yeast expressing the URA3 gene product die in the presence of 5-fluoro-orotic acid (5-FOA), which is metabolized to a toxic metabolite in the presence of the enzyme encoded by URA3 (Sikorski and Boeke, 1991). Yeast transformants that required the presence of the activator plasmid for function of the library UAS grew in histidine-deficient medium only in the absence of 5-FOA. We also discarded transformants that grew in histidine-deficient medium in the presence of 5-FOA as false positives. Of approx. 5×105 transformants screened (representing approximately one-third of the human genome), 12 out of 218 non-sectoring colonies were sensitive to 5-FOA. These were sequenced and analyzed for homology with known sequences, as well as with each other. All 12 plasmids contained novel human genomic sequence without homology to any sequences available in GenBank, including the Eukaryotic Promoter Database. A subset of these had short regions of limited homology (<500 bp) between them, and two in this subset appeared to be overlapping clones from the same genomic region with >3kb of overlap. The observation that selected clones had extended regions of homology suggested that hCdc5 interacts preferentially with certain sequences within the human genome.
Interaction of hCdc5 with specific sites in the human genome is mediated by binding to its target sequence
To determine whether hCdc5 interacts with specific genomic sequences through binding to its consensus target sequence, we first surveyed the 12 genomic sequences (totaling approx. 36,200 bp) identified by binding site selection in yeast for the consensus sequence identified by cyclic amplification and selection (Fig. 2E). An exact match was not found, although this was not surprising as the likelihood of finding a unique 12 bp sequence is 6×10−8, and is predicted to occur by chance approx. 180 times in the human genome.
We then used the information obtained by scanning mutagenesis (Fig. 2C,D) to look for sites containing the core ‘ANCA’ motif flanked by AT-containing palindromes within the genomic clones. Six such sites were identified in clones 6, 8, 12, 25 and 30 (Fig. 5A), of which three were able to interact with the amino terminus of hCdc5, as demonstrated by electrophoretic mobility gel-shift assay (Fig. 5B). Two of these were found in the two clones that represented overlapping sequences from the same region of the genome (clones 6 and 30). All three sites shared features of the consensus binding sequence: a core ‘ANCA’ motif, AT-rich flanking palindrome, and a distance between core and palindrome of no more than 3 bp.
To test whether the sequence we abstracted from clone 6 was the one responsible for Cdc5/GAL4-dependent activation in yeast, we cloned subfragments of clone 6 into the UAS-reporter plasmid, and tested their ability to confer activator-dependent growth in selective media (data not shown). A 500 bp subfragment containing the identified 15 bp sequence was necessary for Cdc5/GAL4-dependent activation in yeast, while removing the 15 bp sequence from this larger fragment abolished activator-dependent growth. Smaller fragments containing the 15 bp sequence failed to confer growth in selective media, suggesting that while the 15 bp sequence is necessary for activation, other elements contained in the 500 bp clone 6 subfragment may be required for Cdc5/GAL4-dependent activation in yeast.
These experiments suggested that the binding site for hCdc5 contains a core ‘AWCA’ flanked within 3 bp by an AT-rich region of symmetry, although deduction of the ‘real’ consensus binding sequence for hCdc5 will require the identification of its downstream target genes and the analysis of their promoters. This remains, however, the first demonstration that a member of the Cdc5 family can interact with specific genomic sequences through direct DNA-protein binding.
The role of Cdc5 in cell division
Genetic and biochemical studies of Cdc5 family members in fission yeast, fly and human confirm a positive regulatory role at the G2/M transition in dividing cells (Bernstein and Coughlin, 1998; Ohi et al., 1998; Ohi et al., 1994). The presence of a consensus DNA binding domain, and experiments with the human protein that have identified a latent transcriptional activation domain, suggest that Cdc5 might function in the transcriptional regulation of gene expression (Bernstein and Coughlin, 1998; Ohi et al., 1998).
We now demonstrate that the amino terminus of hCdc5 binds specifically and with high affinity to a 12 bp, double-stranded DNA sequence, and that this sequence contains a helix-turn-helix binding motif, as well as rotational symmetry. We demonstrate that binding of this sequence by endogenous hCdc5 in cell extracts is capable of activating transcription. We also demonstrate that hCdc5 may interact with specific sequences in the human genome by binding to its consensus target sequence. These studies provide the first evidence that human Cdc5 can act as a site-specific DNA binding protein.
Recently, studies with hCdc5 and homologues in S. cerevisiae and S. pombe have shown that members of this protein family associate with the pre-mRNA splicing complex, or ‘spliceosome’ (Burns et al., 1999; McDonald et al., 1999; Tsai et al., 1999). The spliceosome is known to consist of small nuclear ribonucleic acids (Varani and Nagai, 1998) in complex with a number of essential, associated proteins (Staley and Guthrie, 1998). These proteins have been postulated to function in a variety of regulatory roles, including substrate specificity, splice site recognition, catalytic activity and developmental and tissue-specific control of alternative splicing (Lopez, 1998; Will and Luhrmann, 1997). The heterogeneous nuclear ribonucleoprotein complex also has been implicated in transcriptional regulation through site-specific DNA binding (Krecic and Swanson, 1999). The specific role of Cdc5 in this complex, or the mechanism by which spliceosome function regulates the G2/M transition, are not yet known.
Our previous finding that overexpression of hCdc5 accelerates G2/M (Bernstein and Coughlin, 1998) is difficult to explain solely from its putative role in pre-mRNA splicing. In addition, we have now identified a high-affinity, consensus DNA sequence that specifically binds hCdc5 and appears at low frequency in the human genome. We also have found this sequence in the 5′ untranslated regions of several cell cycle regulatory protein genes in both D. melanogaster (Uba1; Canning and Finnegan, 1999) and mts (Berry and Gehring, 1999; Berkeley Drosophila Genome Project) and C. elegans (cya-2 and cyb-2; Kreutzer et al., 1995; Durbin and Mieg (1996). Taken together, these observations leave open the possibility that Cdc5 family members have multiple, perhaps convergent functions, including the control of cell division through both transcriptional and post-transcriptional mechanisms.
ACKNOWLEDGEMENTS
This work was support by grants from the National Institutes of Health, the American Heart Association National Center, the March of Dimes Birth Defects Foundation, and the Child Health Research Center at UCSF to H.S.B. X.H.L. was supported by a postdoctoral fellowship from the National Institutes of Health. We thank Dr Mark Johnston for providing us with plasmids, yeast strains and advice; Dr Raghavendra Mirmira for sharing technical expertise; Drs Shaun Coughlin, Ira Herskowitz, Synthia Mellon, Michael German and Brian Black for helpful discussions; and the anonymous referees for their insightful comments on the manuscript.