ABSTRACT
We describe an unusual Antennapedia class homeobox gene from the grasshopper Schistocerca gregaria (Orthoptera, African Plague Locust). Its sequence is not sufficiently similar to that of any other insect Hom-Hox gene to identify it unambiguously, but short conserved elements suggest a relationship to the segmentation gene fushi-tarazu, (ftz). We term it Sg Dax (divergent Antenna-pedia class homeobox gene). Antibodies raised against the protein encoded by this gene reveal two phases of expression during embryogenesis. In the early embryo, it is a marker for the posterior part of the forming embryonic primordium, and subsequently for the posterior part of the growing germ band. In older embryos, it labels a subset of neural precursor cells in each trunk segment, very similar to that defined by the expression of fushi tarazu (ftz) in Drosophila. We suggest that Schistocerca Dax and Drosophila ftz are homologous members of a gene family whose members are diverging relatively rapidly, both in terms of sequence and role in early development.
INTRODUCTION
Among the arthropods, homeotic genes of the Antennapedia (Antp) class are generally well conserved. Homeobox exons of these genes have been sequenced in a range of insects (Walldorf et al., 1989; Tear et al., 1990; Nagy et al., 1991; Eggleston, 1992; Ueno et al., 1992; Stuart et al., 1993), and in a single crustacean (Averof and Akam, 1993). In virtually all cases, these sequences can be identified as unique homologues (orthologues) of specific Drosophila genes, with near-identical homeodomains (>58/60 amino acids) and conserved flanking sequences. In the few cases where it has been tested, orthologues are expressed in similar spatial domains in different insects, implying a conserved role in development (Tear et al., 1990; Nagy et al., 1991; Stuart et al., 1993).
We report here the isolation of a gene from the Locust, Schistocerca gregaria, that does not fit this pattern. On the basis of homeodomain sequence, this gene is clearly of the Antp class (Scott et al., 1989), but resembles no particular Drosophila gene more closely than it does the Antp class consensus. It has a pattern of expression during early embryogenesis unlike that of any Drosophila Antp class gene, but it’s expression during neurogenesis is very similar to that of the Drosophila gene fushi tarazu (Carroll and Scott, 1985; Doe et al., 1988a).
Fushi tarazu (ftz) is an exception among the set of Drosophila Antp class genes. Like all other Antp class genes, it is located in a Hom/Hox cluster, in this case between the homeotic genes Sex combs reduced (Scr) and Antennapedia (Antp) of the Antennapedia complex (Wakimoto and Kaufman, 1981). However, it does not contain a canonical YPWM motif characteristic of these Hox genes, and is not generally considered to be a homeotic gene. It is expressed in a pattern of stripes in the Drosophila blastoderm (Hafen et al., 1984; Carroll and Scott, 1985), where it is needed to define the pattern of segments (Wakimoto et al., 1984) and to establish the pattern of homeotic gene expression (Duncan, 1986; Ingham and Martinez-Arias, 1986). Later, in the nervous system, it is expressed in the same subset of neural cells in each segment; its role here appears to be the specification of cell type, not of segment differences (Carroll and Scott, 1985; Hiromi et al., 1985; Doe et al., 1988a).
Of these two processes in which the ftz gene is involved, neurogenesis has been well conserved in evolution, but segment formation differs markedly in different insect groups. The pattern of neural precursor cells, and the initial fate of their progeny, is very similar in insects as different as flies and locusts, allowing homologous cells to be identified unambiguously (Thomas et al., 1984). However, there is no simple way to equate the stages of early development in these insects prior to the formation of the segmented germ band. Flies (Diptera), are ‘long germ’ insects. They specify the complete pattern of segments at the syncytial blastoderm stage, virtually simultaneously. Grasshoppers and locusts (Orthoptera) are ‘short germ’ insects. They form a cellular embryonic primordium, which becomes segmented only as it grows. This short-germ mode of development is shared with the less specialised members of other arthropod groups, and is therefore likely to be the primitive mode of embryogenesis for the phylum (Sander, 1976; Patel et al., 1994).
We suggest that the gene that we have isolated is a homologue of ftz, but that this gene family is diverging relatively rapidly, both in terms of sequence and role in early development. The expression of the Schistocerca gene suggests that it plays a role in the specification of the embryonic axis in this short germ insect.
MATERIALS AND METHODS
Locust embryos
Schistocerca gregaria (Forskål) eggs were kindly supplied by Cambridge University Zoology field station. They were collected overnight, laid in moist sand at 26°C, and then incubated in the laboratory at 28°C. All staging of locust embryos in terms of % development is based on that of Bentley et al. (1979).
Isolation of genomic clones
A Schistocerca gregaria partial Sau 3A library of genomic DNA (Tear et al., 1990) was screened with short fragments containing the Drosophila Ubx, Antp and Dfd homeoboxes (Dawson, 1989). Probes were hybridised in 42% deionised formamide, 5× SSPE, 5× Denhardts, 0.1% SDS with 250 µg/ml sheared, denatured salmon sperm DNA at 37°C. Washes were taken to a stringency of 65°C in 2× SSPE. Homeobox homologous fragments were subcloned and sequenced by standard techniques (Dawson, 1989). The Dax homeobox is contained on a 2.7 kb EcoRI/SalI genomic fragment (subclone p2313RS2.7 derived from phage λ2313).
RACE PCR
Coding sequence 5′ of the homeobox exon of p2313RS2.7 was obtained by RACE PCR. First strand cDNA was made from 1 µg of poly(A)-enriched RNA isolated from embryos at 35% development, using 12.5 pmol of Dax-specific primer RED5 (see below for primer sequences) and Superscript™ RNase H– Reverse Transcriptase (GIBCO BRL). First strand cDNA was size selected on Size Select-400 Spun Columns (Pharmacia) then tailed and used as substrate for RACE PCR, carried out as described by Frohman (1990), except that DMSO was not used.
For the first PCR, reactions containing 1/20th to 1/200th of the first strand were amplified with Adapter primer, dT18-Adapter primer and Dax-specific primer RED6. Southern analysis of the first PCR reactions, using a Dax homeobox probe, identified positive bands. The largest of these products were gel purified and 1/50th of the removed gel slice was used as template for a second round of RACE PCR (using dT18-Adapter primer and nested Dax-specific primer RED203; 40 cycles). A further round of Southern analysis, size selection and PCR (20 rounds) were carried out, using nested Dax-specific primer RED3. Bands were subcloned into the SmaI site of pBluescript II SK+ (Stratagene).
Three independent plasmids deriving from the final PCR reaction (pA1, pA2 and pA3) containing inserts of 1060 bp were sequenced using the Sequenase® Version 2.0 DNA Sequencing Kit (United States Biochemical). The full sequence of both strands was obtained by primer walking. The sequence of pA1, pA2 and pA3 showed differences at 9 positions over 1 kb. The consensus of these three sequences was taken to reduce PCR errors in the sequence.
Primers
RED5: 5′ ACTTCAGAATGGTAGTCAGGTGTGGCA 3′ (reverse) RED6: 5′ GAGGGTCGGTCTCTGACGGGATTCGAGT 3′ (reverse) RED203: 5′ GACCGCGAGACACTCCGGGTCGCGGACGCT 3′
(reverse)
RED3: 5′ GCCGCTCAGTCCATCGCCAA 3′ (reverse)
dT18-Adapter primer: 5′ AGAGAACTAGTGTCGACGCGGCCGC(T)18 3′ (forward)
Adapter primer: 5′ AGAGAACTAGTGTCGACGCGG 3′ (forward)
In situ hybridisation to embryo sections
Sg Dax probe was prepared by random priming the 2.7 kb EcoRI-SalI fragment from p2313.RS2.7 in the presence of [35S]dATP. Hybridization was carried out as described by Tear et al. (1990).
Production of fusion proteins
A 1.42 kb PstI fragment from p2313RS2.7 (Dawson, 1989) contains the homeobox with 21 bp upstream and 1.2 kb downstream. This was subcloned into the PstI site of pUR292 to produce an in frame fusion of the Sg Dax homeodomain coding region downstream of the lacZ gene to give plasmid pUR292.2313.
A second fusion construct was made by subcloning the 1.48 kb BamHI-EcoRI homeobox containing fragment of pUR292.2313 into the BamHI-EcoRI sites of pGEX3X (Smith and Johnson, 1988) (gift from S. Russell, Department of Genetics, Cambridge). The resulting plasmid, pGEX3X.2313, consists of the Sg Dax homeodomain coding region fused downstream of the glutathione S-transferase gene.
Fusion proteins from both these plasmids were produced in E. coli strain TG1 and isolated by SDS-polyacrylamide gel electrophoresis followed by electroelution as described by Tear et al. (1990). Protein was snap frozen in liquid nitrogen and stored at –70°C.
Antibody production
Two rabbits were injected with 200 µg of fusion protein produced from the pGEX3X.2313 construct in complete Freund’s adjuvant. Five boosts of 200 µg protein in incomplete Freund’s adjuvant were given to each rabbit at 2 week intervals. All injections were subcutaneous at multiple sites. Blood was taken from the rabbits after 5, 7, 9 and 11 weeks and sera collected and stored with 0.02% sodium azide at –20°C (Harlow and Lane, 1988).
Sera were tested for the ability to detect both the β-galactosidase and glutathione S-transferase fusion proteins on western blot strips made from protein extracts of E. coli expressing the fusion protein. Reaction to the fusion protein was seen in both rabbits after 5 weeks.
Purification of antisera
The sera were affinity purified against the second fusion protein (pUR292.2313) to select for antibodies that bind specifically to the Sg Dax fragment of the protein. 400 µg of fusion protein produced from plasmid pUR292.2313 was dialysed (room temperature, 3 hours) against 0.1 M Hepes-KOH, pH 7.5, bound to Affi-Gel 10 gel (Bio-Rad) and transferred to a column. Sera from the 11th week bleed of both rabbits were passed over the column and bound antibodies were eluted with 0.1 M glycine-HCl, pH 2.5 and neutralised with 0.1 volume 2 M Tris-HCl, pH 7.4. Bovine serum albumin (BSA) was added to 1% and the serum dialysed against PBS (overnight, 4°C) and stored at 4°C with 0.02% sodium azide.
Immunohistochemistry
Locust embryos were dissected from their eggs under locust embryo saline (150 mM NaCl, 3 mM KCl, 5 mM TES (Sigma), 2 mM CaCl2, 1 mM MgSO4) and fixed in 3.7% formaldehyde in PBS at 0°C for 40 minutes. Embryos were then washed in PTx (PBS+1% Triton X-100), blocked in PBTx (PTx+0.1% BSA) at 4°C for 2-4 hours and agitated at 4°C overnight in a 1:50 dilution of affinity-purified serum in PBTx. After the primary incubation, the serum was washed off with three changes of PBTx at room temperature over 3 hours and left in a 1:400 dilution of biotin-conjugated goat anti-rabbit antibody (Vector Laboratories, Peterborough, England) in PBTx, at 4°C on a rotating platform for 90 minutes. The embryos were then washed in three changes of PT (PBS+0.1% Tween 20) over 30 minutes at room temperature and incubated in avidin-biotin complex (ABC Elite, Vector Laboratories) for 30 minutes at room temperature. Excess ABC complex was washed off with five changes of PT in 30 minutes. They were stained in 0.25 mg/ml diaminobenzidine (Sigma), 0.1% hydrogen peroxide (from 30% solution, Sigma) and 0.03% nickel chloride in PT. Embryos were dehydrated in ethanol, cleared in xylene and mounted in DPX mountant (BDH Ltd., Poole, England).
Whole Schistocerca eggs were placed in 3.7% formaldehyde in PBS and pricked with a very fine needle (15 µm diameter) about 10 times through the chorion, avoiding the most posterior region where the embryo will develop. The eggs were left in formaldehyde overnight at 4°C. The following day, the chorion was peeled off the eggs, which were then washed in PBTx for 2 hours. Thereafter the whole eggs were treated in the same way as dissected embryos that have been fixed and blocked.
Drosophila embryos were dechorionated in bleach, fixed in 4% paraformaldehyde, devitellinised with methanol, washed in PBTx and then treated in the same way as the fixed Schistocerca embryos. The primary antibody was a mouse monoclonal anti-ftz antibody (Krause et al., 1988) kindly provided by David Ish-Horowicz and was used at a 1:1000 dilution. Biotinylated horse anti-mouse (Vector Laboratories) was used as the secondary antibody.
RESULTS
Library screens and characterisation of the Dax gene
Genomic clones carrying homeoboxes were isolated by further screens of the library described by Tear et al. (1990). Of four different Hom/Hox class homeoboxes sequenced, three were immediately identifiable as homologues of specific Drosophila genes (Sex combs reduced (Scr) (Akam et al., 1988; Dawson, 1989), abdominal-A (abd-A), (Tear et al., 1990) and Abdominal-B (Abd-B), (Kelsh et al., 1993)). The fourth homeobox was not immediately identifiable. It provides the focus for this study.
The complete coding sequence of the gene containing this homeobox was determined from genomic DNA spanning the homeodomain and carboxyl terminal region, and from RACE PCR clones covering the amino terminal portion of the protein. The gene encodes a protein of 314 amino acids, showing the characteristic structure of Hox/Hom genes, with the conserved YPWM motif lying 13 amino-acids upstream of the homeodomain, and separated from it by an intron (Fig. 1A). The homeodomain matches the Antp class consensus sequence (Scott et al., 1989; Burglin, 1994) at all 35 amino acid positions but differs from each specific Drosophila Antp class gene by 9-13 residues (Fig. 1B).
Sequences flanking the homeodomain and the YPWM motif are frequently diagnostic for specific gene families within the Hox clusters (Bürglin, 1994). Residues flanking the homeodomain of this Schistocerca protein show no striking similarities to other Hox gene products, but the amino acids immediately flanking the YPWM motif are identical to those found in the Tribolium ftz gene (Fig. 1C, Brown and Denell, personal communication). Although this short sequence cannot be considered diagnostic, the extended FYPWMK motif is found in no other Hox/Hom protein. The remainder of the coding sequence shows only low levels of similarity with other sequences in the EMBL database.
We believe that this gene is a ftz homologue (see below), but as sequence alone does not identify it unambiguously, we refer to it as Sg Dax (divergent Antp-class homeobox gene).
To test whether this sequence is expressed in Schistocerca, we hybridised a probe spanning the putative homeobox exon with sections of Schistocerca embryos at 30% development. We observed labelling over the developing nervous system, but not over more lateral regions of the embryo (Fig. 2). Other Antp class genes are strongly expressed in the lateral epidermis at this stage (Tear, 1990; Tear et al., 1990), suggesting that the signal observed in this experiment is specific, despite the presence of a homeobox in the probe.
To examine the pattern of expression of Sg Dax in more detail, we raised antibodies to a fusion protein containing the homeodomain and carboxy terminal sequences of the putative protein. Two rabbits were immunised. Affinity-purified sera from both stained the same pattern of cells in the developing Schistocerca embryo. These sera reveal a pattern unlike that of the other Antp class genes that we have studied, but consistent with the results obtained by in situ hybridisation.
Expression of Dax in the early embryo
Figs 3 and 4 illustrate the early development of the Schistocerca embryo. The zygote nucleus is initially located near the posterior pole of the egg. After fertilisation, cleavage nuclei disperse to form a syncytial blastoderm, but only at the posterior pole do these nuclei coalesce to form a dense monolayer. This will become the embryonic primordium. Initially, this polar cap of nuclei is circular, but it rapidly extends along the dorsal side of the egg, forming first a heart shape, and then a ‘tadpole’, thereby revealing the anteroposterior axis of the embryo (Roonwal, 1936; Bentley et al., 1979). Sg Dax is expressed in posterior regions of this primordium from its initial formation until about 23% of development.
While the nuclei are still condensing to form the embryonic disc, our antibody labels a crescent of nuclei near the posterior pole, on the dorsal side of the egg (Fig. 3B). We presume that these nuclei will form the posterior part of the embryonic axis, though this has not been demonstrated directly by fate mapping. It is not known precisely where the boundary between embryonic and extraembryonic tissue lies at this stage, or indeed whether such a boundary has yet been defined. Once the embryonic primordium becomes visibly asymmetric, the region of nuclear staining can be seen to fill the posterior part of the presumptive germ band and to extend into the forming amnion. At 15% of development, the whole of the trunk is stained, from the anterior of the gastrulation groove backwards (Figs 3D, 4A). As far as we can tell this staining is both mesodermal and ectodermal. The head lobes are not stained. Shortly thereafter, staining in the anterior part of the growing trunk fades rapidly, transiently leaving a diffuse anterior stripe (Fig. 4B), but persisting only near the posterior tip (Fig. 4C,D). Between 18 and 20% development, cells at the very posterior tip of the embryo cease to stain, leaving only a subterminal domain of labelled cells in the presumptive abdomen. By 23%, this too has disappeared.
Segmentation does not become visible until the tadpole stage (17% development), when first stripes of Engrailed protein (Patel et al., 1989) and then segmentation of the mesoderm, define the three thoracic segments. Thereafter stripes of Engrailed protein appear sequentially down the abdomen and visible segmentation follows. In each region, Dax expression fades 2-3% of development (10-15 hours) before the first detection of Engrailed protein (compare Fig. 4 with Fig. 1 of Patel et al. (1989)). Thereafter, Dax expression is limited to the neurectoderm and its neural derivatives, as described below.
Expression in the central nervous system
At later developmental stages, our antibody stains a re-iterated pattern of cells within each neuromere of the Schistocerca nervous system (Fig. 5). When segmentation is completed, this staining extends from the posterior half of the first gnathal (mandibular) segment to the last abdominal segment (A11). Other tissues of the embryo are not stained, with the exception of a few isolated cells in the gnathal and thoracic appendages, and a single cell or pair of lateral cells in each abdominal segment (Fig. 5C). These may be peripheral neurons and/or glial cells.
The central nervous system of insects forms from the mid-ventral ectoderm of each segment. This region is termed the neurectoderm. Shortly after gastrulation, neuronal and glial precursor cells delaminate from the neurectoderm, moving dorsally and then enlarging. In Drosophila, this process occurs almost simultaneously in each trunk segment. In Schistocerca, segments develop sequentially, so that segments in a single embryo present a sequence of stages of neurogenesis (Fig. 5); (Bate, 1976; Bate and Grunewald, 1981; Doe and Goodman, 1985; Shepherd and Bate, 1990).
In each segment, the later phase of Dax expression proceeds in concert with neurogenesis. Thus staining in the neurectoderm is first seen in the thorax at about 20% embryogenesis (Fig. 4), but not until 30% in the posterior abdominal segments (Fig. 5) (Doe and Goodman, 1985). Blocks of about 40 cells stain on either side of the midline just as the first neuroblasts are beginning to delaminate. The number and intensity of stained nuclei varies considerably from embryo to embryo, reflecting dynamic or variable expression.
As the neuronal precursor cells delaminate, they form a stereotyped array that varies little from segment to segment, and even from one insect species to another (Thomas et al., 1984). These neuronal precursor cells consist of neuroblasts (NBs) and midline precursor cells (MPs). Neuroblasts divide to produce a fixed lineage of ganglion mother cells (GMCs) which themselves divide once to give rise to two neurons (Bate, 1976; Doe and Goodman, 1985). Midline precursor cells divide only once, giving rise to two neurons (Bate and Grunewald, 1981). The remaining cells of the neurectoderm give rise to non-neural support cells and contribute to the ventral epidermis (Doe and Goodman, 1985).
Shortly after the first generalised expression of Dax in the Schistocerca neurectoderm, staining appears in more dorsally located cells that have delaminated from the neurectoderm (Fig. 5B,C). Two large nuclei stain first, one on either side of the midline. The size and position of these cells identifies them as the midline precursor-2 cells (MP2) (Bate and Grunewald, 1981). In Drosophila, the MP2 cells are the first cells to express ftz in the neurectoderm (Doe et al. (1988a) and Fig. 6B).
About 1% of development after expression first appears in the MP2 nuclei, staining appears in a single nucleus lying between, and in the same dorsoventral plane, as the MP2 cells (Fig. 5C). From the size, position and subsequent behaviour of this cell, we identify it as the midline precursor-1 cell (MP1) (Bate and Grunewald, 1981). Shortly after staining, this cell divides, and its progeny move laterally until they lie over the progeny of the MP2 cells, which by this time have also divided.
By the time the MP1 cell has divided, several more nuclei in each hemisegment are strongly stained (Fig. 5C). One of these prominent cells is located among the anterior row of neuroblasts and their progeny, at the end of the row closest to the midline. It is too small to be a neuroblast. We identify it as a ganglion mother cell that is probably descended from neuroblast 1.1. It divides symmetrically about 2% of development later giving rise to two neurons which both stain. In Drosophila, the first ganglion mother cell produced by neuroblast 1.1 expresses ftz prominently, as do its daughters (Doe et al. (1988a) and Fig. 6C).
Subsequently, the pattern of expression becomes more complex. About 20% of the nuclei in each neuromere stain. These stained nuclei are concentrated in the anterior half of each neuromere. None of the labelled cells appear to be large enough to be neuroblasts; only midline precursor cells, ganglion mother cells and neurons stain. In all these respects, the pattern resembles that of ftz staining in the Drosophila CNS (Doe et al., 1988a; Fig. 6).
Even at these later stages some cells are still identifiable, especially in regions where the concentration of stained nuclei is lower. For example, less than 1% of development after the MP2 and MP1 progeny have formed bilateral clusters, a large nucleus between the two clusters begins to stain. This cell divides about 3% of development later. Its progeny remain on the midline until at least 35% development. This cell is likely to be the MP-3 cell (Bate and Grunewald, 1981; Goodman et al., 1981).
The description above applies to all trunk segments from the maxillary to the tenth abdominal segment. The pattern of staining is different in the mandibular segment. Only cells in the posterior half of this neuromere stain. Also, because MP2 cells are not present in the mandibular neuromere (Doe and Goodman, 1985), these cells are missing from the pattern. The eleventh abdominal (A11) neuromere is reduced in comparison with the other trunk segments (Doe and Goodman, 1985). Dax is expressed in the A11 neuromere, but we have not examined this pattern in detail.
Overall, the parallels between Dax and ftz expression in the nervous system are striking, and extend to both general and specific characteristics. Among the early, easily identifiable cells that express these genes, there is virtually complete overlap. The only clear difference is that Dax is expressed in MP3 cells as well as MP1 and MP2, whereas Drosophila has no homologous MP3 cell (Thomas et al., 1984). The anterior limit of both ftz and Dax expression lies in the mandibular neuromere. Posteriorly, the Drosophila CNS is compressed, making comparisons difficult, but ftz expression certainly extends into the fused terminal neuromere that represents the ninth and all more posterior segments of the Drosophila CNS.
DISCUSSION
A ftz/Dax gene family derived from Hox genes
Sg Dax is clearly a member of the Antp class of homeobox genes (Scott et al., 1989). In Drosophila the Antp class comprises four homeotic genes (Antennapedia (Antp), Sex combs reduced (Scr), abdominal-A (abd-A) and Ultrabithorax (Ubx)) and the pair rule segmentation gene fushi tarazu (ftz). Clear homologues for all four of these homeotic genes have now been identified in Schistocerca. These can easily be identified by sequence conservation in the homeobox exon. In each case this identification is consistent with a well conserved pattern of gene expression (Akam et al., 1988; Tear et al., 1990; Kelsh et al., 1993; Patel, 1993 and Kelsh et al., 1994). No clear homologue, however, has been identified for ftz.
Previous searches have failed to identify ftz homologues outside of the Diptera (Walldorf et al., 1989). This is probably because the sequences of ftz and its homologues in other insects have been less constrained than those of the canonical homeotic genes: even comparing species of Drosophila, ftz shows appreciable divergence within and flanking the homeobox (D. Maier, personal communication). However, the recent identification of a ftz gene from Tribolium (Brown and Denell, personal communication) enables us to recognize conserved features that link Dax with the ftz genes.
Tribolium and Drosophila are both endopterygotes, and therefore more closely related to one another than either is to Schistocerca (Kristensen, 1981, 1991). Even so, the Tribolium ftz gene is barely identifiable by sequence alone as a ftz homologue. Unambiguous recognition of this gene depends not only on sequence, but also on knowledge of its position in the Tribolium Hom-C, and on its expression pattern, which resembles that of Drosophila, both in the blastoderm and during neurogenesis (Brown and Denell, personal communication).
Three characteristics link Schistocerca Dax with the ftz genes. First, Tribolium ftz and Schistocerca Dax retain identical flanking amino acids around the YPWM motif (AFYPWMK). These residues are normally well conserved in Hox proteins of the same class. This motif has diverged almost beyond recognition in Drosophila ftz (Fig. 1C). Second, despite the overall divergence of their homeodomains, comparison of these three genes reveals one common feature: the homeodomains of all three are identical at the amino terminal end (residues 2-10), and are unique among Antp class genes in showing a T at residue 4 (Fig. 1C). This N-terminal region of the homeodomain is essential to specify functional differences between other Antp-class proteins, which all share identical sequences in helix three: swapping residues 1, 4, 6 and 7 alone is sufficient to switch the specificity of Antp to Scr and vice versa (Furukubo-Tokunaga et al., 1993; Zeng et al., 1993). Thus conservation of the ftz/Dax genes in this region is likely to be significant. Finally, Dax and ftz show a conserved pattern of expression in the same subset of cells in the developing nervous system. Thus we have no doubt that Dax is a homologue of ftz. However, because Dax shows no trace of the pair-rule pattern for which ftz is famous, and may well not play the same role in segmentation (see below), we retain a different name for the gene.
All three of the insect ftz/Dax genes map within or close to Hox clusters. The Drosophila ftz gene lies between Scr and Antp. Although its transcription orientation is reversed with respect to the flanking Hox genes, this appears to result from a recent inversion within one Drosophila subgenus; other Drosophila species carry ftz at the same location, but in the same orientation as flanking genes (Maier et al., 1993). The Tribolium ftz gene has been mapped by deletion to a region that includes Scr, Antp, Ubx and abd-A homologues (Stuart et al., 1991 and personal communication). We do not know the precise position of Dax, but it maps to the same chromosomal locus as Scr and abd-A, to within the limits of resolution of chromosome in situ hybridization (D. Ferrier and M. Akam, unpublished data).
The inclusion of the Drosophila segmentation gene ftz in the Hox clusters of Drosophila has always been somewhat enigmatic. Identifying the Schistocerca Dax and Tribolium ftz genes throws some light on this. It seems most likely that ftz originated as a typical Hox cluster gene, and has since lost, by sequence divergence, features that are strongly conserved in other Hox genes (e.g. the YPWM motif). In this context, it is interesting that the expression of Dax in Schistocerca displays a generic resemblance to that of a homeotic, Hom/Hox gene. First, it is expressed only in the trunk (including the mouth-parts), not in the head. This is true in very early development, but also during neurogenesis — it is striking that a relatively large fraction of cells in the ventral nerve cord express Dax, but none do so in the brain. Second, it shows a sharp anterior limit in its domain of expression. In the case of Dax, this limit is virtually at the anterior of the trunk — but in the nervous system, it is very clear that Dax is not expressed in the anterior part of the first gnathal neuromere (the equivalent of parasegment 0 in Drosophila). We note that both these characteristics are also shared by ftz, although the parallel has not previously been stressed.
Sg Dax, however, differs from the Drosophila homeotic genes in other important ways. In the later embryo expression is restricted (or virtually restricted) to the nervous system. Also, Dax shows no trace of down-regulation by other, posteriorly expressed homeotic genes — the protein is present in the same cells and at similar levels in all segments from the maxillary back. Finally, no Drosophila homeotic gene has an anterior limit of expression at the parasegment 0/1 boundary. Thus the expression pattern of Dax provides no evidence to suggest that it derives from a duplication of any known insect homeotic gene. It may represent a relatively old family of Hox genes that has, in arthropods, acquired functions rather different from those of the other homeotic genes.
Specific homologues of this ftz/Dax family may extend to the Crustacea. Among six Hox genes isolated from Artemia (Averof and Akam, 1993), five were immediately recognizable as orthologues of insect genes, but one was more divergent. This divergent sequence, AfHx1, shares with the ftz/Dax family conservation within the amino-terminus of the homeodomain (Fig. 1C). Tree-building based on sequence comparisons cluster the AfHx1 homeobox specifically with Dax and the ftz genes, in preference to other insect Hox sequences (Averof, 1994).
Dax expression and early patterning
Whatever its origins, Sg Dax provides us with a molecular marker to define patterning at blastoderm stages in these short germ embryos. This is of particular interest, because the cellular context of early development differs markedly between Drosophila and Schistocerca.
In Drosophila, the nurse cells provision the egg with maternal information in the form of specific RNA and protein molecules that become localised to the poles of the egg. This information is used to define the anteroposterior axis of the developing embryo (St. Johnston and Nusslein-Volhard, 1992). In Schistocerca, no nurse cells are present in the egg tubes of the ovary (Mahowald, 1972). The only RNA in the egg is presumed to be provided by the oocyte nucleus. It is unclear to what extent maternal information is important for defining the axes of the embryo. The anterior half of the egg is unnecessary, for isolated posterior fragments of the egg can still develop a normally segmented embryo (Moloo, 1971; Sander, 1976). Any relevant maternal information must be located at the posterior tip of the egg.
The early expression of Dax shows that regional differences amongst the cleavage nuclei have already been established while the nuclei are still aggregating to form the embryonic primordium at the posterior pole. Without a fate map, it is not possible to say which parts of the prospective germ band this early Dax expression represents. It is clear, however, that by 15% development (heart stage), Sg Dax expression defines a distinction between head and trunk.
Shortly after the trunk begins to grow, Dax expression becomes restricted to just the posterior part, where segments have not yet formed. Later still, Dax expression is excluded from the posterior tip of the abdomen, but remains in a subterminal zone. Ablation experiments suggest that the extreme posterior abdominal segment is determined before segmentation of the abdomen is complete (Heinig, 1967). Thus there is some correlation between Dax expression, and regions that remain in the ‘growth zone’.
This early pattern of Dax expression is strikingly different to that of ftz, in that it shows no trace of resolution into periodic stripes at double segment intervals.
Segmentation mechanisms in short germ insects
Ftz is one of a set of ‘pair-rule’ genes, all of which are required to elaborate segment pattern in the syncytial blastoderm of Drosophila (Akam, 1987; Ingham, 1988). When any one of these genes is mutant, embryos develop with alternate segments missing. One other gene from this pair-rule set has been characterised in Schistocerca — the homeobox gene even-skipped (eve).
Like ftz, Drosophila eve plays several roles in development. In the blastoderm, eve is expressed in a pattern of stripes exactly complementary to those of ftz (Carroll and Scott, 1985; MacDonald et al., 1986) and serves to define the anterior boundaries of odd-numbered parasegments (Lawrence and Johnston, 1989). In later development, it is expressed in a subset of muscle precursors, and in a small subset of ganglion mother cells and neurons, where it is required for neural specification (MacDonald et al., 1986; Doe et al., 1988b).
Schistocerca eve is expressed in the same specific neural and muscle cells as its Drosophila counterpart but, like Dax, it shows no trace of ‘pair-rule’ expression during early development (Patel et al., 1992). It, like Dax, is expressed in the posterior part of the growing germ band from gastrulation onwards. Earlier eve expression, in the germinal disc, has not been described.
In Drosophila, the spatially patterned transcription of eve and ftz are central to the mechanism that generates positional signals in the blastoderm. Moreover, eve is directly involved in patterning the transcription of ftz (Carroll and Scott, 1986). The observation that neither Sg eve nor Dax show periodic stripes, implies that the molecular mechanisms generating segments in Drosophila are substantially different from those in Schistocerca. We discuss this question more fully elsewhere (Akam and Dawes, 1992).
The precise role of eve and Dax in the early Schistocerca embryo remains an open question. In vertebrates, even-skipped homologues are linked to the Hox gene cluster (Bastian and Gruss, 1990; Boncinelli et al., 1991). The vertebrate eve homologues are expressed in the posterior regions of early embryos, and have been implicated in the specification of ‘posterior identities’ (Ruiz i Altaba and Melton, 1989; Bastian and Gruss, 1990). The expression of Dax and eve at the heart stage onwards in Schistocerca also suggests a role in regionalisation of the embryo.
In Tribolium, pair-rule patterns of eve (Patel et al., 1994; Brown and Denell, personal communication), hairy (Sommer and Tautz, 1993) and ftz (Brown and Denell, personal communication) stripes appear sequentially along the length of the embryo, suggesting that a pair-rule patterning mechanism exists in beetles (Patel et al., 1994). However, a deletion of the Tribolium Hom-C (Stuart et al., 1991) spanning the ftz gene (Brown and Denell, personal communication) does not result in pair-rule segmentation defects. Thus in Tribolium, ftz appears to be regulated by the pair-rule patterning system, but not required for segmentation. No modern insect represents an actual intermediate in the evolution of insect developmental mechanisms, but in these comparisons we may be seeing intermediate stages in the process whereby ftz acquired a role in defining segment boundaries.
ACKNOWLEDGEMENTS
We thank Alfonso Martinez-Arias for enthusiatically supporting the initiation of this project, Michael Bate for patiently helping us to interpret the Schistocerca nervous system, Nipam Patel, Sue Brown, Rob Dennell and Dieter Maier for communicating their results prior to publication and David Micklem for his comments on the manuscript. This work was supported by grants from the Medical Research Council and the Wellcome Trust.