A cDNA library has been constructed in the expression vector gt11 from mRNA isolated from squid (Loligo forbesi) optic lobes. The library was screened with anti-bodies generated against purified squid neurofilaments. A positive clone was isolated, which harboured a gt11 recombinant having an insert size of 3.5 kb. Hybridiz-ation analysis by Southern and northern blotting showed that the corresponding protein is encoded by a single gene that gives rise to a transcript of 2.6 kb. Translation of the full nucleotide sequence of the gene revealed an open reading frame covering 557 amino acids. This squid-neurofilament-like protein, SNLK, bears the characteristic N-terminal head, rod and C-ter-minal tail domains present in all intermediate filament (IF) proteins. The rod has the classical heptad repeats indicating coiled-coil-forming ability, and the predicted lengths of the coils are similar to coils 1a, 1b and 2 of intermediate filaments. At the C-terminal end of the rod there is a strongly conserved IF epitope, and a fusion protein containing SNLK is recognised by the pan-specific intermediate filament antibody, IFA. A poly-clonal antibody raised against SNLK has been used to show that the protein is present only in neuronal tissues and that it is immunologically related to neurofilaments from Myxicola but not from mammals.

Intermediate filament proteins are expressed co-ordinately during development and cell differentiation. They are probably involved in special functions related to the differenti-ated state of cells such as the maintenance of resistance to mechanical stress. Some cells lack intermediate filaments, so they do not perform essential ‘house keeping’ functions (Hedberg and Chen, 1986; Bartnik and Weber, 1989).

Intermediate filaments in the cytoplasm of vertebrate cells are composed of different but related polypeptides. Five subgroups for these proteins have been distinguished: type I and II keratins; type III proteins, i.e. vimentin, desmin, GFAP and peripherin; the neurofilament proteins (NFL, NFM, NFH and α-internexin), which constitute the type IV proteins; and the nuclear lamins, which form the type V. All these filamentous proteins share basic structural principles, which are a central α-helical rod region that is flanked by N- and C-domains that are essentially non-helical (Steinert and Roop, 1988). The rod regions form coiled coils-the structural building blocks of the filament back-bone. The nuclear lamins follow similar principles for their construction and are thought to be related through evolution to the intermediate filament family of proteins (Weber et al., 1989; Döring and Stick, 1990).

Sequence information obtained from the intermediate fil-ament proteins of invertebrates shows that they are related more closely to nuclear lamins than to their vertebrate coun-terparts. For example the nuclear lamins of both vertebrates and invertebrates have six extra heptads at the ends of coil 1b and a similar extension to this region has been found in epithelial intermediate filaments from Helix pomatia (Weber et al., 1988), and cytoplasmic intermediate filaments from Ascaris lumbricoides (Weber et al., 1989). Lamin-like tail domains are also found in these invertebrate intermediate filament proteins.

The pattern of similarities is shared by neuronal inter-mediate filaments as well. The two low molecular mass neu-rofilament proteins (60 kDa and 70 kDa) from the squid, Loligo pealei, which have had their sequences deduced from the cDNA sequences, show the basic features of the group of invertebrate intermediate filament proteins (Szaro et al., 1991). The squid neurofilament proteins, like all the vertebrate neurofilaments (Eagles et al., 1990), have long C-terminal extensions to their polypeptides, and this region has clusters of charged residues.

We describe in this paper a new neuronal protein, the cDNA sequence of which was obtained from a squid optic lobe cDNA library. The protein is related to the interme-diate filament class of proteins, yet it is quite distinct from the squid neurofilament sequences so far examined. We have called this protein SNLK (squid neurofilament-like).

Preparation and screening of the library

The Amersham (UK) cDNA synthesis and cloning system kit (λgt11) was used for the construction of the library. Total RNA was isolated from squid (Loligo forbesi) optic lobes using guani-dinium isothiocyanate as described by Kaplan et al. (1979). Poly(A)+ RNA was obtained from the total RNA by oligo(dT) cel-lulose chromatography (Aviv and Leder, 1972). Double-stranded cDNA with EcoRI linkers was prepared by the procedure of Gubler and Hoffman (1983) and inserted into the unique EcoRI site of λgt11 (Huynh et al., 1985); it was then packaged. The total packaged library, before amplification, contained 2.8×108 pfu/ml. Plaques were screened for the presence of neurofilament epitopes essentially as described by Young and Davis (1983). Plaque lifts on nitrocellulose were screened with antibodies generated against purified squid neurofilaments that had been previously dephos-phorylated.

Sequencing and analysis of the 3.5 kb cDNA

The 3.5 kb cDNA was subcloned into pUC19, and a set of uni-directional nested deletions was generated using exonuclease III then mung bean nuclease (Stratagene, UK) as described by Henikoff (1984). Double-stranded sequencing was carried out using the dideoxy chain-termination method (Sanger et al., 1977). The sequence was analysed with computer programs implemented within the University of Wisconsin Genetics Computer Group (GCG) software on the VAX (Devereaux et al., 1984).

Preparation of gt11 fusion proteins

The clone harbouring the 3.5 kb cDNA was used to infect the lysogenic strain E. coli Y1089 as described by Huynh et al. (1985). Fusion proteins were produced by induction with 10 mM IPTG. Bacterial pellets were boiled in sample buffer (1% SDS, 100 mM Tris-HC1, pH 6.8) and spun briefly. The proteins were resolved by SDS-PAGE on a 5-15% gel (Laemmli, 1970) then either stained with Coomassie Blue or electrophoretically transferred onto nitrocellulose (Towbin et al., 1979).

Immunoblotting

Nitrocellulose filters containing proteins were blocked for 1 hour with 2% (w/w) non-fat milk with TBS (150 mM NaC1, 200 mM Tris-HCl, pH 7) then incubated with primary antibody for 1 hour at room temperature. After three washes with TBS/0.2% Tween-20, peroxidase-conjugated donkey anti-rabbit antibodies were added for polyclonal antibodies or antimouse for monoclonal antibodies (Amersham, UK) diluted 1:1000 in 2% non-fat milk and further incubations carried out for 1 hour. The filters were then washed three times with TBS/0.2% Tween-20 and finally developed with a solution containing 3-amino-9-ethylcarbazole and hydrogen peroxide (Kaplow, 1974).

Antibody preparation

Anti-SNLK serum and polyclonal antibodies to dephosphorylated squid NFL and NFH were raised in Dutch rabbits by subcutaneous injections of purified antigen. Squid neurofilaments were prepared as described previously (Brown and Eagles, 1986) and dephos-phorylated enzymatically (Chin et al., 1989). Filaments contain-ing all neurofilament-associated polypeptides were used directly for injection without further purification. The antibodies were tested by western blotting and they recognised squid NFH polypeptides and a number of proteins in the region around a rel-ative molecular mass of 60 kDa. The polyclonal serum was freed of any E. coli reacting antibodies by immunoprecipitation with E. coli lysates. For the preparation of anti-SNLK, source gels of the fusion protein β-gal-SNLK were run and the region of the gel con-taining the protein removed. The gel was macerated in adjuvant and then injected. Antigens were injected together with Freund’s complete adjuvant followed by one or two booster injections in incomplete adjuvant. β-galactosidase-reacting antibodies were removed from the serum by immunoprecipitation. IFA (Pruss et al., 1981) was obtained from the ascites fluid of cell lines express-ing this antibody.

Preparation of DNA and RNA for hybridization analysis

Genomic DNA from squid optic lobes was prepared as described by Jeffreys et al., 1977. DNA (5 μg) was digested with restric-tion enzymes then electrophoresed on a 0.8% agarose gel at 90 V. Lambda DNA cut with HindIII was used as markers. Elec-trophoresis of RNA (10 μg) was carried out on a 0.5% agarose/formaldehyde gel. Transfer of DNA and RNA onto nitro-cellulose membranes was as described (Southern, 1975; Thomas, 1980). Filters were prehybridized at 42°C for 3 hours in 50% deionised formamide, 5× SSC, 0.2% SDS, 5 × Denhardt’s solution and 100 μg/ml denatured salmon sperm DNA. For hybridizations, the 3.5 kb cDNA was labelled with [32P]dCTP (Amersham, UK) by the oligonucleotide-priming procedure of Feinberg and Vogel-stein (1984). Hybridizations were carried out overnight at 42°C. Filters were washed at a final stringency of 0.1× SSC/0.1% SDS at 65°C. Amersham Hyperfilm was used for autoradiography.

Immunocytochemistry

Giant axons were dissected out intact from fresh squid, embed-ded in paraffin wax then snap-frozen in liquid nitrogen. Sections (10 μm thick) were cut on a cryostat maintained at a temperature of −20°C. Air-dried sections were fixed with 95% ethanol/5% acetic acid for 10 minutes at −10°C. Sections were blocked in 2% non-fat milk in TBS for 1 hour, washed several times in TBS and exposed for 1 hour at room temperature to anti-SNLK (1:1000 dilution). Sections were then washed with TBS and further incu-bated for 1 hour with rhodamine-labelled goat anti-rabbit anti-bodies (Amersham, UK) diluted 1:50 in blocking solution. After further washes, the sections were layered with an antiphoto-bleaching agent (ascorbic acid, pH 7.0, 5 mg/ml), covered with a coverslip, then viewed using fluorescence microscopy and pho-tographed.

Screening of the gt11 library

The screening of the λgt11 expression library with squid neurofilament antibodies identified nine positive clones. Of these, three were purified by two rounds of rescreening, and on analysis their insert sizes ranged between 3.0 and 3.5 kb. The clone containing the largest insert was taken for further investigation.

The β-galactosidase fusion protein specified by the 3.5 kb cDNA was generated in amounts sufficient for analysis by SDS-PAGE. Crude protein lysates from the wild-type lysogen clearly showed a protein of around 114 kDa. This protein was identified as β-galactosidase by its binding of an anti-β-galactosidase monoclonal antibody. In the lysate from the recombinant lysogen, the 114 kDa β-galactosidase protein was replaced by a recombinant protein of approximately 180 kDa, indicating that about 66 kDa of polypep-tide is specified by the cloned cDNA. Analysis by Southern blotting (Fig. 1) indicated that the squid genome contained a single copy of the gene. Analysis by northern blotting of squid optic lobe mRNA (Fig. 2) with the 3.5 kb cDNA as probe revealed one major transcript, which had a size of around 2.6 kb.

Fig. 1.

Southern blot analysis of squid genomic DNA. Optic lobe DNA (10 μg) was digested to completion with BglII (lane A), and BamHI (lane B). After digestion, the fragments were resolved on a 0.5% agarose gel and then transferred to nitrocellulose. The 3.5 kb cDNA was used as probe and the blots washed to a final stringency of 0.1× SSC, 65°C. The exposure was for 3 days using Kodak AR film. The size (kb) of the markers is indicated at the right of the Figure. A single band of around 4 kb was seen in both digests. A single band was also seen in digests using PstI, HindIII, and EcoRI.

Fig. 1.

Southern blot analysis of squid genomic DNA. Optic lobe DNA (10 μg) was digested to completion with BglII (lane A), and BamHI (lane B). After digestion, the fragments were resolved on a 0.5% agarose gel and then transferred to nitrocellulose. The 3.5 kb cDNA was used as probe and the blots washed to a final stringency of 0.1× SSC, 65°C. The exposure was for 3 days using Kodak AR film. The size (kb) of the markers is indicated at the right of the Figure. A single band of around 4 kb was seen in both digests. A single band was also seen in digests using PstI, HindIII, and EcoRI.

Fig. 2.

Northern blot analysis of squid optic lobe RNA. Total RNA (5 μg) was run on a 1.0% agarose/formaldehyde gel and transferred to nitrocellulose. The 3.5 kb cDNA was labelled with [32P]dCTP and used as probe. Autoradiography was performed for 24 hours. The size of the transcript (kb) is indicated on the right.

Fig. 2.

Northern blot analysis of squid optic lobe RNA. Total RNA (5 μg) was run on a 1.0% agarose/formaldehyde gel and transferred to nitrocellulose. The 3.5 kb cDNA was labelled with [32P]dCTP and used as probe. Autoradiography was performed for 24 hours. The size of the transcript (kb) is indicated on the right.

The fusion proteingal-SNLK is recognised by IFA

Crude lysates from the recombinant lysogen were subjected to western blot analysis. The antibodies used were of two types: IFA, a monoclonal antibody that recognises a highly conserved epitope at the end of the rod region of all inter-mediate filaments, and anti-SNF-L, a polyclonal antibody that was raised against the fusion protein specified by the 3.5 kb cDNA. The results are shown in Fig. 3.

Fig. 3.

Western blot analysis of the crude lysate from the recombinant lysogen. Total lysates of E. coli Y1089 lysogens containing the 3.5 kb SNLK sequence in λgt11 were subjected to SDS-PAGE, western blotted and probed using the following reagents (lane a) biotin/streptavidin system alone, (lane b) preimmune serum, (lane c) IFA, (lane d) anti-SNLK. Lane (e) shows a Coomassie Blue-stained strip. The arrowhead indicates the position of the β-gal-SNLK polypeptide.

Fig. 3.

Western blot analysis of the crude lysate from the recombinant lysogen. Total lysates of E. coli Y1089 lysogens containing the 3.5 kb SNLK sequence in λgt11 were subjected to SDS-PAGE, western blotted and probed using the following reagents (lane a) biotin/streptavidin system alone, (lane b) preimmune serum, (lane c) IFA, (lane d) anti-SNLK. Lane (e) shows a Coomassie Blue-stained strip. The arrowhead indicates the position of the β-gal-SNLK polypeptide.

The polyclonal serum, after having had all the β-galactosidase antibodies absorbed out, recognizes the fusion protein (Fig. 3, lane d) and the preimmune serum is negative (lane b). The fusion protein is also recognised by IFA (lane c), which indicates that it is related to the intermediate fil-ament class of proteins.

Nucleotide sequence analysis of the 3.5 kb cDNA

To facilitate sequencing and analysis of the cloned cDNA, the 3.5 kb EcoRI fragment was subcloned into pUC19. Eleven unidirectional deletion mutants were generated by exonuclease III digestion to allow sequencing from vector primers. The complete nucleotide sequence of the cDNA together with the predicted amino acid composition of the protein is shown in Fig. 4. The nucleotide sequence reveals an uninterrupted reading frame covering 557 amino acids. The open reading frame starts with the ATG codon at nucleotides 37 to 39. This ATG is located in an environment propitious for translation because it is located within a sequence differing in only three out of nine positions from the consensus sequence CC(G/A)CCATGG, considered to be a favourable context for eukaryotic translation (Kozak, 1986).

Fig. 4.

Complete nucleotide sequence of the cDNA, and predicted amino acid sequence of SNLK. Amino acid residues are represented in the single letter code below the third base of each codon. The translational start is assigned to the ATG triplet at nucleotide positions 37 to 39 (underlined). An in-frame stop codon (TAG) is located at positions 1708 to 1710 (asterix). Canonical polyadenylation signals are underlined (Birnstiel et al., 1985). A poly(A) tail is absent from this clone. Residues 58-60, RCS, and 498-501, DDDEE, are in bold and their significance is discussed in the text.

Fig. 4.

Complete nucleotide sequence of the cDNA, and predicted amino acid sequence of SNLK. Amino acid residues are represented in the single letter code below the third base of each codon. The translational start is assigned to the ATG triplet at nucleotide positions 37 to 39 (underlined). An in-frame stop codon (TAG) is located at positions 1708 to 1710 (asterix). Canonical polyadenylation signals are underlined (Birnstiel et al., 1985). A poly(A) tail is absent from this clone. Residues 58-60, RCS, and 498-501, DDDEE, are in bold and their significance is discussed in the text.

Plots of the α-helical content for the protein and the probability of forming heptad repeats indicated a potential rod region covering residues 88-424. The α-helical pattern search (McLachlan, 1977; Beavil et al., 1992) was also employed to find periodicities within the sequence. This method of analysis is particularly suited to patterns that repeat at regular intervals and which are imperfectly repeated, such as intermediate filaments. Three α-helical regions were defined by the analysis as being significant in terms of coiled-coil-forming ability (Table 1) and they occupied positions consistent with their being coils 1A, 1B and 2 of the rod domain. Of the residues occupying positions ‘a’ and ‘d’ in the rod domain, 63% are hydrophobic. Comparison of the predicted protein sequence of SNLK with other intermediate filament proteins showed that SNLK has many features in common (Table 2 and Fig. 5). A series of heptad repeats covering 337 amino acids define the rod region, leaving a head of 87 amino acids (residues 1 to 87) and a tail extension of 133 amino acids (residues 425 to 557). The rod can be subdivided into smaller sections: coil 1A, 6 heptads; coil 1B, 14 heptads; and coil 2, 21 heptads. Discontinuities in the heptad repeats fall in regions where discontinuities are found in other intermedi-ate filament sequences.

Table 1.

Comparison of-helical coiled-coil forming probabilities of the various subdomains in the rod region

Comparison of-helical coiled-coil forming probabilities of the various subdomains in the rod region
Comparison of-helical coiled-coil forming probabilities of the various subdomains in the rod region
Table 2.

Table of lengths of the head, tail and the subdomains within the rod regions of various intermediate filament proteins

Table of lengths of the head, tail and the subdomains within the rod regions of various intermediate filament proteins
Table of lengths of the head, tail and the subdomains within the rod regions of various intermediate filament proteins
Fig. 5.

Sequence alignment of rod domains. The predicted amino acid sequence of the rod domain of SNLK is aligned with the corresponding region of Ascaris lumbricoides IF B (Weber et al., 1989), hamster desmin (Quax et al., 1985), mouse NFL (Lewis and Cowan, 1986) and squid NF60 (Szaro et al., 1991). Identical amino acid matches between SNLK and one or more of the IF protein sequences are indicated by bold type, and related residues are underlined. The sequences were aligned manually with dashes inserted within the linker regions to optimize correspondence.

Fig. 5.

Sequence alignment of rod domains. The predicted amino acid sequence of the rod domain of SNLK is aligned with the corresponding region of Ascaris lumbricoides IF B (Weber et al., 1989), hamster desmin (Quax et al., 1985), mouse NFL (Lewis and Cowan, 1986) and squid NF60 (Szaro et al., 1991). Identical amino acid matches between SNLK and one or more of the IF protein sequences are indicated by bold type, and related residues are underlined. The sequences were aligned manually with dashes inserted within the linker regions to optimize correspondence.

A comparison of the lengths of the various rod domains found in SNLK and other intermediate filaments is shown in Table 2. Coil 1A in SNLK is longer than that seen in SNF60, desmin, intermediate filament from Ascaris, and vertebrate NFL. Coil 1B of SNLK is longer than desmin and NFL but shorter than the lamin-like domains found in the invertebrate intermediate filaments. The length of coil 2 is similar to that found in both the vertebrate and inver-tebrate low molecular mass neurofilament polypeptides and to invertebrate intermediate filament polypeptides.

Most notably the rod region of SNLK has part of the consensus sequence LNDRFANY at the N-terminal end of coil 1A (LNFNNQVI), and at the C-terminal end of coil 2 is found the sequence SRRLLEGSG similar to the IFA epi-tope now identified with the consensus motif YRKL-LEGEE. In summary the predicted protein sequence of SNLK shows the molecule to have all the hallmarks of a typical intermediate filament polypeptide.

Tissue-specific expression of SNLK

To characterise further the protein SNLK, its distribution in various tissues was investigated by using the polyclonal serum raised against it. Western blot analysis was per-formed on squid axoplasm, optic lobe, heart and skin (Fig. 6). The polyclonal serum bound to a single protein of mass around 60 kDa in axoplasm and optic lobe, and no reac-tion was observed with proteins derived from the other tissues. The 60 kDa protein in the neuronally derived tissues also reacted with IFA. SNLK is therefore expressed in cells having a neuronal origin.

Fig. 6.

Tissue-specific expression of SNLK. Total proteins were prepared by direct homogenization in 2% SDS of tissues from squid axoplasm (lane A), optic lobe (lane B), heart (lane C), and skin derived from tentacles (lane D). The proteins were separated by 5-15% SDS-PAGE and blotted onto nitrocellulose. Blot 1 is stained with anti-SNLK serum; blot 2 is stained with IFA; blot 3 shows a Coomassie Blue-stained gel of similar samples. The position of SNLK in 1, 2 and 3 is marked by a black arrowhead, the position corresponds to a relative molecular mass of around 60 kDa. In 2 and 3, the squid axoplasm sample (lane A) has been marked to show the positions of the squid neurofilament polypeptides (open arrowheads). The upper arrowhead marks the position of NF200, the middle one marks the position of NF70, and the lower one, NF60. The positions of NF60 and SNLK overlap.

Fig. 6.

Tissue-specific expression of SNLK. Total proteins were prepared by direct homogenization in 2% SDS of tissues from squid axoplasm (lane A), optic lobe (lane B), heart (lane C), and skin derived from tentacles (lane D). The proteins were separated by 5-15% SDS-PAGE and blotted onto nitrocellulose. Blot 1 is stained with anti-SNLK serum; blot 2 is stained with IFA; blot 3 shows a Coomassie Blue-stained gel of similar samples. The position of SNLK in 1, 2 and 3 is marked by a black arrowhead, the position corresponds to a relative molecular mass of around 60 kDa. In 2 and 3, the squid axoplasm sample (lane A) has been marked to show the positions of the squid neurofilament polypeptides (open arrowheads). The upper arrowhead marks the position of NF200, the middle one marks the position of NF70, and the lower one, NF60. The positions of NF60 and SNLK overlap.

The polyclonal serum was also used to stain tissue sec-tions cut from the giant axon (Fig. 7). Thread-like staining within the axoplasm is seen, which indicates that SNLK is probably distributed as filaments in axoplasm.

Fig. 7.

Longitudinal section through squid (L. forbesi) giant axon, stained with anti-SNLK. A frozen section of giant axon was air dried and then reacted with the polyclonal serum against SNFL. Bar, 50 μm.

Fig. 7.

Longitudinal section through squid (L. forbesi) giant axon, stained with anti-SNLK. A frozen section of giant axon was air dried and then reacted with the polyclonal serum against SNFL. Bar, 50 μm.

Reactivity of anti-SNLK serum with polypeptides from other species

In order to probe the distribution of molecules resembling SNLK in other species, the antiserum to SNLK was used on western blots containing axoplasm from squid, axoplasm from the marine worm Myxicola infundibulum, partially purified preparations of vertebrate neurofilaments from ox, homogenates from the optic nerve and brain of fish (Oreochromis) and from the clawed toad Xenopus (Fig. 8).

Fig. 8.

Reactivity of anti-SNLK serum with polypeptides from other species. Western blot analysis of nervous tissue homogenates from squid axoplasm (lane A), fanworm axoplasm (lane B), ox spinal nerve (lane C), fish optic nerve and brain (lanes D and E, respectively) and Xenopus optic lobe (lane F). Proteins were resolved on 5-15% SDS-PAGE, transferred onto nitrocellulose, and probed with anti-SNLK and IFA (panels 1 and 2, respectively). Panel 3 is the corresponding Coomassie Blue-stained gel. The position of SNLK is indicated by the black arrowhead at the side of each panel and corresponds to a relative molecular mass of around 60 kDa. In 2 and 3, the squid axoplasm sample, (lane A), has been marked to show the positions of the neurofilament polypeptides NF200, NF70 and NF60 (see Fig. 5 for details).

Fig. 8.

Reactivity of anti-SNLK serum with polypeptides from other species. Western blot analysis of nervous tissue homogenates from squid axoplasm (lane A), fanworm axoplasm (lane B), ox spinal nerve (lane C), fish optic nerve and brain (lanes D and E, respectively) and Xenopus optic lobe (lane F). Proteins were resolved on 5-15% SDS-PAGE, transferred onto nitrocellulose, and probed with anti-SNLK and IFA (panels 1 and 2, respectively). Panel 3 is the corresponding Coomassie Blue-stained gel. The position of SNLK is indicated by the black arrowhead at the side of each panel and corresponds to a relative molecular mass of around 60 kDa. In 2 and 3, the squid axoplasm sample, (lane A), has been marked to show the positions of the neurofilament polypeptides NF200, NF70 and NF60 (see Fig. 5 for details).

The antiserum reacted with a polypeptide in the squid sample that had an apparent molecular mass of 60 kDa, thus identifying the SNLK molecule. The antiserum also cross-reacted with two polypeptides with molecular masses of around 160 kDa and 170 kDa, corresponding to Myxicola neurofilament polypeptides (Fig. 8, lane B, panel 1). No reaction was observed with mammalian neurofilaments. In addition the antiserum cross-reacted with polypeptides from fish optic nerve, one of which had a molecular mass around 60 kDa.

It therefore seems that SNLK shares immunological similarity with Myxicola neurofilaments but not with mam-malian neurofilaments.

We have cloned the cDNA of a new gene from squid neuronal tissue. The predicted amino acid sequence from the cDNA indicates that the protein is like an intermediate fil-ament polypeptide. On northern blots the major transcript had a size of around 2.6 kb.

Although the serum that we used to screen the library was raised against dephosphorylated squid neurofilaments, it presumably contained antibodies to SNLK as well. Because SNLK and SNF60 have similar molecular masses, the polyclonal serum, when used for western blotting, is not able to distinguish between them and only a single band is seen. Staining with IFA of preparations containing SNLK and SNF60 also shows only one band on blots at the position corresponding to a relative molecular mass of 60 kDa. The relationship between SNLK and the other squid neu-rofilament proteins NF200, NF70 and NF60 is unclear at present. A sequence comparison using the data from Szaro et al. (1991) shows that there is some homology, but there is clearly more similarity between NF200, NF70, and NF60 than between any member of this group and SNLK. SNLK therefore may belong to a separate family. This situation, where a neuron-specific intermediate filament protein is expressed together with a type IV neurofilament protein, is not new and examples include peripherin, preferentially found in neurones of the PNS (Thompson and Ziff, 1989), and XNIF, a developmentally expressed protein in neurones of Xenopus (Charnas et al., 1992). We do not know as yet whether there is a change in the expression of SNLK during development, but we are investigating this possibilty.

Comparison of the predicted amino acid sequence of SNLK with that for other IF proteins demonstrates that SNLK is a true IF protein, with the characteristic head, rod and tail domains, well established for this class of protein. The N-terminal head domain of SNLK is non-α-helical.

This is similar to the situation in invertebrate and vertebrate IF proteins and shows the previously recognised sequence hypervariability (Geisler and Weber, 1982, 1986). This domain also contains a feature that may be important for filament assembly-the sequence RCS at position 58-60.

The motif RXS (where X can be any other residue) is recog-nised by protein kinase A and is found in the same domain of several IF proteins. Mouse NFL for example contains the sequence RTS at position 53-55 and hamster desmin the sequence RTS at position 59-61. With desmin and vimentin, phosphorylation at this head domain site by pro-tein kinase A or protein kinase C blocks subunit polymer-ization in vitro and disassembles pre-existing filaments (Geisler and Weber, 1988; Inagaki et al., 1987). In hamster desmin, mouse NFL, and SNLK, the serine residue is imme-diately preceded by argininesgroups known to be impor-tant in filament assembly (Traub and Vorgias, 1984).

Further evidence in support of the role of the head domain of NFL for filament growth is provided by Gill et al. (1990) who concluded that part of the head domain (between residues 31 and 87) is required for the early steps in assembly. It therefore seems likely that the function served by this site may be conserved in intermediate fila-ments.

In SNLK, charged residues are occasionally found interspersed with hydrophobic residues at postions ‘a’ and ‘d’ within the heptads of SNLK. This, however, is not unique to SNLK because it has been found that periodic bands of alternating positive and negative charge with a period close to 28/3 appear as significant features in the sequences of alpha-keratins (Parry et al., 1977), myosin rod (McLachlan and Karn, 1982) and other intermediate filament proteins. All these proteins form regular assemblies in vivo in which coiled-coils pack side by side. Since there is no extensive sequence homology between these molecules, the 28/3-residue periodicity may reflect a similar pattern of three-dimensional packing.

The percentage homology between each subdomain within the rod region of SNLK and the corresponding domain of the other IF proteins was analysed. Coil 1a had the highest overall sequence homology of all the proteins analysed. The best score of 47% was with the IF B protein from Ascaris lumbricoides. SNLK has two extra heptads in this domain compared to the nuclear lamins and inverte-brate intermediate filaments (Fisher et al., 1986; McKoen et al., 1986; Doring and Stick, 1990; Weber et al., 1989; Szaro et al., 1991). The length of this domain is however similar to the corresponding domain in hamster vimentin, peripherin and α-internexin (Quax et al., 1983; Leonard et al., 1988; Fliegner et al., 1990).

Coil 1b had the lowest overall homology with the other IF proteins. The highest score of 26% is found in mouse NFL and squid (L. pealei) NF60. This domain is peculiar in that nuclear lamins and invertebrate IF proteins possess an extra six heptads (Weber et al., 1989; Doring and Stick, 1990; Fisher et al., 1986; McKeon et al., 1986; Krohne et al., 1987; Gruenbaum et al., 1988; Szaro et al., 1991) when compared to similar domains in vertebrate IFs. SNLK has one heptad more than vertebrate IFs and the recently iden-tified Branchiostoma IF-1 protein (Riemer et al., 1992) and four less than the nuclear lamins and invertebrate cyto-plasmic IFs. The abnormal length of the coil 1b in SNLK suggests that SNLK is a novel neurofilament-like protein from an invertebrate.

Coil 2 of SNLK has 21 heptads. This is in accord with all IF polypeptides sequenced to date. The percentage homology is fairly low but higher than that found for coil 1b. The heptad repeat changes phase at aspartate 303 and glycine 367. Similar positions are found for this phase change in coil 2 for all IF proteins (Weber et al., 1989).

A characteristic feature of the low molecular mass neurofilament polypeptides in vertebrates is a glutamic acid-rich, carboxy-terminal domain (Lewis and Cowan, 1986). These negatively charged residues may be involved in inter-actions with other cytoskeletal proteins within the neurone. These residues are also useful diagnostically because Bodian’s silver stain, a specific marker for neurofilaments, binds to this region (Philips et al., 1983). A group of charged residues is also found in the carboxy-terminal domain of SNLK at positions 498-501. The group consists of three aspartic and two glutamic residues (Fig. 4). This cluster of negatively charged groups may perform similar functions to that found in vertebrate NFL.

In the squid, SNLK is expressed exclusively in tissues having a neuronal origin and here the protein has a fila-mentous distribution within the axoplasm. This distribution is not dissimilar to that observed for neurofilaments, though whether SNLK is attached to the neurofilament network is currently unknown. We searched for the presence of immunologically similar proteins to SNLK in other species. Whilst cross-reactivity was found with neurofilaments from Myxicola, Xenopus and Oreochromis, indicating shared epi-topes with SNLK, no reactivity was found with mammalian neurofilament polypeptides. The epitopes, therefore, have a selected distribution amongst neurofilament species.

To conclude, we have cloned and sequenced the cDNA of a gene for a new protein, SNLK, from squid neuronal tissue. The deduced amino acid sequence of the protein shows that, whilst it shares all the features commonly regarded as being hallmarks of intermediate filaments, it is quite distinct from the neurofilaments sequences examined so far, not only from squid but also from higher species. Further work on SNLK is continuing in order to understand the function of this protein and the structure of the gene.

We thank Mrs R. Rao for providing technical assistance. We also thank Pankaj Marya, Paul Fraylich, Alison Maggs, Bob Hannon, Hitesh Patel, Andrew Beavil and Huub Dodemont for help and discussions throughout this study. We are grateful for the MBA Plymouth for supplying squid and for facilities for car-rying out the initial stages of this study. We also thank the refer-ees for pointing out one of the polyadenylation signals previously missed by us. This work was supported by the SERC and the MRC. The sequence data are available from EMBL/GenBank/DDBJ under the accession number X66695.

Aviv
,
H.
and
Leder
,
P.
(
1972
).
Purification of biologically active globin messenger RNA by chromatography on oligothymidylic acid-cellulose
.
Proc. Nat. Acad. Sci. USA
69
,
1408
1412
.
Bartnik
,
E.
and
Weber
,
K.
(
1989
).
Widespread occurrence of intermediate filaments in invertebrates; common principles and aspects of diversion
.
Eur. J. Cell Biol.
50
,
17
33
.
Beavil
,
A. J.
,
Edmeades
,
R. L.
,
Gould
,
H. L
and
Sutton
,
B. J.
(
1992
).
Alpha-helical coiled-coil stalks in the low-affinity receptor for IgE (FcεRII/CD23) and related C-type lectins
.
Proc. Nat. Acad. Sci. USA
86
,
753
757
.
Birnstiel
,
M. L.
,
Busslinger
,
M.
and
Strub
,
K.
(
1985
).
Transcription termination and 3′ processing: the end is in site
.
Cell
41
,
349
359
.
Brown
,
A.
and
Eagles
,
P. A. M.
(
1986
).
Squid neurofilaments - phosphorylation and calcium-dependent proteolysis in situ
.
Biochem. J
.
239
,
191
197
.
Charnas
,
L. R.
,
Szaro
,
B. G.
and
Gainer
,
H.
(
1992
).
Identification and developmental expression of a novel low molecular weight neuronal intermediate filament protein expressed in Xenopus laevis
.
J. Neurosci.
12
,
3010
3024
.
Chin
,
T. K.
,
Harding
,
S.
and
Eagles
,
P. A. M.
(
1989
).
Characterization of two proteolytically derived soluble polypeptides from the neurofilament triplet components NFM and NFH
.
Biochem. J
.
264
,
53
60
.
Devereaux
,
J.
,
Haekerli
,
P.
and
Smithies
,
O.
(
1984
).
A comprehensive set of sequence programs for the VAX
.
Nucl. Acids Res.
11
,
524
545
.
Doring
,
V.
and
Stick
,
R.
(
1990
).
Gene structure and nuclear lamin L111 of Xenopuslaevis; a model for the evolution of IF proteins from a lamin-like ancestor
.
EMBO J
.
9
,
4073
4081
.
Eagles
,
P. A. M.
,
Pant
,
H. C.
and
Gainer
,
H.
(
1990
).
In Cellular and Molecular Biology of Intermediate filaments
(ed.
R. D.
Goldman
and
P. M.
Steinert
), pp.
37
94
.
Plenum Publishing Corporation
,
New York
.
Feinberg
,
A. P.
and
Vogelstein
,
B.
(
1983
).
A technique for radiolabelling DNA restriction endonuclease fragments to high specific activity
.
Anal. Biochem.
132
,
6
13
.
Fisher
,
D.
,
Chaudhary
,
N.
and
Blobel
,
G.
(
1986
).
cDNA sequencing of nuclear lamins A and C reveals primary and secondary homolgy to intermediate filament proteins
.
Proc. Nat. Acad. Sci. USA
83
,
6450
6454
.
Fliegner
,
K. H.
,
Ching
,
G. Y.
and
Liem
,
R. K.
(
1990
).
The predicted amino acid sequence of α-internexin is that of a novel neuronal intermediate filament protein
.
EMBO J
.
9
,
749
755
.
Geisler
,
N.
and
Weber
,
K.
(
1982
).
The amino acid sequence of chicken muscle desmin provides a common structural model for intermediate filament proteins
.
EMBO J
.
1
,
1649
1656
.
Geisler
,
N.
and
Weber
,
K.
(
1986
).
In Cellular and Molecular Biology of the Cytoskeleton
(ed.
J. W.
Shay
), pp.
15
20
.
Plenum Press
,
New York
.
Geisler
,
N.
and
Weber
,
K.
(
1988
).
Phosphorylation of desmin in vitro inhibits formation of intermediate filaments; identification of three kinase A sites in the aminoterminal head domain
.
EMBO J
.
7
,
15
20
.
Gill
,
S. R.
,
Wong
,
P. C.
,
Monteirom
,
M.
and
Cleveland
,
D. W.
(
1990
).
Assembly properties of dominant and recessive mutations in the small mouse neurofilament (NFL) subunit
.
J. Cell Biol
.
111
,
2005
2019
.
Gruenbaum
,
Y.
,
Landerman
,
Y.
,
Drees
,
B.
,
Bare
,
J. W.
,
Saumweber
,
H.
,
Paddy
,
M. R.
,
Sedat
,
J. W.
,
Smith
,
D. E.
,
Benton
,
B. M.
and
Fisher
,
P. A.
(
1988
).
Drosophila nuclear lamin precursor Dm-o is translated from either of two developmentally regulated mRNA species apparently encoded by a single gene
.
J. Cell Biol
.
106
,
585
596
.
Gubler
,
U.
and
Hoffman
,
B. J.
(
1983
).
A simple and very efficient method for generating cDNA libraries
.
Gene
25
,
263
269
.
Hedberg
,
K. K.
and
Chen
,
L.-B.
(
1986
).
Absence of intermediate filaments in a human adrenal cortex carcinoma-derived cell line
.
Exp. Cell Res
.
163
,
509
517
.
Henikoff
,
S.
(
1984
).
Unidirectional digestion with exonuclease III creates targeted breakpoints for DNA sequencing
.
Gene
28
,
351
359
.
Huynh
,
T. V.
,
Young
,
R. A.
and
Davis
,
R. W.
(
1985
).
In DNA Cloning, Volume 1
(ed.
D. M.
Glover
), pp.
49
78
.
IRL Press, Oxford and Washington DC
.
Inagaki
,
M.
,
Nishi
,
Y.
,
Nishizawa
,
K.
,
Matsuyama
,
M.
and
Sato
,
C.
(
1987
).
Site-specific phosphorylation induces disassembly of vimentin filaments in vitro
.
Nature
328
,
649
652
.
Jeffreys
,
A. J.
and
Flavell
,
R. A.
(
1977
).
A physical map of DNA regions flanking the rabbit β-globin gene
.
Cell
12
,
429
433
.
Kaplan
,
B. B.
,
Bernstein
,
S. L.
and
Giolo
,
A. E.
(
1979
).
An improved method for the rapid isolation of brain ribonucleic acid
.
Biochem. J
.
183
,
181
184
.
Kaplow
,
L. S.
(
1974
).
Substitute for benzidine in myeloperoxidase stains
.
Amer. J. Clin. Path
.
63
,
451
.
Kozak
,
M.
(
1986
).
Point mutations define a sequence flanking the AUG initiator codon that modulates translation by eukaryotic ribosomes
.
Cell
44
,
283
292
.
Krohne
,
G.
,
Wolin
,
S. L.
,
Mckeon
,
F. D.
,
Franke
,
W. W.
and
Kirschner
,
M. W.
(
1987
).
Nuclear lamin L1 of Xenopus laevis: cDNA cloning, amino acid sequence and binding specificity of a number of the lamin B subfamily
.
EMBO J
.
6
,
3801
3808
.
Laemmli
,
U. K.
(
1970
).
Cleavage of structural proteins during the assembly of the head of bacteriophage T4
.
Nature
227
,
680
685
.
Leonard
,
D. G. B.
,
Gorham
,
J. D.
,
Cole
,
P.
,
Greene
,
L. A.
and
Ziff
,
E. B.
(
1988
).
A nerve growth factor-regulated messenger RNA encodes a new intermediate filament protein
.
J. Cell Biol
.
106
,
181
193
.
Lewis
,
S. A.
and
Cowan
,
N. J.
(
1986
).
Anomalous placement of introns in a member of the intermediate filament multigene family: An evolutionary conundrum
.
Mol. Cell Biol
.
6
,
1529
1534
.
McKeon
,
F. D.
,
Kirschner
,
M. W.
and
Caput
,
D.
(
1986
).
Homologies in both primary and secondary structure between nuclear envelope and intermediate filament proteins
.
Nature
319
,
463
468
.
McLachlan
,
A. D.
(
1977
).
Analysis of periodic patterns in amino acid sequences: collagen
.
Biopolymers
16
,
1271
1297
.
McLachlan
,
A. D.
and
Karn
,
J.
(
1982
).
Periodic charge distributions in the myosin rod amino acid sequence match cross-bridge spacings in muscle
.
Nature
299
,
226
231
.
Parry
,
D. A. D.
,
Crewther
,
W. G.
,
Fraser
,
R. D. B.
and
MacRae
,
T. P.
(
1977
).
Structure of α-keratin: structural implications of the amino acid sequences of the type 1 and type 11 chain segments
.
J. Mol. Biol.
113
,
449
454
.
Philips
,
L. L.
,
Autilio-Gambetti
,
L.
and
Lasek
,
R. J.
(
1983
).
Bodians’s silver method reveals molecular variations in the evolution of neurofilament proteins
.
Brain Res.
278
,
219
223
.
Pruss
,
R. M.
,
Mirsky
,
R.
,
Raff
,
M. C.
,
Thorpe
,
R.
,
Dowding
,
A. J.
and
Anderton
,
B. H.
(
1981
).
All classes of intermediate filaments share a common antigenic determinant defined by a monoclonal antibody
.
Cell
27
,
419
428
.
Quax
,
W.
,
Egbert
,
W. V.
,
Hendricks
,
W.
,
Quax-Jeuken
,
Y.
and
Bloemendal
,
H.
(
1983
).
The structure of the vimentin gene
.
Cell
35
,
215
223
.
Quax
,
W.
,
Van den Broek
,
L.
,
Egbert
,
W. V.
,
Ramaekers
,
F.
and
Bloemendal
,
H.
(
1985
).
Characterization of the hamster desmin gene: expression and formation of desmin filaments in non-muscle cells after gene transfer
.
Cell
43
,
327
338
.
Riemer
,
D.
,
Dodemont
,
H.
and
Weber
,
K.
(
1992
).
Analysis of the cDNA and gene encoding a cytoplasmic intermediate filament (IF) protein from the cephalochordate Branchiostoma lanceolatum; implications for the evolution of the IF protein family
.
Eur. J. Cell Biol.
58
,
128
135
.
Sanger
,
F.
,
Nickelin
,
S.
and
Coulson
,
A. R.
(
1977
).
DNA sequencing with chain-terminating inhibitors
.
Proc. Nat. Acad. Sci. USA
74
,
5463
5467
.
Southern
,
E. M.
(
1975
).
Detection of specific sequences among DNA fragments separated by gel electrophoresis
.
J. Mol. Biol
.
98
,
503
517
.
Steinert
,
P. M.
and
Roop
,
D. R.
(
1988
).
Molecular and cellular biology of intermediate filaments
.
Annu. Rev. Biochem.
57
,
593
625
.
Szaro
,
B. G.
,
Pant
,
H. C.
,
Way
,
J.
and
Battey
,
J.
(
1991
).
Squid low molecular weight neurofilament proteins are a novel class of neurofilament protein
.
J. Biol. Chem
.
23
,
15035
15041
.
Thomas
,
P. S.
(
1980
).
Hybridization of denatured RNA and small DNA fragments transferred to nitrocellulose
.
Proc. Nat. Acad. Sci. USA
77
,
5201
5205
.
Thompson
,
M. A.
and
Ziff
,
E. B.
(
1989
).
Structure of the gene encoding peripherin: an NGF-regulated neuronal-specific type III intermediate filament protein
.
Neuron
2
,
1043
1053
Towbin
,
H.
,
Staehlin
,
T.
and
Gordon
,
J.
(
1979
).
Electrophoretic transfer of proteins from polyacrylamide gels to nitrocellulose sheets: procedure and some applications
.
Proc. Nat. Acad. Sci. USA
76
,
4350
4354
.
Traub
,
P.
and
Vorgias
,
C. E.
(
1984
).
Differential effect of arginine modification with 1,2-cyclohexanedione on the capacity of vimentin and desmin to assemble into intermediate filaments and to bind to nucleic acids
.
J. Cell Sci.
65
,
1
20
.
Weber
,
K.
,
Plessmann
,
U.
,
Dodemont
,
H.
and
Kossmagk-Stephan
,
K.
(
1988
).
Amino acid sequences and homopolymer-forming ability of the intermediate filament proteins from an invertebrate epithelium
.
EMBO J.
7
,
2995
3001
.
Weber
,
K.
,
Plessmann
,
U.
and
Ulrich
,
W.
(
1989
).
Cytoplasmic intermediate filament proteins of invertebrates are closer to nuclear lamins than are vertebrate intermediate filament proteins; sequence characterization of two muscle proteins of a nematode
.
EMBO J
.
8
,
3221
3227
.
Young
,
R. A.
and
Davis
,
R. W.
(
1983
).
Efficient isolation of genes by using antibody probes
.
Proc. Nat. Acad. Sci. USA
80
,
1194
1198
.