A brain consists of numerous distinct neurons arising from a limited number of progenitors, called neuroblasts in Drosophila. Each neuroblast produces a specific neuronal lineage. To unravel the transcriptional networks that underlie the development of distinct neuroblast lineages, we marked and isolated lineage-specific neuroblasts for RNA sequencing. We labeled particular neuroblasts throughout neurogenesis by activating a conditional neuroblast driver in specific lineages using various intersection strategies. The targeted neuroblasts were efficiently recovered using a custom-built device for robotic single-cell picking. Transcriptome analysis of mushroom body, antennal lobe and type II neuroblasts compared with non-selective neuroblasts, neurons and glia revealed a rich repertoire of transcription factors expressed among neuroblasts in diverse patterns. Besides transcription factors that are likely to be pan-neuroblast, many transcription factors exist that are selectively enriched or repressed in certain neuroblasts. The unique combinations of transcription factors present in different neuroblasts may govern the diverse lineage-specific neuron fates.
INTRODUCTION
The complex brain consists of numerous distinct types of neuron. Generating such cellular diversity involves heterogeneous neural stem cells (NSCs) (Greig et al., 2013). A mouse brain consists of ∼75 million neurons (Oh et al., 2014) that arise from an undefined number of NSCs. Conserved neurogenic programs with stem cell-like progenitors underlie the generation of ∼30,000 neurons from ∼200 neuroblasts (NBs) in the much smaller Drosophila cerebrum (Urbach and Technau, 2004; Yu et al., 2013). Moreover, Drosophila NBs, like mammalian NSCs, exhibit both basic and complex patterns of neurogenesis (Homem and Knoblich, 2012). In the basic type I lineages, one progenitor buds off a series of intermediate precursors, called ganglion mother cells (GMCs), that either divide once to produce two neurons or directly differentiate into a single mature neuron. By contrast, the complex type II lineages are further amplified through intermediate neural progenitors (INPs) that, like the founding progenitor, self-renew to produce an independent series of offspring. Given the genetic, cellular and molecular conservations, studying Drosophila NBs should shed light on the development of the brains of more complex organisms.
Drosophila NBs are fated early in development to produce specific lineages of post-mitotic neurons and/or glia. Approximately 100 cerebral NBs arise per hemisphere, with each expressing a unique combination of early patterning genes (Urbach and Technau, 2003). Labeling the offspring made by individual cerebral NBs has revealed the composition of ∼100 discrete NB clones per hemisphere with characteristic morphologies (Yu et al., 2013). Mapping individual neurons serially made by one NB provides a detailed description of the developmental fate of that NB. Such single-cell lineage analyses, enabled by twin-spot MARCM (Yu et al., 2009), have not only substantiated the hypothesis that specific neuron sets are produced by specific NBs, but also revealed the distinct patterns of neuronal diversification characteristic of different NB lineages. Striking distinctions are observed among the extensively studied cerebral lineages, which include, per hemisphere, the four ‘equivalent’ lineages of mushroom body (MB) intrinsic neurons (Ito et al., 1997), four of the five known antennal lobe (AL) lineages (Jefferis et al., 2001; Lai et al., 2008), and the eight complex type II NB lineages (Bello et al., 2008; Boone and Doe, 2008; Bowman et al., 2008). The MB is the insect learning and memory center, and the AL is the primary olfactory center. Notably, the MB NBs, as the sole NBs that divide incessantly until fly eclosion (Truman and Bate, 1988), yield only three major classes of MB neurons, with no sister fate diversification in the paired neurons made by one GMC (Lee et al., 1999). By contrast, the AL NBs end proliferation around pupation but can generate ∼40 neuron types from a single hemilineage, as their GMCs make daughter cells with distinct A/B fates due to a Notch-mediated binary sister fate decision (Lin et al., 2010, 2012; Yu et al., 2010). Additional offspring diversities arise in the complex type II lineages through the production of variant INP sublineages by each type II NB and the derivation of distinct neuron/glia types from each INP (Wang et al., 2014). Transcriptome analysis of such diverse NBs should reveal the transcriptional networks governing distinct NB developmental fates as opposed to general NB programs.
Molecular profiling of distinct NBs requires means for targeting the NB(s) of interest throughout neurogenesis. Despite extensive lineage analysis, one cannot recognize specific NBs without markers. Probably owing to the combinatorial and dynamic nature of cell fate-determining gene expression, it is rare to see drivers that label a small number of specific NBs exclusively and persistently in the developing nervous system. Even with such drivers in hand, it is challenging to obtain enough pure lineage-specific NBs for genome-wide molecular profiling. Bulk fluorescently labeled Drosophila NBs have been successfully collected by fluorescence-activated cell sorting (FACS) for RNA sequencing (RNA-seq) (Berger et al., 2012). However, it is very challenging to find an appropriate gating condition for sorting rare cell populations with FACS. Manual picking of dissociated cells under the microscope was therefore adopted to recover rare target cells without contamination (Nagoshi et al., 2010; Okaty et al., 2011). However, such procedures are labor intensive, require very fine motor control, and become unreliable in the detection of dim cells. These technical constraints limit systematic efforts in the molecular characterization of NSC heterogeneity.
Here we report effective strategies for engineering lineage-specific NB drivers, and describe a custom-built, single-cell picking device for robotically recovering rare target cells from dissociated tissues. Transcriptome analyses of various NB subsets compared with non-selective NBs revealed novel pan-NB transcription factors and distinct sets of lineage-specific transcription factors expressed in different NB subsets. The lineage-specific transcription factors may govern the specification of lineage-characteristic offspring. The robotic single-cell picking device we describe should allow efficient purification of any cell type that can be uniquely labeled. The same strategy can be readily adopted for cell type-specific molecular profiling in diverse tissues.
RESULTS
Lineage-specific NB drivers
The first challenge in molecular profiling of NB heterogeneity and dynamics was to mark various NB subsets specifically and persistently through development. To achieve permanent NB labeling we engineered recombinase-dependent NB drivers in which a chimeric dpnEE promoter (active in all cerebral NBs) is separated from an exogenous transcriptional activator (GAL4 or LexA::P65) (Awasaki et al., 2014) by a site-specific recombination cassette carrying transcription/translation stop signals (Pfeiffer et al., 2010). Only in dpnEE-active cells can excision of the intervening cassette lead to activation of the driver. Thus, one can transform complex, often dynamic, patterns of recombinase expression, driven by various cis-regulatory elements, into clean, permanent NB drivers covering distinct NB subsets (Fig. 1). However, most isolated cis-regulatory elements show not only high activities in some restricted patterns but also weaker expression in other, often larger, domains (Awasaki et al., 2014). Such low-level ‘background’ activities could activate the recombinase-dependent NB driver in many more NBs at low frequencies, and drastically reduce the targeting specificity of otherwise sparse NB drivers.
Therefore, to target NBs of interest in small subsets with minimal background, we further incorporated various intersectional strategies to refine driver patterns and eliminate all ‘off-target’ expression. We successfully made three clean NB drivers that specifically label the four equivalent MB NBs, four of the five diverse AL NBs, and the eight complex type II NBs, respectively (Fig. 1, Fig. S1). We achieved unique and permanent MB NB labeling by restricting recombinase induction using split GAL4 (Luan et al., 2006; Pfeiffer et al., 2010) with R14D11 and R41D10 (Jenett et al., 2012), which show common MB NB activities and non-overlapping backgrounds (Fig. 1A-C). To exclusively label the four major AL NBs (ALad1, ALl1, ALv1 and ALv2), we first drove a recombinase with R44F03 (Awasaki et al., 2014), which confined the pan-NB driver primarily to the AL. Then we used another characterized AL promoter, ems (Lichtneckert et al., 2008; Lin et al., 2013), to drive a second recombinase for activation of the GFP reporter only in the AL NBs (Fig. 1D-F). For targeting type II NBs, we started with the stg14 enhancer (Wang et al., 2014) that selectively labels type II NBs but also occasionally labels type I NBs and INPs. We further restricted the stg14-patterned dpnEE-GAL4 activity by inhibiting GAL4 activity in type I NBs with asense-GAL80 (Fig. 1G-I).
A custom-built, single-cell picking device
The next challenge was to isolate a sufficient number of GFP-marked NBs from many developing brains of the same age. We accomplished this by building an integrated microscopy system for rapid target cell detection and robotic single-cell picking. The hardware of our custom-built, single-cell picking device includes three main components (Fig. 2A). The imaging component was built with the Leica M205 FA fluorescence stereomicroscope, a motorized XY stage for scanning a slide and tracking xy coordinates automatically, and an electron-multiplying CCD camera/sensor for capturing both fluorescence and bright-field images with enhanced sensitivity. The picking component was assembled with a metal capillary needle, a syringe pump and a micromanipulator. The stainless steel needle of internal diameter 35 µm was mounted via a holder onto a piezo micromanipulator with a positioning range of 20×20×20 mm3, 0.5 μm resolution and 3 μm repeatability (Fig. 2B). The end of the capillary needle was connected to a 100 µl syringe via a PEEK tube and tubing fitting. The syringe was mounted on a standard infuse or withdraw pump that drives fluid flow through the metal capillary needle. The control component was built upon a desktop computer with essential input/output peripherals.
The software that monitors and controls the imaging and picking components was coded with LabView. The user interface provides control buttons for operating (1) the motorized XY stage to scan through the slide or move to specific xy coordinates, (2) the microscope to adjust fluorescence versus bright-field imaging, (3) the micromanipulator to align and place the capillary tip at the selected position, and (4) the syringe pump to control the fluid flow direction, duration and rate for collecting, releasing or separating objects (Fig. 2C). In addition, distinct views of real-time images are shown in three windows. The CCD camera/sensor continuously reads both bright-field and green or red fluorescence images. The overlay image with a field of view of 1×1 mm is shown in the left large window, while enlarged views of the overlay and fluorescence images around the ‘red cross’ (positioned by the user's mouse click) are displayed in the other two windows (Fig. 2C). Moreover, the system automatically calculates independent object diameters based on their fluorescence signals and selectively highlights those objects that fall within a predefined size range. This enabled us to select the larger NBs of interest and not the daughter cells that had perdurance of the fluorescent label.
To eliminate background fluorescence, which is crucial for locating target cells expressing modest levels of fluorescent proteins (to avoid toxicity), we placed dissociated cells on glass slides without ‘autofluorescent’ plastic substrates. Moreover, the slide surface was prepared with PLL-PEG to reduce adhesiveness. We created two chambers on the microscope slide using transparent silicon clear rubber carved into two 30×30 mm2 chambers. Each chamber was pre-wet with 1300 µl medium (AHS buffer containing 2% FBS). The dissociated cells were concentrated via centrifugation in 150 µl medium and then loaded into one chamber for the initial cell picking; the other chamber was reserved for cleaning the isolated cells. We picked the cells using flexible metal capillaries, as opposed to fragile glass micropipettes.
Prompt recovery of Drosophila NBs for RNA-seq
The engineered MB NB driver stochastically labels about four of the eight MB NBs that make the bilaterally symmetric adult MBs throughout larval and pupal development (Ito et al., 1997; Lee et al., 1999). We practiced picking the GFP-labeled MB NBs, ranging from 14 to 22 μm in size (Fig. 2D), dissociated from larval CNS using the custom-built, single-cell picking device. We established a two-step procedure that allows efficient, clean recovery of most of the 60 or so targeted NBs that can be reliably dissociated from 30 larvae carrying ∼120 marked MB NBs in total (Fig. S2, Movie 1).
We collected over 300 MB NBs isolated ∼50 h after larval hatching (ALH). We explored whether an extra wash step can enhance the purity, and we collected another 300 MB NBs following two rounds of cleaning. We also examined how critical the timing of the procedure is, and we collected another 300 MB NBs that had been left in medium for an additional 2 h after cleaning, a total of 4 h after tissue disruption. We used 100 NBs for each RNA/cDNA preparation, and three replicates were executed for each condition. Using qPCR, we assessed the expression of various genes characteristic of NBs and of potential neuronal and glial contaminants. Compared with the RNA of late larval CNS, we observed substantial enrichment of various known NB genes, including deadpan (dpn) (Bier et al., 1992) and miranda (mira) (Ikeshima-Kataoka et al., 1997; Shen et al., 1997), and minimal expression of neuronal Synaptobrevin (nSyb) (Südhof et al., 1989) and glial reversed polarity (repo) (Xiong et al., 1994) in all conditions of isolated MB NBs (Fig. 2E). This lends support to the purity of the picked NBs. In addition, both qPCR and RNA-seq results showed no detectable effect on the transcriptomes of the isolated MB NBs due to additional cleaning or prolonged incubation (Fig. 2E,F).
Taken together, our 2 h, two-step protocol of single-cell picking is adequate to recover a pure population of rare NBs, freshly dissociated from highly heterogeneous neural tissues, that are apparently maintained in a stable state through the procedure.
Distinguishable transcriptomes of different NBs
Distinct NBs carry out analogous neurogenic programs but yield different lineage-specific neuron types in diverse lineage-characteristic patterns. Transcriptome analysis of discrete NB subsets should reveal the transcriptional networks governing individual NB developmental fates as opposed to general NB programs. Using the lineage-specific NB drivers, we isolated 300 AL and 300 type II NBs at 50 h ALH. We also picked 300 non-selective NBs marked with a pan-NB driver (Awasaki et al., 2014), as a reference pool. Three replicates (100 NBs per RNA/cDNA preparation) were executed for each NB group. We confirmed the fidelity of picking different sets of NBs by qPCR. The qPCR analysis showed selective enrichment of the type I NB-specific gene asense (ase) (Bowman et al., 2008) in the MB and AL NBs and of the type II-specific gene pointed (pnt) (Zhu et al., 2011) in type II NBs, as compared with late larval CNS (Fig. 3A). In further support of minimal contamination or sampling bias, we obtained high replicate consistency in the NB RNA-seq data that unambiguously showed analogous, yet distinguishable, transcriptomes for different NB collections, highlighting their clear distinctions from the transcriptomes of post-mitotic neurons and glia (Fig. 3B, Table S1).
Pairwise comparisons among the transcriptomes of the age-matched MB, AL or type II NBs uncovered 585 differentially expressed genes with average transcripts per million (TPM) greater than 10 and q-value <0.05 (see Materials and Methods) (Fig. 3C). About 12% of the differentially expressed genes were annotated as transcription factors (TFs), making the TF class the most over-represented gene class with differential expression among distinct NB subsets (Fig. S3). Nonetheless, only 68 out of 445 TFs with average TPM>10 were recovered as differentially expressed TFs, supporting the presence of shared neurogenic programs.
Abundantly expressed TFs common to diverse NBs
Universal NB TFs should be expressed in the three distinct NB subsets as well as in the non-selective NB pool. A Venn diagram of the expressed TFs among these four groups of NB collections revealed 348 common TFs with average TPM>10 and 70 common TFs with average TPM>100 (Fig. 4A). Sixty-one of the 70 strong common TFs are comparably expressed with less than a 2-fold change (FC<2) in the expression level between different collections (Fig. 4B). However, nine abundant NB TFs (TPM>100) show varied expression levels (FC>2) in the sampled NBs (Fig. 4C). These include: HmgZ, E(spl)mγ-HLH, dm (Myc), apt, E(spl)m8-HLH, dj-1β, E(spl)m7-HLH, CG12911 and tai. The three E(spl) transcripts are selectively lower in MB NBs, which might reflect substantially weaker Notch activities in the MB NBs, as E(spl) genes are well-known Notch downstream targets (Delidakis et al., 2014). Fifty-two of the 70 abundant NB TFs (TPM>100 in all four NB groups) were independently recovered by comparing NB samples with neuron/glia samples using the limma-voom package with the q-value <0.05 to identify 195 NB-enriched TFs in total (Fig. S4).
ase and pros, two well-known type I NB genes, were not recovered as universal NB TFs due to minimal expression in type II NBs. Notably, there are five TFs with TPM>100 in all, except type II, NB groups. These additional potentially pan-type I NB TFs include emc, ase, CG2199, pros and CG8378 (Smyd4-4) (Fig. 4C). Of all 75 TFs abundantly expressed in most, if not all, NBs (Fig. 4), 11 [CG5343 (Bug22), CycH, E(spl)m8-HLH, E(spl)mγ-HLH, HmgD, ase, dj-1β, dpn, grh, sna and wor] are greatly enriched in NBs (FC>10) as compared with post-mitotic neurons and glia (Fig. 4, asterisks). Given the known involvement of wor, dpn, grh, sna, ase and pros (not recovered as NB-specific TFs due to significant glia expression) in NB proliferation versus differentiation (Ashraf and Ip, 2001; Cai et al., 2001; Almeida and Bray, 2005; Maurange et al., 2008; Lai et al., 2012; Zhu et al., 2012; Lai and Doe, 2014; Yasugi et al., 2014), these general NB-specific TFs are likely to constitute the transcriptional networks that orchestrate the expression of gene batteries required for NB-characteristic programs, including asymmetric cell division and neuron/glia production.
TFs differentially expressed in distinct NBs
In addition to pan-NB TFs, distinct NBs express different combinations of lineage-specific TFs, as evidenced by the recovery of 68 differentially expressed TFs from the above pairwise transcriptome comparisons among the MB, AL and type II NBs. We do not know the expression patterns of those differentially expressed TFs in the remaining, highly heterogeneous NBs. To reveal TFs that are more specifically enriched or repressed in particular NBs, we next compared a given NB subset with all other NB collections, including the non-selective NB pool. We also compared the non-selective NB pool with the three distinct NB subsets that we sampled. We used the limma-voom package with q-value <0.05 to recover TFs specifically enriched or repressed in the MB NBs, the AL NBs, the type II NBs, and the non-selective NBs (Fig. 5).
We uncovered seven TFs that were specifically enriched in the MB group comprising a homogenous population of NBs (Fig. 5A). Among the heterogeneous populations of NBs, we identified six different TFs enriched in the AL group (Fig. 5B) and another six TFs enriched in the type II group (Fig. 5C). Furthermore, we detected seven TFs that stood out in the non-selective NB pool (Fig. 5D) despite modest to low TPM values, apparently due to their minimal expressions in the MB, AL and Type II NBs. It is unclear how restricted their expression is in the remaining NBs. We also uncovered eight, two and three TFs that are specifically repressed in the MB, AL and type II NBs, respectively. We did not recover any TF specifically repressed in the non-selective NB pool, consistent with its composition of all diverse NBs.
We examined the expression patterns of five differentially expressed TFs by immunostaining (Fig. 6). We detected Rx expression in all MB NBs (Fig. 6A′, arrows), Mid expression in one of four AL NBs (Fig. 6D′, arrow), and Dr expression in another AL NB (Fig. 6E′, arrow). Notably, Dr is abundant in the progeny of two additional AL lineages (Fig. 6E′, yellow asterisks). We also confirmed that there is no detectable expression of SoxN in the MB NBs (Fig. 6B′, asterisk) but that there is strong SoxN expression in many brain cells, including the progeny of type II lineages (outlined in Fig. 6B′). By contrast, Tll is abundant in the offspring of MB NBs (Kurusu et al., 2009) (Fig. 6C, arrows) and the developing optic lobe (Li et al., 2013), but could not be detected within the AL lineages (Fig. 6C,C′, green). These immunostaining results substantiate the expression of Rx, and not SoxN, in the MB NBs, the absence of Tll in AL NBs, and the expression of Mid and Dr in various AL NBs, which further attests to the molecular heterogeneity of the AL NBs.
The repression of otherwise broadly expressed SoxN in the MB NBs is crucial for normal MB neurogenesis. Ectopic expression of SoxN altered MB NB fate as evidenced by gradual loss of Tll expression (Fig. 6F). Consistent with the known requirement of Tll for MB neurogenesis (Kurusu et al., 2009), the SoxN-positive MBs showed reduced neuron numbers and aberrant morphologies (data not shown). This exemplifies the importance of lineage-specific expression or repression of various TFs in programming distinct NBs and their production of lineage-characteristic progenies.
DISCUSSION
Single-cell analysis has been shedding much new light on the development and function of complex tissues. Molecular profiling of single cells will further revolutionize biomedical research. For instance, a large-scale, single-cell RNA-seq study has allowed the molecular classification of cell types in the mouse cortex and hippocampus and revealed that TFs form complex layered codes in diverse cell types (Zeisel et al., 2015). However, it is challenging to capture a particular cell for a single-cell RNA-seq reaction. Moreover, single-cell RNA-seq techniques remain exploratory. One practical solution is RNA-seq of a few ‘identical’ cells obtained from different individuals. In addition to the reliability or otherwise of RNA/cDNA preparation, there are two major concerns over the quality of the sample that can affect the consistency and interpretation of the data. One concern is the purity of the collected cells, since even trace contamination with cells of other identities could have a significant impact when the sample size is small. The other concern is the complexity of sample composition, as heterogeneous target cells might not be adequately sampled if only a few cells are picked. Additional concerns include the well-being of the collected cells, cell type/status-dependent variations in RNA content, and possible contamination with RNAs released from damaged cells during tissue disruption. In summary, absolute accuracy becomes important when dealing with a small number of target cells.
We addressed these concerns by sampling a small number of specific NBs using robotic single-cell picking. Remarkable consistency exists in the RNA-seq data of all MB NB samples collected at the same developmental time (50 h ALH) with three replicates per collection scheme. The high reproducibility (correlation coefficient of 0.928±0.0039, mean±s.d.) demonstrates the fidelity of our single-cell picking device as well as the homogeneity of larval MB NBs. Only six genes (RpL18, RpL23, RpLP1, RpLP2, sna, CG34300) differed significantly in the extra wash condition, and no gene showed a significant difference in the extra 2 h incubation condition based on the criteria of FC>2, TPM>10, q-value <0.05. This suggests that the established 2 h one-wash protocol is well within the time limits for recovering pure target cells by single-cell picking without progressive fate or viability changes. Even the replicates of heterogeneous AL or type II NBs show excellent reproducibility in the transcriptome of the same group (correlation coefficients of 0.94 and 0.95, respectively), indicating no sampling bias in the coverage of the four or eight targeted NBs with collections of 100 NBs.
Pairwise comparisons among various NB subsets revealed 585 genes with average TPM>10 and q-value <0.05 (Fig. 3C). TFs account for ∼12% of the differentially expressed genes. We uncovered a small number of TFs that were strongly enriched in, or absent from, a particular NB subset as compared with other NB subsets and the non-selective NBs. The selective expression or repression of such lineage-specific TFs may modify core NB programs and/or govern the specification of lineage-characteristic offspring. For instance, in the type II lineages, the expression of pnt, but not ase, maintains the NB fate, and the expression of btd, but not pros, promotes the INP cell fate (Bowman et al., 2008; Bayraktar et al., 2010; Zhu et al., 2011; Komori et al., 2014; Xie et al., 2014). In addition, tll is essential for efficient and extended production of MB neurons (Kurusu et al., 2009), and lack of SoxN ensures proper Tll expression in MB NBs and young progenies (Fig. 6F). Moreover, retn and ems have been shown to govern MB and AL neuronal differentiation, respectively (Ditch et al., 2005; Lichtneckert et al., 2008). The expression of multiple lineage-specific TFs in a given NB, possibly as a combinatorial TF code, contrasts with the recent report of otd (oc) as a master lineage cell fate-determining gene in a particular NB lineage (Sen et al., 2014).
Our RNA-seq data also reveal 75 TFs abundantly expressed in most, if not all, NBs. A putative transcriptional network of 28 TFs enriched in ase-positive type I NBs, as compared with their neuronal offspring, has been depicted (Berger et al., 2012). Only seven of the TFs [E(spl)mγ-HLH, wor, dpn, crc, Ssrp, grh and ken] appear on our list of abundantly expressed NB TFs. Six additional genes (mod, Ssb-c31a, klu, CG4570, CG15715 and CG10565) are not annotated as TFs and would otherwise have been included in our list. The remaining 15 previously identified type I NB-characteristic TFs fail to pass our arbitrary abundance threshold of average TPM>100 in the non-selective NB group (Fig. S5); five of these TFs show an average TPM of less than 30. Although they might be significantly enriched in NBs as compared with post-mitotic neurons, these weakly expressed TFs show similarly low levels of expression in all our sampled NB groups and are thus unlikely to play central roles in core NB programs. By contrast, the ∼80 strongly expressed, pan-NB TFs reported here, especially the seven uncharacterized NB-enriched TFs [CG5343, CycH, E(spl)m8-HLH, E(spl)mγ-HLH, HmgD, dj-1β and mod], along with the six well-known NB-specific TFs (ase, dpn, grh, sna, wor and pros), are likely to govern various aspects of core NB programs.
In conclusion, we have established an integrated microscopy system for robotic single-cell picking and demonstrated how it facilitates the identification and prompt recovery of rare target cells from dissociated tissues. The system is ideal for picking genetically marked cells. Three specific subsets of Drosophila NBs, targeted via various intersectional transgenic tactics, were collected for RNA-seq, which revealed the molecular heterogeneities of neural progenitors. Transcriptome analysis of lineage-specific precursors promises to elucidate the gene regulatory networks underlying the development of complex tissues.
MATERIALS AND METHODS
Genetic models for picking lineage-specific NBs
Transgenes used in this study are listed in Table 1. To label MB NBs specifically, we crossed dpnEE>KDRTs-stop-KDRTs>LexA::P65; lexAop-myrGFP female flies with UAS-KD; R41A10-ZpGDBD, R14D11-P65ADZ male flies. To label the four AL (ALad1, ALl1, ALv1 and ALv2) NBs specifically, we crossed dpnEE>KDRTs-stop-KDRTs>LexA::P65; lexAop>FRT-stop-FRT>myrGFP; R44F03-KD female flies with lexAop>FRT-stop-FRT>myrGFP; EMS-GAL4 2.6D, UAS-FLP male flies. To label the eight type II NBs specifically, we crossed dpn>KDRTs-stop-KDRTs>GAL4; UAS-mCD8::GFP (46), UAS-mCD8::GFP (48); stg14-KD, lexAop-myr::tdTomato female flies with R9D11-LexA::P65, 20XUAS-mCD8::GFP; R9D11-GAL80, asense-GAL80 male flies.
For ectopic expression of SoxN, GAL4-OK107 was used to drive UAS-SoxN (Overton et al., 2002) throughout MB neurogenesis.
Dissection and trituration of larval CNS
Drosophila larvae hatched within a 2 h time window were cultured at 25°C for 50 h. We dissected out the synchronized CNS in freshly prepared adult hemolymph saline (AHS) buffer (108 mM NaCl, 5 mM KCl, 2 mM CaCl2, 8.2 mM MgCl2, 4 mM NaHCO3, 1 mM NaH2PO4, 5 mM trehalose, 10 mM sucrose, 5 mM HEPES, pH 7.0). As much as possible of the ventral nerve cord was removed. A collection of ∼30 larval brain samples was immediately transferred to an Eppendorf tube (1.5 ml) containing 200 µl AHS with Pronase (1 mg/ml; P5147, Sigma-Aldrich) and Dispase (1 mg/ml; LS02104, Worthington Biochemical Corporation). After tissue digestion for 30 min at 25°C, the enzyme medium was removed, followed by two brief, gentle washes with fresh AHS containing 50 μM AP-5 (76326-31-3, Sigma-Aldrich), 20 μM DNQX (2379-57-9, Sigma-Aldrich), 0.1 μM TTX citrate (1069, Tocris) and 2% FBS. The brain samples were then triturated by gently pipetting in 1 ml AHS through four fire-polished Pasteur pipettes (13-678-20B, Fisher Scientific) with descending pore sizes for ∼20 min. Next, the dissociated cells were centrifuged for 5 min at 49 RCF and the AHS was replaced to minimize any RNA content released from damaged cells during the trituration process. Finally, the dissociated cells were concentrated by centrifuging 5 min at 49 RCF before single-cell picking.
Single-cell picking
Parts of our custom-built single-cell picking device, including specifications and sources, are listed in Table 2.
The 75×50 mm microscope slide was cleaned with double-distilled water and the surface was dried with compressed air (Air'it; 23-022523, Fisher Scientific). A 75×40 mm piece of transparent silicone clear rubber (thickness ∼1.6 mm), carved into two 30×30 mm chambers, was placed on the center of the slide. To reduce adhesion, chamber surfaces were treated with 500 μl 300 μg/ml PLL-PEG [PLL(20)-g[3.5]- PEG(2); SuSoS] for 2 h at room temperature. Each chamber was rinsed twice with AHS then pre-wet by adding 400 μl AHS. Concentrated cells were resuspended in 150 μl AHS then gently added into the center of the pre-wet left chamber. A final volume of 1300 μl AHS was added to each chamber. The two-step picking procedure was able to collect ∼60 NBs in 2 h (Fig. S2). Picked cells were released directly into a PCR tube (PCR-02-C, Corning Life Sciences-Axygen Scientific), and 9.5 μl PicoPure extraction buffer (12317-03, Life Technologies) was immediately added. The mixture was incubated at 42°C for 30 min before storing at −80°C.
For the extra wash experiment, pure target cells were transferred to a fresh chamber, ten cells at a time, using a new capillary needle. After all the pure target cells were transferred, ten target cells were collected using another capillary needle and then released directly into a PCR tube for RNA extraction and storage. For the extra 2 h incubation experiment, pure target cells were incubated in the collection chamber for 2 h after removing non-target cells.
RNA/cDNA preparation and qPCR
Each PCR tube containing approximately ten cells was stored at −80°C until RNA preparation and amplification. Approximately 100 NBs of the identical experimental condition were combined from ten tubes and used as an input for RNA extraction with the PicoPure RNA Isolation Kit (KIT0204, Life Technologies). The RNA solution in 11 μl elution buffer (Life Technologies) was concentrated to 4 μl with a SpeedVac. The quality of RNA extracted from ∼100 NBs was examined on a bioanalyzer using the RNA 6000 Pico Kit (5067-1513, Agilent), with a yield of ∼500 pg (Fig. S6). After adding 1 μl ERCC RNA Spike-in Mix (1/100,000 of the original concentration; 4456740, Life Technologies), the RNA was immediately converted to cDNA and underwent further amplification with the Ovation RNA-Seq System V2 (7102-A01, NuGEN). The final cDNA was dissolved in 30 μl TE buffer (75793, Affymetrix). Total RNA from larval CNS was isolated by TRIzol purification (15596-026, Invitrogen).
Before cDNA fragmentation, qPCR was performed to determine the expression levels of known NB-specific, neuron-specific and glia-specific genes, as a preliminary quality control. For each gene, at least three qPCR primers, either as suggested in http://www.flyrnai.org/FlyPrimerBank or as used in previous publications, were screened through standard curve analysis and the best primers were chosen based on their efficiency and specificity. For qPCR, 2.5 ng cDNA was used for inputs, which is within the best range of DNA input suggested by the standard curve analysis.
After confirming cDNA quality, ∼2.25 μg cDNA from each sample was fragmented by ultrasonicator (LE220, Covaris), followed by final library construction with the Ovation Rapid DR Multiplex System 1-96 kit (0328-96, NuGEN). Each of the final fragmented cDNA libraries was dissolved in 9 μl buffer EB (19086, Qiagen) and sequenced (single-end 100 bp read) using the HiSeq 2500 system (Illumina).
RNA-seq and transcriptome analysis
A total of 778 million reads (average 43.2 million reads per sample) were obtained. The FASTQ data were first processed with cutadapt (Martin, 2011) to remove Illumina adaptor and NuGEN SPIA adaptor. Then, reads mapping to ribosomal and other abundant or low complexity sequences (polyA, polyC, phiX, mitochondrial sequences) were removed using Bowtie 2 (Langmead and Salzberg, 2012). On average, 77.2% of the reads were removed at this stage. The remaining reads were then mapped to UCSC dm3 genome using STAR (Dobin et al., 2013). On average, 22.3% of the initial reads were mapped to dm3.
CPM (counts per million fragments mapped) and FPKM (fragments per kilobase of exon per million fragments mapped) were calculated from the mapped reads using HTSeq (Anders et al., 2015) supplied with Illumina iGenome UCSC dm3 gene annotation gtf. FPKM values were converted to TPM (transcripts per million) for downstream analysis (Table S1).
For differential expression analysis (Fig. 3C, Fig. 5, Fig. S4), the limma-voom package (Law et al., 2014) was used as supplied with CPM filtered with the criteria that at least four samples should have CPM>1 (9968 selected out of 15340). q-values were calculated using the Benjamini-Hochberg method.
As TFs we used a combined list of genes annotated in Gene Ontology (term: GO:0001071) of QuickGo (Binns et al., 2009) and genes annotated as ‘transcription factor' in PANTHER (Mi et al., 2013). PANTHER classification was used for the enrichment analysis in Fig. S3.
RNA-seq data are available in NCBI Gene Expression Omnibus with accession number GSE71104.
Immunohistochemistry and confocal imaging
Brain tissues dissected from larvae at a specific stage were fixed with 2% paraformaldehyde and immunostained as described previously (Lin et al., 2012). The following primary antibodies were used: rabbit anti-GFP 1:1500 (A11122, Life Technologies), rat anti-GFP 1:500 (GF090R, Nacalai), rabbit anti-DsRed 1:500 (632496, Clontech), mouse nc82 mAb 1:100 (Developmental Studies Hybridoma Bank), rabbit anti-SoxN 1:400 (gift from S. Crews, University of North Carolina at Chapel Hill), guinea pig anti-Tll 1:500 (gift from C. Desplan, New York University), rabbit anti-Rx 1:300, rabbit anti-Mid 1:1000 and rabbit anti-Dr 1:500 (gifts from H. Lacin, HHMI). Corresponding fluorescent secondary antibodies (1:500) were purchased from Life Technologies. Images were collected on a Zeiss LSM 710 confocal microscope.
Acknowledgements
We thank the Janelia Neuro-Seq Project Team for essential technical support; Dr Haluk Lacin for Rx, Mid and Dr antibodies; Dr Steven Russell for UAS-SoxN and SoxN antibody; Dr Stephen Crews for SoxN antibody; Dr Claude Desplan for Tll antibody; R. Miyares for input and critical reading of the manuscript; and C. Sullivan for administrative support.
Footnotes
Author contributions
C.-P.Y., Z.L. and T.L. designed the study. C.-P.Y., K.S. and Z.L. performed experiments. C.-C.F. constructed the cell-picking device with suggestions from L.P.L. and K.S. L.-Y.L. contributed to fly brain dissection and cell dissociation. Q.R. engineered the MB NB-specific driver. X.Y. made various DNA constructs. C.-P.Y., Z.L., K.S. and T.L. analyzed the data and wrote the manuscript. C.-P.Y., C.-C.F., K.S. and Z.L. contributed equally to this work.
Funding
This work was supported by the Howard Hughes Medical Institute. Deposited in PMC for release after 6 months.
References
Competing interests
The authors declare no competing or financial interests.