Intestinal stem cell (ISC) plasticity is thought to be regulated by broadly permissive chromatin shared between ISCs and their progeny. Here, we have used a Sox9EGFP reporter to examine chromatin across ISC differentiation. We find that open chromatin regions (OCRs) can be defined as broadly permissive or dynamic in a locus-specific manner, with dynamic OCRs found primarily in loci consistent with distal enhancers. By integrating gene expression with chromatin accessibility at transcription factor (TF) motifs in the context of Sox9EGFP populations, we classify broadly permissive and dynamic chromatin relative to TF usage. These analyses identify known and potential regulators of ISC differentiation via association with dynamic changes in chromatin. Consistent with computational predictions, Id3-null mice exhibit increased numbers of cells expressing the ISC-specific biomarker OLFM4. Finally, we examine the relationship between gene expression and 5-hydroxymethylcytosine (5hmC) in Sox9EGFP populations, which reveals 5hmC enrichment in absorptive lineage-specific genes. Our data demonstrate that intestinal chromatin dynamics can be quantitatively defined in a locus-specific manner, identify novel potential regulators of ISC differentiation and provide a chromatin roadmap for further dissecting cis regulation of cell fate in the intestine.

Intestinal stem cells (ISCs) maintain the intestinal epithelium, which is replaced once a week throughout adult life. To preserve digestive and barrier function, ISCs balance proliferation with differentiation into post-mitotic intestinal epithelial cells (IECs): absorptive enterocytes, goblet cells, Paneth cells, enteroendocrine cells (EECs) and tuft cells. Models for cellular hierarchy in the intestinal epithelium have been complicated by observations of extensive cellular plasticity, wherein secretory and absorptive progenitors, as well as post-mitotic Paneth cells, can de-differentiate following damage to or ablation of the ISC pool (Buczacki et al., 2013; Schmitt et al., 2018; Tetteh et al., 2016). The genetic mechanisms that allow IECs to adopt metastable differentiated fates while exhibiting facultative ISC function are not fully understood.

Chromatin landscapes consist of histone post-translational modifications, DNA modifications and higher-order structural organization, and are known to exert a regulatory control on cell fate. Chromatin is progressively remodeled during differentiation, and is highly dynamic in embryonic stem cells as well as in tissue-specific stem cells in the hair follicle and hematopoietic system (Dixon et al., 2015; Lara-Astiaso et al., 2014; Lien et al., 2011). Recent studies of chromatin in ISCs and differentiated IECs suggest that intestinal chromatin is largely homogenous, with similar enhancer, open chromatin and DNA methylation profiles observed in ISCs and differentiated progeny (Kaaij et al., 2013; Kim et al., 2014). The observed broadly permissive nature of chromatin has been proposed as a mechanism for cell fate plasticity in the intestine (Kim et al., 2014). However, these studies relied on pharmacological or genetic modulation of intestinal epithelial populations to force over-production of absorptive or secretory cells, or have compared Lgr5+ ISCs with bulk differentiated populations (Barker et al., 2007; Kaaij et al., 2013; Kim et al., 2014). The nature of IEC chromatin has been further complicated by other studies demonstrating histone methylation dynamics between ISCs and differentiated progeny, as well as chromatin remodeling during de-differentiation of facultative ISCs following radiation injury (Jadhav et al., 2017; Kazakevych et al., 2017). The extent to which chromatin is dynamic across ISCs, intermediate transit-amplifying (TA) progenitors and post-mitotic IECs remains controversial.

Differential expression levels of Sox9, observed via a Sox9EGFP BAC transgene, are associated with phenotypically distinct IEC populations (Formeister et al., 2009). Histological and gene expression studies have shown that Sox9low cells are consistent with ISCs, Sox9sublow (Sox9sub) cells with TA progenitors, and Sox9high cells with label-retaining secretory progenitors/facultative ISCs in the crypts, and mature EECs and tuft cells in the villus; Sox9neg cells consist of the remaining post-mitotic populations (goblet, Paneth and absorptive enterocyte) (Formeister et al., 2009; Gracz et al., 2010, 2015; Roche et al., 2015). We reasoned that this model could be applied to omics-scale assays to understand dynamics in native IEC chromatin. Here, we profile the transcriptome, open chromatin (Omni ATAC-seq) and 5hmC (Nano-hmC-Seal) to interrogate the relationship between gene expression changes and intestinal chromatin dynamics in the Sox9EGFP model. Through the application of quantitative approaches, our analyses demonstrate clear examples of both broadly permissive and dynamic chromatin in IECs that can be associated with specific transcriptional regulatory networks.

Sox9 expression levels define transcriptomically distinct populations

We isolated Sox9neg, Sox9sub, Sox9low and Sox9high IEC populations from adult Sox9EGFP mice by fluorescence-activated cell sorting (FACS) and subjected them to RNA-seq (Fig. 1A, Fig. S1A), which confirmed that Sox9EGFP levels faithfully recapitulate endogenous Sox9 expression (Fig. S1B) (Formeister et al., 2009). Previous studies have defined Sox9EGFP populations by RT-qPCR and immunofluorescence, and we next sought to confirm our model with transcriptomic data (Formeister et al., 2009; Gracz et al., 2010). To identify genes enriched in each population, we conducted differential expression analysis and filtered for genes significantly upregulated in a single Sox9EGFP population. We identified 2852, 253, 1499 and 3521 genes that were upregulated in Sox9neg, Sox9sub, Sox9low and Sox9high populations, respectively (Fig. 1B and Table S1). These included genes known to be associated with expected cell types in each Sox9EGFP population: absorptive enterocyte Fabp1 and goblet cell Muc5b/6 in Sox9neg, secretory progenitor Dll1 in Sox9sub, ISC Ascl2, Olfm4 and Smoc2 in Sox9low, and EEC Reg4 and tuft cell Il25 in Sox9high (Fig. 1B). Our data show enrichment of Paneth cell-specific Lyz1 and Lyz2 in Sox9neg, consistent with previous observations that the EGFP transgene is silenced in Paneth cells, which express endogenous Sox9 (Formeister et al., 2009; Gracz et al., 2015). As our analysis was designed for stringency, Sox9EGFP population gene lists excluded genes expressed highly in more than one population, such as Lgr5, which is expressed in Sox9low active ISCs as well as in Sox9high secretory progenitors (Fig. S1C) (Buczacki et al., 2013; Roche et al., 2015). Gene ontology (GO) analysis identified expected relationships between Sox9EGFP populations and cellular function, including mitosis in Sox9sub progenitors, stem cell maintenance in Sox9low ISCs, secretory and hormone processes in EEC-enriched Sox9high, and absorption and secretion in Sox9neg, which contains absorptive enterocytes as well as secretory goblet and Paneth cells (Fig. S1E and Table S2).

Fig. 1.

Sox9EGFP populations capture the spectrum of IEC differentiation. (A) Sox9EGFP is expressed at distinct levels in the crypt and as rare ‘high’-expressing cells in the villi. Scale bar: 100 µm. (B) Sox9EGFP levels define distinct cell populations bearing unique transcriptomic signatures (heat map represents differentially expressed genes by RNA-seq). (C) Comparison with cell type-specific gene expression signatures from scRNA-seq experiments demonstrates relative enrichment of differentiated cell types in Sox9neg, progenitors in Sox9sub, ISCs in Sox9low, and EECs and tuft cells in Sox9high (each point represents the expression of one gene). (D) Colocalization between cell type-specific protein markers and Sox9EGFP by immunofluorescence. (E) Quantification of EGFP signal confirms these relationships. Scale bar: 25 µm. Each point in E represents an individual cell. *P<0.005; **P<0.0001.

Fig. 1.

Sox9EGFP populations capture the spectrum of IEC differentiation. (A) Sox9EGFP is expressed at distinct levels in the crypt and as rare ‘high’-expressing cells in the villi. Scale bar: 100 µm. (B) Sox9EGFP levels define distinct cell populations bearing unique transcriptomic signatures (heat map represents differentially expressed genes by RNA-seq). (C) Comparison with cell type-specific gene expression signatures from scRNA-seq experiments demonstrates relative enrichment of differentiated cell types in Sox9neg, progenitors in Sox9sub, ISCs in Sox9low, and EECs and tuft cells in Sox9high (each point represents the expression of one gene). (D) Colocalization between cell type-specific protein markers and Sox9EGFP by immunofluorescence. (E) Quantification of EGFP signal confirms these relationships. Scale bar: 25 µm. Each point in E represents an individual cell. *P<0.005; **P<0.0001.

We then compared gene expression levels from each Sox9EGFP population against gene sets that define distinct IEC subtypes in previously published scRNA-seq data (Fig. 1C) (Haber et al., 2017). Consistent with our previous characterization of Sox9EGFP populations, genes that define absorptive enterocytes and Paneth cells were enriched in Sox9neg, TA and enterocyte progenitors (EP) in Sox9sub, Lgr5-positive ISCs in Sox9low, and EECs and tuft cells in Sox9high. Similar trends of enrichment between Sox9EGFP populations and IEC-specific gene sets were observed in comparisons with a second, independently generated scRNA-seq dataset (Fig. S1D) (Yan et al., 2017). Interestingly, some goblet cell genes were enriched in Sox9sub TAs, which may reflect lineage ‘priming’ of post-mitotic gene biomarkers in progenitor populations, consistent with previous reports (Kim et al., 2016b).

Finally, we validated the correlation between Sox9EGFP levels and protein biomarkers of IEC populations by semi-quantitative confocal microscopy, which demonstrated that cells identified by cell type-specific markers express EGFP levels predicted by RNA-seq (Fig. 1D,E). Importantly, MUC2+ goblet cells were found in the Sox9neg population, despite enrichment of some goblet cell genes in Sox9sub TAs. Collectively, our RNA-seq data support the distinct identities of Sox9EGFP populations, define the transcriptional output of each population, and establish the Sox9EGFP model as a viable, single-biomarker approach for isolating ISCs, progenitors and post-mitotic cells for studies of chromatin dynamics.

Open chromatin regions are dynamic across Sox9 populations

Next, we sought to determine the contribution of chromatin dynamics to the unique transcriptomic identities of Sox9EGFP populations. We mapped open chromatin regions (OCRs) in Sox9neg, Sox9sub, Sox9low and Sox9high by Omni-ATAC-seq (Corces et al., 2017). High confidence OCRs (81,919) were identified across all four populations (Fig. S2A and Table S3). We observed a strong correlation between biological replicates within each Sox9EGFP population, and distinct clustering of Sox9neg, Sox9sub, Sox9low and Sox9high by OCRs (Fig. 2A). Sox9sub TAs and Sox9low ISC populations were more similar when analyzed using Pearson correlation than were Sox9neg and Sox9high populations, consistent with their identity as proliferative undifferentiated IECs (Fig. 2A). We classified peaks found in all four Sox9EGFP populations as ‘shared’ and compared shared peaks with peaks found in single populations relative to genomic features. The majority of peaks present in a single population were found in introns or intergenic regions, while promoters represented a greater relative proportion of shared peaks (Fig. 2B). Site-specific analysis of genome browser tracks confirmed dynamic changes in OCRs, with open chromatin found in loci encoding genes associated with function in Sox9EGFP subpopulations. Lct, which is expressed in absorptive enterocytes, was enriched for OCRs specific in Sox9neg (Fig. 2C), with a similar correlation between expression and chromatin status observed for Neurod1 in Sox9high (Fig. 2D) and Olfm4 in Sox9low (Fig. 2E). Some intragenic OCRs were found in Sox9EGFP populations that did not highly express the corresponding gene, as observed for Olfm4-associated OCRs in Sox9sub cells (Fig. 2E). This could either reflect chromatin ‘priming’ to facilitate activation of the corresponding gene under stimulus, or the presence of an enhancer with impacts on a distal gene.

Fig. 2.

IEC populations have unique and dynamic OCR signatures. (A) Sox9EGFP populations cluster reproducibly when assessed by Pearson's correlation of OCRs (n=3 biological replicates per population). (B) Intergenic regions and introns represent the majority of OCRs present in single populations, while promoters represent a larger proportion of OCRs shared across all four populations. (C-E) Representative browser tracks demonstrate relationships between OCR activity and gene expression for (C) Sox9neg-associated Lct, (D) Sox9high-associated Neurod1 and (E) Sox9low-associated Olfm4. (F) UMAP dimensionality reduction and Gini index analysis of OCRs reveals broadly permissive and dynamic chromatin, and predictable clustering by genome feature, including broadly permissive promoters and dynamic introns and intergenic loci (n=81,919 OCRs). (G) Quantitative ranking of genome features by Gini index confirms that the highest degree of chromatin dynamics are associated with introns and intergenic regions, but demonstrates dynamic loci in all features, including promoters. (H) UMAP visualization of dynamic OCRs (Gini≥0.4) by highest accessibility demonstrates distinct clustering by the Sox9EGFP population.

Fig. 2.

IEC populations have unique and dynamic OCR signatures. (A) Sox9EGFP populations cluster reproducibly when assessed by Pearson's correlation of OCRs (n=3 biological replicates per population). (B) Intergenic regions and introns represent the majority of OCRs present in single populations, while promoters represent a larger proportion of OCRs shared across all four populations. (C-E) Representative browser tracks demonstrate relationships between OCR activity and gene expression for (C) Sox9neg-associated Lct, (D) Sox9high-associated Neurod1 and (E) Sox9low-associated Olfm4. (F) UMAP dimensionality reduction and Gini index analysis of OCRs reveals broadly permissive and dynamic chromatin, and predictable clustering by genome feature, including broadly permissive promoters and dynamic introns and intergenic loci (n=81,919 OCRs). (G) Quantitative ranking of genome features by Gini index confirms that the highest degree of chromatin dynamics are associated with introns and intergenic regions, but demonstrates dynamic loci in all features, including promoters. (H) UMAP visualization of dynamic OCRs (Gini≥0.4) by highest accessibility demonstrates distinct clustering by the Sox9EGFP population.

A limitation of peak overlap-based methods for assessing chromatin dynamics in a multi-population experiment is how to classify peaks that are found in two or more populations, but not in all populations. In order to more quantitatively assess chromatin dynamics, we analyzed OCR variability across Sox9EGFP populations using the Gini index, a statistical measure of inequality that assigns higher values to more variable distributions (Yoshida et al., 2019). We visualized the Gini index by Uniform Manifold Approximation and Projection (UMAP), where each point on the plot represents a single OCR and the Gini index indicates the extent of variable accessibility across all four Sox9EGFP populations (Fig. 2F) (McInnes et al., 2018 preprint). High Gini index values are associated with dynamic chromatin, while low Gini index values indicate broadly accessible OCRs (Fig. S2B). As predicted by genome feature analysis of shared and single OCRs, promoters were mostly associated with OCRs bearing low Gini index values, while introns were associated with a greater number of OCRs bearing high Gini index values (Fig. 2F,G). Additionally, promoters were clustered together in the UMAP representation of the data, whereas intronic OCRs were spread throughout the plot, highlighting similarities in accessibility between promoter-associated OCRs. Comparison of Gini values at different genomic features confirmed that the highest degree of dynamic chromatin is found in introns and intergenic regions (Fig. 2G). Despite the overall low diversity of promoters, we identified 738 promoter-associated OCRs with Gini≥0.4 (Fig. 2F,G and Fig. S2; Gini index values for all 81,919 OCRs are listed in Table S3). These dynamic promoters were found in all four Sox9EGFP populations (n=56 Sox9neg, 97 Sox9sub, 216 Sox9low, 341 Sox9high) and were associated with genes enriched in their respective population (Fig. S2C,D). Applying Gini index values facilitated a more quantitative assessment and visualization of chromatin dynamics, which allowed us to identify OCRs with potential regulatory significance that may have been missed by conventional peak comparisons.

Next, we visualized OCR intensity and distribution by Sox9EGFP population. OCRs for Sox9low ISCs and Sox9sub progenitors were distributed across UMAP plots, with overlapping adjacent regions of density (Fig. S2E). Sox9neg and Sox9high populations demonstrated highest intensity OCRs at opposite ends of the plot, suggesting that the most significant differences in OCRs are found between populations containing post-mitotic IEC subtypes (Fig. S2E). To focus our analysis, we plotted dynamic OCRs (Gini>0.4) by Sox9EGFP population in which accessibility was highest. OCRs clustered by Sox9EGFP population, suggesting that many OCRs shared between two or more populations are subject to regulatory activity that reaches a maximum in a single cell type (Fig. 2H). Globally, ATAC-seq in Sox9EGFP populations demonstrates that chromatin in IEC differentiation is dynamic, associated with population-defining genes, and exhibits the most significant global changes in introns and intergenic regions in post-mitotic cells.

Chromatin dynamics are linked to transcription factor networks

We next sought to establish a framework to further examine the relationship between chromatin dynamics and gene expression in Sox9EGFP populations. We reasoned that changes in TF motif usage could be used to identify TFs associated with either broadly permissive or dynamic IEC chromatin, and further define the cis-regulatory landscape relative to potential trans-regulatory interactions. ChromVAR, which quantifies motif-dependent gain or loss of OCRs, identified altered motif accessibility across Sox9EGFP populations (Schep et al., 2017). Ranking TF motifs by variability in Sox9EGFP populations revealed motifs associated with broadly permissive chromatin (n=406 motifs, chromVAR value<5) as well as dynamic chromatin (n=195 motifs, chromVAR value ≥5) (Fig. 3A and Table S4).

Fig. 3.

Identification of regulatory networks by OCR dynamics. (A) Analysis of motif use in Sox9EGFP populations identifies TFs associated with dynamic (n=195, chromVAR≥5) and broadly permissive (406 motifs, chromVAR value<5) OCRs. (B) A subset of 69 TFs demonstrates significant correlation with dynamic motif use. TFs were selected based on a significant association between accessibility and expression, chromVAR>5 and TF expression higher than the 25th percentile of all genes. (C-E) Tn5 insertion relative to motif center (left) and relationships between TF expression and motif accessibility (right). (C) The majority of gene expression and motif relationships are positively correlated, including Foxa1 and Hnf4g. (D) Negative correlations are observed for transcriptional repressors, including Klf12. (E) TFs lacking correlation between motif accessibility and expression, like Stat3, are associated with broadly permissive chromatin. (F) Mapping motif counts with respect to UMAP space plots of Sox9EGFP population-specific OCRs reveals distributions predicted by gene expression/motif accessibility relationships (n loci=12,117 Hnf4g; 7877 Klf12). (G) Motifs for Foxa1, Foxa2, Rfx3 and Rfx6 are differentially distributed within Sox9high OCRs, suggesting regulatory roles in subpopulations of the same parent Sox9EGFP population (number of loci=7472 for Foxa1, 3302 for Foxa2, 3680 for Rfx3 and 4862 for Rfx6).

Fig. 3.

Identification of regulatory networks by OCR dynamics. (A) Analysis of motif use in Sox9EGFP populations identifies TFs associated with dynamic (n=195, chromVAR≥5) and broadly permissive (406 motifs, chromVAR value<5) OCRs. (B) A subset of 69 TFs demonstrates significant correlation with dynamic motif use. TFs were selected based on a significant association between accessibility and expression, chromVAR>5 and TF expression higher than the 25th percentile of all genes. (C-E) Tn5 insertion relative to motif center (left) and relationships between TF expression and motif accessibility (right). (C) The majority of gene expression and motif relationships are positively correlated, including Foxa1 and Hnf4g. (D) Negative correlations are observed for transcriptional repressors, including Klf12. (E) TFs lacking correlation between motif accessibility and expression, like Stat3, are associated with broadly permissive chromatin. (F) Mapping motif counts with respect to UMAP space plots of Sox9EGFP population-specific OCRs reveals distributions predicted by gene expression/motif accessibility relationships (n loci=12,117 Hnf4g; 7877 Klf12). (G) Motifs for Foxa1, Foxa2, Rfx3 and Rfx6 are differentially distributed within Sox9high OCRs, suggesting regulatory roles in subpopulations of the same parent Sox9EGFP population (number of loci=7472 for Foxa1, 3302 for Foxa2, 3680 for Rfx3 and 4862 for Rfx6).

To further explore relationships between TF expression and chromatin dynamics, we analyzed correlation between these two readouts and generated a subset of TFs demonstrating: (1) significant correlation between gene expression and chromatin accessibility over motifs and (2) chromVAR values >5 (Fig. 3B). Increased chromatin accessibility over a given motif was mostly associated with increased expression of the corresponding TF. This category included known regulators of IEC differentiation, which validated the computational approach. Foxa1, which drives EEC and goblet cell differentiation, is most highly expressed with the greatest degree of motif accessibility in Sox9high cells and Hnf4g, which drives absorptive differentiation, is most highly expressed with the greatest degree of motif accessibility in Sox9neg cells (Fig. 3C) (Chen et al., 2019; Ye and Kaestner, 2009). We also observed several cases where TF expression and motif accessibility were inversely correlated. Klf7, Klf8 and Klf12 were observed to be highly expressed but have the lowest level of motif accessibility in Sox9high cells, consistent with known roles for Klf family TFs as repressors (Fig. 3D and Fig. S3A) (McConnell and Yang, 2010). In addition to Klf TFs, we identified other TFs with no established role in ISC differentiation, including nuclear receptor superfamily genes in Sox9neg, and Ets family genes in Sox9high (Fig. 3B). Targeted examination of expression and accessibility relationships for known ISC (Fig. S3B) and early progenitor (Fig. S3C) TFs revealed a mix of expected and unexpected relationships. Sox9 and Ascl2 both demonstrated increased expression and motif accessibility in their expected Sox9EGFP populations (Fig. S3B). However, some secretory associated TFs, including Atoh1 and Spdef, demonstrated intermediate accessibility coincident with their highest expression (Fig. S3C). These relationships may suggest competition for or cooperation between multiple TFs at the same motif and require direct detection of TF binding in order to fully understand TF-chromatin relationships.

Finally, we visualized TF motif usage in OCRs relative to the Sox9EGFP population by UMAP to determine whether OCRs enriched for specific motifs formed clusters. Motif accessibility correlated with populations, as predicted by accessibility and TF expression relationships, and revealed differential distribution of TF motif usage within Sox9EGFP population clusters (Fig. 3F,G). Foxa1 and Foxa2 demonstrated similar distribution within Sox9high-associated OCRs, while Rfx3 and Rfx6 motifs were also enriched within the Sox9high cluster, but with unique distribution (Fig. 3G). These data suggest the existence of multiple cis-regulatory landscapes within each population and are supported by differential regulation of EEC subpopulations by Foxa1/2 (L- and D-cells) versus Rfx3/6 (K-cells) (Suzuki et al., 2013; Ye and Kaestner, 2009). By analyzing OCRs in Sox9EGFP populations relative to associated TF motifs, our data infer trans-regulatory interactions with dynamic chromatin that are confirmed by identification of TFs with known roles in IEC differentiation. Furthermore, the TF-motif dependent variability in OCR dynamics suggests that different trans-regulatory pathways may be associated with either broadly permissive or dynamic chromatin.

Chromatin dynamics are predictive of expanded ISC biomarker expression in Id3 null intestines

To test whether our computational analyses were robust in identifying novel ISC regulatory factors, we focused on Id3, which demonstrates an inverse correlation between TF expression and motif accessibility (Fig. 4A). Id3 is a known transcriptional repressor with functional significance in colon cancer, but its role in ISC biology remains unknown (Benezra et al., 1990; O'Brien et al., 2012). We acquired Id3 null mice, which carry a red fluorescent protein (RFP) inserted in the first exon of Id3, resulting in loss of expression (McMahon et al., 2008). Strong RFP expression was observed in the intestinal submucosa of Id3RFP/RFP jejunum, with weaker expression throughout the villus epithelium (Fig. 4B). Epithelial RFP expression patterns were therefore consistent with Id3 expression in differentiated Sox9neg and Sox9high populations by RNA-seq. Crypts in Id3RFP/RFP intestines were larger compared with wild-type controls (Fig. 4C) and immunofluorescence revealed increased numbers of crypt-based cells expressing OLFM4, an ISC-specific biomarker (Fig. 4D) (Schuijers et al., 2014). Additionally, we found that OLFM4+ cells extended further up the crypt axis (Fig. 4E). We observed no change in the percentage of KI67+ proliferating cells in Id3RFP/RFP crypts, suggesting that increased proliferation does not drive the observed increase in ISCs (Fig. 4F). Collectively, our data demonstrate an expansion of cells expressing the ISC biomarker OLFM4 in Id3RFP/RFP knockout mice, as predicted by Id3 expression and motif dynamics in Sox9EGFP populations. The inverse relationship between Id3 expression and motif accessibility, combined with the observed phenotype, suggests that Id3 may be involved in the repression of ISC-specific OCRs important for maintaining the stem cell state.

Fig. 4.

Loss of Id3 results in crypt hyperplasia and expansion of ISC biomarkers. (A) Expression and motif accessibility relationships suggest a repressive role for Id3 in ISC (Sox9low)-associated OCRs (n=5664 loci). (B) Id3RFP/RFP null mice exhibit RFP expression in villus epithelium, consistent with observed expression of Id3 in Sox9neg and Sox9high populations. Scale bar: 100 µm. (C,D) Id3RFP/RFP crypts are significantly larger compared with wild-type controls (C) and demonstrate increased OLFM4+ ISC numbers (D). Scale bar: 50 µm. *P<0.05; n=3 biological replicates, 12 crypts per mouse. (E) OLFM4+ cells are observed significantly further up the crypt-villus axis in Id3RFP/RFP intestines, suggesting delayed differentiation of ISCs (*P<0.05; n=3 biological replicates, 12 crypts per mouse). (F) Despite increased OLFM4+ cells and crypt depth, the percentage of KI67+ proliferating cells is unchanged between wild-type and Id3RFP/RFP crypts. Scale bar: 50 µm. n=3 biological replicates, 10 crypts per mouse. Data are mean±s.e.m. and individual points represent biological replicates.

Fig. 4.

Loss of Id3 results in crypt hyperplasia and expansion of ISC biomarkers. (A) Expression and motif accessibility relationships suggest a repressive role for Id3 in ISC (Sox9low)-associated OCRs (n=5664 loci). (B) Id3RFP/RFP null mice exhibit RFP expression in villus epithelium, consistent with observed expression of Id3 in Sox9neg and Sox9high populations. Scale bar: 100 µm. (C,D) Id3RFP/RFP crypts are significantly larger compared with wild-type controls (C) and demonstrate increased OLFM4+ ISC numbers (D). Scale bar: 50 µm. *P<0.05; n=3 biological replicates, 12 crypts per mouse. (E) OLFM4+ cells are observed significantly further up the crypt-villus axis in Id3RFP/RFP intestines, suggesting delayed differentiation of ISCs (*P<0.05; n=3 biological replicates, 12 crypts per mouse). (F) Despite increased OLFM4+ cells and crypt depth, the percentage of KI67+ proliferating cells is unchanged between wild-type and Id3RFP/RFP crypts. Scale bar: 50 µm. n=3 biological replicates, 10 crypts per mouse. Data are mean±s.e.m. and individual points represent biological replicates.

Profiling chromatin features shared between active and facultative ISC populations

The precise genetic regulation of IEC progenitor de-differentiation following epithelial injury remains poorly defined, but is associated with rearrangement of chromatin landscapes in facultative ISCs to more closely resemble active ISC chromatin (Jadhav et al., 2017). Previous studies have demonstrated that a subpopulation of Sox9high cells is consistent with label-retaining facultative ISCs (Roche et al., 2015). To assay for common chromatin features between active and facultative ISCs in uninjured intestinal epithelium, we identified OCRs found in Sox9low (active) and Sox9high (facultative) populations, but excluded from Sox9neg and Sox9sub (n=5092 of 81,919 peaks). Sox9low/high shared OCRs were found primarily in introns and intergenic regions, consistent with dynamic OCRs found across all four Sox9EGFP populations (Fig. 5A). Plotting shared OCRs with respect to UMAP space demonstrated overlap between Sox9low and Sox9high population distributions (Fig. 5B, compare with Fig. 2H), and shared OCRs exhibited generally higher Gini index values relative to all OCRs (Fig. 5C). To determine whether Sox9low/high shared OCRs were associated with increased transcriptional activity specific to these populations, we identified the nearest gene to each shared OCR and plotted expression across all Sox9EGFP populations. This gene set was significantly upregulated in Sox9low and Sox9high populations, further validating our analytical approach to identify shared cis-regulatory features with regulatory significance (Fig. 5D and Fig. S4A).

Fig. 5.

Shared chromatin accessibility in Sox9low and Sox9high populations is associated with enteroendocrine and ISC TFs. (A) Sox9low/high shared OCRs are enriched in introns and intergenic regions, consistent with dynamic chromatin characteristics (n=5092 OCRs). (B) Shared OCRs localize to overlapping Sox9low and Sox9high distributions on UMAP, and (C) demonstrate increased Gini index values relative to all identified IEC OCRs. (D) Genes nearest to shared OCRs are more highly expressed in Sox9low and Sox9high populations relative to Sox9neg and Sox9sub, consistent with functional significance (see Fig. S4A for statistical analysis). (E) Thirty TFs demonstrated motif enrichment and expression in Sox9low/high shared OCRs, including genes with known roles in ISC self-renewal and differentiation. (F) ISC signature genes Ascl2 and Tgif1 exhibit motif accessibility and gene expression patterns consistent with active transcriptional function in Sox9low active ISCs and chromatin ‘priming’ in Sox9high cells, which contain a known facultative ISC subpopulation. Plots show bias-normalized motif accessibility deviations from chromVAR (left) or z-scored RNA expression (right) in Sox9low and Sox9high populations for the 30 TFs in E. (G) ISC TF motifs are enriched across OCRs shared with or more exclusively associated with Sox9low and Sox9high cells, suggesting common and distinct regulatory roles in each population (n=5734 loci for Ascl2, 3281 loci for Tgif1).

Fig. 5.

Shared chromatin accessibility in Sox9low and Sox9high populations is associated with enteroendocrine and ISC TFs. (A) Sox9low/high shared OCRs are enriched in introns and intergenic regions, consistent with dynamic chromatin characteristics (n=5092 OCRs). (B) Shared OCRs localize to overlapping Sox9low and Sox9high distributions on UMAP, and (C) demonstrate increased Gini index values relative to all identified IEC OCRs. (D) Genes nearest to shared OCRs are more highly expressed in Sox9low and Sox9high populations relative to Sox9neg and Sox9sub, consistent with functional significance (see Fig. S4A for statistical analysis). (E) Thirty TFs demonstrated motif enrichment and expression in Sox9low/high shared OCRs, including genes with known roles in ISC self-renewal and differentiation. (F) ISC signature genes Ascl2 and Tgif1 exhibit motif accessibility and gene expression patterns consistent with active transcriptional function in Sox9low active ISCs and chromatin ‘priming’ in Sox9high cells, which contain a known facultative ISC subpopulation. Plots show bias-normalized motif accessibility deviations from chromVAR (left) or z-scored RNA expression (right) in Sox9low and Sox9high populations for the 30 TFs in E. (G) ISC TF motifs are enriched across OCRs shared with or more exclusively associated with Sox9low and Sox9high cells, suggesting common and distinct regulatory roles in each population (n=5734 loci for Ascl2, 3281 loci for Tgif1).

Next, we sought to leverage the Sox9low/high shared OCR dataset to identify potential transcriptional regulators of stemness. We identified 30 TF motifs enriched in Sox9low/high shared OCRs and compared motif accessibility with gene expression of the corresponding TF (Fig. 5E). This analysis identified two known ISC signature genes: Ascl2 and Tgif1 (Muñoz et al., 2012). Compellingly, both TFs demonstrated similar motif accessibility between Sox9low and Sox9high populations, but were expressed at significantly higher levels in Sox9low active ISCs (Fig. 5F). Ascl2 and Tgif1 motifs were also enriched in shared OCRs as well as OCRs that clustered more strongly with Sox9low or Sox9high populations (Fig. 5G). These data are consistent with ISC cis-regulatory elements being ‘primed’ in both active and facultative ISCs, and suggest the presence of shared and unique ASCL2- and TGIF1-binding sites in each population. Conceptually, this would facilitate rapid reactivation of Ascl2 and Tgif1 target genes in Sox9high facultative ISCs in the setting of damage to the active ISC pool. Similar patterns of accessibility and expression were observed for TFs with no known role in ISC function, including Atf1 and Atf7 (Fig. 5E). Future functional assays could be employed to determine whether these factors play novel roles in progenitor plasticity and stemness.

We reasoned that, in addition to shared ISC regulatory features, Sox9low and Sox9high populations would also share OCRs associated with early differentiation from ISCs to EECs. To this end, we identified EEC-associated TFs in our analyses. Unlike ISC TFs, EEC-associated TFs demonstrated substantial increases in both motif accessibility and expression between Sox9low and Sox9high populations, consistent with the activation of Sox9high-specific EEC phenotypes (Fig. S4B,C). By examining OCR and gene expression profiles shared between Sox9low and Sox9high populations, we characterize chromatin dynamics associated with ISC and EEC TFs and demonstrate proof of concept for parsing TF and chromatin networks associated with specific subpopulations from bulk genome/transcriptome analyses.

5-Hydroxymethylation is dynamic across ISC differentiation

We next sought to examine a specific chromatin modification in Sox9EGFP populations to further assay cis-regulatory dynamics. 5hmC is correlated with active gene expression and can be mapped in small numbers of primary cells using Nano-hmC-Seal (Han et al., 2016). Our transcriptomic data demonstrated that Tet1, Tet2 and Tet3, which catalyze hydroxymethylation of 5mC, are differentially expressed in Sox9EGFP populations, suggesting the potential for dynamic regulation of 5hmC across ISC differentiation (Fig. S5A). As 5hmC has been associated with both proximal and distal regulatory elements, we first examined differences in Nano-hmC-Seal signal across Sox9EGFP populations relative to promoters, gene bodies and distal OCRs (>2 kb from TSS) (Fig. 6A). Raw 5hmC signal was enriched over gene bodies (Fig. S5B), but reads were evenly distributed across gene bodies, OCRs and promoters when normalized for feature length (Fig. S5C). We identified differentially hydroxymethylated regions (DMHRs) in all four Sox9EGFP populations (Table S5). Sox9neg and Sox9high populations demonstrated more DMHRs relative to Sox9sub and Sox9low, consistent with increased 5hmC in differentiated IECs (Fig. 6B) (Kim et al., 2016a). Furthermore, a majority of DHMRs were found over gene bodies in each Sox9EGFP population (Fig. 6B). 5hmC varied across populations with particularly distinct hydroxymethylated promoters and gene bodies in Sox9neg and Sox9high populations, while OCRs demonstrated more shared or potentially progressive hydroxymethylation across all Sox9EGFP populations (Fig. 6C).

Fig. 6.

5hmC is associated with OCR dynamics in putative distal regulatory regions. (A) Selection criteria for the three classes of genomic features examined. Read counts from each region were used as input for differential hydroxymethylation analysis. (B) Number of statistically significant DHMRs found in each Sox9EGFP population (adjusted P-value<0.05). (C) Heatmap of z-scored 5hmC signal at DHMRs from each genomic class. (D) Relationships between 5hmC and gene expression in different genomic classes demonstrate correlation between gene body hydroxymethylation and gene expression (Pearson's r coefficient is reported for each association). (E) Browser tracks of 5hmC signal at Dgki show that intragenic 5hmC is dynamic across Sox9EGFP populations and associated with changes in gene expression. (F) The majority of genes demonstrate concordant relationships between 5hmC and expression.

Fig. 6.

5hmC is associated with OCR dynamics in putative distal regulatory regions. (A) Selection criteria for the three classes of genomic features examined. Read counts from each region were used as input for differential hydroxymethylation analysis. (B) Number of statistically significant DHMRs found in each Sox9EGFP population (adjusted P-value<0.05). (C) Heatmap of z-scored 5hmC signal at DHMRs from each genomic class. (D) Relationships between 5hmC and gene expression in different genomic classes demonstrate correlation between gene body hydroxymethylation and gene expression (Pearson's r coefficient is reported for each association). (E) Browser tracks of 5hmC signal at Dgki show that intragenic 5hmC is dynamic across Sox9EGFP populations and associated with changes in gene expression. (F) The majority of genes demonstrate concordant relationships between 5hmC and expression.

To determine the regulatory significance of 5hmC distribution by genomic element, we compared 5hmC levels with expression levels of genes located nearest to 5hmC-enriched loci. While OCRs and promoters demonstrated little correlation with gene expression, gene body 5hmC correlated positively with gene expression (Fig. 6D). Interestingly, we also noted a positive correlation between promoter and gene body 5hmC levels, suggesting that, despite relationships between promoter and genic 5hmC, the latter is more predictive of gene expression levels (Fig. S5D). We noted the same moderate positive correlation between expression and genic 5hmC in all Sox9EGFP populations, with Sox9high and Sox9neg cells displaying the strongest relationship, consistent with increased DHMRs in these populations and previous reports of 5hmC enrichment in villus epithelium (Fig. S5E) (Kim et al., 2016a). Accordingly, genome browser tracks revealed broad, dynamic domains of 5hmC across gene bodies that correlate with gene expression between Sox9EGFP populations (Fig. 6E). As genic 5hmC has been linked to increased transcription in previous studies, we compared fold change in gene expression with fold change of gene body 5hmC in each Sox9EGFP population (Han et al., 2016; Tsagaratou et al., 2014; Wu et al., 2011). We found that the vast majority of genes in each Sox9EGFP population were concordantly regulated relative to changes in genic 5hmC (gene expression increasing with genic 5hmC) (Fig. 6F). Together, our Nano-hmC-seal data demonstrate that 5hmC is differentially enriched between Sox9EGFP populations across the genome, but that gene body 5hmC is most predictive of gene expression levels.

Genic 5-hydroxymethylation correlates with absorptive gene expression

To determine whether 5hmC dynamics correlate with predictable patterns in ISC differentiation, we assessed the enrichment of 5hmC signal over defined marker genes for cell types identified by scRNA-seq (Haber et al., 2017). Consistent with previous reports, this analysis showed that 5hmC is depleted near the TSS, with high levels of the mark found throughout the gene body (Fig. 7A and Fig. S6A) (Han et al., 2016). We observed predictable relationships between enrichment of 5hmC in gene bodies associated with specific IEC subpopulations and their corresponding Sox9EGFP population. For example, Sox9neg cells were enriched for high 5hmC signal over genes associated with absorptive enterocytes, while Sox9high cells demonstrated modest enrichment of 5hmC over genes associated with tuft cells (Fig. 7A and Fig. S6A). Most cell type-specific marker gene sets were devoid of 5hmC enrichment associated with any one Sox9EGFP population. This suggests that 5hmC may have regulatory significance in some IEC populations, while being less important for cell type-specific gene expression in others.

Fig. 7.

5hmC dynamics over gene bodies are predictive of a role in absorptive gene expression. (A) Enrichment of 5hmC over scRNA-seq-defined IEC gene sets reveals increased signal in absorptive enterocyte genes in the Sox9neg population and tuft cell genes in the Sox9high population. (B) Organoids treated with dimethyl succinate to suppress global 5hmC levels do not demonstrate changes in organoid size (n=30 organoids per 0 mM and 10 mM dimethyl succinate groups, from three biological replicates). Scale bar: 100 µm. (C) Three days of succinate treatment results in a pronounced downregulation of the absorptive enterocyte markers Sis and Lct, as well as decreased expression of Chga and Tff3 (n=3 0 mM and 10 mM dimethyl succinate; *P<0.05, **P<0.005). Data are mean and individual points represent biological replicates.

Fig. 7.

5hmC dynamics over gene bodies are predictive of a role in absorptive gene expression. (A) Enrichment of 5hmC over scRNA-seq-defined IEC gene sets reveals increased signal in absorptive enterocyte genes in the Sox9neg population and tuft cell genes in the Sox9high population. (B) Organoids treated with dimethyl succinate to suppress global 5hmC levels do not demonstrate changes in organoid size (n=30 organoids per 0 mM and 10 mM dimethyl succinate groups, from three biological replicates). Scale bar: 100 µm. (C) Three days of succinate treatment results in a pronounced downregulation of the absorptive enterocyte markers Sis and Lct, as well as decreased expression of Chga and Tff3 (n=3 0 mM and 10 mM dimethyl succinate; *P<0.05, **P<0.005). Data are mean and individual points represent biological replicates.

To functionally test the role of 5hmC, we sought to modulate enzymatic activity of TETs in intestinal organoid cultures. TET1, TET2 and TET3 are expressed in an overlapping manner in IECs, and site-specific roles, potential redundancy and non-enzymatic function complicate 5hmC functional analyses relying on TET knockdown. As all TET enzymatic activity is sensitive to intracellular succinate/α-ketoglutarate ratios, we treated organoids with cell-permeable succinate to inhibit oxidation of 5mC to 5hmC. Although increased succinate is associated with intestinal inflammation and likely impacts IECs beyond α-ketoglutarate sensitive enzymes, this approach has previously been applied to broadly repress 5hmC and dissect its function in ES and neuroblastoma cells (Carey et al., 2015; Connors et al., 2018; Laukka et al., 2016). Dot blot analysis revealed a significant reduction in global 5hmC levels in organoids treated with 10 mM succinate for 5 days in vitro (Fig. S6B,C). Succinate-treated organoids appeared normal following treatment, with no obvious morphological defects or changes in size (Fig. 7B). To assess lineage allocation, we conducted RT-qPCR for IEC subtype-specific markers in control and succinate-treated organoids. We observed a significant and reproducible downregulation of the absorptive enterocyte-specific genes Lct and Sis at both 3 and 5 days of succinate treatment (Fig. 7C and Fig. S6D). Although Chga and Tff3 demonstrated slight downregulation within 3 days of treatment, this effect was not observed at 5 days, owing perhaps to increasing organoid heterogeneity and variability in gene expression between biological replicates over time. Succinate treatment had no impact on ISC gene expression (Fig. S6E). Our organoid experiments suggest that Nano-hmC-Seal assays demonstrating the greatest degree of 5hmC enrichment over absorptive enterocyte genes are predictive of the sensitivity of these genes to decreased global 5hmC.

Prior studies have arrived at conflicting conclusions regarding the extent to which chromatin in IECs is broadly permissive (Kaaij et al., 2013; Kim et al., 2014) or dynamic, with cell type-specific patterns in accessibility and chromatin modifications (Jadhav et al., 2017; Kazakevych et al., 2017; Lindeboom et al., 2018). Broadly permissive chromatin has been proposed as the genomic mechanism that facilitates progenitor de-differentiation during intestinal regeneration (Kim et al., 2014). However, focused studies of a secretory progenitor subpopulation have demonstrated significant chromatin reorganization during this same process, suggesting that chromatin must also undergo a significant degree of reorganization during normal differentiation (Jadhav et al., 2017). In an attempt to resolve these apparently conflicting results, we ‘mapped’ OCRs and 5hmC in IEC populations identified by a Sox9EGFP transgene. Although the Sox9EGFP model does not isolate ‘pure’ IEC subpopulations, our transcriptomic analyses demonstrate that distinct EGFP expression levels are capable of reproducibly defining ISCs, TAs and differentiated populations, capturing the longitudinal spectrum of lineage specification and facilitating ‘tissue wide’ chromatin profiling in an unperturbed system. This study establishes the Sox9EGFP transgene as tool for studying chromatin regulation in multiple IEC subpopulations, including intermediate progenitors, without requiring genetic modulation to enrich for different lineages.

Defining the relative extent of inter-population chromatin differences as ‘broadly permissive’ or ‘dynamic’ remains a significant conceptual challenge. Logically, all cell types within a single tissue can be expected to share a large number of broadly permissive cis-regulatory elements foundational to tissue identity. It is also expected that some degree of cis-regulatory elements define requirements for cellular subpopulation identity, and that these would appear more dynamic across populations. To dissect broadly permissive and dynamic elements, we applied Gini index as a quantitative approach for objectively defining changes in OCRs. The Gini index was developed for economic dispersion analysis, but was recently applied to quantify chromatin dynamics in the hematopoietic system (Yoshida et al., 2019). Our data affirm the Gini index as a valuable approach for quantifying chromatin dynamics in studies involving three or more groups, where pairwise analyses may be insufficient. We found that a majority of OCRs with relatively high Gini index values were present in intronic or intergenic regions, consistent with the positioning of enhancer elements and genome features associated with highly dynamic chromatin in the hematopoietic system (Yoshida et al., 2019). Functional association between specific enhancers and their associated genes would help further define the significance of these cis-regulatory elements in establishing and maintaining IEC cell fates.

To further characterize chromatin in Sox9EGFP populations, we combined analysis of (1) OCR activity, (2) TF motifs found within OCRs and (3) corresponding TF expression to identify transcriptional networks and their associated cis-regulatory landscape. Most identified TF motifs (n=406 motifs) were associated with OCRs that exhibited little variation in accessibility across ISC differentiation, consistent with a model of broadly permissive chromatin, where IEC chromatin is ‘primed’ to respond to diverse transcriptional regulatory signals (Kim et al., 2014). However, we identified an additional 195 motifs that were associated with dynamic chromatin accessibility, including known and potentially novel regulators of specific IEC lineage differentiation. We validated our computational approach by demonstrating a previously uncharacterized expansion of OLFM4+ IECs in Id3 knockout mice. Although further experiments are required to validate if loss of Id3 results in increased functional stemness, the specificity of OLFM4 as an ISC marker strongly suggests that computationally determined relationships between ID3 motif accessibility and expression across Sox9EGFP populations were robust in predicting TF function in a specific IEC subpopulation. Intriguingly, Id2 has been shown to negatively regulate ISC expansion in development, further supporting our phenotypic data in Id3RFP/RFP mice and suggesting common functional roles for Id proteins in intestinal biology (Nigmatullina et al., 2017). A notable limitation of our computational approach is the reliance on predicted TF motifs, which may overlap between multiple TFs. In the case of ID3, which does not directly bind DNA, the repressive effect on chromatin accessibility may be due to interaction with multiple distinct TFs, the motifs of which likely define predicted ID3 motifs found in public databases (Benezra et al., 1990). Direct mapping of TF-DNA interactions in Sox9EGFP populations will be required to further understand the interaction between TF networks and broadly permissive or dynamic IEC chromatin. Our data provide a comprehensive resource of expression and chromatin accessibility that can be used as a foundation for functional and mechanistic studies that uncover new regulators of IEC function.

To address the limitation of Sox9EGFP populations containing multiple subpopulations of different IECs, we demonstrated that ‘multi-omic’ analyses can be used to parse useful regulatory information from bulk datasets. Specifically, we applied quantitative analyses to identify shared OCRs, TF motifs and corresponding TF expression between Sox9low and Sox9high populations. This approach allowed us to characterize expression and motif accessibility dynamics for ISC-associated and EEC-associated TFs. In line with proposed mechanisms for progenitor plasticity, we found that ISC-associated TFs Ascl2 and Tgif1 exhibit characteristics consistent with broadly permissive chromatin, with variable expression but relatively invariable accessibility between Sox9low and Sox9high populations. In contrast, EEC-associated TFs are associated with both highly variable expression and motif accessibility, suggesting different cis- and trans-regulatory mechanisms underlying cellular processes of plasticity and lineage specification.

To understand a specific chromatin modification in ISC differentiation, we examined 5hmC dynamics. In agreement with previous studies in embryonic stem cells and the hematopoietic system, we observed enrichment of 5hmC in putative distal enhancers as well as promoters and gene bodies, but found that only genic 5hmC correlates with gene expression levels (Han et al., 2016; Tsagaratou et al., 2014; Wu et al., 2011). To determine the functional significance of this correlation, we combined Sox9EGFP population identities with gene sets identified in published scRNA-seq to plot enrichment of 5hmC over cell type-specific gene sets. This revealed increased hydroxymethylation over absorptive enterocyte genes, which we functionally confirmed by observed downregulation of Lct and Sis following succinate treatment of intestinal organoids. Although succinate treatment results in global reduction of 5hmC, it has broad effects, including inhibition of other alpha-ketoglutarate-dependent chromatin-modifying enzymes (Carey et al., 2015). Although outside of the scope of the present study, our organoid experiments raise questions about the individual and collective role of TET enzymes and 5hmC in ISC differentiation. While no current mechanism explains the link between genic 5hmC and increased gene expression, it has been speculated that this is secondary to TET2/3 association with RNA Pol II via SET/COMPASS (Deplus et al., 2013). Therefore, it remains unknown whether the observed downregulation of 5hmC-enriched absorptive genes in our experiments is a direct consequence of, or a correlation with, globally reduced 5hmC. Previous studies have shown that Tet1 is required for proper ISC numbers (Kim et al., 2016a), and further work dissecting the 5hmC-dependent and -independent functions of TET enzymes is needed to fully understand the role of 5hmC in ISC biology.

Large-scale chromatin mapping projects, including the Epigenome Roadmap, are crucial for expanding our understanding of the functional genome, but often focus on whole tissues at the expense of cell-level resolution (Roadmap Epigenomics Consortium, 2015). Next-generation studies require higher resolution approaches to avoid population averaging effects and to fully understand the regulatory significance of chromatin landscapes. Our study provides a chromatin ‘roadmap’ of a continuously renewing adult tissue that will facilitate further mechanistic dissection of the chromatin-modifying enzymes and TF networks involved in establishing functional genomes in IECs. Our quantitative analyses demonstrate that the IEC chromatin landscape can be both broadly permissive and highly dynamic, and that the extent of chromatin changes in ISC differentiation can be classified relative to predicted TF networks.

Mice

Sox9EGFP mice were maintained as heterozygotes on the C57Bl/6 background. Id3RFP/RFP mice were obtained from the Zhuang lab at Duke University and were also maintained on the C57Bl/6 background. Wild-type C57Bl/6 mice were used as controls in Id3 knockout experiments. All genomics experiments were carried out on adult males between 8 and 16 weeks of age. Id3 knockout experiments were carried out on adult male and female mice between 8 and 20 weeks of age, and control and Id3RFP/RFP mice were age and sex matched. Mice were co-housed, littermates were used when possible and age-matched non-littermates when not. All mice received Tekland Global Soy Protein-Free Extruded Rodent Diet (Envigo, 2020SX) and water ad libitum. Phenotyping for Sox9EGFP was carried out by observation of EGFP expression in hair follicles of tail clippings, as previously described (Formeister et al., 2009). The Institutional Animal Care and Use Committee (IACUC) of the University of North Carolina at Chapel Hill reviewed and approved all protocols.

Intestinal epithelial dissociation and FACS

Full-length intestine (duodenum, jejunum and ileum) was used for all sequencing-based experiments, and jejunum was used for organoid experiments. Intestinal epithelial dissociations were carried out as previously described (Zwarycz et al., 2018). For full-length preps, the entire small intestine was dissected and opened longitudinally before being rinsed in 1×Dulbecco's phosphate-buffered saline (DPBS) (Gibco) to remove fecal matter. For jejunal preps, the proximal 5 cm of intestine was removed and discarded, and the proximal half of the remaining intestine was opened longitudinally and rinsed in 1× DPBS to remove fecal matter. Rinsed intestine was transferred to a 50 ml conical tube containing 10 ml of 3 mM EDTA (Corning) and incubated for 15 min at 4°C with gentle nutation (RPM=40). Intestinal tissue was retrieved and transferred to a glass plate prior to being ‘brushed’ with a pipette tip to remove ∼50% of villi. The tissue was then minced to ∼3-5 mm pieces and transferred to a new 50 ml conical flask containing 10 ml of 3 mM EDTA and incubated for an additional 35 min at 4°C with gentle nutation (RPM=40). Tissue fragments were transferred to 10 ml DPBS in a third 50 ml conical flask and shaken by hand for 4 min to remove crypt-enriched epithelium. To obtain single cells, crypt epithelium was rinsed once with 25 ml DPBS then resuspended in 10 ml HBSS with 0.3 U/ml dispase (Corning) and 10 µM Y27632 (Selleck Chem), and incubated at 37°C for 12-16 min with gentle shaking and observation by light microscopy every 2 min until ∼80% of prep consisted of single cells. Cells were rinsed twice with DPBS containing 10% FetalPlex (Gemini Biosciences) to quench dispase activity.

Dissociated single cells were resuspended in 1× Sort Media [Advanced DMEM/F12 (Gibco), N2 (Gibco), B27 (Gibco), 10 mM HEPES (Corning), Glutamax (Gibco), Pen/Strep (Gibco), and 10 µM Y27632 (Selleck Chem)] and stained with CD31-APC (Biolegend) and CD45-APC (Biolegend) for 45 min on ice. Stained cells were rinsed once with Advanced DMEM/F12 and resuspended in 1× Sort Media. 7-AAD (Biolegend) and Annexin V-APC (Biolegend) were added immediately prior to sorting. Cells were sorted on a Sony SH800 FACS instrument with a 100 µm disposable nozzle chip’. Gating was carried out as previously described by Gracz et al. (2018) and shown in Fig. S1A. Doublet discrimination was visually validated by sorting ∼500 cells directly on to a glass slide and observing by light microscopy.

Organoid culture and dimethyl succinate treatment

Organoids were cultured from jejunal crypts in Cultrex Type II Growth Factor-Reduced matrix (R&D Biosystems). For 5hmC dot blot experiments, ∼300 crypts were seeded in 30 µl Cultrex droplets in 48-well plates and overlaid with 200 µl organoid media. For gene expression experiments, ∼100 crypts were seeded in 10 µl Cultrex droplets in 96-well plates and overlaid with 100 µl organoid media. Media consisted of Advanced DMEM/F12 (Gibco), N2 (Gibco), B27 (Gibco), 10 mM HEPES (Corning), Glutamax (Gibco), Pen/Strep (Gibco), 10% RSPO1-conditioned media (RSPO1 HEK293 cells were kindly provided by Calvin Kuo, Stanford University), 50 ng/ml recombinant murine EGF (Gibco) and 100 µg/ml recombinant human noggin (Peprotech). Y27632 (Selleck Chem, 10 µM) was added at initial plating. Organoids were allowed to establish themselves for 48 h prior to treatment with dimethyl succinate (Sigma). Media and succinate were replaced every 48 h.

RNA isolation

Twelve-thousand cells from each Sox9EGFP population were sorted directly into 500 µl of Lysis Buffer (Ambion) and stored at −80°C until RNA isolation. Total RNA was prepared using the RNAqueous Micro Kit (Ambion) according to manufacturer instructions and eluted in 15 µl molecular-grade H2O (Corning). RNA was treated with Ambion RNAqueous Micro Kit DNase I for 30 min at 37°C to eliminate gDNA, and DNase was inactivated using DNase Inactivation Beads. gDNA-free RNA was transferred to low-binding tubes and stored at −80°C until QC and library preparation.

gDNA isolation for Nano-hmC-Seal-seq and dot blot

For FACS-isolated Sox9EGFP populations, cells were collected directly into 250 µl 1× sort media, pelleted by centrifugation at 6000 g for 5 min, then lysed in 200 µl gDNA Lysis Buffer [10 mM Tris HCl (pH 8.0), 100 mM EDTA (pH 8.0), 0.5% SDS] with 1 mg/ml Proteinase K (Ambion) and incubated at 55°C in a water bath overnight. For organoids, growth media was isolated from wells, and Cultrex and organoids were lysed in 500 µl gDNA lysis buffer with 1 mg/ml proteinase K (Ambion) and incubated at 55°C in a water bath overnight. gDNA from both FACS-isolated cells and organoids was purified using a Quick DNA MicroPrep kit (Zymo) and assayed for concentration using the Qubit dsDNA High Sensitivity kit (Thermo Fisher) and a Qubit Fluorometer (Thermo Fisher).

RNA-seq: QC, library prep and sequencing

RNA from 12,000 cells per Sox9EGFP population was collected for each RNA-seq sample. From each sample, 5 µl out of 15 µl in total were allocated for RT-qPCR validation. cDNA was prepared as described in the ‘RT-qPCR’ section and sequential enrichment of Sox9 mRNA across EGFP-sorted populations was validated by RT-qPCR with Taqman probes against Sox9 (Life Technologies) using 18S as an internal control. To validate RNA quality, 1 µl of a 15 µl total from each sample was subjected to Agilent Bioanalyzer analysis using the Total Eukaryotic RNA Pico kit (Agilent). All samples demonstrated significant enrichment of Sox9 across the EGFP population and had RIN values of at least 8.0. Libraries were prepared using the SMARTer Stranded Total RNA-seq Kit v2 (Clontech) and 8 µl of 15 µl total RNA per sample, according to the manufacturer's instructions. Libraries were sequenced on a NextSeq500 (Illumina) with 75 bp single-end reads, v2 chemistry. Each library was sequenced to ≥2.9×107 reads.

Omni-ATAC-seq: library prep and sequencing

Twelve-thousand cells per Sox9EGFP population were collected directly into 250 µl 1× sort media, pelleted by centrifugation at 2000 g for 5 min, and subjected to Omni-ATAC as previously described by Corces et al. (2017). Libraries were sequenced on a NextSeq500 (Illumina) with 75 bp paired-end reads, v2 chemistry. Each library was sequenced to ≥5.0×107 reads.

Nano-hmC-Seal-seq: library prep and sequencing

Twelve-thousand cells per Sox9EGFP population were collected via FACS (above). gDNA was isolated as described above (see ‘gDNA isolation for Nano-hmC-Seal-seq and dot blot’ section) and subjected to Nano-hmC-Seal as previously described (Han et al., 2016). Libraries were sequenced on a NextSeq500 (Illumina) with 75 bp single-end reads, v2 chemistry. Each library was sequenced to ≥2.2×107 reads.

RNA-seq: analysis

Quantification was performed using salmon (version 0.8.2) (Patro et al., 2017) and differential expression testing was performed using DESeq2 (version 1.2.22) with tximport (1.10.1) to summarize transcript data at the gene level in R. Expression values in the Sox9-defined populations were then compared with single cell data from two independent sources (Haber et al., 2017; Yan et al., 2017). Genes for Fig. 1C were selected by filtering for genes significantly upregulated in a single population only. Ontology enrichment analysis was performed using clusterProfiler R Package (3.12.0).

Omni-ATAC-seq: analysis

Paired-end 76 bp reads were trimmed using trim_galore (version 0.4.3), aligned using bowtie2 (version 2.3.4) with -X 2000. Then duplicates were removed using picard (version 2.10.3) and peaks were called with Macs2 (version 2.1.2) –q 0.001 –f BAMPE –keep-dup-all. For visualization, signal tracks were computed using Deeptools (version 2.5.2) bamCoverage –binSize 1 –extendReads –normalizeUsingRPKM –ignoreForNormalization chrM chrX chrY –minFragmentLength 40 –maxFragmentLength 130. Comparison of replicates was performed using bamCompare in the deeptools package. Normalized Tn5 insertions were calculated using pyatac (Schep et al., 2015).

To identify differences in motif use, the R package chromVAR was used (Schep et al., 2017). To compare differences in accessibility at individual sites, UMAP dimensionality reduction was used (McInnes et al., 2018preprint) (R package version 0.2.1.0; CRAN.R-project.org/package=umap) and the gini index for each site across the four populations (three replicates each) was calculated using the gini function in the edgeR package. Motif counts were identified using the motifmatcher R package, using the default P-value cutoff of 5e-5 and projected into UMAP space. For TFs selected in Fig. 3B, we: (1) applied a filter for overall expression (>25th percentile of all gene expression), (2) required the TF be significantly differentially expressed in at least one Sox9EGFP population, and (3) required that the correlation between motif accessibility and TF expression be significant (adjusted P-value<0.05). The TFs identified in Fig. 5D were selected based on their enrichment using Homer (findMotifsGenome.pl version 4.8.3) comparing peaks shared between Sox9low and Sox9high with those peaks that were found in Sox9neg and Sox9sublow. They were further filtered by the overall expression amount (>25th percentile of all gene expression).

Nano-hmC-Seal-seq: analysis

5hmc data were trimmed using trim_galore and aligned to mm9 using bowtie2 (version 2.3.4) –sensitive. Following duplicate removal using picard (version 2.10.3), signal tracks were generated using deeptools (version 2.5.2) bamCoverage –binSize 10 –extendReads 150 –normalizeUsingRPKM. Signal over genic regions was calculated using deeptools computeMatrix function and gencode annotations (vM1). Pearson correlation between expression and 5hmc levels was calculated in R using the cor.test function and statistical inferences were adjusted for multiple testing using the p.adjust function. For DHMR analysis, reads in the different genomic bins (Fig. 6A) were counted using the R package featureCounts and then used as input to a DESeq2 analysis as described in the RNA-seq analysis section.

5hmC dot blot

gDNA prepared from control and dimethyl succinate-treated organoids was diluted to 400 ng per sample in 20 µl TE. Sodium hydroxide (40 µl and 0.1 M) was added and samples were incubated at 95°C for 10 min in thermal cycler to denature DNA. Ice-cold ammonium acetate (60 µl of 1 M solution) was added and serial dilutions were made: 60 µl sample diluted with 60 µl TE. Denatured gDNA was spotted onto Hybond N+ membranes (GE Amersham) using a BioDot SF Microfiltration apparatus (Bio-Rad). Membranes were dried for 5 min at 80°C, UV crosslinked and blocked overnight in 5% non-fat dry milk in PBS. Membranes were incubated in anti-5-hydroxymethylcytidine (Active Motif, mouse monoclonal, 59.1, 0.4 µg/ml) for 3 h at room temperature in 5% non-fat dry milk in PBS, then rinsed three times in PBS. Secondary detection was carried out by incubating membranes in Gt-anti-Ms HRP (Jackson ImmunoResearch, 1:5000) for 3 h at room temperature in 5% non-fat dry milk in PBS. Membranes were rinsed three times in PBS, HRP was detected using Clarity ECL (Bio-Rad), and blots were imaged using a Protein Simple gel box/imaging station. Following 5hmC detection, blots were rinsed three times in PBS and incubated for 10 min at room temperature in 0.02% methylene blue (Sigma) in 0.3 M sodium acetate (pH 5.2) to detect total gDNA. Methylene blue-stained blots were rinsed five times in molecular-grade H2O and imaged using a Protein Simple gel box/imaging station. Signal intensity for 5hmC was measured for 100 ng dots in ImageJ and changes in signal for succinate-treated organoids were calculated as percentage change relative to the untreated control for each biological replicate (n=4 biological replicates).

RT-qPCR

cDNA was prepared using iScript cDNA Synthesis Kit (Bio-Rad) and diluted 1:10 with molecular grade H2O (Corning) prior to use in RT-qPCR assays with SsoAdvanced Universal Master Mix (Bio-Rad). Taqman primers (Life Tech) were used to detect: Ascl2 (Mm01268891_g1), Chga (Mm00514341_m1), Cox1 (Mm04225243_g1), Defa-rs1 (Mm00655850_m1), Dclk1 (Mm00444950_m1), Lct (Mm01285112_m1), Lgr5 (Mm00438890_m1), Lyz2 (Mm00727183_s1), Muc2 (Mm00545872_m1), Sis (Mm01210305_m1), Sox9 (Mm00448840_m1), Tac1 (Mm00436880_m1) and Tff3 (Mm00495590_m1). Fold change was calculated using ΔΔCT with 18S (Hs99999901_s1) as the internal reference (Pfaffl, 2001).

Immunofluorescence and image quantification

Sox9EGFP, Id3RFP/RFP or wild-type C57Bl/6 control jejunum was dissected, fixed overnight in 4% PFA, then transferred to 30% sucrose for an additional 16-18 h prior to embedding in Tissue-Tek OCT (Sakura) and frozen at −80°C. Immunofluorescence for IEC lineage markers was carried out on 10 µm sections. Tissue was permeabilized with 0.3% Triton X-100 (Sigma) in PBS for 20 min at room temperature, followed by blocking in 1× Animal-Free Block (Cell Signaling Technologies) for 40 min. Primary antibodies (Table S6) were diluted at specified concentrations in 1× Animal-Free Block and incubated for 2 h at room temperature. Secondary antibodies were diluted 1:500 in 1× Animal-Free Block and incubated for 45 min at room temperature. Nuclear counterstain was carried out using bisbenzimide (Sigma).

For quantification of EGFP fluorescence in IEC lineages, images were taken as 1 µm optical sections using a Zeiss LSM 700 confocal microscope. EGFP fluorescence intensity was measured in ImageJ. Briefly, CD326 was used as an epithelial-specific membrane marker and cells positive for lineage-specific markers were outlined using the freehand drawing tool for pixel intensity measurement. Background signal was measured and subtracted from the raw value of the measured cells from the same image. Enteroendocrine cells were marked by CHGA (n=36), tuft cells were marked by DCLK1 (n=33), absorptive enterocytes were marked by FABP1 (n=97), Paneth cells were marked by LYZ (n=78), goblet cells were marked by MUC2 (n=62), intestinal stem cells (ISCs) were marked by OLFM4 (n=78), and transit-amplifying progenitor cells were considered to be in the +4 position and not marked by OLFM4 (n=78). Statistical significance was assessed by one-way ANOVA with post-hoc comparisons using Tukey's test in Prism 8 (GraphPad).

For OLFM4+ and KI67+ cells in Id3RFP/RFP and wild-type control mice, 1 µm optical sections were acquired using a Zeiss LSM 700 confocal microscope to ensure accurate quantification. OLFM4+ cells were quantified and localized based on crypt position, designating the first cell position at the base of each crypt as ‘0’. KI67 was quantified by counting all bisbenzimide-positive nuclei in each crypt and determining the percentage of KI67+ cells. Crypt depth was determined by measuring the distance from the basolateral membrane of the crypt base to the crypt-villus junction in ImageJ. Statistical significance was assessed by unpaired t-test in Prism 8 (GraphPad). The representative Id3RFP/RFP OLFM4 immunofluorescent image in Fig. 4D was rotated to achieve proper crypt orientation for comparison with the wild-type control.

The authors thank Drs Mei-fang Dai and Yuan Zhuang of Duke University for providing Id3RFP mice. Brian Golitz and Noah Sciaky for technical support with sequencing and deconvolution of sequencing files. Qiancheng You and Dr Chuan He of the University of Chicago for technical input and training on Nano-hmC-Seal. Dr Pablo Arial and the Microscopy Services Laboratory in the Department of Pathology and Laboratory Medicine at UNC Chapel Hill for assistance with confocal microscopy. Dr Bailey Zwarycz, Dr Joseph Burclaff, and Othmane Jadi for helpful conversations and critical reading of the manuscript. Dr Terry Magnuson and members of the Magnuson lab at UNC Chapel Hill for helpful conversations and feedback. The Microscopy Services Laboratory, Department of Pathology and Laboratory Medicine, is supported in part by P30 CA016086 Cancer Center Core Support Grant to the UNC Lineberger Comprehensive Cancer Center.

Author contributions

Conceptualization: S.T.M., A.D.G.; Methodology: J.R.R., A.D.G.; Validation: A.D.G.; Formal analysis: J.R.R., D.Y.T., K.E.W., J.M.M., A.D.G.; Investigation: J.R.R., D.Y.T., K.E.W., J.M.M., A.D.G.; Resources: S.T.M., A.D.G.; Data curation: J.R.R., D.Y.T.; Writing - original draft: J.R.R., A.D.G.; Writing - review & editing: J.R.R., S.T.M., A.D.G.; Visualization: J.R.R.; Supervision: A.D.G.; Project administration: A.D.G.; Funding acquisition: S.T.M., A.D.G.

Funding

This work was funded by the National Institutes of Health (K01 DK111709 and R03 DK122111 to A.D.G.; R01 DK091427 to S.T.M.), the American Gastroenterological Association (Research Scholar Award to A.D.G.), the National Institute of Diabetes and Digestive and Kidney Diseases (P30 DK34987 to A.D.G.), the Burroughs Wellcome Fund (Collaborative Research Travel Grant to A.D.G.), and the U.S. Department of Defense (W81XWH-19-1-0423 to J.R.R.). Deposited in PMC for release after 12 months.

Data availability

RNA-seq, Omni-ATAC-Seq and Nano-hmC-Seal data have been deposited in GEO under accession number GSE131442.

Barker
,
N.
,
van Es
,
J. H.
,
Kuipers
,
J.
,
Kujala
,
P.
,
van den Born
,
M.
,
Cozijnsen
,
M.
,
Haegebarth
,
A.
,
Korving
,
J.
,
Begthel
,
H.
,
Peters
,
P. J.
, et al. 
(
2007
).
Identification of stem cells in small intestine and colon by marker gene Lgr5
.
Nature
449
,
1003
-
1007
.
Benezra
,
R.
,
Davis
,
R. L.
,
Lockshon
,
D.
,
Turner
,
D. L.
and
Weintraub
,
H.
(
1990
).
The protein Id: a negative regulator of helix-loop-helix DNA binding proteins
.
Cell
61
,
49
-
59
.
Buczacki
,
S. J. A.
,
Zecchini
,
H. I.
,
Nicholson
,
A. M.
,
Russell
,
R.
,
Vermeulen
,
L.
,
Kemp
,
R.
and
Winton
,
D. J.
(
2013
).
Intestinal label-retaining cells are secretory precursors expressing Lgr5
.
Nature
495
,
65
-
69
.
Carey
,
B. W.
,
Finley
,
L. W. S.
,
Cross
,
J. R.
,
Allis
,
C. D.
and
Thompson
,
C. B.
(
2015
).
Intracellular alpha-ketoglutarate maintains the pluripotency of embryonic stem cells
.
Nature
518
,
413
-
416
.
Chen
,
L.
,
Toke
,
N. H.
,
Luo
,
S.
,
Vasoya
,
R. P.
,
Fullem
,
R. L.
,
Parthasarathy
,
A.
,
Perekatt
,
A. O.
and
Verzi
,
M. P.
(
2019
).
A reinforcing HNF4–SMAD4 feed-forward module stabilizes enterocyte identity
.
Nat. Genet.
51
,
777
-
785
.
Connors
,
J.
,
Dawe
,
N.
and
Van Limbergen
,
J.
(
2018
).
The role of succinate in the regulation of intestinal inflammation
.
Nutrients
11
,
E25
.
Corces
,
M. R.
,
Trevino
,
A. E.
,
Hamilton
,
E. G.
,
Greenside
,
P. G.
,
Sinnott-Armstrong
,
N. A.
,
Vesuna
,
S.
,
Satpathy
,
A. T.
,
Rubin
,
A. J.
,
Montine
,
K. S.
,
Wu
,
B.
, et al. 
(
2017
).
An improved ATAC-seq protocol reduces background and enables interrogation of frozen tissues
.
Nat. Methods
14
,
959
-
962
.
Deplus
,
R.
,
Delatte
,
B.
,
Schwinn
,
M. K.
,
Defrance
,
M.
,
Méndez
,
J.
,
Murphy
,
N.
,
Dawson
,
M. A.
,
Volkmar
,
M.
,
Putmans
,
P.
,
Calonne
,
E.
, et al. 
(
2013
).
TET2 and TET3 regulate GlcNAcylation and H3K4 methylation through OGT and SET1/COMPASS
.
EMBO J.
32
,
645
-
655
.
Dixon
,
J. R.
,
Jung
,
I.
,
Selvaraj
,
S.
,
Shen
,
Y.
,
Antosiewicz-Bourget
,
J. E.
,
Lee
,
A. Y.
,
Ye
,
Z.
,
Kim
,
A.
,
Rajagopal
,
N.
,
Xie
,
W.
, et al. 
(
2015
).
Chromatin architecture reorganization during stem cell differentiation
.
Nature
518
,
331
-
336
.
Formeister
,
E. J.
,
Sionas
,
A. L.
,
Lorance
,
D. K.
,
Barkley
,
C. L.
,
Lee
,
G. H.
and
Magness
,
S. T.
(
2009
).
Distinct SOX9 levels differentially mark stem/progenitor populations and enteroendocrine cells of the small intestine epithelium
.
Am. J. Physiol. Gastrointest. Liver Physiol.
296
,
G1108
-
G1118
.
Gracz
,
A. D.
,
Ramalingam
,
S.
and
Magness
,
S. T.
(
2010
).
Sox9 expression marks a subset of CD24-expressing small intestine epithelial stem cells that form organoids in vitro
.
Am. J. Physiol. Gastrointest. Liver Physiol.
298
,
G590
-
G600
.
Gracz
,
A. D.
,
Williamson
,
I. A.
,
Roche
,
K. C.
,
Johnston
,
M. J.
,
Wang
,
F.
,
Wang
,
Y.
,
Attayek
,
P. J.
,
Balowski
,
J.
,
Liu
,
X. F.
,
Laurenza
,
R. J.
, et al. 
(
2015
).
A high-throughput platform for stem cell niche co-cultures and downstream gene expression analysis
.
Nat. Cell Biol.
17
,
340
-
349
.
Gracz
,
A. D.
,
Samsa
,
L. A.
,
Fordham
,
M. J.
,
Trotier
,
D. C.
,
Zwarycz
,
B.
,
Lo
,
Y.-H.
,
Bao
,
K.
,
Starmer
,
J.
,
Raab
,
J. R.
,
Shroyer
,
N. F.
, et al. 
(
2018
).
SOX4 promotes ATOH1-independent intestinal secretory differentiation toward tuft and enteroendocrine fates
.
Gastroenterology
155
,
1508
-
1523.e10
.
Haber
,
A. L.
,
Biton
,
M.
,
Rogel
,
N.
,
Herbst
,
R. H.
,
Shekhar
,
K.
,
Smillie
,
C.
,
Burgin
,
G.
,
Delorey
,
T. M.
,
Howitt
,
M. R.
,
Katz
,
Y.
, et al. 
(
2017
).
A single-cell survey of the small intestinal epithelium
.
Nature
551
,
333
-
339
.
Han
,
D.
,
Lu
,
X.
,
Shih
,
A. H.
,
Nie
,
J.
,
You
,
Q.
,
Xu
,
M. M.
,
Melnick
,
A. M.
,
Levine
,
R. L.
and
He
,
C.
(
2016
).
A highly sensitive and robust method for genome-wide 5hmC profiling of rare cell populations
.
Mol. Cell
63
,
711
-
719
.
Jadhav
,
U.
,
Saxena
,
M.
,
O'Neill
,
N. K.
,
Saadatpour
,
A.
,
Yuan
,
G.-C.
,
Herbert
,
Z.
,
Murata
,
K.
and
Shivdasani
,
R. A.
(
2017
).
Dynamic reorganization of chromatin accessibility signatures during dedifferentiation of secretory precursors into Lgr5+ intestinal stem cells
.
Cell Stem Cell
21
,
65
-
77.e5
.
Kaaij
,
L. T. J.
,
van de Wetering
,
M.
,
Fang
,
F.
,
Decato
,
B.
,
Molaro
,
A.
,
van de Werken
,
H. J. G.
,
van Es
,
J. H.
,
Schuijers
,
J.
,
de Wit
,
E.
,
de Laat
,
W.
, et al. 
(
2013
).
DNA methylation dynamics during intestinal stem cell differentiation reveals enhancers driving gene expression in the villus
.
Genome Biol.
14
,
R50
.
Kazakevych
,
J.
,
Sayols
,
S.
,
Messner
,
B.
,
Krienke
,
C.
and
Soshnikova
,
N.
(
2017
).
Dynamic changes in chromatin states during specification and differentiation of adult intestinal stem cells
.
Nucleic Acids Res.
45
,
5770
-
5784
.
Kim
,
T.-H.
,
Li
,
F.
,
Ferreiro-Neira
,
I.
,
Ho
,
L.-L.
,
Luyten
,
A.
,
Nalapareddy
,
K.
,
Long
,
H.
,
Verzi
,
M.
and
Shivdasani
,
R. A.
(
2014
).
Broadly permissive intestinal chromatin underlies lateral inhibition and cell plasticity
.
Nature
506
,
511
-
515
.
Kim
,
R.
,
Sheaffer
,
K. L.
,
Choi
,
I.
,
Won
,
K.-J.
and
Kaestner
,
K. H.
(
2016a
).
Epigenetic regulation of intestinal stem cells by Tet1-mediated DNA hydroxymethylation
.
Genes Dev.
30
,
2433
-
2442
.
Kim
,
T.-H.
,
Saadatpour
,
A.
,
Guo
,
G.
,
Saxena
,
M.
,
Cavazza
,
A.
,
Desai
,
N.
,
Jadhav
,
U.
,
Jiang
,
L.
,
Rivera
,
M. N.
,
Orkin
,
S. H.
, et al. 
(
2016b
).
Single-cell transcript profiles reveal multilineage priming in early progenitors derived from Lgr5+ intestinal stem cells
.
Cell Rep.
16
,
2053
-
2060
.
Lara-Astiaso
,
D.
,
Weiner
,
A.
,
Lorenzo-Vivas
,
E.
,
Zaretsky
,
I.
,
Jaitin
,
D. A.
,
David
,
E.
,
Keren-Shaul
,
H.
,
Mildner
,
A.
,
Winter
,
D.
,
Jung
,
S.
, et al. 
(
2014
).
Immunogenetics. chromatin state dynamics during blood formation
.
Science
345
,
943
-
949
.
Laukka
,
T.
,
Mariani
,
C. J.
,
Ihantola
,
T.
,
Cao
,
J. Z.
,
Hokkanen
,
J.
,
Kaelin
,
W. G.
, Jr
,
Godley
,
L. A.
and
Koivunen
,
P.
(
2016
).
Fumarate and succinate regulate expression of hypoxia-inducible genes via TET enzymes
.
J. Biol. Chem.
291
,
4256
-
4265
.
Lien
,
W.-H.
,
Guo
,
X.
,
Polak
,
L.
,
Lawton
,
L. N.
,
Young
,
R. A.
,
Zheng
,
D.
and
Fuchs
,
E.
(
2011
).
Genome-wide maps of histone modifications unwind in vivo chromatin states of the hair follicle lineage
.
Cell Stem Cell
9
,
219
-
232
.
Lindeboom
,
R. G. H.
,
van Voorthuijsen
,
L.
,
Oost
,
K. C.
,
Rodríguez-Colman
,
M. J.
,
Luna-Velez
,
M. V.
,
Furlan
,
C.
,
Baraille
,
F.
,
Jansen
,
P. W. T. C.
,
Ribeiro
,
A.
,
Burgering
,
B. M. T.
, et al. 
(
2018
).
Integrative multi-omics analysis of intestinal organoid differentiation
.
Mol. Syst. Biol.
14
,
e8227
.
McConnell
,
B. B.
and
Yang
,
V. W.
(
2010
).
Mammalian Krüppel-like factors in health and diseases
.
Physiol. Rev.
90
,
1337
-
1381
.
McInnes
,
L.
,
Healy
,
J.
and
Melville
,
J.
(
2018
).
Umap: uniform manifold approximation and projection for dimension reduction
.
arXiv
preprint arXiv:1802.
03426
.
McMahon
,
A. P.
,
Aronow
,
B. J.
,
Davidson
,
D. R.
,
Davies
,
J. A.
,
Gaido
,
K. W.
,
Grimmond
,
S.
,
Lessard
,
J. L.
,
Little
,
M. H.
,
Potter
,
S. S.
,
Wilder
,
E. L.
, et al. 
(
2008
).
GUDMAP: the genitourinary developmental molecular anatomy project
.
J. Am. Soc. Nephrol.
19
,
667
-
671
.
Muñoz
,
J.
,
Stange
,
D. E.
,
Schepers
,
A. G.
,
van de Wetering
,
M.
,
Koo
,
B.-K.
,
Itzkovitz
,
S.
,
Volckmann
,
R.
,
Kung
,
K. S.
,
Koster
,
J.
,
Radulescu
,
S.
, et al. 
(
2012
).
The Lgr5 intestinal stem cell signature: robust expression of proposed quiescent ‘+4’ cell markers
.
EMBO J.
31
,
3079
-
3091
.
Nigmatullina
,
L.
,
Norkin
,
M.
,
Dzama
,
M. M.
,
Messner
,
B.
,
Sayols
,
S.
and
Soshnikova
,
N.
(
2017
).
Id2 controls specification of Lgr5+ intestinal stem cell progenitors during gut development
.
EMBO J.
36
,
869
-
885
.
O'Brien
,
C. A.
,
Kreso
,
A.
,
Ryan
,
P.
,
Hermans
,
K. G.
,
Gibson
,
L.
,
Wang
,
Y.
,
Tsatsanis
,
A.
,
Gallinger
,
S.
and
Dick
,
J. E.
(
2012
).
ID1 and ID3 regulate the self-renewal capacity of human colon cancer-initiating cells through p21
.
Cancer Cell
21
,
777
-
792
.
Patro
,
R.
,
Duggal
,
G.
,
Love
,
M. I.
,
Irizarry
,
R. A.
and
Kingsford
,
C.
(
2017
).
Salmon provides fast and bias-aware quantification of transcript expression
.
Nat. Methods
14
,
417
-
419
.
Pfaffl
,
M. W.
(
2001
).
A new mathematical model for relative quantification in real-time RT-PCR
.
Nucleic Acids Res.
29
,
e45
.
Roadmap Epigenomics Consortium
,
Kundaje
,
A.
,
Meuleman
,
W.
,
Ernst
,
J.
,
Bilenky
,
M.
,
Yen
,
A.
,
Heravi-Moussavi
,
A.
,
Kheradpour
,
P.
,
Zhang
,
Z.
,
Wang
,
J.
, et al. 
(
2015
).
Integrative analysis of 111 reference human epigenomes
.
Nature
518
,
317
-
330
.
Roche
,
K. C.
,
Gracz
,
A. D.
,
Liu
,
X. F.
,
Newton
,
V.
,
Akiyama
,
H.
and
Magness
,
S. T.
(
2015
).
Sox9 maintains reserve stem cells and preserves radioresistance in mouse small intestine
.
Gastroenterology
149
,
1553
-
1563.e10
.
Schep
,
A. N.
,
Buenrostro
,
J. D.
,
Denny
,
S. K.
,
Schwartz
,
K.
,
Sherlock
,
G.
and
Greenleaf
,
W. J.
(
2015
).
Structured nucleosome fingerprints enable high-resolution mapping of chromatin architecture within regulatory regions
.
Genome Res.
25
,
1757
-
1770
.
Schep
,
A. N.
,
Wu
,
B.
,
Buenrostro
,
J. D.
and
Greenleaf
,
W. J.
(
2017
).
chromVAR: inferring transcription-factor-associated accessibility from single-cell epigenomic data
.
Nat. Methods
14
,
975
-
978
.
Schmitt
,
M.
,
Schewe
,
M.
,
Sacchetti
,
A.
,
Feijtel
,
D.
,
van de Geer
,
W. S.
,
Teeuwssen
,
M.
,
Sleddens
,
H. F.
,
Joosten
,
R.
,
van Royen
,
M. E.
,
van de Werken
,
H. J. G.
, et al. 
(
2018
).
Paneth Cells respond to inflammation and contribute to tissue regeneration by acquiring stem-like features through SCF/c-kit signaling
.
Cell Rep.
24
,
2312
-
2328.e7
.
Schuijers
,
J.
,
van der Flier
,
L. G.
,
van Es
,
J.
and
Clevers
,
H.
(
2014
).
Robust cre-mediated recombination in small intestinal stem cells utilizing the olfm4 locus
.
Stem Cell Rep.
3
,
234
-
241
.
Suzuki
,
K.
,
Harada
,
N.
,
Yamane
,
S.
,
Nakamura
,
Y.
,
Sasaki
,
K.
,
Nasteska
,
D.
,
Joo
,
E.
,
Shibue
,
K.
,
Harada
,
T.
,
Hamasaki
,
A.
, et al. 
(
2013
).
Transcriptional regulatory factor X6 (Rfx6) increases gastric inhibitory polypeptide (GIP) expression in enteroendocrine K-cells and is involved in GIP hypersecretion in high fat diet-induced obesity
.
J. Biol. Chem.
288
,
1929
-
1938
.
Tetteh
,
P. W.
,
Basak
,
O.
,
Farin
,
H. F.
,
Wiebrands
,
K.
,
Kretzschmar
,
K.
,
Begthel
,
H.
,
van den Born
,
M.
,
Korving
,
J.
,
de Sauvage
,
F.
,
van Es
,
J. H.
, et al. 
(
2016
).
Replacement of Lost Lgr5-positive stem cells through plasticity of their enterocyte-lineage daughters
.
Cell Stem Cell
18
,
203
-
213
.
Tsagaratou
,
A.
,
Aijo
,
T.
,
Lio
,
C. W. J.
,
Yue
,
X.
,
Huang
,
Y.
,
Jacobsen
,
S. E.
,
Lahdesmaki
,
H.
and
Rao
,
A.
(
2014
).
Dissecting the dynamic changes of 5-hydroxymethylcytosine in T-cell development and differentiation
.
Proc. Natl. Acad. Sci. USA
111
,
E3306
-
E3315
.
Wu
,
H.
,
D'Alessio
,
A. C.
,
Ito
,
S.
,
Wang
,
Z.
,
Cui
,
K.
,
Zhao
,
K.
,
Sun
,
Y. E.
and
Zhang
,
Y.
(
2011
).
Genome-wide analysis of 5-hydroxymethylcytosine distribution reveals its dual function in transcriptional regulation in mouse embryonic stem cells
.
Genes Dev.
25
,
679
-
684
.
Yan
,
K. S.
,
Janda
,
C. Y.
,
Chang
,
J.
,
Zheng
,
G. X. Y.
,
Larkin
,
K. A.
,
Luca
,
V. C.
,
Chia
,
L. A.
,
Mah
,
A. T.
,
Han
,
A.
,
Terry
,
J. M.
, et al. 
(
2017
).
Non-equivalence of Wnt and R-spondin ligands during Lgr5+ intestinal stem-cell self-renewal
.
Nature
545
,
238
-
242
.
Ye
,
D. Z.
and
Kaestner
,
K. H.
(
2009
).
Foxa1 and Foxa2 control the differentiation of goblet and enteroendocrine L- and D-cells in mice
.
Gastroenterology
137
,
2052
-
2062
.
Yoshida
,
H.
,
Lareau
,
C. A.
,
Ramirez
,
R. N.
,
Rose
,
S. A.
,
Maier
,
B.
,
Wroblewska
,
A.
,
Desland
,
F.
,
Chudnovskiy
,
A.
,
Mortha
,
A.
,
Dominguez
,
C.
, et al. 
(
2019
).
The cis-regulatory atlas of the mouse immune system
.
Cell
176
,
897
-
912.e20
.
Zwarycz
,
B.
,
Gracz
,
A. D.
and
Magness
,
S. T.
(
2018
).
Organoid cultures for assessing intestinal epithelial differentiation and function in response to Type-2 inflammation
.
Methods Mol. Biol.
1799
,
397
-
417
.

Competing interests

The authors declare no competing or financial interests.

Supplementary information