In the past decade, two major advances in our understanding of nuclear organization have taken the field of gene regulation by storm. First, technologies that can analyze the three-dimensional conformation of chromatin have revealed how the genome is organized and have provided novel insights into how regulatory regions in the genome interact. Second, the recognition that many proteins can form membraneless compartments through liquid-liquid phase separation (LLPS) has challenged long-standing notions of how proteins within the nucleus are organized and has offered a tantalizing general mechanism by which many aspects of nuclear function may be regulated. However, the functional roles of chromatin topology and LLPS in regulating gene expression remain poorly understood. These topics were discussed with great fervor during an open discussion held at a recent workshop titled ‘Chromatin-based regulation of development’ organized by The Company of Biologists. Here, we summarize the major points covered during this debate and discuss how they tie into current thinking in the field of gene regulation.
The cell nucleus is a crowded environment. This is easily appreciated by considering that the almost 2 m long DNA polymer that comprises the human genome is packaged into a ∼6-10 µm diameter cell nucleus along with all the biological macromolecules required to replicate, maintain and interpret genetic information. The challenges posed by this crowding are appreciated when considering that distal regulatory elements that modulate transcription, such as enhancers, can be located hundreds of kilobases away from the genes that they act on, and that regulatory proteins, such as transcription factors, have to find their specific binding sites among billions of non-target sites. In the face of these challenges, elegant mechanisms have evolved that take advantage of this crowded environment and organize the nucleus, both in terms of the three-dimensional architecture of DNA packaged into chromatin, and of the spatial and temporal distributions of the macromolecular complexes that act on chromatin to regulate the genome. Understanding how such organization is achieved, its functional implications, and how it changes during development and disease progression has become a major focus of inquiry in the life sciences.
Driven by advances in technology, two major paradigms have emerged that dominate emerging models of nuclear organization. First, that the genome is compartmentalized at multiple spatial scales and, second, that some biological macromolecules can form membraneless compartments through the creation of two distinct phases, most commonly through liquid-liquid phase separation (LLPS) as occurs in the familiar case of oil drops in water. However, despite a tremendous amount of research on both genome topology and LLPS, the underlying mechanisms and functional implications of both remain poorly understood. This obscurity is due to a lack of experimental evidence and appropriate tools, as well as contradictory findings in the literature. These issues were discussed at great length in a recent workshop, organized by The Company of Biologists, which took place in April 2019 and was titled ‘Chromatin-based regulation of development’. A facilitated open discussion was held at the workshop, motivated by two timely questions: (1) does LLPS enable gene regulation within a crowded nucleus? (moderated by Mustafa Mir and Geeta Narlikar); and (2) is chromatin topology important for gene expression? (moderated by Wendy Bickmore and Eileen Furlong). Here, we summarize the major points raised during this discussion, which we think reflect ongoing debates in the field, and provide some perspective on how they relate to current thinking on the regulation of gene expression.
What is LLPS and why are people so excited by it?
The notion that the interior of cells resembles and behaves as an emulsion, composed of both membrane-bound and membraneless organelles, has existed in the literature since the 19th century (Wilson, 1899; Montgomery, 1898). More recently, renewed interest in the mesoscopic organization of proteins within the cell has led to the discovery that such membraneless compartments can form through the well-understood mechanism of LLPS (Brangwynne et al., 2009, 2011; Li et al., 2012; Kato et al., 2012). This has led to an explosion of research into which proteins can undergo LLPS, the physico-chemical properties of the resulting compartments, and the mechanistic principles that govern these properties.
In general, LLPS offers a dynamic and versatile mechanism by which the nucleoplasm may be spontaneously organized (i.e. without requiring additional energy) into distinct compartments across a broad range of spatial and temporal scales. The potential functional implications of such compartmentalization are far reaching, from creating distinct local high-concentration environments that increase the probability of specific biochemical reactions, to buffering the levels of free protein in the nucleoplasm, and to rapidly mediating chromatin organization into active and inactive regions (Holehouse and Pappu, 2018). These possibilities have captured the imagination of many biologists, with LLPS often being used brazenly to explain many currently mysterious phenomenon despite the fact that the functional roles of LLPS compartments remain largely theoretical. In our discussion at the workshop, we reviewed the current evidence for LLPS occurring in nuclei and its implications for the regulation of gene expression.
Identifying liquid-liquid phase separated compartments in vivo: seeing is not believing
To reconcile the general excitement around LLPS with current experimental evidence, our discussion was initiated by surveying the room for ‘validated’ examples of phase separation in nuclei. The responses comprised classical and commonly cited examples, such as nucleoli (Brangwynne et al., 2011), and more recent examples, such as heterochromatin (Strom et al., 2017; Larson et al., 2017) and Polycomb bodies (Tatavosian et al., 2019; Plys et al., 2019). Objections were quickly raised in all cases, necessitating a conversation about how we define phase separation and what counts as sufficient evidence to claim that a specific protein is incorporated into a membraneless compartment through LLPS in vivo.
LLPS occurs when the concentration of a molecule reaches a critical point beyond which it is supersaturated. Under such conditions, it may be energetically favorable for the molecule in question to partition into low- and high-concentration phases; these distinct phases are viewed by biologists as membraneless compartments. The critical concentration at which LLPS occurs depends on a number of factors, including temperature, the properties of the solvent (such as ionic strength mediated by salt concentration), and the properties of the molecule being considered, as is best summarized by a phase diagram (Fig. 1). For example, proteins that tend to undergo LLPS typically have large, low-complexity and/or intrinsically disordered regions (IDRs) that enable hetero- and homotypic multivalent interactions (Chong et al., 2018), or have other tendencies that lead to oligomeric states. The existence of such domains in a protein are often motivation to suggest that a protein may be capable of LLPS (Banani et al., 2017).
However, the rigorous definitions of LLPS that have emerged from classical physics and chemistry research are at odds with the way LLPS and its associated terminology are used in the life sciences. Moreover, the inability to modulate important parameters, such as concentration or temperature, in vivo has led to an overwhelming reliance on less-rigorous, qualitative descriptors of LLPS. This reliance on qualitative descriptors was revealed when the question was posed of what experimental evidence exists for the case of nucleoli being formed by LLPS. The overwhelming response was that they are round in shape (indicating surface tension-driven morphology) and that they undergo fusion and fission events indicative of liquid-like properties. Similar criteria were described for P granules from Caenorhabditis elegans in which fusion, dripping and wetting behaviors have contributed substantially to conceptualizations of membraneless organelles (Brangwynne et al., 2009). At the same time, it was also discussed how such qualitative descriptors do not suffice as evidence for LLPS in the absence of measuring concentration or temperature dependence to determine the phase diagram for a given system.
A frequently used, but commonly misunderstood, quantitative assay, fluorescence recovery after photobleaching (FRAP), was also brought up as a more rigorous test beyond qualitative descriptors. This assay is based on the idea that the rapid recovery rate of fluorescence within a droplet, caused by rapid internal rearrangement, is indicative of its liquid-like nature (Alberti et al., 2019). It was quickly pointed out that the idea of ‘rapid liquid-like recovery’ is also qualitative, as recovery times spanning from sub-seconds to minutes have been reported in the literature as being diagnostic of LLPS. Furthermore, rapid recovery in FRAP can result from a variety of mechanisms (Mueller et al., 2010; Sprague et al., 2006), and unless proper controls are conducted and multiple models are tested, the use of recovery time as a marker of LLPS is insufficient.
We agreed (mostly) upon a set of criteria for defining LLPS in vivo, including: evidence for a membraneless compartment, a different set of chemical rules inside and outside the compartment, and concentration- and temperature-dependent properties. For practical reasons, measurements such as concentration and temperature dependence are often tested in vitro and these results are used to claim in vivo LLPS. However, there was general agreement that, although in vitro experiments are informative, concluding that a molecule can undergo LLPS in vivo requires quantitative measurements in the endogenous cellular context. Thus, although qualitatively ‘seeing’ evidence for in vivo LLPS using microscopy assays is an essential starting step, it is not sufficient in the absence of quantitative measurements. The conclusion for this part of the discussion can thus be best summarized as ‘just seeing is not believing’.
Does LLPS enable gene regulation within a crowded nucleus?
Part of the appeal of LLPS is that it can enable distinctive regulatory mechanisms. One often-used example is the potential to regulate local concentrations of transcriptional activators and repressors in a locus-dependent manner. We thus considered a thought experiment in which multiple enhancers are in spatial proximity to a single gene, all working synergistically with multiple DNA-binding proteins in a single phase-separated body, without requiring actual physical contact between individual enhancers and promoters. Such a mechanism would confer robustness, as the disruption of a single enhancer would not lead to loss of activation. At the same time, the plasticity and reversibility of the multivalent interactions that lead to LLPS could provide a way to tune the components and thus reactions ‘permitted’ in the compartment and also provide a mechanism to shut-off activation or repression rapidly, for example through post-translational modifications that make LLPS energetically unfavorable. Also discussed was another unique advantage that LLPS potentially holds: to pull together and organize big swaths of the genome without expending energy by relying on the thermodynamically favorable fusion of smaller droplets. This mechanism may explain how small droplets of heterochromatin regions undergo coalescence to create larger domains of repression, as has been observed in Drosophila embryos (Strom et al., 2017). In the process of discussing the dynamic nature of phase-separated states, we came up with an analogy to ‘pop-up’ restaurants. In this analogy, phase-separated states get assembled at different times and locations to serve specific functions and then get disassembled when their purpose has been served. Thus, the limited nuclear space can be used multiple times to carry out distinct functions.
The types of experiments needed to determine whether such mechanisms exist were also briefly discussed. A common approach is to show that a protein requires a certain disordered domain to phase separate in vitro, and that deletion or mutation of that domain leads to loss of nuclear puncta and a measurable effect on transcription in vivo. However, it is difficult to separate direct and indirect effects in such experiments, as the disordered regions of proteins have likely evolved to play multiple roles (such as in the activation domains of transcription factors) independent of their ability to enable LLPS in vitro. It was suggested that trying to understand the evolutionary pressures that have led to the ability of some proteins to phase separate may provide some insight into the functional roles of LLPS.
In summary, it was clear that there is a lot of interest in LLPS but very limited understanding of how it is defined and what it can do functionally. Although there are some clear-cut examples of LLPS occurring in vivo, there are many more for which the evidence is weak. Furthermore, recent studies have described regulatory hubs within nuclei that qualitatively resemble LLPS compartments but appear to form via independent mechanisms (McSwiggen et al., 2019; Chong et al., 2018; Mir et al., 2018). For example, viral replication compartments exhibit many of the usual qualitative descriptors of LLPS (such as roundness, the ability to coalesce and high local concentrations) but form through transient non-specific DNA binding rather than LLPS and allow for unrestricted diffusion across boundaries (McSwiggen et al., 2019). Such alternative mechanisms that lead to high local concentrations of proteins may also enable many of the possible functions associated with LLPS and thus must also be considered when LLPS is suspected. Therefore, to address the question of how ubiquitous LLPS is and its functional roles with necessary rigor, we must incorporate what we can from physical sciences and form interdisciplinary teams to design new experimental approaches and tools.
What do we mean by chromatin topology?
Metazoan genomes are organized hierarchically, at multiple size scales. The smallest structural feature is the nucleosome, around which ∼140 bp of DNA are wrapped. At the ∼100 kb to ∼1 Mbp scale, chromatin is organized into topological associating domains (TADs) (Nora et al., 2012; Dixon et al., 2012; Sexton et al., 2012). These domains exhibit a higher than expected interaction frequency between the genomic regions within them, as measured by chromosome conformation capture (3C)-based methods such as Hi-C. In a large number of cases, architectural proteins and promoters are present at TAD boundaries (Dixon et al., 2012; Rao et al., 2014). At these scales and higher, chromatin is also organized into ‘active and inactive’ compartments that are thought to form independently of TADs (Nora et al., 2017; Schwarzer et al., 2017; Haarhuis et al., 2017). At an even larger scale, chromosomes fall into their own distinct territories (Cremer et al., 2014).
In our discussion of the functional role of chromatin topology, we agreed to focus on TADs and compartments in interphase nuclei, as these are the features of chromatin topology that have been unveiled most recently. We also placed emphasis on recent evidence from single cell studies using 3C-based methods (Stevens et al., 2017; Flyamer et al., 2017; Nagano et al., 2013) and imaging approaches (Bintu et al., 2018; Finn et al., 2019; Cattoni et al., 2017), both of which have suggested that chromatin topology exhibits significant cell-cell variability and is likely far more dynamic than previously imagined (Hansen et al., 2018; Finn et al., 2019). Keeping this hierarchical and dynamic topology in mind, we discussed the role of chromatin topology in gene repression and activation.
Is chromatin topology and packing important for gene repression?
The relationship between chromatin accessibility or compaction in heterochromatic regions of the genome has been long established. We discussed if the way constitutive heterochromatin is folded is important, or whether compaction alone is sufficient for repression. The point was raised that heterochromatic regions are known to have sub-structure (Maison et al., 2010), but it is unclear whether any sort of deterministic folding is necessary, or even if the substructure is the cause or consequence of large-scale compaction. These considerations become more important when considering facultative heterochromatin formation, which is dynamic during early development. In this context, the discussion turned to the role of large membraneless compartments formed by Polycomb-group (PcG) proteins in gene repression. These compartments, which are termed ‘PcG bodies’, are known to contain many repressed genes. This results in an increased spatial proximity of co-repressed genes and suggests that higher order chromatin structure could play an important role in repression (for a recent review, see Schuettengruber et al., 2017).
We discussed whether the presence of a Polycomb response element (PRE) is sufficient for repression, or whether large domains and topology are necessary. Occupancy of a single PRE by a PcG protein close to a promoter or enhancer can be sufficient for repression (Papp and Muller, 2006), suggesting that topology is not always necessary. However, at other loci, looping is required for PcG-mediated repression (Ogiyama et al., 2018; Cheutin and Cavalli, 2018), suggesting that some level of higher order structure is needed. Likewise, if PcG body formation is inhibited, only subtle phenotypes are observed in the absence of other perturbations (Bantignies et al., 2011). These and related observations led to the tentative conclusion that although the formation of large PcG bodies might not be necessary for repression, they have likely evolved as a mechanism to provide additional robustness to the system. Furthermore, there was no consensus on how to design experiments that distinguish the role of local binding of PcG proteins to PREs from a role in the formation of large PcG bodies.
What is the role of chromatin topology in gene activation by distal enhancers?
Enhancers provide a powerful lens through which to consider the role of chromatin topology in gene regulation because of their many interesting properties: they can be located at great genomic distances from their target genes, they do not always regulate the gene that is most proximal to them, they function independently of their orientation or location, and they are classically thought to function by making physical contact with their target genes' promoter through protein-protein interactions, although chromosome topologies inconsistent with this idea have been reported (Benabdallah et al., 2019). These observations implicate the regulation of chromatin topology in enhancer function but require further investigation.
Our discussion of distal regulation by enhancers was initiated by the question of what happens if you systematically increase the distance between an enhancer and a promoter. Evidence in Drosophila embryos suggests that although spatial patterns of expression do not change as a function of enhancer-promoter distance, transcription levels (i.e. the probability of enhancer function) are decreased (Mikhaylichenko et al., 2018; Fukaya et al., 2016). This is in contrast to observations in the mouse genome where distance dependence for enhancers appears less evident (Anderson et al., 2014). However, we were reminded repeatedly that in the context of the nucleus, linear genomic distance becomes less relevant than 3D spatial proximity (although they are generally correlated), which may be regulated by a variety of mechanisms, including currently known topological features such as TADs and loops. Thus, to understand better the role of topology in enhancer function it was prudent to first discuss the evidence for the functional roles of TADs.
What are the functional roles of TADs?
Measurements of genome-wide interaction frequencies indicate that the spatial distance between genomic loci is generally reduced within TADs compared with loci at equal linear distance between TADs. Enhancers and their target genes are generally contained within the same TAD (Symmons et al., 2014) and thus are more likely to interact with higher frequency than if they are contained in different domains. However, TAD boundaries do not represent strict physical barriers – contact probabilities within a TAD are only 2- to 3-fold greater than contact frequencies across a TAD boundary. It is also important to bear in mind that, although the presence of TADs has been confirmed by microscopy, they are essentially data features that emerge from Hi-C analyses, and how TADs are defined largely depends on the resolution of the sequencing performed. With higher-resolution maps, more nested structures emerge with large TADs containing weaker sub-TADs. However, a large number of TAD boundaries are well conserved over evolutionary time scales and are marked by genomic features such as promoters and the binding of architectural proteins such as CTCF in mammals (Dixon et al., 2012).
Importantly, it was pointed out that there is contradictory evidence on the phenotypic consequences of disrupting TADs on gene expression, or of moving genes and enhancers into separate TADs at a given locus (Paliou et al., 2019; Kane et al., 2019 preprint; Despang et al., 2019; Laugsch et al., 2019; Ghavi-Helm et al., 2019). If TAD formation is widely disrupted, for example through the removal or depletion of cohesin (Sofueva et al., 2013; Wutz et al., 2017; Schwarzer et al., 2017; Rao et al., 2017; Haarhuis et al., 2017) or CTCF (Nora et al., 2017), the activity of only a subset of genes is disrupted, whereas the expression of many genes appears unperturbed. Similarly, extensive genetic rearrangements or deletions that fuse and shuffle TADs have only modest effects on gene expression, as seen for many TADs in Drosophila (Ghavi-Helm et al., 2019) and for the TFAP2A (Laugsch et al., 2019), HoxD (Rodríguez-Carballo et al., 2019), Sox9-Kcnj2 (Despang et al., 2019) and Shh (Williamson et al., 2019) containing TADs in vertebrates.
These results converge with emerging data showing cell-to-cell variability in TAD structure: evidence from imaging experiments and single cell 3C-based studies indicate that, far from being composed of strict stable structures, as suggested for long-range high-frequency loops, chromatin topology is highly heterogeneous from cell to cell and likely very dynamic (Cattoni et al., 2017; Finn et al., 2019; Flyamer et al., 2017; Nagano et al., 2013; Stevens et al., 2017). Given that transcription factor residence times on DNA and cohesin-mediated loop extrusion (thought to bring enhancers and promoters together) are in the order of tens of seconds and are thought to occur frequently and repetitively (Hansen et al., 2018), it is likely that transient but very frequent interactions may be sufficient to initiate and maintain transcription at a given locus. In such a model, the probability of interactions between some enhancer-promoter pairs may be high enough without requiring a constrained topology. For some pairs, however, a well-regulated topology might be crucial to force a higher frequency of interaction. Such a model may explain the mild effects of disrupting processes, such as extrusion (e.g., by deleting CTCF or cohesin), on transcription levels at the genome-wide scale, as well as the large effects observed at specific loci. Although such models still imply that regulating chromatin topology is important, they call into question current textbook models of stable and hierarchical chromatin organization.
Results indicating that the presence of TADs is not necessary, or permissive, for the expression of many genes (at least as measured by standard in situ hybridization and RNA-seq) question why TADs are so well conserved over evolution. Perhaps TADs provide an additional layer of robustness or precision to gene expression, with some genes being more dependent on this layer compared with others. The genomic regions spanning TADs are highly correlated with blocks of conserved synteny (Engstrom et al., 2007; Harmston et al., 2017), suggesting that they serve as conserved units during recombination to avoid separating enhancers and promoters. TADs may also reflect gene transcription, being stabilized or reinforced during gene expression, which could explain their conserved synteny. Recent data have also linked TADs to DNA replication (Pope et al., 2014; Sima et al., 2019) and to DNA damage (Canela et al., 2017), implying alternative and more general roles for TADs in genome functions.
In summary, it is clear that we have only just begun to appreciate the role of chromatin topology. Although some principles, such as extrusion and insulation, may apply broadly, it is likely that more nuanced and locus-specific considerations will be necessary to elucidate the functional role of genome topology.
Although extraordinary progress is being made in understanding 3D genome architecture and nuclear organization, it was clear from our discussions that further studies are required to convince scientists of the functional role of LLPS and TADs in the regulation of gene expression. At this point, refraining from creating generalized models is prudent, as they are easily challenged by contradictory data. Although LLPS- and chromatin topology-based models may or may not provide the unifying principles to explain gene regulation that we are desperately searching for, both phenomena are highly interconnected and likely play important roles by virtue of their existence in multiple organisms spanning large evolutionary time scales. Understanding these roles will continue to require interdisciplinary studies and the development of new technologies that allow us to both visualize and dissect the molecular mechanisms that dictate nuclear organization and function.
We thank all participants of the 2019 The Company of Biologists Workshop on ‘Chromatin-based regulation of development’ for the lively discussions that provided the material and motivation for this article; Joanna Wysocka (Stanford) and Benoit Bruneau (UCSF) for organizing the workshop; and The Company of Biologists for sponsoring it. We also thank Anders S. Hansen and David T. McSwiggen for useful comments on the manuscript.
W.B. acknowledges support from the UK Medical Research Council (MC_UU_00007/2). G.N. acknowledges support from the National Institutes of Health (R35 GM127010). E.E.M.F. acknowledges support from the European Research Council (ERC) Advanced Grant DeCRyPT (787611). Deposited in PMC for release after 12 months.
The authors declare no competing or financial interests.