Regulatory machinery for replication and gene expression is punctately organized in supramolecular complexes that are compartmentalized in nuclear microenvironments. Quantitative approaches are required to understand the assembly of regulatory machinery within the context of nuclear architecture and to provide a mechanistic link with biological control. We have developed `intranuclear informatics' to quantify functionally relevant parameters of spatially organized nuclear domains. Using this informatics strategy we have characterized post-mitotic reestablishment of focal subnuclear organization of Runx (AML/Cbfa) transcription factors in progeny cells. By analyzing point mutations that abrogate fidelity of Runx intranuclear targeting, we establish molecular determinants for the spatial order of Runx domains. Our novel approach provides evidence that architectural organization of Runx factors may be fundamental to their tissue-specific regulatory function.
The architectural organization of nucleic acids and cognate factors in subnuclear microenvironments is linked to gene regulation, replication and repair (Stein et al., 2000a; Stein et al., 2000b; Lemon and Tjian, 2000; Dundr and Misteli, 2001; Iborra and Cook, 2002; Spector, 2003; Stein et al., 2003). Spatio-temporal changes in this subnuclear organization accompany cell cycle progression and cell differentiation (Ma et al., 1998; Francastel et al., 2000). Perturbations in subnuclear organization have been functionally related with compromised gene expression that accompanies the onset and progression of disease (Dyck et al., 1994; Karpuj et al., 1999; McNeil et al., 1999). Traditionally, biological control of gene expression has been experimentally addressed by the identification and characterization of promoter elements and cognate regulatory and co-regulatory proteins, as well as by mechanistically defining the dynamics of chromatin structure and nucleosome organization. It is becoming increasingly evident that regulatory parameters of gene expression are operative within a higher-order subnuclear organization of nucleic acids and regulatory proteins. Observations made by epifluorescence and confocal microscopy have provided the initial insight into assembly of nuclear microenvironments that support the combinatorial compartmentalization of regulatory factors and chromosomal domains (Cook, 1999; Stein et al., 2000a; Stein et al., 2000b; Stein et al., 2003; Spector, 2003). Quantitative strategies are necessary to mechanistically associate the subnuclear organization of regulatory factors with biological control.
Here we describe a novel approach, intranuclear informatics, to examine the subnuclear organization of regulatory factor domains from digital microscopic images. Intranuclear informatics utilizes parameters with biologically relevant variability to characterize subnuclear organization. We have developed an image-processing algorithm to acquire and evaluate these parameters of subnuclear organization. The result is a multivariable data-set that can be used for exploratory analysis techniques and for quantitatively testing specific biological hypotheses.
Runx transcription factors provide a paradigm for compartmentalization of gene expression and nuclear matrix association of regulatory proteins (Lian and Stein, 2003). A conserved intranuclear targeting signal (NMTS) within the C-terminus directs Runx factors to matrix associated subnuclear sites that support transcriptional control in the interphase nucleus (Choi et al., 2001; Zaidi et al., 2001; Zeng et al., 1997). By the application of intranuclear informatics we elucidate that Runx regulatory proteins exhibit an interphase architectural signature that is restored following mitosis. Furthermore, our analysis of NMTS mutant proteins provides evidence that architectural association of Runx factors may be fundamental to their tissue-specific gene regulatory functions. Thus, intranuclear informatics quantitatively bridges the spatial organization of protein domains with regulatory determinants of biological control.
Materials and Methods
Cell culture and transfections
ROS 17/2.8 osteosarcoma cells were maintained in F12 with penicillin and streptomycin, 2 mM L-glutamine and 5% fetal bovine serum (FBS). Hela cells were maintained in DMEM with PS, 2 mM L-glutamine and 10% FBS. Exponentially growing HeLa cells were transfected with 500 ng of either HA-tagged wild-type Runx2, an HA-tagged C-terminal deletion, or one of the four HA-tagged NMTS point mutants for 24 hours with Superfectamine (Invitrogen, San Diego, CA).
Hela and Ros cells were grown on gelatin-coated coverslips (BD Biosciences, Lexington, KY). Cells were processed for in situ immunofluorescence as described (Javed et al., 2000). In brief, cells were rinsed twice with ice-cold PBS and fixed in 3.7% formaldehyde in PBS for 10 minutes on ice. After rinsing once with PBS, the cells were permeabilized in 0.1% Triton X-100 in PBS, and rinsed twice with PBSA [0.5% bovine serum albumin (BSA) in PBS] followed by antibody staining. Antibodies and their dilutions used are as follows: rabbit polyclonal antibodies against Runx2 (1:200; Oncogene, Carlsbad, CA) and rabbit polyclonal antibodies against HA-epitope (1:500, Santa Cruz Biotechnology, Santa Cruz, CA). The secondary antibodies used were either anti-rabbit or -mouse Alexa568 or Alexa488 (1:800, Molecular Probes, Eugene, OR).
Image acquisition and restoration
Immunostaining of cell preparations was recorded using a CCD camera attached to an epifluorescence Zeiss Axioplan 2 microscope (Zeiss, Thorwood, NY). For Runx2 interphase/telophase studies single image planes were acquired and deconvoluted using the Metamorph Imaging Software (Universal Imaging, Dowingtown, PA). For NMTS mutation experiments Z-series image stacks were acquired at 0.25 μm intervals with 67 nm/pixel (xy). Restoration of images was carried out by 3D deconvolution using a measured point-spread function as described previously (Carrington et al., 1995).
We have developed an image processing algorithm that automatically performs image segmentation, feature extraction and parameter computation. Our algorithm requires the input of any number of image pairs and a text-file that lists the name of the images to be analyzed. For each pair of images, one is the digital micrograph and a second is the nuclear mask image. The nuclear mask, which is generated using Metamorph imaging software or Adobe Photoshop (Adobe Systems, San Jose, CA), is utilized to eliminate intensity data that is located outside the nucleus and restrict analysis to intensity data within the nucleus. For mitosis studies we analyzed a single image plane per cell. For NMTS mutation studies we analyzed z-section images from deconvoluted Z-series stacks. Image segmentation is carried out using a threshold technique, where the selected threshold is the intensity value that maximizes the number of detectable nuclear domains. The image analysis is implemented using the MATLAB® image processing and statistics toolboxes (The Mathworks, Natick, MA) and Metamorph Imaging Software (Universal Imaging).
Image feature extraction
Our algorithm extracts the total number of domains within the nucleus, the size of each domain, the location of each domain-centroid in image pixel coordinates, the nuclear cross-sectional area and the nuclear cross-sectional perimeter from the segmented and mask images. From these measurements we determine the following statistics for both domain size and nearest-neighbor distances: mean, median, variance, standard deviation, index of dispersion, coefficient of variation, skewness and kurtosis. The index of dispersion and coefficient of variation are mean normalization measures of variation and standard deviation, respectively. Skewness reflects the degree of asymmetry in the distribution with positive values indicating right skewness and negative values indicating left skewness. Kurtosis is a measure of the peakedness of the distribution: positive values indicate a tall peak and negative values indicate a flat peak (or plateau) (Norman, 2000). To assess the spatial domain randomness we measured Euclidean nearest-neighbor distances (NN distance) between domain centroids. The mean and variance of the Euclidean nearest-neighbor distances between domains is compared with a Poisson point-process of an equivalent density (i.e. domains per unit nuclear area) (Clark and Evans, 1954). Expected nearest-neighbor distance parameters are corrected for edge effects (Donnelly, 1978; Sinclair, 1985). The ratio of observed (Ro) to expected (Re) mean nearest-neighbor distances is referred to as the Clark and Evans statistic (Ro/Re<1, clustered; Ro/Re=1, random; Ro/Re>1, ordered) (Clark and Evans, 1954). We also examined the radial position of domains within the nucleus. This statistic is determined by measuring the mean distance from each domain centroid to the nuclear centroid (mean domain radius) and the mean distance from the nuclear centroid position to the each perimeter pixel (mean perimeter radius); for a circle this would be the radius. The ratio of the two values is the mean relative domain radius. Values between 0 and 0.5 reflect a tendency for domains to be positioned in the nuclear interior and values between 0.5 and 1 reflect a tendency for domains to be positioned toward the nuclear periphery.
For mitosis studies ANOVA tests were conducted on subnuclear organization data to determine the significance of observed differences in each parameter. Asterisks indicate parameters with differences that are considered to be statistically significant on a 0.05 level. P-values were adjusted to account for the false-discovery rate; asterisks are indicative of this adjustment. Analysis was performed using the general linear model (GLM) procedures in SAS/STAT (Sas Institute, Cary, NC). These statistical tests were conducted to compare each telophase (T1 and T2) nuclei as well as to compare telophase nuclei with interphase (I) nuclei. We analyzed 60 nuclei for Runx2; 20 for each nucleus (see supplementary material for the complete dataset). For NMTS studies, statistical tests were conducted to compare wild-type Runx2 with each of the five mutants. In total, 330 Z-sections were analyzed, 55 for each protein from two independent experiments (see supplementary material for the complete dataset). Five Z-sections were analyzed per cell to account for within cell variability. Thus, the effect of NMTS mutation was assessed using a repeated measure ANOVA at a 0.05 level.
Factor analysis was performed on parameters of subnuclear organization for each of the wild-type Runx and the five mutant proteins using the data obtained from 330 nuclear images. This analysis represents the observed subnuclear organization parameters in terms of a smaller number of uncorrelated `factors' (or groups of parameters) that account for most of the information contained in the complete data set (Norman, 2000). Factors are extracted using principal component analysis and rotated using the varimax method. Factors scores were computed for each image and represent the sum of the standardized subnuclear organization parameters multiplied by their respective factor loadings. Factor loading refers to the correlation of each subnuclear organization parameter with a particular factor. Factor loadings greater than 0.65 were considered to be significant. This analysis was carried out using the factor procedure in SAS/STAT.
Hierarchical cluster analysis was performed on mean subnuclear organization parameters from wild-type Runx and the five mutant proteins using the data from 330 nuclear images. Cluster analysis was performed using the Euclidean distance metric with complete linkage. Clusters were displayed using a dendrogram. Cluster analysis was carried out using the cluster procedure in SAS/STAT.
Intranuclear informatics: a signature of nuclear architecture for regulatory proteins
We have developed intranuclear informatics to characterize spatially organized protein domains within the nucleus in terms of parameters with inherent biological variability. The conceptual framework for quantifying nuclear organization is outlined in Fig. 1 and briefly described here. Alterations in size and number of protein domains with respect to physiological conditions, cell cycle stage, and/or cellular differentiation have been observed (Ma et al., 1998; Stenoien et al., 2001; Nielsen et al., 2002; Zaidi et al., 2003). Intranuclear informatics exploits this variability in domain size and number, to elucidate changes between different biological conditions (Fig. 1). Another prominent feature of nuclear organization is the non-random localization of chromosome territories and protein domains (Noordmans et al., 1998; Cremer and Cremer, 2001; Shiels et al., 2001; Kozubek et al., 2002; Tanabe et al., 2002). Our approach employs first-order nearest-neighbor statistics, commonly used in ecological studies (Clark and Evans, 1954; Sinclair, 1985), to characterize the spatial randomness of nuclear microenvironments (Fig. 1). Finally, the radial position of regulatory machinery for replication and transcription is functionally interrelated to the location of chromosomal territories as well as chromatin structure (Ma et al., 1998; Cook, 1999; Cremer and Cremer, 2001; Tumbar and Belmont, 2001; Kozubek et al., 2002). Intranuclear informatics establishes the placement of regulatory foci within the context of nuclear morphology (Fig. 1). Based on these biological observations, our approach describes and defines intranuclear organization utilizing 25 parameters, evaluated from digital fluorescence microscopic images. We have developed an image-processing and statistical algorithm to acquire measurements and compute the parameters from any number of images. The resulting data are then analyzed to quantitatively address specific biological questions using statistical approaches such as factor analysis and multivariate clustering techniques. In summary, intranuclear informatics incorporates the principal features of intranuclear organization to provide a vehicle for quantitatively defining nuclear structure-function interrelationships.
Intranuclear informatics reveals that the post-mitotic restoration of Runx subnuclear domain organization is functionally conserved
The hematopoietic and osteogenic Runx transcription factors are involved in tissue-specific gene expression and support cell differentiation (Tracey and Speck, 2000; Komori, 2002; Lutterbach and Hiebert, 2000; Lian and Stein, 2003). In the interphase nucleus Runx proteins are associated with the nuclear matrix and are organized into punctate domains (Zaidi et al., 2001; Zeng et al., 1997). These nuclear microenvironments spatially coincide with sites of active transcription and colocalize with several coregulatory proteins (Thomas et al., 2001; Javed et al., 2000; Lian and Stein, 2003; Harrington et al., 2002; Zaidi et al., 2002; Westendorf et al., 2002; Kundu et al., 2002; Zaidi et al., 2004). These observations suggest a direct link between the activity of Runx proteins and their spatiotemporal organization within the nucleus. We have recently demonstrated that Runx1 and Runx2 protein domains persist during mitosis, and undergo spatial and temporal reorganization resulting in equal partitioning into progeny nuclei (Zaidi et al., 2003). These mitotic alterations reflect natural perturbations in both nuclear structure and function and serve as a biological template for understanding Runx domain organization. Together, the dynamic distribution of Runx proteins provides a model for quantitative and comparative analysis of the subnuclear organization of regulatory proteins.
Here we have applied intranuclear informatics to understand the spatial organization of endogenous Runx2 domains in the interphase nucleus as well as following mitosis. Immunofluorescence microscopy confirms that the protein is distributed in punctate subnuclear domains (Fig. 2). We analyzed and compared 25 parameters of subnuclear organization in interphase nuclei and in both telophase nuclei. Our quantitative results show that most parameters are similar in interphase and telophase nuclei for Runx2. As expected telophase nuclei are significantly smaller than interphase. There are the same number of domains in progeny telophase nuclei but higher numbers in interphase nuclei. This observation is consistent with the mitotic partitioning of Runx proteins (Zaidi et al., 2003) (Fig. 2). We further find that Runx2 domains exhibit a non-random organization with spatial order. We conclude that the post-mitotic restoration of Runx subnuclear organization is functionally conserved in progeny cells.
Intranuclear informatics establishes molecular determinants for the spatial domain organization of Runx transcription factors
A viable candidate for elucidating the underlying requirements for Runx domain organization is the nuclear matrix targeting signal (NMTS). The NMTS is a conserved and unique Runx protein motif that is necessary and sufficient for directing the protein to matrix-associated intranuclear sites (Zaidi et al., 2001; Zeng et al., 1997). Biochemical, cellular and in vivo genetic approaches have established the requirement of the NMTS and associated functions in Runx control of cell differentiation and tissue-specific development (Choi et al., 2001; Yergeau et al., 1997). Importantly, mutations in Runx proteins that alter subnuclear targeting are associated with skeletal disease and leukemia (McNeil et al., 1999; Choi et al., 2001; Barseguian et al., 2002; Zhang et al., 2000).
Our experimental strategy combines mutagenesis, microscopy, and intranuclear informatics to understand the contribution of the NMTS to Runx domain organization. We examined wild-type Runx2, a C-terminal deletion (Runx2-ΔC) that lacks the NMTS, as well as four NMTS point mutations, using immunofluorescence microscopy. These mutants exhibit varying degrees of compromised intranuclear targeting and selective alterations in physical and functional protein-protein interactions (Zaidi et al., 2002; Zaidi et al., 2004), (our unpublished observations). Our intranuclear informatics analysis was performed on deconvoluted images (n=330) from nuclei of cells expressing these proteins. All of the Runx proteins localize to punctate domains within the nucleus (Fig. 3). Initial evaluation of subnuclear organization data reveals that there are significant differences in 17 of 25 parameters, as identified by ANOVA. By contrast, parameters that reflect variation in nearest-neighbor distances are not significantly altered by the mutations. These results are schematically demonstrated by a color representation of standardized data (Fig. 3). Collectively, our observations indicate that there are indeed alterations in the spatial domain organization of Runx proteins as a consequence of mutations in the NMTS.
Our analysis reveals that R398A and Y407A mutants share most of their properties with the wild-type protein; except for a reduction in the domain radial positioning for the Y407A mutant protein, and an increase in the domain size variability for the R398A mutant protein. Significant alterations in subnuclear organization were observed for the Y428A, Y433A and ΔC mutations. It has been proposed that nuclear microenvironments represent the steady state local accumulation of proteins resulting from dynamic molecular interactions providing threshold concentrations of regulatory factors for combinatorial control (Stein et al., 2000a; Stein et al., 2000b; Misteli, 2001; Stein et al., 2003). Consistent with this concept, the Y428A mutant, which has a significant reduction in the mean and variability in domain size (Fig. 3), functionally abrogates interactions between Smad and Runx, thereby blocking integration of TGFβ signals at matrix-associated subnuclear sites (Zaidi et al., 2002). Furthermore, impairment of Src/YAP signaling by the Y433A mutation correlates with a decrease in average domain size (Fig. 3). Similarly, deletion of the C-terminus of Runx2 abolishes interactions with several known co-regulators and results in reduced domain size and variability (Hanai et al., 1999; Javed et al., 2000; Westendorf et al., 2002; Thomas et al., 2001) (Fig. 3). Hence, the reduction in Runx domain size is likely to be a consequence of abrogated and/or altered protein-protein interactions. We further find that the NMTS may contribute to the spatial distribution of domains within the nucleus. This is demonstrated by a reduced variability in domain nearest-neighbor distances for Y428A, Y433A and the ΔC protein. In addition, the Runx2-ΔC protein, which has abrogated subnuclear targeting, exhibits a significant increase in mean domain nearest-neighbor distances as well as in domain density. We conclude that NMTS mediated intranuclear targeting is a functional determinant for the characteristic spatially ordered distribution of Runx domains.
Intranuclear informatics selectively discriminates between the subnuclear organization of wild-type and mutant Runx protein domains
To identify on a broader level the biological features of subnuclear organization that are predominantly influenced by the NMTS mutations we used factor analysis, a multivariate analytical tool for grouping related parameters (`factors'). Three factors that capture a large proportion of the biological variability and are readily interpretable describe the domain size (factor A), the domain packing (factor B) and the domain spatial randomness (factor C) (Table 1). We evaluated the subnuclear distribution of each protein by calculating `factor scores' and generated star-plots to compare wild-type Runx2 with each of the NMTS mutants (see Fig. 4 legend and Materials and Methods for details). Differences are evident in all three factors. Our analysis of these changes reveals that NMTS mutations have selective effects on Runx subnuclear organization (Fig. 4). Based upon the observed differences we can categorize the proteins into two groups. One group contains wild-type Runx2, R398A and Y407A, which exhibit similar spatial randomness and domain packing. The second group contains the remaining mutants with similar effects on domain packing, but selective effects on size and spatial randomness. Whereas Y428A and Y433A mutants display similar changes in spatial randomness, domain size alterations are common between the Y428A mutant and the Runx2-ΔC protein. Of all the mutants, the Runx-ΔC protein has the most prominent effect on the three factors collectively. Notably, this mutant protein exhibits compromised subnuclear targeting, fails to promote osteoblast differentiation, and has been linked to the human disease cleidocranial dysplasia (CCD) (Choi et al., 2001; Zhang et al., 2000). Taken together, our analysis selectively distinguishes between wild-type Runx2 and NMTS mutant proteins based upon the three factors of subnuclear organization.
Intranuclear informatics quantitatively bridges the spatial organization of protein domains with regulatory determinants of biological control
We have demonstrated that mutations in the NMTS have selective and specific effects on the architectural signature of Runx proteins (Figs 3 and 4). Consequently, it is important to comprehensively assimilate all the data to establish the overall degree of domain organizational similarity between wild-type and mutant proteins. Here we utilized hierarchical cluster analysis to group each protein on the basis of the 25 parameters that describe and define their subnuclear organization (Fig. 5). The dissimilarity between the subnuclear organization of wild-type and the functionally compromised Runx2-ΔC mutant is evident by the presence of two distinct clusters. This mutation removes the entire C-terminus including the subnuclear targeting signal and associated functions; homozygosity for the Runx2-ΔC allele results in embryonic lethality (Choi et al., 2001). We find that the subnuclear organization of the Y433A and Y428A mutants is similar to that of the Runx2-ΔC mutant. This observation is in agreement with our factor analysis and is consistent with the evidence that these mutant proteins are functionally compromised and are incompetent for integrating physiological signals, which include BMP/TGFβ and Src/YAP signaling (Zaidi et al., 2001; Zaidi et al., 2004). To provide further insight into the Runx nuclear structure-function relationships, we directly compare the hierarchical cluster arrangement and the intranuclear targeting competency of each protein, as well as their contribution to development and disease. This analysis reveals a link between Runx subnuclear domain organization and biological function. We conclude that the architectural organization of Runx transcription factors within the nucleus is fundamental to their tissue specific regulatory function.
Knowledge of the biochemical and genetic components of gene regulation, replication and repair far exceeds our understanding of the integration of these processes within the context of nuclear architecture (Stein et al., 2000a; Stein et al., 2000b; Lemon and Tjian, 2000; Dundr and Misteli, 2001; Iborra and Cook, 2002; Spector, 2003; Stein et al., 2003). Here we have established a bioinformatics approach to describe and define organization of protein domains within the nucleus. Intranuclear informatics provides the quantitative platform to capture the relevant parameters of subnuclear organization and relate them to the fundamental requirements for biological control. Using this approach, we have demonstrated that the focal subnuclear organization of Runx proteins is conserved in progeny cells. Our strategy has enabled us to discriminate between functional and non-functional Runx proteins based upon their domain organization within the nucleus. Furthermore, we have identified an architectural signature of Runx transcription factors that is coupled with fidelity of intranuclear targeting. In a broader context, intranuclear informatics can be applied to analyze subtle alterations in any spatially organized nuclear microenvironments under normal and pathological conditions.
We thank Brian Wiener of Worcester Polytechnic Institute and Jitesh Pratap of our laboratory for helpful comments in preparing this manuscript. We thank Elizabeth Bronstein and Judy Rask for editorial assistance. Studies reported were in part supported by grants from the National Institutes of Health (PO1CA82834, PO1AR48818, AR39588), as well as grant 5P30 DK32520 from the National Institute of Diabetes and Digestive and Kidney Diseases. The contents of this manuscript are solely the responsibility of the authors and do not necessarily represent the official views of the National Institutes of Health.