Repeat expansion diseases are a large group of human genetic disorders caused by expansion of a specific short tandem repeat tract. Expansion in somatic cells affects age of onset and disease severity in some of these disorders. However, alleles in DNA derived from blood, a commonly used source of DNA, usually show much less expansion than disease-relevant cells in the central nervous system in both humans and mouse models. Here we examined the extent of expansion in different DNA sources from mouse models of the fragile X-related disorders, Huntington's disease, spinocerebellar ataxia type 1 and spinocerebellar ataxia type 2. We found that DNA isolated from stool is a much better indicator of somatic expansion than DNA from blood. As stool is a sensitive and noninvasive source of DNA, it can be useful for studies of factors affecting the risk of expansion, or the monitoring of treatments aimed at reducing expansion in preclinical trials, as it would allow expansions to be examined longitudinally in the same animal and allow significant changes in expansion to be observed much earlier than is possible with other DNA sources.

More than 40 genetic disorders are caused by expansion of a single short tandem repeat (STR) tract. These disorders are known as repeat expansion diseases (REDs), and include the fragile X-related disorders (FXDs) caused by expanded CGG/GCC repeats, and a large number of degenerative disorders caused by expanded CAG/CTG repeats such as Huntington's disease (HD), myotonic dystrophy type 1 (DM1) and many spinocerebellar ataxias (SCAs). The length of the repeat tract that is inherited is a major determinant of the age of onset of many REDs. However, these STRs tend to increase in repeat number both on intergenerational transmission and somatically. The extent of expansion increases with time but varies with cell type, with some cell types acquiring extremely large expansions in a relatively short period of time, and others remaining relatively stable for many years. Interest in somatic instability in the CAG/CTG repeat expansion diseases has been driven in part by studies showing that the tendency to expand somatically is associated with an earlier age at disease onset and/or more severe symptoms in HD (Swami et al., 2009; GeM_HD Consortium, 2019) and DM1 (Morales et al., 2020). In addition, genome-wide association studies (GWAS) have demonstrated that variants in mismatch repair (MMR) proteins and the structure-dependent endonuclease Fanconi-associated nuclease 1 (FAN1) are associated with pronounced variation in the age of disease onset in HD mutation carriers (GeM_HD Consortium, 2015; Hong et al., 2021). Similar results have been observed in several forms of SCA caused by CAG/CTG repeat expansions (Bettencourt et al., 2016). Similarly, loss of MSH2, an important MMR protein, has been shown to modify the timing of early disease onset in a mouse model of HD (Wheeler et al., 2003). Since mutations in MMR genes are associated with variations in the extent of somatic expansion in both patients (GeM_HD Consortium, 2019; Bettencourt et al., 2016; Flower et al., 2019; Kim et al., 2020) and mouse models (reviewed in Zhao et al., 2021a; Wheeler and Dion, 2021), this adds weight to the idea that somatic expansion is an important modifier of disease risk and/or severity. This has led to interest in pharmacological strategies that are focused on reducing somatic instability, a strategy that may be therapeutically useful for many of these diseases (Wiggins and Feigin, 2021).

Blood is generally used for measuring the repeat size in patients. However, blood cells show less somatic instability than disease-relevant cells, such as striatal neurons from both patients and mouse models of different repeat expansion diseases (Telenius et al., 1994; Takano et al., 1996; Pinto et al., 2020; Zhao et al., 2019; Kacher et al., 2021). A source of peripheral DNA that shows more extensive expansions would expedite testing of genetic factors affecting expansion risk and would be useful for monitoring therapeutic efforts to reduce expansions, at least in preclinical models. Previously we showed that expansion in the DNA derived from the small intestine is more extensive than in DNA from other tissues in a FXD mouse model (Zhao et al., 2018), and is sensitive to mutations in genetic factors like FAN1 that are important modifiers of expansion risk in patient cohorts and in mice (GeM_HD Consortium, 2015, 2019; Kim et al., 2020; Bettencourt et al., 2016; Ciosi et al., 2019; Mergener et al., 2020; Zhao et al., 2018, 2021b). Since the murine intestinal epithelium turns over every 3-4 days (Hughes et al., 1958; Walker and Leblond, 1958), large numbers of exfoliated cells from the intestine accumulate in stool. This makes mouse stool a useful source of genomic DNA for genotyping (Broome et al., 1999; Kalippke et al., 2009). Most of the exfoliated cells found in stool are epithelial cells from the colon (Iyengar et al., 1991). If the colonic epithelium is as prone to expansion as the cells of the small intestine, large expansions should also be detectable in stool samples. Here we show that the colon does show extensive expansions in mouse models of four different repeat expansion diseases, and these expansions are mirrored in the DNA isolated from stool.

Expansion in colon and stool DNA is higher than DNA from most other sources in an FXD mouse model

As with many other comparable studies, we used mice with larger repeat numbers than those present in the inherited allele in some of the repeat expansion diseases. However, the available evidence suggests that the same genetic factors affect the expansion of both longer and shorter repeats (reviewed in Zhao et al., 2019), and by using animals with larger repeat numbers, significant expansions can be observed within an experimentally reasonable time frame.

We collected DNA samples from FXD mice with ∼163 inherited repeats at 3 months of age, including from the colon and stool, as well as from urine and sperm. We then compared the extent of repeat expansion using the expansion index (EI) metric (Miller et al., 2020). The EI of tail samples taken at 3 weeks of age was used as the baseline. As shown in Fig. 1, DNA from the central nervous system (CNS), liver and tail showed a ∼2-fold increase in the EI at 3 months of age. The EI in testes and sperm was somewhat higher. The similarity in the EI in these two DNA sources is consistent with our previous demonstration that most expansion in the testis is confined to the gametes (Zhao and Usdin, 2018b). The small intestine, distal colon and stool samples showed the highest levels of expansion, with the colon and stool showing very similar expansion profiles. This similarity is consistent with the fact that epithelial cells are the most common cell in the postnatal colon, and are the cells most likely to be sloughed off into the intestinal lumen and thus present in stool. In contrast, as can be seen in Fig. 1B, blood and urine samples showed little or no significant change in the EI relative to tail DNA taken at 3 weeks. Most DNA isolated from urine has been shown to be derived from the epithelial cells from the kidneys and bladder (Abedini et al., 2021). Thus, not all epithelial cells were equally expansion prone. Since the yield of DNA from mouse urine was generally poor, we did not test this source of DNA further.

The correlation with the EI in striatum was much stronger for stool and sperm than for blood (Fig. 1C). Furthermore, as can be seen in Fig. 1D, although expansions in the striatum, stool and blood increased significantly with time, the amount of expansion in blood showed relatively little change between 3 and 12 months. Thus, sperm and stool samples were very sensitive indicators of expansion. However, stool has the advantage that not only was it found to be a slightly more sensitive indicator of expansion than sperm, but it can also be used to examine expansion in both males and females.

Expansion detected in stool DNA can be seen at an early age and increases with age and inherited repeat size in an FXD mouse model

To examine the extent of repeat expansion in the DNA from mouse stool, we collected fresh stool samples from FXD mice at different ages. Expansions were evident even in 1-month-old mice with 158+ repeats, with the repeat tract in stool DNA already being three to four repeats larger than it is in tail DNA, a DNA source that showed only modest expansions (Fig. 2A). Furthermore, even at this age, a modest effect of inherited allele size on the rate of repeat addition could be seen in mice with 170 repeats showing a gain of four repeats, compared to a gain of three repeats for mice with 158 and 163 repeats. As can be seen in Fig. 2B, even mice with only 146 repeats gained three repeats in DNA from stool by 2 months of age, and alleles detected in stool DNA continued to gain repeats over a 6-month period at a relatively consistent rate, resulting in the gain of 13 repeats relative to the tail DNA isolated from tail samples taken at 3 weeks of age. This corresponded to an average increase of two repeats per month, which increased to ∼3.5 repeats per month in mice with an inherited allele with 170 repeats (Fig. 2C).

Expansion in stool DNA reflects the effect of FAN1 on somatic expansion in an FXD mouse model

FAN1 has been identified as a genetic modifier of disease progression in a number of repeat expansion diseases (GeM_HD Consortium, 2015, 2019; Kim et al., 2020; Bettencourt et al., 2016; Ciosi et al., 2019; Mergener et al., 2020), and we previously showed that FAN1 protects against repeat expansion in an FXD mouse model (Zhao and Usdin, 2018a). Furthermore, a D963A point mutation in the nuclease domain of FAN1 nuclease was found to result in a significant increase in expansion in the striatum (Zhao et al., 2021b). Here, we show that the same mutation resulted in a significant increase in the EI in stool even in heterozygotes (Fig. 2D). In contrast, expansion in blood and sperm samples were similar in both wild-type (WT) and Fan1 mutant mice. Thus, stool DNA reflects the effect of an important genetic modifier of expansion risk in disease-relevant cells, whereas blood DNA does not.

Expansion in stool DNA is also a good indicator of somatic expansion in CAG/CTG repeat expansion diseases

To test whether stool DNA is also a good indicator of somatic expansion in other repeat expansion disease models, we measured the extent of expansion in stool DNA from a mouse model of HD. As in the FXD mice, expansions were low in heart samples. However, as can be seen in Fig. 3, the striatum and liver were the most expansion-prone tissues in this model, followed by the kidneys and cortex. In contrast, the small intestine and distal colon had a lower EI compared with that of the FXD mouse. However, the EI in stool was similar to other tissues in the HD mouse and higher than either blood or sperm (Fig. 3B). Significant correlations were seen between the EI in the striatum and the EI in both stool and blood (Fig. 3C). However, as can be seen in Fig. 3D, the EI in stool showed a larger increase with age than that seen in blood. Thus, stool was also a better indicator of somatic expansion in HD mice than blood. On average, the expansion rate in the DNA isolated from stool of HD mice with ∼112 inherited repeats was about 0.4 repeats per month (Fig. 3E).

We also examined expansion in two other CAG/CTG repeat expansion diseases, SCA1 and SCA2. As can be seen in Fig. 4, for most DNA sources, the extent of expansion in SCA1 mice was similar to that seen in HD mice, with the exception of stool, for which the expansion was significantly higher. The EI in stool was similar to that in the striatum, liver, small intestine and distal colon, and higher than in other tissues including blood and sperm. Expansion in stool DNA from SCA1 mice with ∼173 inherited repeats increased with age at the rate of ∼1.5 repeats per month. In SCA2 mice, the overall extent of expansion was much higher than that seen in FXD, HD and SCA1 mice. As can be seen in Fig. 5, although the EI in stool was lower than the striatum and liver, it was still much higher than in blood and sperm. The expansion rate in stool DNA from SCA2 mice with ∼155 inherited repeats was ∼2.5 repeats per month. The high expansion in stool DNA in both SCA1 and SCA2 mouse models suggests that expansion in DNA isolated from stool is also a sensitive indicator of somatic expansion in the SCA1 and SCA2 mouse models.

We have shown that repeat expansion can be detected in the DNA isolated from stool samples of mouse models of four different repeat expansion diseases. In the FXD (Fig. 1), SCA1 (Fig. 4) and SCA2 (Fig. 5) mouse models, the extent of expansion in mouse stool samples was higher than in blood, and comparable to or greater than expansion in the striatum. Although expansion in stool DNA in the HD mouse model was lower than that in the striatum, it was still higher than that in blood, and shows a better correlation with the extent of expansion in the striatum (Fig. 3). The extent of expansion in stool DNA is comparable to that seen in the distal colon, which is consistent with the fact that most of the host DNA isolated from stool is derived from this part of the digestive system. While colonic epithelial cells are rapidly dividing and neurons are post-mitotic, the available evidence from mouse models suggests that the genetic factors involved in generating somatic expansions are similar in both dividing and non-dividing cells (reviewed in Zhao et al., 2021a), and thus that the expansion mechanisms might also be similar. The correlation seen between the extent of expansion in stool samples and that seen in the striatum (Fig. 1C, Fig. 3C) suggests that the extent of expansion in stool DNA is a good indicator of the extent of expansion in the brain. However, since expansion is not seen in the DNA derived from urine, in which most host DNA is epithelial in origin, not all epithelial sources are equally prone to expansion.

Expansion in DNA isolated from stool is apparent at an early age, and is sensitive to inherited repeat size (Fig. 2). Importantly, expansion in stool DNA also reflects the effect of an important genetic modifier of expansion risk in the FXD mouse model (Fig. 2D). We previously showed that heterozygosity for a D963A point mutation resulted in significantly more expansions in the striatum, cerebellum and liver, but not in other tissues (Zhao et al., 2021b). As can be seen in Fig. 2D, a significant increase in the extent of expansion can also been seen in the DNA from the stool of these animals.

The propensity of any given cell type to expand likely reflects, at least in part, the balance between the levels of factors that promote expansion and those that protect against it. Notably, many organs show a similar extent of expansion in different RED mouse models. For example, heart shows little or no expansion in all the models, whereas the striatum and liver always show higher levels of expansion. This similarity would be consistent with the idea that the same genetic factors affect expansion in different mouse models. However, there is some discordance between the extent of expansion in some organs in different models. For example, the small intestine and distal colon show high levels of expansion in FXD, SCA1 and SCA2 mouse models, but not in the HD mouse model. The kidneys show more expansion in HD and SCA2 mouse models than either SCA1 or FXD mouse models, and the extent of expansion in sperm and testes is high in the FXD model but not in others. These differences might reflect differences in the level of transcription of the affected gene or some other effect of the flanking sequences.

Notably in all four mouse models, expansion in the CNS is highest in the striatum, even though the pathology of the SCAs primarily involves the cerebellum. However, the relatively low level of expansion in the cerebellum mirrors what is seen in SCA1 and SCA2 patients (Pinto et al., 2020; Chong et al., 1995; Lopes-Cendes et al., 1996; Zühlke et al., 1997; Hashida et al., 1997; Matsuura et al., 1999). Thus, the cells that accumulate the largest expansions are not necessarily those that are the most vulnerable to the downstream consequences of expansion. This parallels the observation that tissues expressing the highest levels of the pathogenic protein responsible for these diseases are not always the sites of greatest pathology either (Sharp et al., 1995). It might be that some cells are particularly sensitive to the toxic effects of the mutant protein. In those cells, the addition of a small number of repeats could have a significant effect.

HD is the only disease in the group in which gastrointestinal (GI) dysfunction is a major symptom. Signs of this dysfunction can appear early, before evidence of CNS neurodegeneration is apparent (Kobal et al., 2018; Wood et al., 2008; Andrich et al., 2009). Similar symptoms are seen in mouse models of HD where they are associated with a decrease in the length of the colon (Stan et al., 2020), the mucosal thickness and villus length (van der Burg et al., 2011). While the mutant protein that is responsible for HD pathology is widely expressed in cells of the GI tract, whether dysfunction is due to GI-cell-autonomous effects is unknown. Regardless, there are many other reasons why stool might be a useful source of DNA. First, stool collection is noninvasive and quick, causing minimum stress to the animals, and each stool pellet provides enough mouse DNA for ten or more PCR assays. Second, expansions in mouse stool DNA are more extensive than in blood DNA and are more sensitive to age and repeat size. This allows expansions to be more rapidly detected in stool samples from younger animals, even those with smaller repeat numbers. It also reduces the time needed to see meaningful differences in the extent of expansion in mice with different genotypes or mice that receive different potential expansion-modifying treatments. Third, at least in the case of one important known genetic modifier of expansion risk, namely FAN1, expansion in stool DNA mirrors what is seen in the brain. Finally, the simple, rapid and noninvasive collection of stool samples allows repeat length changes to be easily and frequently monitored in the same animal over time. Thus, the use of stool DNA should expedite studies on the expansion mechanism and experimental approaches to limit these expansions. Other readily accessible sources of DNA like hair follicles might also be worth testing. However, because of low yields (Picazo and García-Olmo, 2015) and contamination risk (Cinelli et al., 2007), these may be less than ideal. Furthermore, since hair follicles from different parts of the body are comprised of cells with different embryonic origins that have different gene expression profiles (reviewed in Driskell et al., 2011), they might also show differences in the extent of expansion.

There is some evidence to suggest that the propensity of different human tissues to expand is similar to that seen in these mouse models (Pinto et al., 2020; Chong et al., 1995; Lopes-Cendes et al., 1996; Zühlke et al., 1997; Hashida et al., 1997; Matsuura et al., 1999). In addition to the higher levels of expansion in the striatum and lower levels of expansion in the cerebellum of SCA1 and SCA2 patients mentioned above, HD patients show similarly elevated levels of expansion in the striatum, as well as in tissues like the liver that show high levels of expansion in mouse models (Pinto et al., 2020; Kennedy et al., 2003; Swami et al., 2009; Telenius et al., 1994; De Rooij et al., 1995). This raises the possibility that human stool samples could also show more extensive expansions than in blood. Although more work is required to properly evaluate the clinical use of somatic expansion as a measure of disease progression/onset risk, our results might have important clinical implications as stool might be more useful than blood for the assessment of somatic expansion risk. Stool might also be useful for monitoring the efficacy of clinical trials of therapies aimed at reducing somatic expansion. Furthermore, the similarity we have seen in the cell-type specificity of expansion in the four disease models suggests that stool could be a useful peripheral source of DNA for monitoring expansions in patients with other repeat expansion diseases and/or their mouse models.

Reagents

All reagents were from Sigma-Aldrich (St Louis, MO, USA) unless otherwise specified. Primers were from Life Technologies (Grand Island, NY, USA). Capillary electrophoresis of fluorescently labeled PCR products was carried out by the Roy J Carver Biotechnology Center, University of Illinois (Urbana, IL, USA).

Mouse generation, breeding and maintenance

The generation of FXD and Fan1 D963A mice was described previously (Entezam et al., 2007; Zhao et al., 2021b). These mice were maintained at the National Institutes of Health (NIH) in a manner consistent with the Guide for the Care and Use of Laboratory Animals (NIH publication no. 85-23, revised 1996) and in accordance with the guidelines of the National Institute of Diabetes and Digestive and Kidney Diseases (NIDDK) Animal Care and Use Committee, who approved this research (ASP-K021-LMCB). The generation of the HD (Wheeler et al., 1999), SCA1 (Watase et al., 2002) and SCA2 (Sen et al., 2019) mouse models was described previously. HD mice, SCA1 mice (provided by Huda Zoghbi, Howard Hughes Medical Institute, Baylor College of Medicine, Houston, TX, USA) and SCA2 mice (provided by Georg Auburger, Goethe University Medical School, Frankfurt, Germany) were maintained at Western Washington University (WWU) in a manner consistent with protocols approved by the WWU institutional animal care and use committee. All mice are on a C57BL/6J background.

Mouse stool and urine sample collection

For stool collection, mice were moved to a clean cage with a mat. Three to five pieces of fresh stool were collected in a 1.5 ml tube and transferred to dry ice. Stool samples were kept at −80°C until further processing. Mouse urine was collected in one of two ways. The first involved holding the animal over a 1.5 ml collection tube while lightly stroking its belly, and the second involved placing the mouse in a clean, dry, empty cage covered with a plastic wrap until it urinated. The urine was then aspirated with a pipette and transfered to the collection tube. These steps were repeated until at least 300 μl of urine was collected. Urine samples were kept at 4°C for up to 24 h before processing, or at −80°C until further processing.

DNA isolation

DNA from the tails of 3-week-old mice was extracted for genotyping using the KAPA Mouse Genotyping Kit (KAPA Biosystems, Wilmington, MA, USA). A 5 cm region of the jejunum starting 10 cm downstream of the stomach was used as the small intestine sample. A 5 cm region of the colon upstream of the anus was used as the distal colon sample. DNA from tissue samples was isolated using a Maxwell®16 Mouse Tail DNA Purification Kit (Promega, Madison, WI, USA) according to the manufacturer's instructions. DNA from blood was isolated using a Maxwell®16 Blood DNA Purification Kit (Promega, Madison, WI, USA) according to the manufacturer's instructions. Sperm collection and DNA preparation were as previously described (Zhao and Usdin, 2018b). DNA from stool was isolated using a Norgen Stool DNA Isolation Kit (Norgen Biotek, Thorold, Ontario, Canada) according to the manufacturer's instructions. A single mouse stool pellet weighed an average of 30.8 mg and yielded ∼5.5 µg of DNA. This was sufficient for >10 repeat PCR assays (see below). DNA from urine was isolated using a Norgen Urine DNA Isolation Micro Kit (Norgen Biotek, Thorold, Ontario, Canada) according to the manufacturer's instructions.

Genotyping and analysis of repeat number

Repeat size analysis of the Fmr1, Atxn1, Atxn2 and Htt alleles in the FXD, SCA1, SCA2 and HD mice, respectively, was carried out using a fluorescent PCR assay with fluorescein amidite (FAM)-labeled primer pairs. The primers FAM-labeled FraxM4 (FAM-5′-CTTGAGGCCCAGCCGCCGTCGGCC-3′) and FraxM5 (5′-CGGGGGGCGTGCGGTAACGGCCCAA-3′) were used for the Fmr1 allele (Entezam et al., 2007), the primers FAM-labeled 8930 (FAM-5′-CAGACGCCGGGACACAAG-3′) and 8931 (5′-ATCATCGTCTGATGGGGATG-3′) were used for the Atxn1 allele (Watase et al., 2002), the primers FAM-labeled SCA2Ex1-Fwd5 (FAM-5′-CCCCGCCCGGCGTGCGAGCCGGTGTAT-3′) and SCA2Ex1-Rev2 (5′-CGGGCTTGCGGCCAGTGG-3′) were used for the Atxn2 allele (Sen et al., 2019), and the primers FAM-labeled CAG1 (FAM-5′-ATGAAGGCCTTCGAGTCCCTCAAGTCCTTC-3′) and HU3 (5′-GGCGGCTGAGGAAGCTGAGGA-3′) were used for the Htt allele (Lee et al., 2010). The amount of DNA template in the PCR mix varied for different samples. For blood, sperm and tissue samples, 100 ng of DNA was used as a template. For stool samples, 200-400 ng of DNA was used as a template, as DNA isolated from stool contains a large amount of microbial DNA. For urine samples, 10-50 ng of DNA was used as a template, as the DNA yield varied between urine samples from different animals. The PCR mix for the Fmr1, Atxn1 and Atxn2 alleles contained 2 μl DNA template, 1× KAPA2G Fast HotStart Genotyping Mix (KAPA Biosystems, Wilmington, MA, USA), 2.4 M betaine, 2% DMSO and 0.5 μM each of the primers. PCR parameters for the Fmr1 allele were: 95°C for 10 min; 35 cycles of 95°C for 30 s, 65°C for 30 s and 72°C for 90 s; followed by incubation at 72°C for 10 min. PCR parameters for the Atxn1 allele were: 95°C for 10 min; 40 cycles of 95°C for 30 s, 58°C for 30 s and 72°C for 90 s; followed by incubation at 72°C for 10 min. PCR parameters for the Atxn2 allele were: 95°C for 10 min; 40 cycles of 95°C for 40 s, 60°C for 40 s and 72°C for 90 s; followed by incubation at 72°C for 10 min. The PCR mix for the Htt allele contained 2 μl DNA template, 1× KAPA2G Fast HotStart Genotyping Mix (KAPA Biosystems, Wilmington, MA, USA), 1.2 M betaine, 1% DMSO and 0.5 μM each of the primers. Touchdown PCR was used to amplify the Htt allele with the following parameters: 95°C for 10 min; 10 cycles of 95°C for 30 s, 72°C with −1°C/cycle for 30 s and 72°C for 90 s; 28 cycles of 95°C for 30 s, 63°C for 30 s and 72°C for 90 s; followed by incubation at 72°C for 10 min. The PCR products were resolved by capillary electrophoresis on an ABI Genetic Analyzer. The resultant fsa file was then displayed using a previously described custom R script (Hayward et al., 2016) that is available upon request.

The number of repeats in the modal allele found in tail samples collected from 3-week-old mice was used as an indicator of the number of original inherited repeats for all samples collected from that animal. For tissues showing a unimodal distribution of allele sizes, the difference between the repeat number present in the modal allele and the repeat number of the modal allele in tail DNA from 3-week-old mice was used as a measure of the extent of expansion. For tissues with a bimodal distribution of alleles, the extent of expansion was calculated by subtracting the repeat number in tail DNA from 3-week-old mice from the number of repeats of the modal allele in the larger of the two allele populations. The expansion rate was calculated based on data from an average of at least two different time points between 2 to 12 months from the same animal. We also quantified somatic expansions using the EI as a metric (Miller et al., 2020). The EI of tail samples taken from 3-week-old mice was used as the baseline.

Statistical analyses

Statistical analyses were performed using GraphPad Prism 9.3. For comparisons of EI in different samples to EI in stool or tail DNA taken at 3 weeks, statistical significance was assessed using either a mixed-effects model when not all organs were available for all animals, or a repeated measures (RM) one-way ANOVA, both with Geisser–Greenhouse correction and Dunnett's correction for multiple comparisons. For comparisons of EI in samples with different ages or different genotypes, statistical significance was assessed using the RM two-way ANOVA with Geisser-Greenhouse correction and Tukey's correction for multiple comparisons. For comparisons of EI in stool and blood at different ages, statistical significance was assessed using paired (stool versus blood) or unpaired (6 months versus 14 months) two-tailed t-test with Holm–Sidak's correction for multiple comparisons. Correlation between EI of different tissues was assessed using linear regression.

The authors would like to thank Professors Georg Auburger and Huda Zoghbi for kindly providing us with the SCA2 and SCA1 mice, respectively. We also would like to acknowledge all the hard work done by the staff who take care of our mice and without whom this work would not have been possible.

Author contributions

Conceptualization: X.Z., K.U.; Methodology: X.Z.; Validation: X.Z.; Formal analysis: X.Z., D.A.J.; Investigation: X.Z., C.M., S.R.C., D.A.J., E.A.; Resources: J.B.C., K.U.; Data curation: X.Z., C.M., S.R.C., D.A.J., E.A.; Writing - original draft: X.Z., K.U.; Writing - review & editing: X.Z., C.M., S.R.C., D.A.J., J.B.C., K.U.; Visualization: X.Z.; Supervision: X.Z., J.B.C., K.U.; Project administration: X.Z.; Funding acquisition: J.B.C., K.U.

Funding

This work was made possible by funding from the Intramural Program of the National Institute of Diabetes and Digestive and Kidney Diseases (NIDDK) to K.U. (DK057808). Work in the Carroll lab was supported by funds provided by the Behavioral Neuroscience Program at Western Washington University. Open access funding provided by NIDDK. Deposited in PMC for immediate release.

Abedini
,
A.
,
Zhu
,
Y. O.
,
Chatterjee
,
S.
,
Halasz
,
G.
,
Devalaraja-Narashimha
,
K.
,
Shrestha
,
R.
,
Balzer
,
M. S.
,
Park
,
J.
,
Zhou
,
T.
,
Ma
,
Z.
et al. 
(
2021
).
Urinary single-cell profiling captures the cellular diversity of the kidney
.
J. Am. Soc. Nephrol.
32
,
614
-
627
.
Andrich
,
J. E.
,
Wobben
,
M.
,
Klotz
,
P.
,
Goetze
,
O.
and
Saft
,
C.
(
2009
).
Upper gastrointestinal findings in Huntington's disease: patients suffer but do not complain
.
J. Neural. Transm.
116
,
1607
-
1611
.
Bettencourt
,
C.
,
Hensman-Moss
,
D.
,
Flower
,
M.
,
Wiethoff
,
S.
,
Brice
,
A.
,
Goizet
,
C.
,
Stevanin
,
G.
,
Koutsis
,
G.
,
Karadima
,
G.
,
Panas
,
M.
et al. 
(
2016
).
DNA repair pathways underlie a common genetic mechanism modulating onset in polyglutamine diseases
.
Ann. Neurol.
79
,
983
-
990
.
Broome
,
R. L.
,
Feng
,
L.
,
Zhou
,
Q.
,
Smith
,
A.
,
Hahn
,
N.
,
Matsui
,
S. M.
and
Omary
,
M. B.
(
1999
).
Non-invasive transgenic mouse genotyping using stool analysis
.
FEBS Lett.
462
,
159
-
160
.
Chong
,
S. S.
,
McCall
,
A. E.
,
Cota
,
J.
,
Subramony
,
S. H.
,
Orr
,
H. T.
,
Hughes
,
M. R.
and
Zoghbi
,
H. Y.
(
1995
).
Gametic and somatic tissue-specific heterogeneity of the expanded SCA1 CAG repeat in spinocerebellar ataxia type 1
.
Nat. Genet.
10
,
344
-
350
.
Cinelli
,
P.
,
Rettich
,
A.
,
Seifert
,
B.
,
Bürki
,
K.
and
Arras
,
M.
(
2007
).
Comparative analysis and physiological impact of different tissue biopsy methodologies used for the genotyping of laboratory mice
.
Lab. Anim.
41
,
174
-
184
.
Ciosi
,
M.
,
Maxwell
,
A.
,
Cumming
,
S. A.
,
Hensman Moss
,
D. J.
,
Alshammari
,
A. M.
,
Flower
,
M. D.
,
Durr
,
A.
,
Leavitt
,
B. R.
,
Roos
,
R. A. C.
,
Holmans
,
P.
,
TRACK-HD team, Enroll-HD team
et al.  (
2019
).
A genetic association study of glutamine-encoding DNA sequence structures, somatic CAG expansion, and DNA repair gene variants, with Huntington disease clinical outcomes
.
EBioMedicine
48
,
568
-
580
.
De Rooij
,
K. E.
,
De Koning Gans
,
P. A. M.
,
Roos
,
R. A. C.
,
Van Ommen
,
G.-J. B.
and
Den Dunnen
,
J. T.
(
1995
).
Somatic expansion of the (CAG) n repeat in Huntington disease brains
.
Hum. Genet.
95
,
270
-
274
.
Driskell
,
R. R.
,
Clavel
,
C.
,
Rendl
,
M.
and
Watt
,
F. M.
(
2011
).
Hair follicle dermal papilla cells at a glance
.
J. Cell Sci.
124
,
1179
-
1182
.
Entezam
,
A.
,
Biacsi
,
R.
,
Orrison
,
B.
,
Saha
,
T.
,
Hoffman
,
G. E.
,
Grabczyk
,
E.
,
Nussbaum
,
R. L.
and
Usdin
,
K.
(
2007
).
Regional FMRP deficits and large repeat expansions into the full mutation range in a new Fragile X premutation mouse model
.
Gene
395
,
125
-
134
.
Flower
,
M.
,
Lomeikaite
,
V.
,
Ciosi
,
M.
,
Cumming
,
S.
,
Morales
,
F.
,
Lo
,
K.
,
Hensman Moss
,
D.
,
Jones
,
L.
,
Holmans
,
P.
,
Monckton
,
D. G.
et al. 
(
2019
).
MSH3 modifies somatic instability and disease severity in Huntington's and myotonic dystrophy type 1
.
Brain
142
,
1876
-
1886
.
Genetic Modifiers of Huntington's Disease (GeM_HD) Consortium.
(
2015
).
Identification of genetic factors that modify clinical onset of Huntington's disease
.
Cell
162
,
516
-
526
.
Genetic Modifiers of Huntington's Disease (GeM_HD) Consortium.
(
2019
).
CAG repeat not polyglutamine length determines timing of Huntington's disease onset
.
Cell
178
,
887
-
900.e14
.
Hashida
,
H.
,
Goto
,
J.
,
Kurisaki
,
H.
,
Mizusawa
,
H.
and
Kanazawa
,
I.
(
1997
).
Brain regional differences in the expansion of a CAG repeat in the spinocerebellar ataxias: dentatorubral-pallidoluysian atrophy, Machado-Joseph disease, and spinocerebellar ataxia type 1
.
Ann. Neurol.
41
,
505
-
511
.
Hayward
,
B. E.
,
Zhou
,
Y.
,
Kumari
,
D.
and
Usdin
,
K.
(
2016
).
A set of assays for the comprehensive analysis of FMR1 alleles in the Fragile X-related disorders
.
J. Mol. Diagn.
18
,
762
-
774
.
Hong
,
E. P.
,
MacDonald
,
M. E.
,
Wheeler
,
V. C.
,
Jones
,
L.
,
Holmans
,
P.
,
Orth
,
M.
,
Monckton
,
D. G.
,
Long
,
J. D.
,
Kwak
,
S.
,
Gusella
,
J. F.
et al. 
(
2021
).
Huntington's disease pathogenesis: two sequential components
.
J. Huntingtons Dis.
10
,
35
-
51
.
Hughes
,
W. L.
,
Bond
,
V. P.
,
Brecher
,
G.
,
Cronkite
,
E. P.
,
Painter
,
R. B.
,
Quastler
,
H.
and
Sherman
,
F. G.
(
1958
).
Cellular proliferation in the mouse as revealed by autoradiography with tritiated thymidine
.
Proc. Natl. Acad. Sci. USA
44
,
476
-
483
.
Iyengar
,
V.
,
Albaugh
,
G. P.
,
Lohani
,
A.
and
Nair
,
P. P.
(
1991
).
Human stools as a source of viable colonic epithelial cells
.
FASEB J.
5
,
2856
-
2859
.
Kacher
,
R.
,
Lejeune
,
F.-X.
,
Noël
,
S.
,
Cazeneuve
,
C.
,
Brice
,
A.
,
Humbert
,
S.
and
Durr
,
A.
(
2021
).
Propensity for somatic expansion increases over the course of life in Huntington disease
.
Elife
10
,
e64674
.
Kalippke
,
K.
,
Werwitzke
,
S.
,
von Hornung
,
M.
,
Mischke
,
R.
,
Ganser
,
A.
and
Tiede
,
A.
(
2009
).
DNA analysis from stool samples: a fast and reliable method avoiding invasive sampling methods in mouse models of bleeding disorders
.
Lab. Anim.
43
,
390
-
393
.
Kennedy
,
L.
,
Evans
,
E.
,
Chen
,
C. M.
,
Craven
,
L.
,
Detloff
,
P. J.
,
Ennis
,
M.
and
Shelbourne
,
P. F.
(
2003
).
Dramatic tissue-specific mutation length increases are an early molecular event in Huntington disease pathogenesis
.
Hum. Mol. Genet.
12
,
3359
-
3367
.
Kim
,
K. H.
,
Hong
,
E. P.
,
Shin
,
J. W.
,
Chao
,
M. J.
,
Loupe
,
J.
,
Gillis
,
T.
,
Mysore
,
J. S.
,
Holmans
,
P.
,
Jones
,
L.
,
Orth
,
M.
et al. 
(
2020
).
Genetic and functional analyses point to FAN1 as the source of multiple Huntington disease modifier effects
.
Am. J. Hum. Genet.
107
,
96
-
110
.
Kobal
,
J.
,
Matej
,
K.
,
Koželj
,
M.
and
Podnar
,
S.
(
2018
).
Anorectal dysfunction in presymptomatic mutation carriers and patients with Huntington's disease
.
J. Huntingtons Dis.
7
,
259
-
267
.
Lee
,
J.-M.
,
Zhang
,
J.
,
Su
,
A. I.
,
Walker
,
J. R.
,
Wiltshire
,
T.
,
Kang
,
K.
,
Dragileva
,
E.
,
Gillis
,
T.
,
Lopez
,
E. T.
,
Boily
,
M.-J.
et al. 
(
2010
).
A novel approach to investigate tissue-specific trinucleotide repeat instability
.
BMC Syst. Biol.
4
,
29
.
Lopes-Cendes
,
I.
,
Maciel
,
P.
,
Kish
,
S.
,
Gaspar
,
C.
,
Robitaille
,
Y.
,
Clark
,
H. B.
,
Koeppen
,
A. H.
,
Nance
,
M.
,
Schut
,
L.
,
Silveira
,
I.
et al. 
(
1996
).
Somatic mosaicism in the central nervous system in spinocerebellar ataxia type 1 and Machado-Joseph disease
.
Ann. Neurol.
40
,
199
-
206
.
Matsuura
,
T.
,
Sasaki
,
H.
,
Yabe
,
I.
,
Hamada
,
K.
,
Hamada
,
T.
,
Shitara
,
M.
and
Tashiro
,
K.
(
1999
).
Mosaicism of unstable CAG repeats in the brain of spinocerebellar ataxia type 2
.
J. Neurol.
246
,
835
-
839
.
Mergener
,
R.
,
Furtado
,
G. V.
,
de Mattos
,
E. P.
,
Leotti
,
V. B.
,
Jardim
,
L. B.
and
Saraiva-Pereira
,
M. L.
(
2020
).
Variation in DNA repair system gene as an additional modifier of age at onset in spinocerebellar Ataxia type 3/Machado-Joseph disease
.
Neuromolecular Med.
22
,
133
-
138
.
Miller
,
C. J.
,
Kim
,
G.-Y.
,
Zhao
,
X.
and
Usdin
,
K.
(
2020
).
All three mammalian MutL complexes are required for repeat expansion in a mouse cell model of the Fragile X-related disorders
.
PLoS Genet.
16
,
e1008902
.
Morales
,
F.
,
Vásquez
,
M.
,
Corrales
,
E.
,
Vindas-Smith
,
R.
,
Santamaría-Ulloa
,
C.
,
Zhang
,
B.
,
Sirito
,
M.
,
Estecio
,
M. R.
,
Krahe
,
R.
and
Monckton
,
D. G.
(
2020
).
Longitudinal increases in somatic mosaicism of the expanded CTG repeat in myotonic dystrophy type 1 are associated with variation in age-at-onset
.
Hum. Mol. Genet.
29
,
2496
-
2507
.
Picazo
,
M. G.
and
García-Olmo
,
D. C.
(
2015
).
DNA from tissues of young mice is optimal for genotyping
.
Electron. J. Biotechnol.
18
,
83
-
87
.
Pinto
,
M.
,
Arning
,
R.
,
Giordano
,
L.
,
Razghandi
,
J. V.
,
Andrew
,
P.
,
Gillis
,
M. A.
,
Correia
,
T.
,
Mysore
,
K.
,
Grote Urtubey
,
J. S.
,
Parwez
,
D. M.
et al. 
(
2020
).
Patterns of CAG repeat instability in the central nervous system and periphery in Huntington's disease and in spinocerebellar ataxia type 1
.
Hum. Mol. Genet.
29
,
2551
-
2567
.
Sen
,
N.-E.
,
Canet-Pons
,
J.
,
Halbach
,
M. V.
,
Arsovic
,
A.
,
Pilatus
,
U.
,
Chae
,
W.-H.
,
Kaya
,
Z.-E.
,
Seidel
,
K.
,
Rollmann
,
E.
,
Mittelbronn
,
M.
et al. 
(
2019
).
Generation of an Atxn2-CAG100 knock-in mouse reveals N-acetylaspartate production deficit due to early Nat8l dysregulation
.
Neurobiol. Dis.
132
,
104559
.
Sharp
,
A. H.
,
Loev
,
S. J.
,
Schilling
,
G.
,
Li
,
S. H.
,
Li
,
X. J.
,
Bao
,
J.
,
Wagster
,
M. V.
,
Kotzuk
,
J. A.
,
Steiner
,
J. P.
,
Lo
,
A.
et al. 
(
1995
).
Widespread expression of Huntington's disease gene (IT15) protein product
.
Neuron
14
,
1065
-
1074
.
Stan
,
T. L.
,
Soylu-Kucharz
,
R.
,
Burleigh
,
S.
,
Prykhodko
,
O.
,
Cao
,
L.
,
Franke
,
N.
,
Sjogren
,
M.
,
Haikal
,
C.
,
Hållenius
,
F.
and
Björkqvist
,
M.
(
2020
).
Increased intestinal permeability and gut dysbiosis in the R6/2 mouse model of Huntington's disease
.
Sci. Rep.
10
,
18270
.
Swami
,
M.
,
Hendricks
,
A. E.
,
Gillis
,
T.
,
Massood
,
T.
,
Mysore
,
J.
,
Myers
,
R. H.
and
Wheeler
,
V. C.
(
2009
).
Somatic expansion of the Huntington's disease CAG repeat in the brain is associated with an earlier age of disease onset
.
Hum. Mol. Genet.
18
,
3039
-
3047
.
Takano
,
H.
,
Onodera
,
O.
,
Takahashi
,
H.
,
Igarashi
,
S.
,
Yamada
,
M.
,
Oyake
,
M.
,
Ikeuchi
,
T.
,
Koide
,
R.
,
Tanaka
,
H.
,
Iwabuchi
,
K.
et al. 
(
1996
).
Somatic mosaicism of expanded CAG repeats in brains of patients with dentatorubral-pallidoluysian atrophy: cellular population-dependent dynamics of mitotic instability
.
Am. J. Hum. Genet.
58
,
1212
-
1222
. Available at: .
Telenius
,
H.
,
Kremer
,
B.
,
Goldberg
,
Y. P.
,
Theilmann
,
J.
,
Andrew
,
S. E.
,
Zeisler
,
J.
,
Adam
,
S.
,
Greenberg
,
C.
,
Ives
,
E. J.
,
Clarke
,
L. A.
et al. 
(
1994
).
Somatic and gonadal mosaicism of the Huntington disease gene CAG repeat in brain and sperm
.
Nat. Genet.
6
,
409
-
414
.
van der Burg
,
J. M.
,
Winqvist
,
A.
,
Aziz
,
N. A.
,
Maat-Schieman
,
M. L.
,
Roos
,
R. A.
,
Bates
,
G. P.
,
Brundin
,
P.
,
Björkqvist
,
M.
and
Wierup
,
N.
(
2011
).
Gastrointestinal dysfunction contributes to weight loss in Huntington's disease mice
.
Neurobiol. Dis.
44
,
1
-
8
.
Walker
,
B. E.
and
Leblond
,
C. P.
(
1958
).
Sites of nucleic acid synthesis in the mouse visualized by radioautography after administration of C14-labelled adenine and thymidine
.
Exp. Cell Res.
14
,
510
-
531
.
Watase
,
K.
,
Weeber
,
E. J.
,
Xu
,
B.
,
Antalffy
,
B.
,
Yuva-Paylor
,
L.
,
Hashimoto
,
K.
,
Kano
,
M.
,
Atkinson
,
R.
,
Sun
,
Y.
,
Armstrong
,
D. L.
et al. 
(
2002
).
A long CAG repeat in the mouse Sca1 locus replicates SCA1 features and reveals the impact of protein solubility on selective neurodegeneration
.
Neuron
34
,
905
-
919
.
Wheeler
,
V. C.
and
Dion
,
V.
(
2021
).
Modifiers of CAG/CTG repeat instability: insights from mammalian models
.
J. Huntingtons Dis.
10
,
123
-
148
.
Wheeler
,
V. C.
,
Auerbach
,
W.
,
White
,
J. K.
,
Srinidhi
,
J.
,
Auerbach
,
A.
,
Ryan
,
A.
,
Duyao
,
M. P.
,
Vrbanac
,
V.
,
Weaver
,
M.
,
Gusella
,
J. F.
et al. 
(
1999
).
Length-dependent gametic CAG repeat instability in the Huntington's disease knock-in mouse
.
Hum. Mol. Genet.
8
,
115
-
122
.
Wheeler
,
V. C.
,
Lebel
,
L. A.
,
Vrbanac
,
V.
,
Teed
,
A.
,
te Riele
,
H.
and
MacDonald
,
M. E.
(
2003
).
Mismatch repair gene Msh2 modifies the timing of early disease in HdhQ111 striatum
.
Hum. Mol. Genet.
12
,
273
-
281
.
Wiggins
,
R.
and
Feigin
,
A.
(
2021
).
Emerging therapeutics in Huntington's disease
.
Expert Opin. Emerg. Drugs
26
,
295
-
302
.
Wood
,
N. I.
,
Goodman
,
A. O.
,
van der Burg
,
J. M.
,
Gazeau
,
V.
,
Brundin
,
P.
,
Bjorkqvist
,
M.
,
Petersén
,
Å.
,
Tabrizi
,
S. J.
,
Barker
,
R. A.
and
Morton
,
A. J.
(
2008
).
Increased thirst and drinking in Huntington's disease and the R6/2 mouse
.
Brain Res. Bull.
76
,
70
-
79
.
Zhao
,
X.-N.
and
Usdin
,
K.
(
2018a
).
FAN1 protects against repeat expansions in a Fragile X mouse model
.
DNA Repair
69
,
1
-
5
.
Zhao
,
X.-N.
and
Usdin
,
K.
(
2018b
).
Timing of expansion of Fragile X premutation alleles during intergenerational transmission in a mouse model of the Fragile X-related disorders
.
Front Genet.
9
,
314
.
Zhao
,
X.
,
Zhang
,
Y.
,
Wilkins
,
K.
,
Edelmann
,
W.
and
Usdin
,
K.
(
2018
).
MutLγ promotes repeat expansion in a Fragile X mouse model while EXO1 is protective
.
PLoS Genet.
14
,
e1007719
.
Zhao
,
X.
,
Gazy
,
I.
,
Hayward
,
B.
,
Pintado
,
E.
,
Hwang
,
Y. H.
,
Tassone
,
F.
and
Usdin
,
K.
(
2019
).
Repeat instability in the Fragile X-related disorders: lessons from a mouse model
.
Brain Sci.
9
,
52
.
Zhao
,
X.
,
Kumari
,
D.
,
Miller
,
C. J.
,
Kim
,
G.-Y.
,
Hayward
,
B.
,
Vitalo
,
A. G.
,
Pinto
,
R. M.
and
Usdin
,
K.
(
2021a
).
Modifiers of somatic repeat instability in mouse models of Friedreich Ataxia and the Fragile X-Related disorders: implications for the mechanism of somatic expansion in Huntington's disease
.
J. Huntingtons Dis.
10
,
149
-
163
.
Zhao
,
X.
,
Lu
,
H.
and
Usdin
,
K.
(
2021b
).
FAN1's protection against CGG repeat expansion requires its nuclease activity and is FANCD2-independent
.
Nucleic Acids Res.
49
,
11643
-
11652
.
Zühlke
,
C.
,
Hellenbroich
,
Y.
,
Schaaff
,
F.
,
Gehlken
,
U.
,
Wessel
,
K.
,
Schubert
,
T.
,
Cervos-Navarro
,
J.
,
Pickartz
,
H.
and
Schwinger
,
E.
(
1997
).
CAG repeat analyses in frozen and formalin-fixed tissues following primer extension preamplification for evaluation of mitotic instability of expanded SCA1 alleles
.
Hum. Genet.
100
,
339
-
344
.

Competing interests

J.B.C. is on the scientific advisory board, and has shares in Triplet Therapeutics, which is a company focused on modulating somatic instability in HD and other diseases. Other authors declare no competing or financial interests.

This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution and reproduction in any medium provided that the original work is properly attributed.