SUMMARY
Chronic increases in circulating corticosterone (CORT) generally suppress immune function, but it is not known whether evolved increases necessarily have similar adverse effects. Moreover, the evolution of immune function might be constrained by the sharing of signaling molecules, such as CORT, across numerous physiological systems. Laboratory house mice (Mus domesticus Linnaeus) from four replicate lines selectively bred for high voluntary wheel running (HR lines) generally had baseline circulating CORT approximately twofold higher than in four non-selected control (C) lines. To test whether elevated baseline CORT suppresses the inflammatory response in HR mice, we injected females with lipopolysaccharide (LPS). All mice injected with LPS exhibited classic signs of an inflammatory response, including sickness behavior, loss of body mass, reduced locomotor activity (i.e. voluntary wheel running), enlarged spleens and livers, elevated hematocrit and elevated inflammatory cytokines. However, as compared with C mice, the inflammatory response was not suppressed in HR mice. Our results, and those of a previous study, suggest that selective breeding for high voluntary exercise has not altered immune function. They also suggest that the effects of evolved differences in baseline CORT levels may differ greatly from effects of environmental factors (often viewed as ‘stressors’) that alter baseline CORT during an individual’s lifetime. In particular, evolved increases in circulating levels of ‘stress hormones’ are not necessarily associated with detrimental suppression of the inflammatory response, presumably as a result of correlated evolution of other physiological systems (counter-measures). Our results have important implications for the interpretation of elevated stress hormones and of immune indicators in natural populations.
INTRODUCTION
The ability to respond to infectious agents can significantly affect Darwinian fitness. Although comparative, ecological and evolutionary physiologists have long been interested in the importance of physiological variation in nature, only recently has a serious appreciation for the ecological significance of immune function developed (Lochmiller and Deerenberg, 2000; Ricklefs and Wikelski, 2002; Zuk and Stoehr, 2002; Lee, 2006). Particular interest has focused on the question of whether immune function trades off with other aspects of physiology, and whether immune function mediates trade-offs among other physiological functions (Lochmiller and Deerenberg, 2000; Ricklefs and Wikelski, 2002; Tieleman et al., 2005; Martin et al., 2006; Martin et al., 2008).
In life-history theory, much effort has been dedicated to examining potential trade-offs among major components of life history and also among lower-level traits, including aspects of physiological function, that contribute to survival and reproduction (Stearns, 1992; Clobert et al., 1998; Roff, 2002). Energy is often the currency used to explore these trade-offs (Pough, 1980; Sibly and Calow, 1986; Bennett, 1987; Gittleman and Thompson, 1988; Stearns, 1989), but other mechanisms such as variation in signaling molecules that affect numerous traits might also play a key role in mediating trade-offs (Stearns, 1989; Zera and Harshman, 2001; Ricklefs and Wikelski, 2002; Robinson et al., 2010). Such trade-offs seem particularly plausible in relation to endocrine variation because various hormones (e.g. corticosterone, CORT) influence immune function and many other aspects of physiology and behavior, and might ultimately be involved in the regulation of suites of life-history traits (Harvey et al., 1984; Dallman et al., 1995; Sapolsky et al., 2000; Ricklefs and Wikelski, 2002).
CORT, primarily associated with the stress response, is also involved in the regulation of energy metabolism at rest and during physical activity [see the following and references therein (Dallman et al., 1995; Coleman et al., 1998; Girard and Garland, 2002)] and in the regulation of the immune response (Harvey et al., 1984; Sapolsky et al., 2000). High levels of circulating CORT promote the release of stored energy and alter immune function (Sapolsky et al., 2000; Jacobson, 2005). When elevated for a short time, CORT is thought to contribute to immediate survival of stressful situations by causing increased energy availability. Acute increases in CORT also lead to redistribution of immune resources and enhancement of some measures of immune function (Martin, 2009). However, chronically high CORT levels produce a potential trade-off because CORT likely suppresses other essential activities, including immune function, growth and reproduction (Raberg et al., 1998; Moberg, 2000; Sapolsky et al., 2000; Wingfield and Kitaysky, 2002).
Additionally, CORT facilitates the return of physiological systems to homeostasis after an inflammatory response. Inflammation is a component of the innate immune system and a general response to microbes. Inflammation is physiologically costly as a result of both increased rates of energy use and potential damage to healthy cells (Klasing and Leshchinsky, 1999). As an immunosuppressor, CORT mediates some negative effects by regulating the strength of the inflammatory response (Munck and Náray-Fejes-Tóth, 1992; Baumann and Gauldie, 1994). Within an individual animal, both acute and chronic increases in CORT suppress immune function (Wingfield et al., 1998; Sapolsky et al., 2000; McEwen and Wingfield, 2003; Korte et al., 2005). However, evolutionary changes have led to differences in baseline levels of CORT among populations and among species (e.g. Abbott et al., 2003; Goymann and Wingfield, 2004; Romero, 2004; Bonier et al., 2009; Cockrem et al., 2009), and it is unclear whether or how such differences affect immune responses, particularly inflammatory responses. To test whether evolutionarily elevated baseline CORT suppresses immune function, we studied laboratory house mice (Mus domesticus Linnaeus) from lines that had been artificially selected for 55 generations for high voluntary wheel running (HR lines). Mice from the four replicate HR lines run 2.5–3.0 times more revolutions per day than those from four non-selected control (C) lines when given access to wheels (Garland et al., 2011a). In addition, these HR mice evolved baseline levels of circulating CORT that are approximately twice those of C mice (Malisch et al., 2007; Malisch et al., 2008). [Moreover, during nightly wheel running the HR mice experience exercise-associated elevations in CORT greater than those observed in C mice (Girard and Garland, 2002).] The elevated baseline CORT levels in HR mice potentially increase energy availability during exercise, and hence could be an adaptation (Malisch et al., 2007; Malisch et al., 2008), but the high CORT could also result in a chronic suppression of the inflammatory response (Malisch et al., 2009b), which could be viewed as a cost or trade-off.
In the present study, we challenged HR and C mice with lipopolysaccharide (LPS) from Escherichia coli. LPS is an endotoxin found on the cell wall of gram-negative bacteria. LPS imitates a bacterial infection and triggers an inflammatory response and all its associated costs in terms of energy and nutrients. However, LPS itself does not actually cause any sickness; thus, any changes in physiology are purely the result of the immune response (Elin and Wolff, 1976). Hence, by using a LPS inoculation we would be able to identify differences in inflammatory response between the HR and C lines.
The inflammatory response is a suite of physiological and behavioral responses that include decreased body mass, increased sickness behavior (e.g. decreased activity, piloerect fur, panting; Table 1), increased hematocrit, increased inflammatory cytokine levels, and increased spleen and liver masses (Bauss et al., 1987; Hart, 1988; Gabay and Kushner, 1999). Consequently, we measured the inflammatory response in multiple ways that would most broadly represent the suite of characteristics of the inflammatory response.
We hypothesized that HR mice would exhibit a reduced inflammatory response because they have twice as much circulating CORT under baseline conditions. If the HR mice did exhibit such a reduction in inflammatory response, then changes in indicators of an inflammatory response would be expected to be lower than in C mice. Thus, when injected with LPS, HR mice should exhibit a smaller reduction in body mass, less sickness behavior, a smaller hematocrit reduction, lower circulating levels of cytokines, and smaller increases in spleen and liver mass than C mice. We quantified sickness behavior in two complementary ways: an observational behavioral assay (Table 1) and subsequent voluntary wheel running. We also quantified CORT levels in a later generation of mice to ensure that they were still higher in HR than in C mice, as expected from previous studies (Girard and Garland, 2002; Malisch et al., 2007; Malisch et al., 2008).
MATERIALS AND METHODS
Study animals
The mice used in the immune portion of this study were randomly sampled from generation 55 of an ongoing selection experiment for increased voluntary wheel running. A full description of the selection experiment is provided elsewhere (Swallow et al., 1998), and we give only a brief summary here. Overviews of the characteristics of HR mice can be found in previous publications (Rhodes et al., 2005; Garland et al., 2011a; Garland et al., 2011b).
Outbred Hsd:ICR house mice were used as the founder population (Swallow et al., 1998). Mice were assigned to one of four control lines (no selection) or one of four lines bred for high voluntary wheel running (HR mice). The selection criterion was the number of revolutions run on days 5 and 6 of a 6 day period during which mice had access to running wheels (Wahman-type activity wheels, 1.12 m circumference) at ∼7–9 weeks of age. For the selected HR lines, the male and female from each family that ran the most were chosen as breeders. For control lines, a random male and female from each family were chosen as breeders. Within lines, male and female pairings were random with the restriction that siblings were not mated to each other. In each generation, 10 pairs of mice (families) were used to propagate the next generation of each line.
For this study, 18 adult females from each of the eight lines were randomly chosen at weaning (21 days of age). However, in one control line (lab designation line 1) only 12 females were available because of low birth rates in that generation. We used only females in this experiment to avoid possible effects of sex and also because females run more than males and have higher circulating CORT levels (Malisch et al., 2007). On average, mice were 10 weeks old at the start of the procedures; however, some line 1 mice were only 6 weeks old because of breeding difficulties in the prior generation. At all times, mice were maintained on a 12 h light:12 h dark cycle with lights on at 07:00 h, and they were provided with food (Harlan Teklad Rodent Diets 8604, Madison, WI, USA) and water ad libitum (except during behavioral observations, detailed below). Beginning at weaning, 2–4 mice were housed per cage. All procedures were approved by the Institutional Animal Care and Use Committee of the University of California Riverside, which is accredited by the Association for Assessment and Accreditation of Laboratory Animal Care (AUP A-20080018), and complied with US laws.
Experimental design
Mice from each line were equally divided among three treatments: LPS, sham and baseline. LPS mice received an i.p. injection of 60 μg of LPS (from Escherichia coli 0111:B4, product no. L4391, Sigma-Aldrich, St Louis, MO, USA) diluted in 0.9% sodium chloride solution (product no. S8776, Sigma-Aldrich) in a total volume of 0.1 ml. A dose of 60 μg was determined by a pilot study. At this dose, all injected individuals showed sickness behavior (Table 1) (Hart, 1988), but no mortality. Dose was not adjusted for body mass because the pilot study indicated mice representing the typical range of body masses for each line would all exhibit sickness behavior when given a dose of 60 μg. Sham mice were injected with 0.1 ml of 0.9% sodium chloride. Baseline mice received no injection. Six individuals from each line were assigned to each treatment, except for line 1. As only 12 individuals from line 1 were available, each treatment had 4 mice from line 1. Mice were randomly divided among 6 batches of up to 25 mice each, regardless of line or treatment, because behavior (details below) could be measured for a maximum of 25 mice at a time. Batches were stratified by age to minimize any potential effects of age.
General procedure
For each batch, mice were housed individually beginning the night before the experiment started. On the morning of the first experimental day, mice were weighed. LPS mice and sham mice received the appropriate injection at ∼08:00 h. Whether LPS or sham mice were injected first was randomized. Individual cages were then placed in a predetermined, random order for behavioral observations, which started at 08:30 h. Each individual was observed 15 times and behavioral observations were conducted at 15 s intervals, so that each individual was observed either every 6 min or every 6 min and 15 s (depending on the size of the batch). Hence, for batch sizes of 25 mice all observations were completed within 1 h 34 min and batches of 24 were completed within 1 h 30 min. For the last batch of 20 mice, the observer waited between observation rounds so the timing between observations of each individual mouse was 6 min (similar to timing for batches of 24 mice). A single observer (C.J.D.), who was blind to the experimental treatment, performed all observations for all batches. All behaviors performed by a mouse during the time of observation were recorded. Point values for each behavior were determined during a pilot study that used an open-ended behavioral catalog and are described in Table 1. Mice injected with LPS are typically lethargic (Hart, 1988), so negative values were assigned to indicate ‘active’ behaviors whereas positive values indicate ‘sickness’ behavior. LPS also causes piloerection and panting (Hart, 1988), so these behaviors were assigned positive values. Behavior values were summed for the 15 observation points to yield one overall behavior score. A higher behavior score indicates a sicker mouse. Mice did not have access to food or water during the behavioral experiment, but bedding was present.
At ∼10:00 h, mice were anesthetized with isoflurane and blood samples (two samples of 70 μl) were taken retro-orbitally using heparinized microhematocrit tubes (Hoff, 2000). Blood samples were taken approximately 2 h after injection to correspond with expected peak cytokine production (Zanetti et al., 1992). Mice were then weighed and put into standard housing cages with access to Wahman-type activity wheels (see Swallow et al., 1998) for 48 h (13:00 h on injection day to 13:00 h 2 days later). Computers recorded the number of revolutions in 1 min intervals for each wheel. Data were downloaded every 24 h and wheels were checked to ensure freedom of movement. Wheel data collected are described below (see ‘Statistics’).
After 48 h, mice were removed from the wheels and weighed. A second set of blood samples (two samples of 70 μl) was taken as above. Mice were then killed with CO2, and spleen and liver were removed by dissection. Livers were weighed and dried at approximately 58°C for 48 h or until constant mass. Spleens were weighed and then flash frozen in liquid nitrogen and stored at –80°C for later analysis. Previous work found that two HR lines have small triceps surae complexes (mini-muscles) (Garland et al., 2002). Line 3 is fixed for mini-muscles and line 6 is polymorphic for mini-muscles. Because the mini-muscle phenotype is associated with other morphological and physiological traits (e.g. Garland et al., 2002; Houle-Leroy et al., 2003; Syme et al., 2005; Rezende, 2006), triceps surae were also dissected from line 6 mice. Triceps surae were then weighed, dried to a constant mass at approximately 58°C, and then weighed again to determine which individuals had the mini-muscle phenotype by inspection of graphs versus body mass.
Blood analysis
Immediately after collection, blood samples were centrifuged and plasma was frozen for further analysis. Mean hematocrit was calculated for each of the two sampling periods (before and after access to the running wheels). Wang et al. showed that hematocrit is elevated 2 and 3 h after mice are injected with LPS, but then returns to baseline levels by 4 h after injection (Wang et al., 1991). We calculated the difference between the mean hematocrit before and after mice were placed on wheels as a measure of recovery from the LPS injection.
Blood plasma from the pre-running sampling period was analyzed for four inflammatory cytokines: tumor necrosis factor-alpha (TNF-α), interleukin-6 (IL-6), interleukin-1β (IL-1β) and granulocyte-macrophage colony-stimulating factor (GM-CSF). TNF-α, IL-6 and IL-1β are upstream regulators of the inflammatory response (Beutler and Cerami, 1989; Baumann and Gauldie, 1994; Beutler, 2004). TNF-α and IL-1β are pro-inflammatory cytokines, whereas the anti-inflammatory properties of IL-6 actually help regulate the inflammatory response (Baumann and Gauldie, 1994; Xing et al., 1998; Gabay and Kushner, 1999). We measured all three cytokines because there are numerous, redundant pathways for mounting an inflammatory response (e.g. Fantuzzi et al., 1996; Gabay and Kushner, 1999), and it is possible that C and HR mice have evolved to utilize alternative cytokine pathways. GM-CSF up-regulates leukocyte proliferation in red bone marrow, a component of the inflammatory response (Gasson, 1991). Plasma was processed using a Mouse Inflammatory Four-Plex Antibody Bead Kit (Invitrogen Corporation, Camarillo, CA, USA). All protocols recommended by the kit manufacturer were followed, except we used 25 μl of assay dilutes and samples instead of the recommended 50 μl. Single samples were tested and compared with a standard curve for quantification. Cytokines were quantified in pg ml–1 using Luminex multiplex technology (Luminex 200, Luminex Corporation, Austin, TX, USA).
CORT
Males from generation 60 were used. As per usual procedures, mice were weaned at 21 days of age, toe-clipped for identification, and housed randomly in same-sex groups of four. Mice were 6–9 weeks old at the time of blood sampling. The light cycle was maintained at 12 h light:12 h dark (lights on at 07:00 h) and food and water were given ad libitum.
One male mouse from each of 10 families per line was sampled for this study (total N=80). As part of a separate study of the effects of restraint stress (see Malisch et al., 2007), these animals were sampled during the routine selection procedure, with the addition of a seventh day of wheel access. Following the sixth day of wheel access, half of the mice were removed from their cage, subjected to a 40 min restraint stressor (see Malisch et al., 2007), then returned to their cage with wheel access. The other half were left undisturbed.
Blood samples were taken from all 80 mice (20 HR restrained, 20 HR not restrained, 20 C restrained and 20 C not restrained) ∼24 h post-restraint (between 14:30 h and 16:15 h). Blood was acquired from the retro-orbital sinus (Hoff, 2000) under isofluorane sedation (Malisch et al., 2009b). Precautions were taken to minimize stress to the mice before and during the bleeding process, and blood was acquired within 3 min of removal from the cage.
Blood samples were centrifuged (13,300 r.p.m.) at 4°C, then plasma was decanted and maintained at –80°C until assays. Total plasma CORT concentration was determined in duplicate with a commercially available kit following previous protocols (Malisch et al., 2008).
Statistics
Mixed-model nested ANOVA were used to assess variation in the innate immune responses. Hereafter, ‘linetype’ will indicate differences between HR and C mice and ‘innate treatment’ will indicate differences among baseline, sham and LPS treatments. The test of our main hypothesis is the interaction between linetype and innate treatment. For example, if HR mice had suppressed inflammatory responses, then we would expect TNF-α to be higher in C mice injected with LPS than in HR mice injected with LPS. However, we might not expect a difference in TNF-α in baseline or sham-injected mice.
The general model for each mixed model included linetype, innate treatment, the interaction of linetype × innate treatment, and a dummy variable indicating mini-muscle as fixed effects. In the general model, experimental batch, line nested within linetype and the interaction innate treatment × line nested within linetype were specified as random effects. We investigated all individuals with standardized residuals greater than |3| from all analyses and tried to determine why the residual was so large. Based on this analysis, we excluded two mice from all analyses: one mouse from the baseline treatment was excluded because of abnormally low wheel running. One mouse from the sham treatment was excluded because it had an extremely large change in mass indicating that an incorrect datum was recorded. Details of the model used for each response variable are presented below. Mixed models were run in SAS version 9.2 64-bit (SAS Institute Inc., Cary, NC, USA) using Proc Mixed.
Sickness behavior
Higher sickness behavior scores indicated sicker mice. Behavior score was modeled using the general model without alteration. We also investigated whether our behavior score could be predicted from the level of circulating cytokines TNF-α, IL-6, IL-1β and GM-CSF. We ran a separate model for each cytokine, by adding cytokine level as a predictor covariate to the general mixed model. Finally, we ran a principal components analysis (PCA) on the cytokine values and added the first principal component as a predictor covariate to the general mixed model if a linear combination describing the overall profile of circulating cytokines predicted sickness behavior.
Body mass
Body mass was measured twice, 2 h after injection (initial body mass) and after 48 h of wheel running (final body mass); both were used as response variables. The residuals for initial and final body mass we not normally distributed, so we log-transformed these data. Change in body mass, defined as initial body mass minus final body mass, was also calculated. Because change in body mass could be affected by both initial body mass and wheel running, we tested statistical models that included both covariates. They did not reveal anything different about linetype, innate treatment or the interaction between linetype and innate treatment, so we only present results from the general model.
Hematocrit
We measured hematocrit twice, 2 h after injection and after 48 h of wheel running. We also analyzed the change in hematocrit, defined as hematocrit after wheel running minus hematocrit 2 h after injection. These response variables were analyzed with the unaltered general model.
Organ masses
The statistical models for dry liver and wet spleen included the covariate log-transformed final body mass. The residuals for raw dry liver mass and wet spleen mass were not normally distributed; thus, we log transformed these data.
Cytokines
Raw and log-transformed TNF-α, IL-6, IL-1β and GM-CSF (pg ml–1) exhibited large differences in variance among the innate treatments (i.e. baseline, sham and LPS treatments); thus, these data were rank transformed to equalize variance (Conover and Iman, 1981; Potvin and Roff, 1993). We removed one sham treatment mouse from all of the cytokine analyses because it had an extremely high IL-1β measurement indicating an anomaly in the cytokine measurements. We also performed a PCA on the cytokine data and used the first principal component as a response variable to determine whether the overall cytokine profile was different between the HR and C mice. The principal component scores had unequal variance between the HR and C mice, and so we rank transformed these values for analysis. We used two complementary approaches to test for cytokine differences between the HR and C mice. First, we used the mixed model approach described above. Four mice in the IL-1β data set and six mice in the IL-6 data set had standardized residuals greater than |3|. These mice were all from the LPS treatment and represent variation in the cytokine so we left these individuals in the analysis. Second, we modeled each innate treatment (LPS, sham or baseline) separately. In selection treatments, the line is the unit of replication, not the individual (Henderson, 1989; Garland and Rose, 2009); thus, selection experiments often have relatively low power to detect differences between selection treatments. In addition, ANOVA may have relatively low power to detect interaction effects (Wahlsten, 1990). Therefore, with this second analysis, we attempted to increase our power by testing for differences between C and HR mice within each innate treatment. For these analyses we also used mixed models, but the fixed effects were linetype and a dummy variable indicating mini-muscle; the random effects were experimental batch and line nested within linetype. The results for the second statistical method were similar for all of the cytokines, so we present only results for rank TNF-α to represent the second method.
Wheel running
We analyzed all four measures of wheel running for two time periods: day 1 (first 24 h) alone, day 2 (second 24 h) alone. Distance run was calculated as the number of revolutions each mouse ran on the wheel. Distance run data were square-root transformed for both days to normalize the residuals. Running duration was the number of 1 min intervals during which an individual tallied at least one revolution, and data from both days were square-root transformed to normalized residuals. Mean speed (revolutions min–1) was calculated as the total number of revolutions divided by the number of active intervals. Mean speed data for both days were square-root transformed to normalize residuals. Maximum speed was calculated as the maximum number of revolutions in a 1 min interval, and was not transformed for either day. Before mice were put on wheels, we determined how much the wheel rotated after acceleration to a constant velocity. This measure of freeness was square-root transformed to normalize its distribution, and then used as a covariate in all the analyses of wheel running.
CORT levels
The effect of linetype, restraint and their interaction were tested with a mixed model in SAS Proc Mixed, each with 1 and 6 d.f. Covariates of age, time of day and bleed delay time were included.
RESULTS
As expected, sickness behavior (Fig. 1) was greater in LPS mice than in sham and baseline mice (Table 1; F2,12=47.3, P<0.001), but did not significantly differ between HR and C mice (supplementary material Table S1; F1,6=3.23, P=0.122). Unexpectedly, sickness behavior was greater in mice with the mini-muscle phenotype than in those with normal muscles (supplementary material Fig. S1A, Table S1; F1,106=14.6, P<0.001). TNF-α (F1,104=1.18, P=0.280) and the other cytokine measurements (including the principal component scores indicating overall cytokine profile; results not shown) were not significant predictors of sickness behavior at the level of individual variation in the ANCOVA models.
Initial body mass did not significantly differ among LPS, sham and baseline mice (F2,12=0.36, P=0.707, Fig. 2A). However, body mass after wheel running was lower in LPS-treated mice than in sham and baseline mice (F2,12=13.7, P<0.001, Fig. 2B). Thus, LPS-treated mice lost more mass than sham and baseline mice during the 48 h between mass measurements (F2,12=48.7, P<0.001, Fig. 2C). Initial body mass, final body mass and change in body mass did not differ significantly between HR and C mice or between mice with and without mini-muscles (supplementary material Table S1).
Two hours after injection, LPS-treated mice had higher hematocrit than sham or baseline mice (F2,12=35.1, P<0.001, Fig. 3A). After wheel running, there was no difference in hematocrit among these innate treatment groups (F2,12=0.74, P=0.497, Fig. 3B). Thus, between the first and second measurements, hematocrit decreased more for the LPS-treated mice than for the sham and baseline mice (F2,12=26.8, P<0.001, Fig. 3C). None of the hematocrit variables were significantly different between HR and C mice or between mice with and without mini-muscles (supplementary material Table S1).
The log body mass was a strong predictor of both log dry liver mass (β=1.27±0.09; F1,105=208.2, P=<0.001) and log wet spleen mass (β=1.27±0.13; F1,105=96.0, P<0.001). However, log dry liver mass and log wet spleen mass were greater in LPS mice (F2,12=22.24, P<0.001, Fig. 4A; F2,12=25.8, P<0.001, Fig. 4B, respectively). On average, mini-muscle mice had heavier livers (log dry mass, F1,105=13.12, P<0.001, supplementary material Fig. S1B) and spleens (log wet mass, F1,105=8.14, P=0.005, supplementary material Fig. S1C) than those with normal muscles. Neither organ mass differed significantly between HR and C mice (supplementary material Table S1).
LPS-treated mice had significantly higher cytokine levels than other mice (rank TNF-α: F2,12=170.3, P<0.001; rank GM-CSF: F2,12=75.0, P<0.001; rank IL-1β: F2,12=158.0, P<0.001; rank IL-6: F2,12=301.4, P<0.001, Fig. 5). Cytokine levels did not differ between HR and C mice or between mice with and without mini-muscles (supplementary material Table S1). When rank TNF-α of LPS, sham and baseline mice were examined separately, there was no significant difference between HR and C mice or between mice with and without mini-muscles (supplementary material Table S2). Results for all of the other cytokines were similar to those for rank TNF-α (not shown). The overall cytokine profile (i.e. the first principal component from a PCA of all of the cytokine data) had a quantitatively similar pattern to each individual cytokine (results not shown).
As shown in Fig. 6, mice from the HR lines ran significantly farther than those from control lines, irrespective of innate treatment, mainly because the former ran at higher speeds (see supplementary material Table S1 for statistical results). The interaction between innate treatment and linetype was significant for running duration on day 1 (F2,12=4.87, P=0.028), Aside from this, the interaction term was never statistically significant for any response variable (supplementary material Table S1).
HR mice (least squares mean ± s.e.m., 36.5±2.93 ng ml–1) had statistically higher plasma CORT concentrations (2-tailed, P=0.0158) than C mice (22.6±2.86 ng ml–1), with no statistical effect of restraint (2-tailed, P=0.1451), no interaction between linetype and restraint (P=0.2742), and no significant effect of any covariate (all 2-tailed, P>0.17).
DISCUSSION
As expected, mice injected with LPS exhibited classic signs of an inflammatory response, including sickness behavior, weight loss, increased hematocrit (2 h after injection), heavier livers and spleens, increased cytokine levels and decreased activity (voluntary wheel-running activity) (supplementary material Table S1; Figs 1, 2, 3, 4, 5, 6). Sham-treated and non-manipulated ‘baseline’ mice showed no statistical differences from each other (results not shown). Hence, changes in physiology and behavior in the LPS-injected mice were attributable to an inflammatory response caused by the LPS, not simply the process of injection.
CORT has anti-inflammatory properties and can inhibit the synthesis and release of IL-1β, IL-6, TNF-α and GM-CSF (Munck and Náray-Fejes-Tóth, 1992; Sapolsky et al., 2000). Because HR mice have a baseline circulating CORT concentration approximately twice that of C mice (Malisch et al., 2007), we hypothesized that HR mice would have a reduced inflammatory response. The key statistic to test our hypothesis is the interaction between linetype and innate treatment. Sham and baseline mice were not immune challenged and differences between HR and C mice for these treatments were not expected. In contrast, given their increased CORT levels, we predicted that in response to injection with LPS, HR mice would have a weaker inflammatory response than C mice. Consequently, we predicted differences between HR and C mice only in the LPS-injected group. If these predictions were supported by the data, then a statistically significant interaction between linetype and innate treatment was expected.
Of the 21 response variables, one (running duration on day 1) had a significant (P<0.05) interaction and three (body mass, log-transformed wet spleen mass and wheel running on day 1) had marginally significant (P<0.1) interactions (supplementary material Table S1). These results could be interpreted as suggesting that our experiment lacked sufficient statistical power to detect a difference in inflammatory responses of HR and C mice. However, we think this is unlikely given the clearly non-significant interaction terms for the other 17 response variables, particularly the cytokine results (Fig. 5). Because they are upstream regulators of inflammatory response, the cytokine data are our best indicator of the strength of that response (Beutler and Cerami, 1989; Baumann and Gauldie, 1994; Beutler, 2004), and the interaction term is not significant for these data (supplementary material Table S1). Furthermore, no difference existed between the C and HR mice when each innate treatment (baseline, sham and LPS) was investigated individually (supplementary material Table S2). Finally, type 1 error due to the large number of response variables could explain the single significant interaction we found for running duration on day 1. Thus, we conclude that inflammatory response does not significantly differ between HR and C mice.
The immune system has many redundant pathways for achieving the same end result (Suffredini et al., 1999; Parkin and Cohen, 2001), so it is possible that HR and C mice could have utilized different cytokine cascades for mounting an inflammatory response. However, we did not observe any differences in cytokine values between HR and C mice (supplementary material Tables S1, S2).
Our results are consistent with a previous study showing that, overall, HR and C mice did not differ in another measure of immune function – their ability to clear a parasitic nematode from the gut (Malisch et al., 2009b). Thus, HR mice do not have an impaired ability to clear micro- or macro-parasites. However, the immune system is complicated and multifaceted, with both innate and adaptive components, and its response depends on the specific type of infectious agent or parasite (Medzhitov, 2007). Further, one aspect of the immune system might be impaired while other aspects are enhanced to maintain overall immune function; therefore, it is important to study numerous aspects of the immune system (Norris and Evans, 2000; Matson et al., 2006). In this experiment, we tested only the difference in inflammatory response, a component of innate immune function. In contrast, Malisch et al. (Malisch et al., 2009b) implicitly tested both innate and adaptive responses, as both are required to clear nematodes from the gut (Maizels and Yazdanbakhsh, 2003; Anthony et al., 2007). Differences in adaptive immunity would result in differences between HR and C mice in their ability to clear parasites, but no difference in inflammatory response – i.e. we would expect a significant difference between HR and C mice for the study by Malisch et al. (Malisch et al., 2009b) but not for the present study. However, without testing adaptive immune function directly, we cannot rule out this explanation, so adaptive immune function should be tested directly in future studies of these mice.
When considering the results of Malisch et al. (Malisch et al., 2009b) with our present results, neither study suggests that evolved differences in baseline CORT lead to suppression of the immune response in HR mice as had been hypothesized. Instead, they suggest two potential scenarios regarding how selective breeding for voluntary wheel running has affected immunity in HR mice.
First, CORT levels may not have been elevated in the particular sample of HR mice we studied from generation 55, but we consider this explanation unlikely. The difference in baseline CORT levels between the HR and C mice was first detected in generation 21 (Girard and Garland, 2002), and it was reconfirmed in generations 35 and 39 (Malisch et al., 2007; Malisch et al., 2008), as well as in generation 60 (this experiment).
Second, the effects of evolved differences in baseline CORT levels may be very different from the effects caused by environmental factors that alter baseline CORT concentrations during the course of an individual’s life. The difference in circulating CORT levels between HR and C represents a correlated response to artificial selection on running behavior (Girard and Garland, 2002; Malisch et al., 2007; Malisch et al., 2008). Because the endocrine system does not evolve in isolation from other organ systems, the relationships among CORT, voluntary exercise and immune function may be different in mice bred for high voluntary exercise versus those responding to a shorter-term stressor. For example, CORT is often released when a system’s homeostasis is altered and/or when energy demands increase, such as during exercise (Chrousos and Gold, 1992; Stranahan et al., 2008). CORT released during exercise triggers a response that restores the energy balance (Girard and Garland, 2002; Droste et al., 2003; Stranahan et al., 2008). However, artificial selection for increased exercise may pose a threat to homeostasis because of the co-evolved increase in CORT level. If the new baseline level of CORT was high enough to initiate downstream responses to stress, then HR mice would exhibit chronic stress. In response, the set point for the level of circulating CORT required to trigger a stress response might have evolved in kind to a higher level. Under this scenario, the twofold increase in circulating CORT seen in HR mice would not cause a stress response and would not suppress immune function. In summary, our results suggest that there may be a fundamental difference in the downstream consequences of high CORT when the increase is evolved versus caused by short-term stress. Our results are also consistent with the hypothesis developed for testosterone that evolved differences in hormone levels will not always lead to negative pleiotropic effects (Hau, 2007). Thus, patterns regarding how hormones alter immune function might be more complicated than originally hypothesized (Fuxjager et al., 2011).
Mini-muscle mice
Mice with mini-muscles exhibited greater sickness behavior (supplementary material Fig. S1A) and had larger spleens and livers (Fig. 2B,C) than those with normal triceps surae muscles, suggesting an elevated inflammatory response in the former. However, the cytokine data conflict with that interpretation because levels did not differ between mini-muscle and normal mice (supplementary material Table S1).
One possible explanation is that the differences in sickness behavior and organ masses of mini-muscle mice versus others might be a consequence of the overall phenotype of the mini-muscle individuals. Mini-muscle mice differ phenotypically in many ways from mice with normal muscles (Garland et al., 2002; Swallow et al., 2005; Meek et al., 2009). For example, they have larger mass-corrected spleens and livers than normal muscle mice under baseline conditions (Garland et al., 2002; Swallow et al., 2005; Meek et al., 2009). Thus, if the inflammatory response causes an absolute increase in liver and spleen masses, then mini-muscle mice will still have larger mass-corrected organs than normal muscle mice and appear to be more sick based on organ mass alone.
However, the same logic cannot apply to sickness behavior. Sickness behavior includes many measures of activity (see body position and activities data in Table 1), and lower sickness scores are associated with higher physical activity. Previous studies have shown that mini-muscle mice sometimes run farther on wheels than normal mice (Syme et al., 2005; Hannon et al., 2008) and that HR mice in general have higher home-cage activity than C mice when housed without wheels (Malisch et al., 2009a). If the same correlation is true for mini-muscle mice, then running farther might be correlated with higher general activity in mini-muscle mice. Thus, if mini-muscle mice decrease their activity the same absolute amount as mice with normal triceps surae, they will still be more active and their higher activity could result in lower sickness behavior scores. In fact, mini-muscle mice had higher sickness behavior scores than those with normal muscles (supplementary material Fig. S1A). Although mini-muscle mice sometimes run farther on running wheels than those with normal muscles (Syme et al., 2005; Hannon et al., 2008), this may not translate into greater activity in home cages. Thus, variation in home cage activity cannot easily explain the difference in sickness behavior scores between mini-muscle and normal-muscle mice.
Alternatively, mini-muscle mice might differ from normal mice in the density or activity of cytokine receptors, or of other molecules associated with the inflammatory response. For example, mice deficient in TNF-α receptor and mice with mutations in Toll-like receptor 4 are less sensitive to the lethal effects of large doses of LPS (Pfeffer et al., 1993; Qureshi et al., 1999). Thus, it is also possible that the receptors for inflammatory cytokines are less abundant in mini-muscle mice and they exhibit increased sickness behavior even though they do not have elevated cytokine levels.
CONCLUSIONS
When comparing circulating levels of ‘stress hormones’ among species and among individuals, researchers commonly assume that higher levels have greater downstream effects, such as reducing immune function (see Martin et al., 2006). Our study and that of Malisch et al. (Malisch et al., 2009b) indicate that an observation of elevated baseline circulating CORT levels is not sufficient to make accurate predictions about the suppression of immune function. Selective breeding for voluntary wheel running in mice apparently has not altered the inflammatory response, despite the more than twofold higher circulating CORT levels in HR mice. Thus, evolved increases in baseline levels of CORT may not have the same negative immunosuppressive effects as the elevation of CORT caused by stress within an individual’s lifetime. These results have important implications for comparative studies examining hormonal differences among species and understanding the consequences of variation in circulating CORT levels that occur during an individual’s lifetime versus across generations on an evolutionary timescale.
Acknowledgements
We thank K. Hunter and V. Lombardi for their help with immune techniques and lab space. We also thank members of the Garland lab for their help with the mouse colony.
FOOTNOTES
FUNDING
This study was supported by the National Science Foundation [grant nos IOB-0543429 and IOS-1121273 to T.G.; IOS-0344994 to J.H.], and a grant from the Ecology, Evolution, and Conservation Biology Program at University of Nevada, Reno to C.J.D. H.S. was partially supported by a Chancellor’s Postdoctoral Fellowship from University of California, Riverside.