ABSTRACT
Body temperature (Tb) affects animal function through its influence on rates of biochemical and biophysical reactions, the molecular structures of proteins and tissues, and, ultimately, organismal performance. Despite its importance in driving physiological processes, there are few data on how much variation in Tb exists within populations of organisms, and whether this variation consistently differs among individuals over time (i.e. repeatability of a trait). Here, using thermal radio-frequency identification implants, we quantified the repeatability of Tb, both in the context of a fixed average environment (∼21°C) and across ambient temperatures (6–31°C), in a free-living population of tree swallows (Tachycineta bicolor, n=16). By experimentally trimming the ventral plumage of a subset of female swallows (n=8), we also asked whether the repeatability of Tb is influenced by the capacity to dissipate body heat. We found that both female and male tree swallow Tb was repeatable at 21°C (R=0.89–92), but female Tb was less repeatable than male Tb across ambient temperature (Rfemale=0.10, Rmale=0.58), which may be due to differences in parental investment. Trimmed birds had on average lower Tb than control birds (by ∼0.5°C), but the repeatability of female Tb did not differ as a function of heat dissipation capacity. This suggests that trimmed individuals adjusted their Tb to account for the effects of heat loss on Tb. Our study provides a first critical step toward understanding whether Tb is responsive to natural selection, and for predicting how animal populations will respond to climatic warming.
INTRODUCTION
Reaction norms can be used to simultaneously quantify both the level of phenotypic flexibility across an environmental gradient (i.e. reaction norm slope) and the phenotypic expression of a trait in the population-average environment (i.e. reaction norm intercept). Reaction norm components of several flexible physiological traits have been quantified, many of which are related to energy metabolism (e.g. basal metabolic rate, glucocorticoids: reviewed in Biro and Stamps, 2010) and immunity (e.g. Schreier and Grindstaff, 2020; Tieleman et al., 2010). However, in endotherms, individual variation in the phenotypic flexibility of body temperature (Tb) has been understudied by evolutionary physiologists, despite its importance in shaping organismal performance (Angilletta et al., 2010). Tb exhibits substantial plasticity in endotherms (Boyles et al., 2013), as evidenced by adjustments in Tb in response to different environmental challenges (e.g. food availability, thermal stress), activity level, daily fluctuations in rhythmicity (e.g. circadian and ultradian rhythms) and regional heterothermy within individuals as a consequence of heat flow across the body (Angilletta et al., 2010). Yet, studies investigating variation in Tb (and other thermoregulatory traits) have primarily focused on differences among species and populations (e.g. Noakes and McKechnie, 2019; Noakes and McKechnie, 2020; Smit et al., 2013), or among groups within populations (e.g. age and sex: Cai et al., 2016; Gagnon and Kenny, 2012; Andreasson et al., 2020a,b) rather than among individuals within a single population (e.g. Pessato et al., 2020; Bozinovic, 2007; Møller, 2010).
Evidence from laboratory studies, both on wild-caught and lab-raised individuals, indicates that among-individual variation in Tb exists at the individual level, and may be related to differences in metabolic traits that vary across environments (Briga and Verhulst, 2017). For instance, in birds, individuals with higher basal metabolic rate (a trait with reported genetic basis: Rønning, et al., 2007) also maintain higher Tb during exposure to temperatures below thermoneutrality (Briga and Verhulst, 2017; Stager et al., 2020), and in mice, individual consistency in Tb variation predicts torpor use (Nespolo et al., 2003; Boratyński et al., 2019). Further, variation among individuals in resting and basal metabolic rate has been widely reported in laboratory and free-ranging studies (e.g. Boratyński et al., 2017; Broggi et al., 2009; Hayes et al., 1998; Nespolo and Franco, 2007; Rønning et al., 2005; White et al., 2013). Finally, genetic variation in Tb has been reported in both domestic and laboratory species, including chickens (h2 range, 0.10–0.20: Li et al., 2020; van Goor et al., 2015), dairy cattle (h2=0.17: Dikmen et al., 2012), sows (h2=0.35: Gourdine et al., 2017) and mice (h2=0.36: Connolly and Lynch, 1981).
While there is evidence demonstrating consistent individual differences in thermoregulation in laboratory and domestic animals, far fewer studies have formally examined among-individual variation in Tb in free-ranging species (e.g. Bozinovic, 2007; Dammhahn et al., 2017; Møller, 2010; Nespolo et al., 2003, 2010; Pessato et al., 2020). Free-living animals often experience multiple environmental challenges simultaneously, and the degree of individual consistency and plasticity in Tb might be adjusted differently depending on the environmental context, physiological condition and life-history strategy of the individual (e.g. Cornelius et al., 2017). For example, metabolic rate in free-living animals has been reported to be less repeatable than in laboratory animals, which may be a consequence of how animals differ in their response to the same conditions over time (Auer et al., 2016). Further, exposure to certain environmental conditions during development, such as high environmental temperatures, may induce epigenetic changes in aspects of thermoregulatory physiology (see Ruuskanen et al., 2021, for a review), such as heat shock protein regulation (Kisliouk et al., 2017) and morphology (Burness et al., 2013; see Nord and Giroud, 2020, for a review). Given, that wild animals experience fluctuating environmental conditions both intra- and inter-annually, there may be greater variation in the magnitude and type of epigenetic and parental effects experienced by wild versus laboratory animals.
To understand the degree to which Tb consistently varies among free-living birds, we examined the repeatability of Tb in tree swallows, Tachycineta bicolor, across differing ambient temperatures (Ta). We asked: (1) do birds consistently differ from each other in their average Tb (i.e. is Tb repeatable at the reaction norm intercept)?; and (2) do birds consistently differ from each other in their Tb response across ambient temperature Ta (i.e. is the reaction norm slope repeatable)? Here, we used previously published data from our research group (Tapper et al., 2020a,b) to examine repeatability in swallows.
Because inter-specific variation exists among thermoregulatory traits (Pessato et al., 2020; Stager et al., 2020), we predicted that in both female and male swallows, Tb would be repeatable, both in terms of the average trait expression (i.e. intercept) and how individuals vary Tb across Ta (i.e. slope). However, because female tree swallows have a brood patch, whereas males do not, there may be differences in heat dissipation capacity between the sexes (Nord and Nilsson, 2019). Therefore, we may expect males to forage closer to their maximally allowable Tb relative to females, which could lead to differences in repeatability between the sexes. This could occur if, for example, the upper threshold on Tb does not vary between individuals for biochemical reasons. Alternatively, if both sexes reduce their activity level at high temperatures, meaning that hyperthermia may be a limiting factor, there may be similar repeatability in Tb between male and female swallows.
In our previous studies (Tapper et al., 2020a,b), we experimentally removed the ventral feathers from a sample of breeding females to increase heat dissipation capacity (i.e. trimmed treatment). Although testing for a treatment effect on Tb repeatability was not an explicit goal of this study, previously we found that control birds had higher Tb than trimmed birds (by ∼0.5°C) across temperature (6–31°C) (Tapper et al., 2020b), suggesting that differences in heat dissipation capacity could lead to differences in the repeatability of Tb. Following the same reasoning as above, control and trimmed birds may differ in Tb repeatability if control birds forage relatively closer to their maximally allowable Tb, and if the upper threshold on Tb does not vary between individuals. Alternatively, Tb repeatability may be similar between experimental treatments if individuals tend to adjust their activity level to avoid overheating (Tapper et al., 2020b). In this case, the effects of heat dissipation capacity on Tb should be outweighed by behavioural (or physiological) differences among individuals that directly relate to individual control of Tb.
MATERIALS AND METHODS
Field methods
Our dataset is derived from Tapper et al. (2020b), in which a detailed description of our study sites, field methods and reproductive parameter definitions can be found. However, given that environmental variables such as wind speed, relative humidity and precipitation could affect individual variation in thermoregulation, we present these summary statistics for our study sites here (Table 1). We performed this study in May–July 2018, on two nest-box breeding populations of tree swallows, Tachycineta bicolor (Vieillot 1808), in Peterborough, ON, Canada. We captured female tree swallows during incubation (day 7–10, n=16) and, when possible, their respective male partners during early nestling provisioning (day 4–6, n=10). Upon capture, we implanted individuals with a thermal-sensitive passive integrated transponder (PIT) tag (accuracy ±0.5°C, Bio-Therm13, Biomark, Boise, IA, USA) subcutaneously into the nape of the neck to remotely monitor Tb. PIT tags can shift position post-implantation and if such shifts occurred in our study, this could change among-individual differences in Tb (i.e. increase or decrease repeatability estimates). This will depend on whether the tag was measuring Tb from different body regions within the same individual at different times. However, several studies have implanted PIT tags in the nape and reported minimal issues with respect to tag movement (e.g. Nord et al., 2013, 2016; Oswald et al., 2018), and thus variation in subcutaneous Tb from PIT tag placement seems unlikely. Similarly, in the unlikely event that a tag was placed too close to the skin surface, estimates of Tb could be underestimated as a result of the influence of environmental temperature. Finally, repeatability could be affected by differences in individual tag accuracy, but these differences are likely to be small, as reported in other studies (e.g. 0.2°C: O'Connor et al., 2021). We read PIT tags with Biomark HPR Plus readers (with the minimum time between consecutive reads of the same tag set to 10 s: Tapper et al., 2020b), which we connected to a loop antenna and positioned so that they encircled the nest box entrance.
We recaptured individual female swallows during early nestling provisioning (range: day 1–2 post-hatch) and performed our experimental manipulation (experimental trimming versus handling) to allow for increased heat loss. We assigned females to each treatment by flipping a coin between ‘pairs’ of birds, such that the first female captured received one treatment, while the second female captured received the alternative treatment. We repeated this process for each new dyad. In the case of individuals that had been experimentally manipulated in the previous year, they were typically assigned to the opposite treatment from that which they had experienced previously. Therefore, while we assigned pairs of females to each treatment based on a coin flip, this was not always possible, and on occasion we had to selectively alternate between treatment types. For each female, we either trimmed the contour and downy feathers covering the brood patch to expose the bare skin underneath (∼7% of the surface area of the bird; trimmed treatment) or ascribed them a control condition where they were handled but released with their feathers intact. All research was approved by the Trent University Animal Care Committee, in accordance with the Canadian Council on Animal Care (AUP no. 24747).
Data compilation, organization and statistical analyses
For data compilation and statistical analyses, we used R (version 4.0, R Core Team 2020; http://www.R-project.org/). To maximize our sample size (i.e. assess the repeatability of Tb under a broad range of Ta), we included all Tb measurements made between 2 and 16 days post-hatch, and between 05:00 h and 21:00 h (when swallows are diurnally active). We attempted to record individuals for 24 h at least 3 times throughout the nestling period, once between the periods of day 2–5, day 6–10 and day 11–16. While most individuals had balanced data across the nestling period (Fig. S1A), the Tb data of some individuals covered a wider range of Ta (6–31°C) than others (Fig. S1B). To control for differences in the number of Tb measurements among birds, and to minimize bias in our estimates of individual Tb variance, we calculated the weighted average hourly Tb (°C) for each bird. To do this, we used maximum likelihood to approximate the distribution of the number of reads within a single nest visit (gamma distribution, shape=0.56, scale=37.58) and the number of visits within an hour (normalized between 0 and 1, beta distribution, α=1.02, β=4.63). Approximating the distributions helped to smooth the probabilities and therefore avoid giving a certain number of reads (within a visit) or visits (within an hour) too much or too little weight. The mean number of reads per bird across females and males was 4984 (range 1559–15,917) and 1247 (range 393–3427), respectively. The mean number of visits per bird across females and males was 626 (range 198–1855) and 544 (range 193–1460), respectively. As in Tapper et al. (2020b), we excluded data from one bird that had abnormally low Tb (∼2°C) between 05:00 h and 13:00 h on one day relative to its Tb between 05:00 h and 13:00 h on all other days (i.e. 13% of all observations for that bird).
To test whether Tb was repeatable in our tree swallows, we used Bayesian hierarchical mixed effects models, constructed using the package ‘brms’ (version 2.14, Bürkner, 2017). First, we quantified the repeatability and phenotypic flexibility of Tb of each sex, based on control bird data only. While removing the trimmed birds reduced our sample size from 29 (nfemale=16, nmale=13) to 16 individuals (nfemale=8, nmale=8), any differences in variance between treatments may have been confounded with sex differences on Tb if trimmed birds were included in the analysis. Thus, removing trimmed individuals increases the interpretability of our results.
In our model, we used a ‘heterogeneous error structure’, partitioning the different variance components (i.e. intercept, slope, intercept–slope co-variance, and residual error terms), into female (n=8) and male (n=8) groups, respectively. Because estimates of variance in reaction norm components (i.e. intercept and slope) differ depending on whether potential confounding factors are controlled for in the analysis, we included several covariates (see below) in our model to calculate the adjusted repeatability (Nakagawa and Schielzeth, 2010). We also mean-centred and standardized all covariates to two standard deviation units (Gelman, 2008). Our response variable was therefore mean hourly Tb, and our fixed predictors were treatment, Ta, lay date, brood size, nestling provisioning rate, and a ‘sex×Ta’ interaction. We note that in our preceding study on the effects of treatment on Tb (Tapper et al., 2020b), we modelled Tb as a polynomial function of Ta. Here, we chose to model Tb as a linear function of Ta to increase interpretability of the reaction norm components (both intercepts and slopes) and to avoid overfitting our models (as we had insufficient data to properly model non-linear relationships per bird). We also ran our models with wind speed and relative humidity included as fixed effects (we did not have hourly precipitation data), but the results, and consequently our conclusions, remained similar to the model with these factors excluded. Therefore, we excluded these variables from the final analysis to reduce the number of model terms in our analysis. To determine the degree of phenotypic flexibility in Tb in response to changes in Ta, we also included random slopes for individual identity by Ta. To compare the degree of phenotypic flexibility in Tb between treatments, we ran a Levene's test on the slope estimates from the model. Lastly, we interpreted differences between sexes in the mean repeatability intercept (Rintercept) and slope (Rslope) using Bayes factors (K), as derived from one-way hypothesis tests (calculated with the Savage–Dickey density ratio method) in ‘brms’. To keep our tests conservative, we asked whether the sexes differed from each other in Rintercept and Rslope by a repeatability of >0.1, or 10% (e.g. Rintercept trimmed−Rintercept control>0.1), meaning that we considered a difference in R≤0.1 as practically equivalent to 0 (see Kruschke, 2018, for defining regions of practical equivalence). Support for a difference in repeatability would be given by K>1, with larger K values indicating stronger support and smaller K values weaker support (see Jeffreys, 1961, and Lee and Wagenmakers, 2013, for a guideline on interpretation of Bayes factors). While the choice of 0.1 is arbitrary, repeatability estimates from other avian physiological traits (e.g. basal metabolic rate, glucocorticoid levels) have been reported to cluster between 0.1 and 0.4 (Auer et al., 2016; Schoenemann and Bonier, 2018), and we therefore based our boundary on the low end of the repeatability estimates reported in the literature. Our priors for the intercept repeatabilities were beta distributed (α=12, β=3) for both sexes, and for female and male slope repeatabilities they were beta distributed (female, α=1, β=4; male, α=4, β=3).
Next, we quantified the repeatability and phenotypic flexibility of Tb in females (ncontrol=8, ntrimmed=8) to determine whether heat dissipation capacity (i.e. experimental trimming treatment) affected among-individual variation in Tb. Our second model was structured similarly to our first, except that population-level effects of sex and sex×Ta were replaced with the population-level effects of treatment and treatment×Ta. Additionally, instead of partitioning variance into female and male groups, we partitioned the variance components into control and trimmed groups, respectively. As above, we compared the phenotypic flexibility in Tb between treatments by running a Levene's test on the slope estimates from the model. We also tested for differences in the mean repeatability intercept and slope, with a statistical difference between groups in repeatability as >0.1. For both treatments, priors were beta distributed for the intercept (α=12, β=3) and slope (α=1, β=4).
For both the ‘sex’ and ‘treatment’ model, we used informative prior distributions for the population intercept (i.e. Tb, °C), Ta and feeding rate coefficients, which were based on model estimates in Tapper et al. (2020b). We used gamma distributions for the population intercepts (shape=84, scale=2; producing a peak density of ∼42) and Ta coefficients (shape=1.2, scale=0.75; producing a peak density of ∼0.3) to fix Tb, Ta and feeding rate estimates above 0. For feeding rate and treatment, we used normal distributions (mean=0 and s.d.=1) and for the random effect terms, we used gamma distributions (shape=1.75, scale=0.75; yielding a peak density of ∼0.5). For the remaining model parameters (i.e. population-level fixed effects and residual error terms), we used the ‘brms’ default non-informative priors, which follow Student's t distributions (with degrees of freedom=3, location=0, scale factor=2.5). We corrected for autocorrelation between adjacent Tb measurements by adding an AR1 correlation structure (ρ=0.64). We ran models for 10,000 iterations, discarding the first 1000 iterations as burn-in, and sampling every 10 iterations. for all model parameters fell between 0.99 and 1.01, and the effective sample size was greater than 2000 for all parameters. Autocorrelation was low among consecutive thinned observations (r<0.10 in all models). We viewed trace plots for fixed and random effects to ensure appropriate sampling of the posterior distribution, and we ran three chains of each model to check for model convergence using the Gelman–Rubin diagnostic (Gelman and Rubin, 1992). We report our model estimates as the mean of the posterior distributions ±95% highest density credible intervals (CI).
RESULTS
Repeatability of Tb by sex and Ta reaction norms
The grand mean Tb averaged across all control female and male swallows (n=16) was 42.1°C at the intercept (Ta=0, or ∼21°C), with Tb ranging from 41.07 to 43.8°C (Fig. 1A). On average, the mean Tb of female and male swallows increased with Ta (β=0.38, 95% CI [0.24–0.51]) and feeding rate (β=0.11 [0.07–0.15]), and there was strong evidence that females were ∼0.8°C warmer than males (i.e. sex; Table 2; 97% posterior probability distribution<0), as indicated by the strength of the coefficients, associated credible intervals, and the percentage of posterior probability distribution that fell below 0. However, there was no evidence that the sexes differed in mean Tb as a function of Ta, at the population level (i.e. sex×Ta, β=−0.03 [−0.30–0.24]).
Repeatability of average Tb was high, but similar, for the two sexes (i.e. Rintercept; Table 3), and hypothesis tests confirmed no evidence for a difference between female and male Tb repeatability (β=−0.07 [−0.18, 0.05], K=0.18). However, there was substantial evidence that females and males differed in their mean slope repeatability (Table 3, Rslope,female=0.14, Rslope,male=0.58; Fig. 2), as indicated by the Bayes factor and posterior probability (hypothesis test, β=0.34 [−0.07, 0.68], K=11.8, posterior probability=92%). Additionally, the degree of phenotypic flexibility (i.e. variance in reaction norm slopes) varied between sexes (Levene's test, F1,14=7.63, P=0.02). Taken together, these results provide strong evidence that males displayed greater among-individual variation in their Tb across Ta compared with females. Relatedly, there was a negative relationship between intercepts and slopes in males (Fig. 2), but no evidence for a relationship in females (i.e. Vcov; Table 3). For males, this means that individuals with lower average Tb tended to be more responsive to changes in Ta compared with individuals with a higher average Tb (i.e. negative covariance).
Repeatability of Tb by treatment and Ta reaction norms
The grand mean Tb averaged across all female swallows (n=16) was 42.1°C at Ta≈21°C, but individuals differed from each other by as much as 2.49°C (Fig. 1B; range 40.80–43.29°C). The mean Tb of female swallows increased with Ta (β=0.36 [0.23–0.47]) and feeding rate (β=0.14 [0.10–0.18]), with some evidence that Tb was lower in trimmed compared with control birds (i.e. treatment, β=−0.54 [−1.43, 0.33]; 88% posterior probability distribution<0) (Table 4). Further, while control and trimmed birds maintained relatively similar Tb across Ta (i.e. treatment×Ta, β=−0.10 [−0.29, 0.11]; Fig. 3), 85% of the posterior probability distribution was <0, suggesting that trimmed birds had, on average, shallower slopes across Ta than control birds.
Repeatability of mean Tb in control and trimmed females was high (Rintercept control=0.89 [0.77, 0.99]; Rintercept trimmed=0.79 [0.59, 0.96]; Table 5), but there was no evidence for differences in repeatability between the treatments (hypothesis test, β=−0.01 [−0.19, 0.20], K=0.88). There was low repeatability in individual slopes in both treatments (Rslope control=0.12 [0.00, 0.41]; Rslope trimmed=0.22 [0.00, 0.55]; Table 5), indicating that individuals did not consistently maintain differences in Tb across Ta, irrespective of treatment. Additionally, we found no evidence for a difference in treatment slope repeatability (hypothesis test, β=−0.20 [−0.56, 0.15], K=0.16). However, the degree of phenotypic flexibility (i.e. variance in reaction norm slope) varied between treatments (Levene's test, F1,14=12.38, P=0.003).
We found no support for a relationship between individual identity intercepts and slopes (Vcov control=−0.05 [−0.97, 0.88]; Vcov trimmed=−0.24 [−1.00, 0.59], Table 5), in both treatments, meaning that an individual's average Tb and the plasticity of Tb (across changes in Ta) were not related.
DISCUSSION
Average Tb is repeatable for both sexes, but only males show repeatable reaction norms
We found strong evidence that average Tb is a repeatable trait in female and male tree swallows, meaning that in both sexes, individuals consistently differed from each other in their mean daytime Tb. We also found evidence that Tb was flexible across Ta, and this flexibility was more repeatable in males than in females. The reason for the greater repeatability in the male reaction norm slope is unclear but may be related to sex-specific differences in life-history strategies. While the feeding rate, and variation in feeding rate, was similar between males and females in our study (mean±s.d. 10.4±2.4 females, 10.9±2.7 males), males may be more variable in their overall contribution to parental care than females, which may be reflected in other parental care metrics not measured here (e.g. food load: du Plessis et al., 2012; nest cleaning). Male tree swallows are also known to exhibit polygyny (Dunn and Robertson, 1992), which may affect the time spent attending to their primary nest. Finally, males may be more variable in their body condition, or other aspects of morphology that affect insulation (and consequently Tb), such as feather density or length.
We also found that males with lower average Tb tended to have sharper increases in Tb with increasing Ta, compared with males with higher average Tb. One possible explanation is that males with high Tb may be unable (or unwilling) to raise their Tb further because of costs associated heat stress (i.e. heat dissipation limitation hypothesis: Speakman and Król, 2010). This would be consistent with our previous findings showing that limits to sustained energy expenditure in tree swallows may be a consequence of heat dissipation ability at high temperatures (Tapper et al., 2020b).
That we did not see this same effect in females, despite their higher average Tb than males at the population level, may be due to experimental error; for example, if the PIT tag of the female with the highest Tb (Fig. 1A, ID 8F) was implanted in deeper tissue (muscle) than in the other birds, it could raise the population-average Tb in females. While excluding this individual from the analysis reduced the magnitude of the difference between the sexes' average Tb (sex coefficient: 0.85 included, 0.60 excluded), it does not change our conclusions, given that most of the posterior distribution still excluded 0 (96% <0 included, 90% <0 excluded). A biological reason could be that because males lack a brood patch, they are more limited in their ability to dissipate heat relative to females. Therefore, as work rate increases, Tb may rise quicker in males than in females. Alternatively, females and males may differ in their tolerance of thermally induced cellular damage (reviewed in Hansen, 2009; Iossa, 2019; Pérez et al., 2008; Romero-Haro et al., 2016; but see Walsh et al., 2019) and, consequently, females may be less restricted in their ability to raise Tb. This, however, remains speculative in light of research showing no sex-specific differences in oxidative stress in small passerine birds (Costantini et al., 2006, 2007). A final explanation may be that male and female Tb are differentially constrained by other physiological traits. Regardless of the reason, differences in trait correlation between the sexes may imply historical differences in the strength of selection acting on Tb, because selective pressures tend to constrain trait variation within populations across time (Baker et al., 2018; Bell and Sih, 2007; DiRienzo et al., 2016).
Repeatability of female Tb does not differ as a function of relative heat loss
Experimentally manipulating heat loss did not influence the repeatability of average Tb (i.e. reaction norm intercept, 21°C) or Tb across Ta (i.e. reaction norm slope), although it did influence the degree of phenotypic flexibility in Tb, as indicated by different variance in slopes between treatments. That there was greater variance in the Tb slope estimates among trimmed individuals relative to control individuals, but no difference in slope repeatability, suggests that the greater variance in slope steepness observed among trimmed birds is likely due to differences in how individuals respond to environmental factors. For example, sources of variation likely to cause within-individual differences, such as variable environmental conditions (e.g. solar radiation, wind speed, insect abundance), daily fluctuations in internal state (e.g. hormonal levels), short-term behavioural adjustments (e.g. trailing legs during flight, fluffing feathers) and sources of experimental bias (e.g. measurement error, missing data), may have influenced the Tb slope response more in trimmed compared with control birds. Alternatively, it is possible that we did not detect a difference in slope repeatability between treatments because of small sample sizes and a lack of statistical power.
While there were no differences in the repeatability of Tb between treatments, control females were warmer than trimmed females in average Tb, and their average increase in Tb across Ta (i.e. black line, Fig. 3) was also steeper than in trimmed females. Because of similar repeatability in Tb between treatments, the population-level differences seen in this study are likely to be a consequence of individuals within each group responding consistently to the effect of treatment on Tb. However, it is possible that we may have seen greater inter-individual variation in response to the treatment had we trimmed a larger surface area (i.e. increased the area of heat transfer), because of the difficulty in regulating Tb with increased surface area exposure.
Despite observing low slope repeatability in both groups, there were consistent differences in the way individuals responded to changes in Ta. For instance, some individuals increased their Tb more steeply in response to increasing Ta than others (Fig. 3), suggesting there are different strategies of regulating Tb (Tapper et al., 2020b), or differences in the tolerance of higher Ta (Noakes and McKechnie, 2019). Additionally, the flexibility in the Tb by Ta response could have been influenced by variation in individual energy expenditure, which could arise because of differences in food availability, or the interaction between food availability and Ta (Briga and Verhulst, 2017). Regardless of the reason, such phenotypic flexibility within populations opens the possibility that inter-individual differences in thermoregulatory traits are heritable, and that phenotypic flexibility in Tb may itself be heritable. However, we recognize that even heritability does not necessarily imply trait responsiveness to natural selection (Vatka et al., 2020), and that a variety of phenotypically plastic traits have been reported to be non-responsive to selection by Ta (Arnold et al., 2019).
We note that the repeatability estimates reported here may be not representative of the among-individual variation in swallow core Tb, given that our metric of Tb was peripheral to the core, and consequently is likely to be influenced by both changes in environmental conditions and heat flow across the body, at least to some degree (Nord et al., 2016; Andreasson et al., 2020b). Future studies should consider simultaneously measuring individual variation in both deep and peripheral Tb, remotely, to determine the degree of variability that arises from recording Tb closer to the skin surface. Additionally, it remains to be seen whether among-individual differences in the flexibility of Tb that we report are heritable, and ultimately adaptive.
Acknowledgements
We would like to thank A. Schubert, A. Dain and J. Baici for help with data collection; A. Gerson for loaning RFID equipment; J. Robertson for advice on code and statistics; and S. Morin for edits. We also thank S. Verhulst and an anonymous referee for many comments that improved the manuscript.
Footnotes
Author contributions
Conceptualization: S.T., J.J.N., G.B.; Methodology: S.T., J.J.N., G.B.; Formal analysis: S.T.; Resources: J.J.N., G.B.; Data curation: S.T., G.B.; Writing - original draft: S.T.; Writing - review & editing: S.T., J.J.N., G.B.; Supervision: J.J.N., G.B.; Funding acquisition: J.J.N., G.B.
Funding
This research was supported by funds from the Natural Sciences and Engineering Research Council of Canada (NSERC) (RGPIN-04158-2014) and an internal research grant provided by Trent University. S.T. was supported in part by an Ontario Graduate Scholarship.
Data availability
Data are available in Dryad (Tapper, 2021): bcc2fqzbd.
References
Competing interests
The authors declare no competing or financial interests.