Acoustic allometry occurs when features of animal vocalisations can be predicted from body size measurements. Despite this being considered the norm, allometry sometimes breaks, resulting in species sounding smaller or larger than expected for their size. A recent hypothesis suggests that allometry-breaking mammals cluster into two groups: those with anatomical adaptations to their vocal tracts and those capable of learning new sounds (vocal learners). Here, we tested which mechanism is used to escape from acoustic allometry by probing vocal tract allometry in a proven mammalian vocal learner, the harbour seal (Phoca vitulina). We tested whether vocal tract structures and body size scale allometrically in 68 young individuals. We found that both body length and body mass accurately predict vocal tract length and one tracheal dimension. Independently, body length predicts vocal fold length while body mass predicts a second tracheal dimension. All vocal tract measures are larger in weaners than in pups and some structures are sexually dimorphic within age classes. We conclude that harbour seals do comply with anatomical allometric constraints. However, allometry between body size and vocal fold length seems to emerge after puppyhood, suggesting that ontogeny may modulate the anatomy–learning distinction previously hypothesised as clear cut. We suggest that seals, and perhaps other species producing signals that deviate from those expected from their vocal tract dimensions, may break allometry without morphological adaptations. In seals, and potentially other vocal learning mammals, advanced neural control over vocal organs may be the main mechanism for breaking acoustic allometry.
In many species, acoustic signals help mediate social interactions such as competition for mates and territory, and parent–offspring recognition (Bradbury and Vehrencamp, 1998; Martin et al., 2017). Signals can encode information about the caller's biology which can be readily deciphered by the receiver, including age (Reby and McComb, 2003; Charlton et al., 2009), sex (Vignal and Kelley, 2007; Charlton et al., 2009), body size (Fitch, 1997; Charlton et al., 2009, 2011; Garcia et al., 2016), hormone levels (Koren and Geffen, 2009) and physical condition (Wyman et al., 2008; Koren and Geffen, 2009).
In particular, body size often shapes mammalian sounds by constraining the geometry of the vocal tract (Fitch, 2000; Reby and McComb, 2003). Acoustic cues relating to the body size of the caller can inform the receiver about the caller's competitive ability and reproductive success (Poole, 1999; Reby and McComb, 2003; Kuester et al., 1995; Pfefferle and Fischer, 2006). For example, in primates and carnivores, there is an inverse relationship between body size and call frequency parameters, where larger animals produce calls with lower frequencies, i.e. have a ‘deeper’ voice (Bowling et al., 2017). This relationship between acoustical call features and body size, where one accurately reflects the other, is known as ‘acoustic allometry’ (Taylor and Reby, 2010; Fitch, 1997). Here, signalling is considered honest when the acoustic parameters of observed vocalisations accurately reflect an individual's body size (Zahavi, 1977; Fitch and Hauser, 2003). Deviations from allometry can generate dishonest signals, with animals sounding unexpectedly small or large for their body size (Garcia and Ravignani, 2020). Dishonest signals may be produced when an animal (1) shows a lack of allometric scaling between its vocal tract and its body size, or (2) shows enhanced control over its vocal organs, which allows it to learn new vocalisations or modify existing vocalisations – an ability known as ‘vocal learning’ (Janik and Slater, 1997; Lattenkamp and Vernes, 2018). Recent work indeed showed that, given a cross-species regression between sounds produced and body size, outlier species seem to cluster either well below the regression line (those with anatomical adaptations) or markedly above it (the vocal learners). This led to a ‘morphology versus learning hypothesis’ (Garcia and Ravignani, 2020; Ravignani and Garcia, 2022): dishonest signals in mammals may arise either from anatomical adaptations or from vocal learning capacities. This prediction has the potential to identify new vocal learners or species with unexpected vocal tract morphology. Vocal learners should therefore be able to violate acoustic allometry while possessing a vocal tract that scales allometrically with the rest of their body. For the first time, we tested this prediction, asking whether vocal tract allometry is present in a vocal learning species that is known to violate acoustic allometry.
Harbour seals (Phoca vitulina) are vocal learners that escape acoustic allometry by producing sounds with different frequencies to those expected from their body size, allowing them to transmit dishonest body size information. Indeed, they stand out as outliers in cross-species allometric regressions between body mass and frequency parameters (Ravignani and Garcia, 2022; see Fig. 1). Moreover, previous studies have shown that harbour seals can actively modulate the call frequencies they produce based on auditory experience. In one special case, a human-raised harbour seal, named Hoover, was found to be capable of mimicking human speech sounds (Ralls et al., 1985). In a more recent study on harbour seal pups, young animals were found to be capable of lowering their fundamental frequency (f0) in the presence of background noise (Torres Borda et al., 2021). Do the environmental noise conditions in which vocalisations are produced have a stronger influence on f0 values than body size? To address this, we complemented acoustic data from Torres Borda and colleagues (2021) with body mass information and reanalysed it to show that acoustic allometric relationships do indeed break down in this species as a result of the large vocal plasticity observed within individuals (see Fig. 2 and detailed explanations in the Supplementary Materials and Methods). These re-analyses indicate that, also within species, individual harbour seals may sound bigger or smaller than predicted by body size. Seals can therefore escape the constraints of acoustic allometry, both across and within species.
Harbour seals are particularly vocal during the first few weeks following birth (Perry and Renouf, 1988). Pups produce individually distinctive mother attraction calls (Renouf, 1984) which vary with age, sex and body length (Khan et al., 2006; Sauvé et al., 2015). After weaning, however, these calls disappear entirely from their vocal repertoire, with most vocalisations ceasing aside from occasional clicks and growls (Renouf, 1984). During adulthood, female harbour seals remain almost entirely vocally inactive (van Parijs and Kovacs, 2002), but males start vocalising again, producing underwater calls during the mating season (Hanggi and Schusterman, 1994). The large variation in vocal repertoire observed across individuals, sexes and age classes makes harbour seals ideal candidates to test the morphology versus learning hypothesis, i.e. whether a vocal learning mammal does indeed escape acoustic allometry via learning instead of via anatomical adaptations.
Most mammalian vocalisations are described using the source–filter theory of vocal production. Within this framework, vocal signals are initially produced by a source and are then filtered by the vocal tract before being emitted into the environment (Fant, 1970). In mammals, the source of sound production consists of the vocal folds in the larynx, and the filter is composed of the cavities making up the upper vocal tract (Fant, 1970) (see Fig. 3). The vocal folds are shelves of tissue lying across the airway that attach ventrally and laterally to the thyroid cartilage and dorsally to the arytenoid cartilage (see Fig. 4A). When vocalising, the air expelled from the lungs rushes between the vocal folds, causing them to vibrate and produce sound (Elemans et al., 2015). The sound then continues to propagate along the upper vocal tract and is modified by its geometry (i.e. filtered) before being emitted as vocalisation. The source–filter framework highlights which vocal tract structures determine specific features present in acoustic signals. The rate of vibration of the vocal folds determines the f0 and the cavities of the vocal tract determine formant frequencies (Taylor and Reby, 2010). Measurements of these vocal tract structures can thus be used to estimate specific acoustic features of vocalisations.
Bioacoustics studies often investigate allometric relationships between acoustic signal features and body size, without consideration of the underlying allometric scaling between body size and vocal anatomy. Most mammals show allometry between body size and upper vocal tract length because the upper vocal tract is constrained by bony structures (Fitch, 1997, 2000; Fitch and Giedd, 1999; Plotsky et al., 2013; Garcia et al., 2016). However, allometry between body size and the size of the vocal folds is less common: the larynx is surrounded by cartilaginous structures and is thus less constrained, suggesting that vocal fold length can be decoupled from overall body size, as found in non-human primates (Fitch and Hauser, 1995; Fitch, 1997; Garcia et al., 2017). In mammals, formants, the acoustic proxy of vocal tract length, are thus often a stronger body size predictor than f0, the acoustic proxy of vocal fold length (Fitch, 1997; Garcia et al., 2016).
Within the larger framework of the hypothesis above, this study tested for allometric relationships between body size and vocal anatomy measurements in young harbour seals and how these relationships vary with sex and age. Preliminary work found that harbour seal body length correlates with upper vocal tract length and tracheal diameter, but not with vocal fold length (Ravignani et al., 2017). Here, we aimed to expand on these findings by using a larger sample size (353% increase), adding refined anatomical measurements, and comparing different age classes (to test for developmental effects). Based on previous literature, we expected to find allometry between body size and vocal tract structures that are surrounded – and hence constrained – by bony structures, such as vocal tract length. However, based on harbour seal vocal learning abilities (Ralls et al., 1985; Torres Borda et al., 2021; Janik and Slater, 1997), we expected their vocal flexibility to offer favourable grounds to find deviations from body size allometry for vocal tract components surrounded by cartilage, such as the trachea and vocal fold length.
MATERIALS AND METHODS
Larynges were collected during necropsies on 68 young harbour seals, Phoca vitulina Linnaeus 1758 (35 males, 33 females). Fifty-two samples came from seals that stranded on the Dutch coastline, the rest from animals found on the German coastline (Schleswig-Holstein). Forty-two animals died in captivity at Sealcentre Pieterburen, Pieterburen, The Netherlands, either naturally during rehabilitation despite intensive care or by means of euthanasia because of the presence of severe clinical signs without any indication of recovery. Euthanasia was performed by trained veterinarians, after sedation, with pentobarbital sodium (100 mg kg−1) using the method described in Greer and colleagues (2001). The other 26 animals died in the wild, either naturally or were mercy killed by trained hunters because they showed severe signs of illness (see Table S1). No animals were euthanised or mercy killed for the purpose of this study.
At the time of death, the seals studied were aged between 9 days and 12 months (median 6 months). The age of new-born individuals was estimated in number of days by expert seal veterinarians based on the condition of the umbilical cord or the umbilicus. Older individuals with a closed umbilicus were assigned June as their birth month, which is consistent with the majority of harbour seal births in the Wadden Sea (Osinga et al., 2012; Reijnders et al., 2010). Animals aged 1 month or younger were classified as pups, while those between 1 and 12 months of age were classified as weaners, making age a binary variable. Of the 68 individuals included in this study, 14 (8 males, 6 females) were classified as pups and 54 (26 males, 28 females) were classified as weaners. A Fisher's exact test showed no significant association between age and sex (χ2=0.765, P>0.05), suggesting our sample was balanced between sexes and ages.
Sample treatment and measurements
Post-mortem examinations were performed by veterinarians who all trained at Sealcentre Pieterburen and thereby used the same necropsy protocol (Pugliares et al., 2007). Dutch seals were examined at Sealcentre Pieterburen and German seals were necropsied at the Institute for Terrestrial and Aquatic Wildlife Research (ITAW), Büsum, Germany. Necropsies were performed on either cooled or defrosted carcasses. Body mass, body length and axillary girth were all measured prior to the start of the necropsy. Body length was measured from the tip of the nose to the end of the tail in a non-curvilinear fashion, while the animal was in supine position, and axillary girth was measured as the body circumference directly caudal to the front flippers. The vocal apparatus including the upper vocal tract, the larynx and part of the trachea was then removed and immediately frozen at −20°C. All samples were in a similar condition (i.e. none presented signs of decomposition), comparable to pinniped vocal tracts in Schneider (1962) and Ravignani and colleagues (2017).
Prior to measurement, samples were thawed in a refrigerator at 8°C and each larynx was cut medially to produce two hemi-vocal tracts. The measurements taken on these hemi-vocal tracts (see Fig. 4B,C) include vocal tract length (VTL), vocal fold length (VFL), vocal fold thickness (VFT), and tracheal measurements in the form of subglottic-tracheal dorsoventral distance (STDV; called subglottic-tracheal anterior–posterior distance, STAP, in Roers et al., 2009) using a calliper to an accuracy of ±0.01 mm. Although the vocal tract can be divided into lower (below larynx) and upper (above larynx) sections, formants (the resonant frequencies which often encode information about body size) are only determined by the upper vocal tract (Lester and LaGasse, 2008). VTL will henceforth refer to the length of the upper vocal tract. VTL was measured as the linear distance from the caudal end of the epiglottis to the rostral end of the tongue muscle while the tongue was kept straight. VFL was measured as the distance from the ventral attachment of the vocal fold on the thyroid cartilage to the dorsal attachment of the vocal fold on the arytenoid cartilage. VFT was measured as the distance between the anterior and posterior sides of the vocal folds. The first STDV was measured as the distance between the cricothyroid ligament and the caudal end of the arytenoid. The second STDV was measured as the diameter of the first tracheal ring. All measurements were performed independently by two raters (K.d.R. and A.R.), different from the veterinarians who performed the dissections. For both raters, VTL, VFL and VFT were measured 4 times, twice for each hemi-vocal tract, and STDVs were taken twice, once for each hemi-larynx, because the start and end measuring points were composed of cartilage (as opposed to soft tissue) and, hence, we assumed that the inter-rater reliability for STDVs would be higher than for other measurements.
Statistical analyses were performed in RStudio version 1.1.463 (R version 4.0.4). First, for both raters, the medians for VTL, VFL and VFT were computed from all values reported for every right and left hemi-larynx. Second, using the medians from the first step, the median values for all measurements including STDV1 and STDV2 were computed for each larynx. This provided, for each larynx and rater, five measurements: VTL, VFL, VFT, STDV1 and STDV2. The inter-rater reliability for VTL, VFL, VFT, STDV1 and STDV2 was evaluated using Pearson's correlations. Finally, the overall median values between raters were computed for all measurements. Using these new values, Spearman's correlations between body size and vocal anatomy measurements were then calculated (see Table 1). For each measurement, normality was assessed using the Shapiro–Wilk test and homogeneity of variance was assessed using an F-test. If both assumptions were met, a two-tailed independent samples t-test was computed to check for age and sex differences. When variables were not normally distributed, but samples had equal variance, a Mann–Whitney U-test was performed to assess group differences instead.
Predictive modelling was done using generalised linear models (GLMs) with the stats package (http://www.R-project.org/). A series of models were produced for all anatomical measurements with high inter-rater reliability (r>0.70; Salkind, 2010, p. 627). For every response variable, the full model included the fixed effects body length, body mass, girth, sex, age and the interaction effects of sex with all body size predictors, age with all body size predictors and the interaction of age and sex. The reduced model was then obtained through stepwise regression based on Akaike information criterion (AIC) values. An analysis of variance (ANOVA) test was performed to ensure that the reduced model was not performing significantly worse than the full one. Variance inflation factor (VIF) scores were calculated for all predictors included in the reduced models using the car package (Fox and Weisberg, 2019). Multicollinearity was considered problematic for subsequent model selection if VIF scores were greater than 5 (Akinwande et al., 2015). For all selected models, deviance explained was calculated from the model output (1−residual deviance/null deviance) and expressed as a percentage. Plots displaying the predicted effects of every predictor retained in the final models were produced to assess their relationship with the response variable. Diagnostic residual plots were used to verify the model assumptions. Independence of residuals was tested using a Durbin–Watson test (Fox and Weisberg, 2019). Normality of residuals was assessed visually by plotting model fit against the observed data. Homoscedasticity (i.e. constant variance) of residuals was also assessed visually using quantile–quantile plots. Finally, influential data points were assessed by calculating Cook's distance.
Inter-rater reliability for VTL, VFL, VFT and both STDVs was evaluated using Pearson correlations. VTL (r=0.94), VFL (r=0.88), STDV1 (r=0.97) and STDV2 (r=0.93) showed high inter-rater reliability. VFT (r=0.59) showed lower inter-rater reliability and was consequently excluded from further analysis. All correlations were significant at P<0.001.
All Spearman correlations between body size and vocal anatomy measurements showed positive relationships and significance at the 0.05 level (see Table 1). There were high correlations between body mass and body length (rs=0.70), and between body mass and girth (rs=0.86). Other notable correlations included those between VTL and VFL (rs=0.72), VTL and STDV1 (rs=0.70), VFL and STDV1 (rs=0.82), and VFL and STDV2 (rs=0.76). Spearman correlations for pups and weaners can be found in Table S2.
All anatomical measurements were non-normally distributed but showed equal variances across age and sex groups. A Mann–Whitney U-test was used to test for group differences as only the assumption for homogeneity of variance was satisfied. All anatomical measurements were significantly larger in weaners than in pups (P<0.001; see Table 2 and Fig. 5). No significant sex differences were found when considering pups and weaners together (P>0.05). When considering pups alone, both the normality and homoscedasticity assumptions were met. A two-tailed independent samples t-test found significant sex differences for VTL (t=−3.42, P<0.05; see Fig. 6A). Male pups (mean±s.d. VTL 86.0±2.9 mm) had a larger mean VTL than females (79.8±3.7 mm). When considering weaners alone, variables showed non-normal distribution, but equal variances. A series of Mann–Whitney U-tests found that only STDV1 was significantly different across sexes (U=218, P<0.05; see Fig. 6B). Weaned males (25.1±1.5 mm) had a wider mean STDV1 compared with weaned females (24.2±1.2 mm).
A reduced GLM, obtained by stepwise regression based on AIC values, was produced for every vocal tract measurement with high inter-rater reliability, including VTL, VFL, STDV1 and STDV2. All VIF scores were lower than 5 suggesting that multicollinearity was not problematic in the selected models. All model assumptions were satisfied. Moreover, ANOVA testing indicated that the reduced models did not perform significantly worse than the full models (P>0.90). GLM results showed that most vocal tract dimensions were best explained by body length, body mass, age and sex (see Table 3). Girth was not retained as a predictor term in any of the selected models. For each model, the predictor estimates with their confidence intervals can be found in Table S3 and plots of the predicted effects can be found in Figs S1–S3. Significant interaction effects are shown in Fig. 7.
This study reports on the allometric relationships between body size and vocal tract dimensions in harbour seals. It shows that body length accurately predicts VTL, VFL and STDV1, and body mass predicts VTL and both tracheal measurements (STDV1 and 2). We also find age and sex to be important predictors for the size of vocal tract structures. This is evidenced by significant differences in measurements between age classes and significant sexual differences within age classes.
Previous work showed that upper vocal tract (i.e. filter) dimensions in mammals are predicted by body size measurements (Fitch, 1997, 2000; Fitch and Giedd, 1999; Plotsky et al., 2013; Garcia et al., 2016; Ravignani et al., 2017) and our results provide additional evidence to support such allometry. Although most studies have used body length as a proxy for body size, we found that body mass can also be used to predict VTL in harbour seals. In the first years of life, harbour seals show a linear growth rate for both body length (Hauksson, 2006) and mass (Markussen et al., 1989), suggesting that VTL may develop in a similar fashion during this period. Acoustic proxies for the filter could thus provide a good estimation of a harbour seal's size. In mammals, formant frequencies and formant spacing can be predicted from VTL and vice versa (see Reby and McComb, 2003). Other acoustic proxies include energy quartiles, the frequency of amplitude peaks, and the ratios between these amplitudes (Sauvé et al., 2015). These parameters also encode individual signatures, suggesting that acoustic individuality may partially be an allometric by-product (Ravignani et al., 2017). Harbour seals have the vocal tract predispositions to produce vocalisations that accurately reflect body size whilst also sharing individual-specific information, suggesting that learning does not need to be invoked to explain individuality.
Across mammals, source-related features such as f0 can sometimes predict body size despite showing weaker allometric scaling than filter-related features (Reby and McComb, 2003; Charlton et al., 2011; Pfefferle et al., 2007; Charlton and Reby, 2016); it was unclear whether this holds for harbour seals (Ravignani et al., 2017; Bowling et al., 2017). Our findings indicate that VFL, which may be used to determine f0, can be predicted by body size in harbour seals. Moreover, Sauvé and colleagues (2015) reported a decrease in f0 with an increase in body length of harbour seal pups. Taken together, this suggests that a harbour seal's f0 can be predicted from vocal anatomy. Previous evidence against allometric scaling for VFL could be explained by low statistical power or lack of testing for age effects on vocal tract measurements (Ravignani et al., 2017). It is indeed notable that age is included in both interactions that were retained in the selected VFL model. Our results, including both pups and weaners, show that allometric scaling between body size and VFL only emerges after weaning, suggesting that VFL may not be constrained in harbour seal pups (see Fig. 7, bottom). This begs the following question: how would escaping acoustic allometry for source-related features be beneficial for pups? Broadcasting honest body size information may be detrimental to harbour seal pups as they are significantly more likely to be displaced by larger conspecifics during agonistic interactions (Neumann, 1999). However, pups may be able to benefit from lowering the f0 (Torres Borda et al., 2021) of their calls to create an impression of size exaggeration. However, pups may also benefit from increasing the f0 of their calls to convey distress to the mother (Briefer, 2012). Future playback studies could and should contrast these hypotheses.
Several phocid species use the trachea for sound production (Bryden and Felts, 1974), but this could be a by-product of adaptive modifications to the respiratory tract required for diving (Kooyman and Andersen, 1969; Tyack and Miller, 2002). Our results support the correlation between tracheal diameter and body length found by Ravignani and colleagues (2017), but also provide evidence that tracheal dimensions can be predicted by body mass. Previous literature found that the trachea may potentially convey body size information if its size influences acoustic call features (Ravignani et al., 2017). In humans, a wider tracheal diameter partially predicts turbulence (i.e. unsteady air movements) for large airflows (van den Berg et al., 1957). Applying the same logic to other mammals, larger seals would have wider tracheal dimensions which, in turn, would make vocalisations noisier. This could explain, for instance, why the harmonics-to-noise ratio decreases as harbour seals get older (de Reus, 2017). Future work on sound production in this species could test this prediction using sound–anatomy correlations and excised larynx set-ups. Moreover, playback experiments could test whether adding noise to vocalisations alters interactive behaviour to determine whether the harmonics-to-noise ratio may encode body size information. Understanding whether and how the trachea is involved in sound production will thus require further research.
As expected, all anatomical measurements were larger for weaners than they were for pups. In the study by Ravignani and colleagues (2017), animals up to 108 days old were classified as pups. However, in the wild, the lactation period for harbour seals ranges from 23 to 42 days, after which the pups are weaned (Renouf, 2012). Hence, for the sake of simplicity, we considered animals up to 1 month old to be pups and animals older than 1 month to be weaners. Through this categorical classification, we were able to identify how allometric trends develop over the harbour seal's early life. At the time of data collection, we had very few larynges from subadults and adults, leading us to not include these data points in our analysis to avoid potential problems caused by small sample size. Future research including larynges from subadults and adults will further extend our knowledge of how vocal allometry develops in harbour seals.
There were no sexual differences when considering the sample size as a whole, but significant sexual differences existed within age classes. These differences might be attributed to differing levels of steroid hormones acting on the laryngeal structures in males and females (Aufdemorte et al., 1983; Sauvé et al., 2015). In some mammals, sex hormones affect the structural development of the larynx and the viscoelastic properties of the vocal fold tissue (Fitch and Giedd, 1999; Beckford et al., 1985). At puberty, in these animals, the male larynx descends in the vocal tract causing an elongation of the upper vocal tract, allowing males to convey an exaggerated impression of size (Fitch and Giedd, 1999; Fitch and Reby, 2001). In harbour seal pups of similar body size, males have larger VTLs than females, suggesting that laryngeal descent in males possibly occurs early in life. Once weaned, however, females show a clear increase in VTL whereas it remains relatively constant in males (see Fig. 7, top), suggesting that VTL differences across sexes may become less pronounced over time. In mammalian males, sex hormone action also causes a rapid increase in cartilage size, leading to an enlarged larynx and an increase in the vibrating portion of the vocal folds (Fitch and Hauser, 2003). This could explain why, in weaners, STDV1 is larger in males than in females. Nevertheless, these findings are somewhat surprising as young harbour seals normally show little sexual dimorphism (Le Boeuf, 1991). In particular, there is a lack of evidence for sexual differences regarding birth mass and growth rates among harbour seal pups (Bowen et al., 1994). In our sample, there were no significant body size differences between sexes (P>0.05); however, male pups were slightly larger than female pups in body length (males 81.6±4.4 cm, females 77.5±4.5 cm), which could partially explain the VTL differences observed in this age class. Male (9.8±1.58 kg) and female (9.8±1.6 kg) pups did not differ in body mass, but it is important to note that the sampled animals were sick and/or in poor condition; hence, body mass values are not representative of healthy individuals and should be interpreted with caution. In short, based on these observed differences in vocal anatomy across sexes, formants are expected to differ in pups and harmonics-to-noise ratio is expected to differ in weaners. The anatomical structures that determine these acoustic features both show strong allometric scaling; hence, these parameters may provide distinct body size cues across age classes, potentially facilitating the discrimination of male and female conspecific calls. Future research should investigate how sex hormones affect the elastic properties of harbour seal laryngeal tissues. Hormone levels can be measured by taking blood samples from healthy male and female seals at different developmental stages, and results can be combined with magnetic resonance imaging (MRI) mapping of laryngeal tissue elasticity.
The high inter-rater reliability observed for VTL, VFL and both STDVs demonstrates that these quantities can be measured and replicated easily, making them reliable landmarks for vocal tract measurements. However, tissue properties such as the viscoelasticity of certain vocal tract structures, like the vocal folds, are significant obstacles to getting accurate measurements. Indeed, raters struggled to produce precise data for VFT. Future research in the field of pinniped vocal anatomy would benefit from improved measuring techniques using 2D pictures, radiography, MRI and computed tomography scans as this would enable more accurate measurements for structures that are difficult to handle. Finally, future similar studies should include measurements of another vocal tract structure: the corniculate cartilage. Although widely absent in terrestrial carnivores, harbour seals have rather large corniculate cartilages that help close the trachea together with the epiglottis (Adams et al., 2020). These cartilages are located close to the vocal folds and are possibly innervated by the same nerves and controlled by the same muscles. It may be possible that these cartilages play a role in sound production by, for example, lowering the f0 by adding weight to the vocal folds. Taken together, these suggestions will provide a more precise and detailed picture of the harbour seal's vocal anatomy.
Observed species-specific vocalisations are determined by both a species' vocal anatomy and their capacity for vocal learning (Garcia and Ravignani, 2020; Ravignani and Garcia, 2022). The vocal anatomy generates vocal predispositions by imposing biomechanical constraints, whereas neural processes determine the degree of control species have over their vocal organs (Garcia and Manser, 2020). Particularly, vocal learners, such as the harbour seal, are capable of actively modulating sounds, suggesting that they are less constrained by anatomy and have a refined capacity for vocal motor control. Unfortunately, the relative contribution of the two sound production mechanisms is unclear. In this study, we tested a hypothesis trying to segregate anatomical versus learning mechanisms (Garcia and Ravignani, 2020; Ravignani and Garcia, 2022). As shown here, by testing for allometric relationships between body size and vocal tract structures, one can start to disentangle the respective contributions of vocal anatomy and vocal motor control in shaping acoustic signals. We found that harbour seals are mechanistically constrained by their vocal anatomy, and have large vocal flexibility (Ralls et al., 1985; Torres Borda et al., 2021) which may result in the production of dishonest signals, thus pointing towards extensive volitional control over their vocalisations. In brief, we provide support for the morphology versus learning hypothesis, showing however that this relationship may be mediated by ontogeny.
In summary, we provide evidence of allometry between body size and vocal tract measurements in harbour seals. Body length is a strong predictor for VTL, VFL and STDV1, and body mass is a strong predictor for VTL and both tracheal measurements (STDVs). Age and sex are also important in predicting the dimensions of these anatomical structures. Taken together, the combined findings demonstrate that harbour seal vocal tracts do indeed scale with body size, although allometry between VTL and body size may only emerge after weaning. One could now make inferences about the vocal predispositions of harbour seals (e.g. f0, formants), based on either their body size or the size of their vocal tract. However, to accurately predict f0, further studies are needed in harbour seals to determine the range of stress they apply to their vocal folds while vocalising and to infer the tissue density of their vocal folds (Titze et al., 1989). Once such predictions are made, comparing them with data obtained from observed natural vocalisations would shed light on the range of vocal flexibility resulting from their extensive vocal motor control. Although formant spacing could be predicted from vocal tract length (Titze, 1994), it is still difficult to consistently extract formants from harbour seal vocalisations, meaning that predictions cannot currently be compared with observed vocalisations. Finally, a critical next step to directly relate acoustic features to sound production structures is to connect harbour seal vocal anatomy measurements to the vocalisations they produce while alive. Integrating such results with investigations of call function will eventually inform on which vocal structures are responsible for generating the individual- and species-specific information encoded in harbour seal vocalisations.
We are grateful to Letty Stupers, Rebecca Andreini and Maria Jose Robles for help in gathering and preparing samples for data collection. We thank Aline Joustra for her illustrations of seal vocal tracts. We also greatly appreciate the extensive comments provided by Taylor Hersh, Laura Verga and Yannick Jadoul. We would also like to thank all members and volunteers of Sealcentre Pieterburen for their continued support in this research.
Conceptualization: K.d.R., S.G., M.G., A.R.-G., A.S.-C., A.R.; Methodology: K.d.R., S.G., A.R.-G., A.S.-C., A.R.; Software: S.G.; Validation: K.d.R., M.G., A.R.-G., A.S.-C., A.R.; Formal analysis: K.d.R., D.C., A.L.; Investigation: K.d.R., D.C., A.L., S.G.; Resources: S.G., A.R.-G., A.S.-C., A.R.; Data curation: K.d.R.; Writing - original draft: K.d.R., D.C., A.L., A.R.; Writing - review & editing: K.d.R., S.G., M.G., A.R.-G., A.S.-C., A.R.; Visualization: K.d.R., D.C., A.L.; Supervision: A.R.; Project administration: K.d.R.; Funding acquisition: A.R..
This work was supported by a Max-Planck-Gesellschaft (Max Planck Research Group, MPRG) and a Fonds Wetenschappelijk Onderzoek (FWO) Pegasus Marie Curie fellowship 12N5517N awarded to A.R. K.d.R. was supported by the Fonds Wetenschappelijk Onderzoek project ‘Interactive vocal rhythms’ G034720N awarded to Bart de Boer. The German Ministry of Energy, Agriculture, the Environment, Nature and Digitalization (MELUND) funded the marine mammal stranding scheme and the necropsies in Schleswig-Holstein. The Sealcentre Pieterburen funded the seal stranding scheme and the necropsies at the Sealcentre Pieterburen. M.G. was supported by a research grant from the Ethologische Gesellschaft e.V., as well as a postdoctoral research grant from the Forschungskredit of the University of Zurich [grant no. FK-21-121]. Open Access funding provided by Max Planck Society, Max Planck Digital Library. Deposited in PMC for immediate release.
The authors declare no competing or financial interests.