Skip to main content

Calls during agonistic interactions vary with arousal and raise audience attention in ravens



Acoustic properties of vocalizations can vary with the internal state of the caller, and may serve as reliable indicators for a caller’s emotional state, for example to prevent conflicts. Thus, individuals may associate distinct characteristics in acoustic signals of conspecifics with specific social contexts, and adjust their behaviour accordingly to prevent escalation of conflicts. Common ravens (Corvus corax) crowd-forage with individuals of different age classes, sex, and rank, assemble at feeding sites, and engage in agonistic interactions of varying intensity. Attacked individuals frequently utter defensive calls in order to appease the aggressor. Here, we investigated if acoustic properties of defensive calls change with varying levels of aggression, and if bystanders respond to these changes.


Individuals were more likely to utter defensive calls when the attack involved contact aggression, and when the attacker was higher in rank than the victim. Defensive calls produced during intense conflicts were longer and uttered at higher rates, and showed higher fundamental frequency- and amplitude-related measures than calls uttered during low-intensity aggression, indicating arousal-based changes in defensive calls. Playback experiments showed that ravens were more likely to react in response to defensive calls with higher fundamental frequency by orientating towards the speakers as compared to original calls and calls manipulated in duration.


Arousal-based changes are encoded in acoustic parameters of defensive calls in attacked ravens, and bystanders in the audience pay attention to the degree of arousal in attacked conspecifics. Our findings imply that common ravens can regulate conflicts with conspecifics by means of vocalizations, and are able to gather social knowledge from conspecific calls.


The acoustic structure of vocalizations is modulated by various factors. External stimuli may influence an individuals’ physiological state, which in turn induce changes in the structure of their vocalizations. In social animals, a conspecifics’ behaviour can represent external stimuli that could change the motivational state of a signaller. The emotional state of an individual influences acoustic properties of its vocalizations, with sounds becoming more harsh and lower in frequency when hostility and fear increase [1]. A recent framework proposed a two-dimensional approach to investigate emotional states in animals [2]. Instead of defining basic discrete emotions (e.g. fear, happiness, see [1]) the underlying emotions are measured along two axes. These axes consist of arousal, the physiological activation via the nervous system, and valence, the value of a certain emotion that ranges from very negative to highly positive [2]. Combining the acoustic properties of sounds with their underlying motivation gives insights into the emotional basis of communication during social interactions. The study of different emotional states based on acoustic measures requires a thorough understanding of the mechanisms of sound production and the effects of physiological processes on vocal production [3]. Vocalizations in humans are produced by the vibrating tissue (the source), and then shaped by the vocal tract (the filter) [4]. Although this concept was developed on human speech, it was successfully generalized to mammal vocal production [5, 6] and perception [7,8,9].

Recent studies suggest that although the vocal apparatus of mammals differs morphologically from the sound-producing organ of birds, the concept of the source-filter theory can still be applied to avian species both from a production [10,11,12,13,14,15,16,17,18] and a perception side [19, 20]. Hence, source- and filter-related acoustic features known to vary with arousal in mammals (e.g. fundamental and formant frequencies, amplitude, call duration; [21,22,23]) should cause comparable changes in acoustic parameters also in birds [18]. These acoustic parameters may thus serve as reliable indicators of a caller’s emotional state in general, and may help to manage social interactions with conspecifics and prevent escalation of conflicts. As communication usually occurs in a network of several animals in signalling and receiving range of each other [24], the emotional state of a caller may influence the behaviour of several individuals, addressees in direct interactions and bystanders alike. Consequently, studies should also take into account whether bystanders respond to arousal-based differences in acoustic signals, and thus are capable of inferring the emotional state of the caller.

Common ravens are opportunistic scavengers and gather at large ephemeral food sources such as carcasses [25], where they engage in agonistic interactions of varying intensity with conspecifics [26]. The intensity of an attack can be divided into attacks with and without physical contact: during fights and forced retreats, the aggressor attacks the victim with its beaks and claws, while the victim either fights back, or retreats [27]. During approach-retreat interactions (hereafter ‘retreats’) and submissive displays, the victim is displaced without physical contact. Yet, during submissive displays the aggressor shows self-assertive displays, with erected feathers above the eyes (‘feather-ears’) and the flanks, and the victim signals subordination through a retracted neck and a depressed plumage [27]. Independent of the level of aggression, the victims may utter defensive calls. Ravens were shown to establish a dominance hierarchy that is structured by age, sex, and bonding status: adult birds usually outrank younger ones, males outrank females, and birds with bonding partners outrank singletons [28].

Ravens have a large vocal repertoire [29, 30], including species-typical and individually learned calls. Among the former, many call types are well-studied with respect to call production and function (e.g. food-associated calls: [31,32,33,34] and territorial calls: [35, 36]), while comparably little is known about defensive calls. Defensive calls have been described as highly variable in duration, and are uttered as single calls or sequences of several calls when retreating from dominant conspecifics [37, 38]. As only victims call when retreating from aggressors, it seems that defensive calls function to signal distress and subordination, or ‘appeasement’ [39]. The experienced emotions during attacks are almost certainly negative for the victims; however, the level of arousal may vary with the intensity of the aggression and the perceived threat, and therefore should be reflected in the acoustic structure of defensive calls. In mammals, the most prominent changes in vocalizations relate to call duration, call rate, amplitude, and fundamental frequency, with calls becoming longer, higher in rate, louder and harsher with increasing arousal [23].

We here investigated defensive calls of individually marked free-ranging ravens in the Austrian Alps during agonistic encounters of varying intensity in the context of foraging. We first identified agonistic interactions and analyzed whether in addition to the intensity of the attack the opponents’ rank and relatedness influenced calling occurrences. We expected that the propensity to call and the number of calls emitted would vary with the level of aggression, i.e. calling would be more likely and more calls would be uttered when the conflict was more severe. In addition, the propensity to call and the number of calls uttered may vary inversely with fighting ability, whereupon we would expect calling propensity and the number of calls to be higher in low-ranking individuals. Finally, we expected conflicts to occur predominantly between unrelated individuals, as kin were shown to support each other during agonistic interactions [40]. We then analyzed the acoustic structure of defensive calls with special emphasis on acoustic parameters found to relate to arousal in mammals. We expected to find variation in accordance with those shown in mammals [23], e.g. longer and less tonal defensive calls with increasing attack intensity and opponents’ rank disparity.

Defensive calls raise the attention of bystanders [39, 41]. Victims of aggression were shown to receive social support from bystanders that are lower in rank than themselves, that supported them in previous conflicts, and from kin as well as bonding partners [28, 40]. It remains unknown whether calling increases the probability of receiving support, and which acoustic features of defensive calls bystanders pay attention to. Thus, we selected two parameters that showed significant variation in victims’ defensive calls according to the intensity of the attack, and manipulated these parameters experimentally. Using playback experiments, we tested receivers’ abilities to discriminate between natural and manipulated defensive calls. We hypothesized that higher proportions of bystanders would look towards the speaker when playing back calls that simulated increased arousal.


Data collection

Dyadic agonistic interactions were observed ad libitum [42] from August 2010 to July 2012 at the enclosures of wild boars, bears and wolves during morning feedings (0700–0900 a.m.) at the Cumberland Gamepark in Grünau im Almtal, Upper Austria (47°51′ N, 13°57′ O). The gamepark is built into a naturalistic landscape along the river Alm. Free-ranging ravens gather during morning feedings to snatch food from zoo animals, and are well habituated to the presence of human observers at those enclosures.

In the course of an ongoing monitoring project, ravens have been trapped and marked individually using coloured leg rings and metal rings from the German ringing station. The age class (juvenile, subadult, and adult) was determined by the coloration of the inner beak, which is pink in juveniles below 1 year, pinkish with dark speckles in 2 to 3 year olds, and turns completely black in adult birds aged older than 3 years [43]. At the start of the study, 130 ravens had been marked already. Another 74 ravens were marked in the course of the study, totalling 204 marked ravens. As non-breeder ravens are vagrant, the number and identity of birds present at the feedings varied daily and seasonally. An average of 22.97 ± 8.5 (SD) marked ravens were present during daily feedings in the study period (N = 516 days). At the onset of each feeding, observers were positioned next to the outer fence of the enclosures and delivered the food to the zoo animals, which prompted the ravens to land inside the enclosure and start foraging. Data was recorded using binoculars and voice recorders. In addition, all foraging ravens were video-taped using a digital camera (Canon HF-11 HD camcorder). From the videos, we coded the identity of both opponents and whether the victim produced defensive calls for each dyadic agonistic interaction between marked individuals. In addition, we coded the occurrence of an intervention by a third party, and whether the third party supported the victim, or the aggressor. From August 2011 to July 2012, sound recordings were conducted in addition to behavioural observations using a Sennheiser ME67 directional microphone (frequency response: 40–20,000 Hz) on a K6 Module connected to a Marantz recording device (Marantz PMD-670). Recordings were conducted at distances of 3–10 m with a sampling rate of 48 kHz and a 16-bit amplitude resolution.

Dominance hierarchy

Dominance indices were calculated on 942 agonistic interactions using SOCPROG 2.6 with MATLAB R2015a [44]. Modified David’s scores that account for unbalanced interaction rates [45] were extracted of each individual and normalized to obtain scores ranging from 0 to 1. Age class and sex are closely linked to dominance in ravens [28], and also in our data, adult birds outranked subadults and juveniles (Kruskal-Wallis test: H = 23.777, df = 2, p < 0.001), and males had a higher rank than females (Mann-Whitney U test: U = 391.0, p < 0.001). Thus, only rank differences (rank aggressor - rank receiver) were used in subsequent analyses.

Factors influencing calling propensity and the number of calls uttered

Generalized Linear Mixed Models (GLMMs) were calculated on 865 agonistic interactions involving 83 marked individuals (468 dyads) using the lme4 package [46] in R [47]. Calling (yes/no) was used as binomial response variable with a logit link function. The full model included the factors level of aggression (fight, forced retreat, retreat, and submission), rank difference of opponents, and kinship of opponents based on DNA analysis (full-sibling/parent-offspring, half-sibling, unrelated; detailed descriptions are provided in the Additional file 1). As random factor the identities of the opponents was entered to account for repeated interactions between opponents. To analyze the number of calls uttered during an agonistic interaction bout, a total of 135 bouts were analyzed with a GLMM using a Poisson distribution and a log link function. The identities of the opponents were used as a random factor. Level of aggression, rank difference of opponents, kinship of opponents, two-way interactions between level of aggression and rank difference and level of aggression and kinship were used as fixed factors. Variance inflation factors were calculated beforehand for all fixed factors in the model to ensure that no collinear parameters were entered in the models [48]. To rank the models, the difference in AICc (ΔAICc) was calculated by subtracting the lowest AICc from all others. As measures of strength of evidence for each model, relative likelihood (exp (−0.5/ΔAICc)) and Akaike weight (relative likelihood/sum of all relative likelihoods) were computed [49]. The models with the highest support were selected based on ΔAICc values (ΔAICc 2). As several models had high support, models were averaged using the MuMIn package [50] in R [47]. Post hoc pairwise comparisons were conducted using the multcomp package [51] in R [47], which accounted for multiple comparisons. The averaged models are shown in Table 1, the full model selection is presented in the Additional file 1: Table S1.

Table 1 Results of averaged models (all models with AICc value ≤2) on the propensity to call, and the number of call per interaction bout, with estimated means (EM), adjusted standard errors (SE), z values, and lower and upper confidence intervals (CI)

Sound analysis

A total of 377 defensive calls of 30 individuals were analyzed with a custom-built script in Praat [52]. The detailed routine is provided in the Additional file 2. Parameters measured were call duration (s), harmonicity (dB), amplitude measures: mean (dB), minimum (dB), relative time of minimum (%), maximum (dB), relative time of maximum (%), amplitude variation over time (dB/s); measures of the fundamental frequency (fo): mean (Hz), minimum (Hz), relative time of minimum (%), maximum (Hz), relative time of maximum (%), range (Hz), start (Hz), end (Hz), and sum of variation (sum of all fo changes); jitter; inflex (number of fo changes/s); and tonality (relative duration of tonal parts).

To reduce the amount of acoustic variables, a Principle Component Analysis (PCA) was conducted. Call duration loaded on a single component in the PCA (cp. Table S2 in the Additional file 1) and did not group with other acoustic measures, and thus was excluded from the analysis. PCA was recalculated without call duration, and three Principle Components (PCs) with eigenvalues greater than 1.0 were extracted which explained 90.27% of the total variance (Table 2). The Kaiser-Meyer-Olkin measure of sampling adequacy was 0.708, indicating that the data was suitable for PCA. The first extracted PC included fo-related variables (mean, minimum, maximum, start and end fo) and explained 51.62% of the variance (hereafter termed fo component). PC2 was comprised of amplitude-related variables (mean, minimum, maximum) and explained 27.59% of the variance (hereafter termed amplitude component). PC3 grouped the variables tonality and jitter, adding 12.19% to the total variance (hereafter termed jitter and tonality component). Regression scores of the three PCs were extracted. Call duration was analyzed separately using original measured values in seconds instead of regression scores.

Table 2 Component matrix with loadings of the PCA

Linear Mixed Models (LMMs) were calculated for the regression scores of each PC and call duration with a gaussian distribution and an identity link function with the lme4 package [46] in R [47]. As opponents were sampled multiple times, and as victims uttered several calls per interaction bout, a random factor was entered which nested consecutive calls of each interaction bout within the aggressor-victim dyad. The full models included the fixed factors level of aggression, rank difference of opponents, and kinship of opponents. In addition, two-way interactions between level of aggression and rank difference and level of aggression and kinship were added to the full model. All fixed factors in the model were tested for multicollinearity [48]. All models were ranked using relative likelihood and Akaike weights as described above, and the models with the highest support are shown in Table 3. Post hoc pairwise comparisons were done using the multcomp package [51] in R [47] to account for multiple comparisons. The full model selection table is presented in Table S3 in the Additional file 1.

Table 3 Model selection table for models with the highest support (Δi2) investigating calling occurrences, the number of calls per interaction bout, the three PCs, call duration, and the responses to playbacks of defensive calls manipulated in fo and call duration

Individual discrimination was tested with a permutated discriminant function analysis (pDFA; [53]) in R [47]. A crossed pDFA with 1000 permutations was calculated on a fully balanced set of 115 calls of 23 individuals (5 calls per individual) using the three PC scores and call duration.

Playback experiment and analysis

Eight defensive calls of four male and four female adult ravens with known identity were selected with little background noise and no overlapping calls of other birds. All calls were similar in duration (mean ± SD: 0.187 s ± 0.024) and mean fo (mean ± SD: 447.89 Hz ± 17.33). Calls were adjusted to the lowest sound pressure level using Sound Booth for Mac to assure that all calls had the same sound pressure level. Duration and fo manipulations were conducted in Praat [52]. Each call was shortened and lengthened by 50%, and fo was shifted up and down by 100 Hz. The routine used in Praat is described in the Additional file 1. We designed a playback experiment to test responses, defined as head turns towards the speaker, of free-ranging ravens to defensive calls manipulated in frequency and duration. We conducted 8 sessions to test responses to duration manipulations, and 8 sessions to test responses to frequency manipulations. In each session, we played three calls, the original, unmanipulated defensive call, and two calls either manipulated in duration (shorter and longer) or in fo (shifted up and down by 100 Hz) in randomized order. Sessions testing duration and fo were alternated. The minimum interval between two played back calls in a session was 2 min, and the minimum interval between two sessions was 1 week. Playbacks were conducted during morning feedings. Thirty minutes prior to the feeding, a battery-powered loudspeaker (Roadboy 65, LD Systems, frequency response: 80–15,000 Hz) was placed approximately 3 m from the fence of the wild boar enclosure, and concealed with a camouflage net. When feeding started, the food was provided to the wild boars, causing the ravens to descend and to start scrounging food. Playback stimuli were presented approximately 10 min after the start of the feeding using an iPod nano (6th generation, The iPod was connected to the speaker via a radio transmitter-receiver system (Sennheiser EW 112-p G3-A Band, 516e558 MHz), allowing the playback to be conducted without a visible connection of the experimenter to the speaker. Each session was videotaped using a HD digital camera (Canon HF-11 HD camcorder) on a tripod, which allowed us to precisely measure the responses. The number of birds present and the number of birds responding by turning their head towards the speaker was scored from the videos. Additionally, we scored the number of defensive calls that were uttered within 1 min prior to the playbacks.

Responses of ravens to the playbacks of defensive calls were analyzed with a logistic regression model in R [47]. Separate models were calculated for sessions testing responses to duration and session testing responses to fo using a quasibinomial distribution and a logit link function to account for overdispersion. As response variable, we used a vector that was created from the number of responding birds (successes) and the number of birds that did not respond (failures) to account for varying numbers of birds present in different sessions (mean number of birds present ± SD = 8.66 ± 4.91 individuals). Manipulation type (original, fo shifted up, fo shifted down or original, shorter, longer), the sex of the bird used as stimulus, and the number of defensive calls per minute prior to the playback were used as fixed factors in the full model. To rank the models, quasiAICc values (QAICc) were calculated by dividing the residual deviance (−2 log-likelihood) with the overdispersion parameter of the full model [54]. From this, ΔQAICc, relative likelihood (exp (−0.5/ΔQAICc)) and quasi Akaike weights were computed (Table S4 in the Additional file 1).


Patterns of agonistic interactions

Out of 865 observed agonistic interactions between marked individuals, the majority were initiated by adult ravens and directed towards other adults or subadults. While males were targeting both sexes, females tended to focus on other females (see Table S5 in the Additional file 1). Subadult birds showed a similar pattern, but initiated less conflicts, and juveniles hardly initiated agonistic interactions at all. In 68.9% of all agonistic interactions the opponents were unrelated, 24.3% occurred between half-siblings, and only 6.8% of the dyads were between full-siblings. Interventions in agonistic interactions between individually marked individuals were observed 63 times; in 44 instances the third party supported the aggressor, targeting the victim, and the victim received support in 19 cases.

Factors influencing calling propensity and the number of calls uttered

Ravens uttered defensive calls in 51.9% of all agonistic interactions (cp. Table S5). Victims tended to receive support from a third party more often when calling (14 out of 19 cases with calling: Chi-squared (1) = 3.37, p-value = 0.067). Defensive calls had an average duration of 0.140 ± 0.05 s (SD) and were strongly time-frequency modulated (for an example of two defensive calls see Fig. 1, for descriptive measures see Table S6 in the Additional file 1).

Fig. 1
figure 1

Example of two defensive calls. Spectrogram settings: FFT method, Gaussian window shape, window length = 0.01 s, time steps = 700, frequency steps = 250, dynamic range = 70 dB

The level of aggression (reflecting the intensity of a conflict) affected the birds’ propensity to call. The victims produced defensive calls in 75.8% of all fights and 65.4% of all forced retreats. Submissive displays were accompanied by defensive calls in 60.0% of the cases, and (low-intensity) approach-retreat interactions triggered calls in only 12.6% of the cases. The averaged model identified the level of aggression as the most important factor (relative importance: 1.0), and the rank difference of opponents as the second factor (relative importance: 0.39; Table 1). This indicates that dominance relationships were, aside from the level of aggression, a key factor to understand why victims produced defensive calls. Pairwise comparisons on the averaged model showed that the proportion of calling was significantly lower during approach-retreat interactions as compared to fights, forced retreats, and submissive displays (Fig. 2).

Fig. 2
figure 2

Estimated mean proportion of defensive call occurrences for different levels of aggression. Whiskers represent 1.5XIQR, bold lines denote the median, and circles show outliers. Asterisk indicate p0.001

The number of calls per interaction bout was also influenced by the level of aggression (relative importance: 1.0) and the rank difference of opponents (relative importance: 0.27; Table 1). The highest number of calls per interaction bout was found during fights (Fig. 3). Fewer calls were uttered during submissive displays, and the lowest number of calls were found for forced retreats and approach-retreat interactions. Victims uttered higher numbers of calls per interactions bout when opponents had higher rank differences; i.e. when the victims were very low-ranking, and the aggressors were high-ranking individuals.

Fig. 3
figure 3

Estimated mean number of defensive calls per interaction bout for different levels of aggression. Whiskers represent 1.5XIQR, bold lines show the median, and circles indicate outliers. Asterisk indicate p0.001 (***) and p0.05 (*)

Acoustic structure

The level of aggression had a strong effect on the fo component (F = 4.59, df = 3, p = 0.004), the amplitude component (F = 6.12, df = 3, p < 0.001), and call duration (F = 3.51, df = 3, p = 0.027; Table 3). In all these parameters, highest values were found for defensive call uttered during fights (Fig. 4, Table 4). This supports our hypothesis that these acoustic parameters indicate arousal-based changes in defensive calls. Values decreased gradually for forced retreats and retreats and were lowest during submissive displays.

Fig. 4
figure 4

Estimated mean values for the fo component (a), the amplitude component (b), and call duration (C) with regard to the level of aggression that elicited the defensive calls. Values were derived from the model with the highest support (Table 4). Bold lines indicate the median. Whiskers show 1.5XIQR, and circles denote outliers. Asterisk show adjusted p values corrected for repeated testing and indicate p0.001 (***) and p0.01 (**)

Table 4 Results of the models with the highest support investigating the three PCs and call duration, showing estimated means (EM), adjusted standard errors (SE), t values, and lower and upper confidence intervals (CI) of all coefficients

In addition, rank difference of opponents affected variation in the fo component (F = 2.17, df = 1, p = 0.14), the tonality and jitter component (F = 9.57, df = 1, p = 0.002), and call duration (F = 0.13, df = 1, p = 0.15). While no clear pattern could be observed for rank difference and the fo component and call duration, the tonality and jitter component showed a negative relationship with rank difference: scores decreased, i.e. calls became harsher as rank disparity increased.

The model with the highest support to explain variation in the fo component and call duration further included the two-way interaction between the level of aggression and rank difference (fo component: F = 4.68, df = 3, p = 0.004; call duration: F = 8.79, df = 3, p < 0.001; cp Table 3). Both the fo component and call duration showed the same pattern: fo scores increased and calls became longer as rank disparity increased for submissive displays.

The pDFA failed to discriminate individuals based on the acoustic structure of their defensive calls. None of the cross-validated calls could be classified correctly.

Playback experiment

When playing back calls that varied in fo, the model with the highest support included the factor manipulation type (F = 4.44, df = 2, p = 0.025; Table 3). Higher proportions of responses were found when fo was increased compared to unmanipulated calls and calls with lower fo; the latter two treatments showed no difference in the proportions of responses (Fig. 5). When testing differences in proportions of responses to the manipulation of call duration, neither of the factors remained in the model with the highest support, and responses did not differ between the original and the manipulated calls (see Fig. 5).

Fig. 5
figure 5

Proportion of responding birds with respect to natural playback stimuli (unmanipulated) and stimuli manipulated in call duration (white boxes) and fo (grey boxes). Bold lines indicate the median, and circles the outliers. Whiskers represent 1.5XIQR and asterisk indicate p < 0.05 (*)


We here show for the first time that the arousal-based variation in acoustic features previously found in mammals [23] can be found also in birds, and that bystanders are attentive to these experimentally induced changes in conspecific defensive calls. Ravens’ defensive calls showed higher measured of acoustic parameters related to fo and amplitude during more intense conflicts. Moreover, bystander ravens were highly attentive to defensive calls with increased fo in the playback experiment.

Calling propensity and number of calls uttered

Victims were most likely to utter defensive calls during intense conflicts with contact aggression such as fights and forced retreats, and during submissive displays that are accompanied by self-assertive displays of the aggressors, and reflect a harassment of the victim. Likewise, the number of calls uttered by the victims were highest during fights and submissive displays. Our findings thus support the hypothesis that high arousal during conflicts may have induced higher calling rates in victims. Similar links were shown between the propensity of calling and call rates and increased arousal in the context of predation for rhesus macaques (Macaca mulatta): individuals were less likely to produce alarm calls when treated with an inhibitor of glucocorticoid, and did so at lower rates [55]. Non-invasive studies showed similar results for yellow-bellied marmots (Marmota flaviventris), where individuals with higher glucocorticoid levels, and thus higher levels of physiological arousal, were more likely to emit alarm calls in dangerous situations [56]. A recent study revealed that common marmosets (Callithrix jacchus) were more likely to produce contact calls with higher arousal, which was measured by heart rate [57].

Defensive calling in ravens was also more likely whenever the rank difference of opponents was high. In ravens, dominance rank is strongly influenced by sex and age class [28, 58]. Thus, a large disparity between victims’ and aggressors’ rank may induce higher arousal, and result in higher calling propensity and higher call rates. Previous findings from captive ravens showed that kin had more valuable relationships than unrelated individuals [59]. Our analysis also showed that genetically related individuals rarely engaged in aggressive interactions with each other, and the factor kinship was not included in the models with the highest support.

Acoustic structure

The fo component increased during agonistic interactions with physical aggression, which indicates that arousal could have influenced the increase of fo measures as well. The same was found for the amplitude component, which combined amplitude-related measures of ravens’ defensive calls. These results are in line with previous studies reporting an arousal-based increase in fo and relative amplitude in mammals (reviewed in [23]), and in a bird [60]. Likewise, with increased arousal, call duration was reported to increase in some mammals [23], which was also the case for ravens’ defensive calls during high intensity aggression (e.g. fights). However, not only arousal, but also valence may impact on acoustic parameters of vocalizations [2]. Studies investigating valence in avian vocalizations are scarce, and results of studies in mammals are inconclusive, probably because valence is difficult to assess in non-human animals in general [23]. Yet, the duration and rates of vocalizations were shown to be shorter in positive situations [23]. The jitter and tonality component did not vary with the level of aggression. According to the motivational-structural rule, an increase in arousal is expected to influence tonality, or harmonic-to-noise ratio, with sounds becoming harsher, i.e. lower in tonality [1]. On the contrary, some vocalizations were reported to be less noisy or harsh with increased arousal in mammals [23]. The jitter and tonality component was, however, linked negatively with rank difference of opponents, as scores decreased as rank disparity increased. Calls thus became more harsh when the aggressor was very high-ranking and the victim very low in rank, indicating that a high rank disparity may induce a higher threat, and thus higher arousal.

During submissive displays, the intensity of the agonistic interaction and rank disparity shaped acoustic parameters at the same time: the fo component scores and call duration increased with rank disparity, indicating that submissive displays are perceived as highly arousing, possibly due to the simultaneous self-assertive displays of the high-ranking aggressors. Thus, defensive calls uttered during submission may signal subordination in order to appease the opponent and to prevent an escalation of the situation.

Defensive calls did not differ between individuals. A possible reason could be that victims may not need to communicate their identity because defensive calls are directed at the aggressor, and the aggressor already knows the identity of the victim prior to the attack. We suggest further studies to investigate whether ravens are effectively not able to recognize or discriminate individuals by their defensive calls.

Playback experiment

When investigating ravens’ responses to arousal-based changes in defensive calls, the proportion of responding birds, corrected for the total number of birds present, was highest for defensive calls with higher fo. This indicates that bystander ravens are attentive to the degree of arousal in attacked conspecifics. However, we did not find increased responses to defensive calls manipulated in duration. It is likely that the presentation of a single call did not elicit strong responses. Victims often uttered several defensive calls in a row during intense conflicts. As ravens only responded to call with increased fo, a possible conclusion is that single calls with moderate fo of any length do not raise the attention of bystanders because they do not sound highly urgent and aroused, and arousal is encoded in a high rate of calls with increased fo. This remains to be tested in future studies that explore responses to changes of other acoustic parameters independently [61, 62] or simultaneously. Another possible reason for low numbers of responding birds could have been the absence of visual cues (e.g. an ongoing conflict).


Our results show that agonistic interactions that induced high arousal and negative valence influenced the victims’ likelihood to call and the number of calls produced. Furthermore, the acoustic properties of defensive calls were affected by the intensity of the conflicts that induced calling. Variation in acoustic parameters related to fo, amplitude, call rate and duration approximate the most commonly varying source-related parameters in the study of vocal communication of emotions in mammals (reviewed in [23]). Our study shows that the same acoustic cues connote negative emotions also in ravens. Furthermore, we show that ravens are attentive to changes in acoustic properties of victims’ defensive calls. This finding implies that bystanders are sensitive to the degree of arousal in attacked birds, and that defensive calls may serve to regulate agonistic social interactions with conspecifics. Corvids’ social and cognitive skills are in many aspects comparable to those found in other highly social species. Their social organization characterized by high fission-fusion dynamics requires that members of subgroups constantly refresh their knowledge of others’ social relationships, which could have changes during prolonged fission periods [63]. One possibility to regain knowledge quickly is through eavesdropping on a variety of social signals. Our findings thus add to our understanding of the communicative value of acoustic signals.


  1. Morton ES. On the occurrence and significance of motivation-structural rules in some bird and mammal sounds. Am Nat. 1977;111:855–69.

    Article  Google Scholar 

  2. Mendl M, Burman OHP, Paul ES. An integrative and functional framework for the study of animal emotion and mood. Proc R Soc Lond B Biol Sci. 2010;277:2895–904.

    Article  Google Scholar 

  3. Taylor AM, Reby D. The contribution of source-filter theory to mammal vocal communication research. J Zool. 2009;280:221–36.

    Article  Google Scholar 

  4. Fant G. Acoustic theory of speech production. The Hague; 1960.

  5. Fitch WT, Hauser MD. Vocal production in nonhuman primates: acoustics, physiology, and functional constraints on “honest” advertisement. Am J Primatol. 1995;37:191–219.

    Article  Google Scholar 

  6. Owren MJ, Rendall D. Sound on the rebound: bringing form and function back to the forefront in understanding nonhuman primate vocal signaling. Evol Anthropol. 2001;10:58–71.

    Article  Google Scholar 

  7. Fitch WT, Fritz JB. Rhesus macaques spontaneously perceive formants in conspecific vocalizations. J Acoust Soc Am. 2006;120:2132–41.

    Article  PubMed  Google Scholar 

  8. Fitch WT. Vocal tract length and formant frequency dispersion correlate with body size in rhesus macaques. J Acoust Soc Am. 1997;102:1213–22.

    Article  CAS  PubMed  Google Scholar 

  9. Charlton BD, Ellis WAH, Larkin R, Fitch WT. Perception of size-related formant information in male koalas (Phascolarctos cinereus). Anim Cogn. 2012;15:999–1006.

  10. Beckers GJL, Suthers RA, ten Cate C. Pure-tone birdsong by resonance filtering of harmonic overtones. PNAS. 2003;100:7372–6.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  11. Beckers GJL, Nelson BS, Suthers RA. Vocal-tract filtering by lingual articulation in a parrot. Curr Biol. 2004;14:1592–7.

    Article  CAS  PubMed  Google Scholar 

  12. Ohms VR, Snelderwaard PC, ten Cate C, Beckers GJL. Vocal tract articulation in zebra finches. PLoS One. 2010;5:e11923.

    Article  PubMed  PubMed Central  Google Scholar 

  13. Nowicki S. Vocal tract resonances in oscine bird sound production: evidence from birdsongs in a helium atmosphere. Nature. 1987;325:53–5.

    Article  CAS  PubMed  Google Scholar 

  14. Hoese WJ, Podos J, Boetticher NC, Nowicki S. Vocal tract function in birdsong production: experimental manipulation of beak movements. J Exp Biol. 2000;203:1845–55.

    CAS  PubMed  Google Scholar 

  15. Riede T, Suthers RA, Fletcher NH, Blevins WE. Songbirds tune their vocal tract to the fundamental frequency of their song. PNAS. 2006;103:5543–8.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  16. Patterson DK. A comparative study of human and parrot phonation: acoustic and articulatory correlates of vowels. J Acoust Soc Am. 1994;96:634–48.

    Article  CAS  PubMed  Google Scholar 

  17. Elemans CPH. The singer and the song: the neuromechanics of avian sound production. Curr Opin Neurobiol. 2014;28:172–8.

    Article  CAS  PubMed  Google Scholar 

  18. Elemans CPH, Rasmussen JH, Herbst CT, Düring DN, Zollinger SA, Brumm H, et al. Universal mechanisms of sound production and control in birds and mammals. Nat Commun. 2015;6:8978.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  19. Fitch WT, Kelley JP. Perception of vocal tract resonances by whooping Cranes Grus americana. Ethology. 2000;106:559–74.

  20. Dooling RJ, Best CT, Brown SD. Discrimination of synthetic full-formant and sinewave /ra–la/ continua by budgerigars (Melopsittacus undulatus) and zebra finches (Taeniopygia guttata). J Acoust Soc Am. 1995;97:1839–46.

    Article  CAS  PubMed  Google Scholar 

  21. Scherer KR. Vocal affect expression: a review and a model for future research. Psychol Bull. 1986;99:143–65.

    Article  CAS  PubMed  Google Scholar 

  22. Scherer KR. Vocal communication of emotion: a review of research paradigms. Speech Commun. 2003;40:227–56.

    Article  Google Scholar 

  23. Briefer EF. Vocal expression of emotions in mammals: mechanisms of production and evidence. J Zool. 2012;288:1–20.

    Article  Google Scholar 

  24. McGregor PK. In: PK MG, editor. Animal communication networks: Cambridge University Press; 2005.

  25. Ratcliffe D. The raven. London: T & AD Poyser LTD; 1997.

    Google Scholar 

  26. Braun A, Walsdorff T, Fraser ON, Bugnyar T. Socialized sub-groups in a temporary stable raven flock? J Ornithol. 2012;153:97–104.

    Article  PubMed  PubMed Central  Google Scholar 

  27. Goodwin D. Crows of the world. 1st ed. London: British Museum (Natural History); 1976.

  28. Braun A, Bugnyar T. Social bonds and rank acquisition in raven nonbreeder aggregations. Anim Behav. 2012;84:1507–15.

    Article  PubMed  PubMed Central  Google Scholar 

  29. Heinrich B. Ravens in winter. New York: Summit Books; 1989.

    Google Scholar 

  30. Enggist-Dueblin P, Pfister U. Cultural transmission of vocalizations in ravens, Corvus corax. Anim Behav. 2002;64:831–41.

    Article  Google Scholar 

  31. Bugnyar T, Kijne M, Kotrschal K. Food calling in ravens: are yells referential signals? Anim Behav. 2001;61:949–58.

    Article  Google Scholar 

  32. Szipl G, Boeckle M, Wascher CAF, Spreafico M, Bugnyar T. With whom to dine? Ravens' responses to food-associated calls depend on individual characteristics of the caller. Anim Behav. 2015;99:33–42.

    Article  PubMed  PubMed Central  Google Scholar 

  33. Boeckle M, Szipl G, Bugnyar T. Who wants food? Individual characteristics in raven yells. Anim Behav. 2012;84:1123–30.

    Article  PubMed  PubMed Central  Google Scholar 

  34. Heinrich B, Marzluff JM. Do common ravens yell because they want to attract others? Behav Ecol Sociobiol. 1991;28:13–21.

    Article  Google Scholar 

  35. Boeckle M, Bugnyar T. Long-term memory for affiliates in ravens. Curr Biol. 2012;22:801–6.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  36. Reber SA, Boeckle M, Szipl G, Janisch J, Bugnyar T, Fitch WT. Territorial raven pairs are sensitive to structural changes in simulated acoustic displays of conspecifics. Anim Behav. 2016;116:153–62.

    Article  PubMed  PubMed Central  Google Scholar 

  37. Pfister U. Zur Morphologie, Ontogenese und Funktion der Rufe von Kolkraben. Bern: University of Bern; 1988.

    Google Scholar 

  38. Gwinner E. Untersuchungen über das Ausdrucks- und Sozialverhalten des Kolkraben (Corvus corax corax L.). Z Tierpsychol. 1964;21:657–748.

    Article  Google Scholar 

  39. Heinrich B, Marzluff JM, Marzluff CS. Common ravens are attracted by appeasement calls of food discoverers when attacked. Auk. 1993;110:247–54.

    Google Scholar 

  40. Fraser ON, Bugnyar T. Reciprocity of agonistic support in ravens. Anim Behav. 2012;83:171–7.

    Article  PubMed  PubMed Central  Google Scholar 

  41. Massen JJM, Pašukonis A, Schmidt J, Bugnyar T. Ravens notice dominance reversals among conspecifics within and outside their social group. Nat Commun. 2014;5:3679.

    Article  PubMed  PubMed Central  Google Scholar 

  42. Altmann J. Observational study of behavior: sampling methods. Behaviour. 1974;49:227–67.

    Article  CAS  PubMed  Google Scholar 

  43. Heinrich B, Marzluff JM. Age and mouth color in common ravens. Condor. 1992;94:549–50.

    Article  Google Scholar 

  44. Whitehead H. SOCPROG Programs: analysing animal social structures. Behav Ecol Sociobiol. 2009;63:765–78.

    Article  Google Scholar 

  45. de Vries H, Stevens JMG, Vervaecke H. Measuring and testing the steepness of dominance hierarchies. Anim Behav. 2006;71:585–92.

    Article  Google Scholar 

  46. Bates D, Maechler M, Bolker B, Walker S. Fitting linear mixed-effects models using lme4. J Stat Softw. 2015;67:1–48.

    Article  Google Scholar 

  47. R Core Team. R: A language and environment for statistical computing. 3rd ed. Vienna: R Foundation for Statistical Computing; 2017.

  48. Zuur A, Ieno EN, Walker N, Saveliev AA, Smith GM. Mixed effects models and extensions in ecology with R. In: Gail M, Krickeberg K, Samet JM, Tsiatis A, Wong W, editors. Statistics for biology and health. New York: Springer; 2009. p. 261–94.

    Google Scholar 

  49. Burnham KP, Anderson DR, Huyvaert KP. AIC model selection and multimodel inference in behavioral ecology: some background, observations, and comparisons. Behav Ecol Sociobiol. 2011;65:23–35.

    Article  Google Scholar 

  50. Bartoń K. MuMIn: multi-model inference. R package. R package; 2009. Available from:

    Google Scholar 

  51. Hothorn T, Bretz F, Westfall P. Simultaneous inference in general parametric models. Biom J. 2008;50:346–63.

    Article  PubMed  Google Scholar 

  52. Boersma P, Weenink D. Praat: doing phonetics by computer. 5 ed. 2016. Available from: Retrieved 24 May 2015.

  53. Mundry R, Sommer C. Discriminant function analysis with nonindependent data: consequences and an alternative. Anim Behav. 2007;74:965–76.

    Article  Google Scholar 

  54. Burnham KP, Anderson DR. Model selection and multimodel inference: Springer; 2012.

  55. Bercovitch FB, Hauser MD, Jones JH. The endocrine stress response and alarm vocalizations in rhesus macaques. Anim Behav. 1995;49:1703–6.

    Article  Google Scholar 

  56. Blumstein DT, Patton ML, Saltzman W. Faecal glucocorticoid metabolites and alarm calling in free-living yellow-bellied marmots. Biol Lett. 2006;2:29–32.

    Article  CAS  PubMed  Google Scholar 

  57. Borjon JI, Takahashi DY, Cordero Cervantes D, Ghazanfar AA. Arousal dynamics drive vocal production in marmoset monkeys. J Neurophysiol. 2016:116;753–64.

  58. Heinrich B. Dominance and weight changes in the common raven Corvus corax. Anim Behav. 1994;48:1463–5.

    Article  Google Scholar 

  59. Fraser ON, Bugnyar T. The quality of social relationships in ravens. Anim Behav. 2010;79:927–33.

    Article  PubMed  PubMed Central  Google Scholar 

  60. Szipl G, Boeckle M, Werner SAB, Kotrschal K. Mate recognition and expression of affective state in Croop calls of northern bald ibis (Geronticus eremita). PLoS ONE. Public Library of Science; 2014;9:e88265.

  61. Pitcher BJ, Briefer EF, McElligott AG. Intrasexual selection drives sensitivity to pitch, formants and duration in the competitive calls of fallow bucks. Evol Biol. 2015;15:1–13.

  62. Charlton BD, Zhihe Z, Snyder RJ. Giant pandas perceive and attend to formant frequency variation in male bleats. Anim Behav. 2010;79:1221–7.

    Article  Google Scholar 

  63. Aureli F, Schaffner CM, Boesch C, Bearder SK, Call J, Chapman CA, et al. Fission-fusion dynamics: new research frameworks. Curr Anthropol. 2008;49:627–54.

    Google Scholar 

  64. ABS A. Guidelines for the treatment of animals in behavioural research and teaching. Anim Behav. 2016;111:I–IX.

Download references


We are grateful to Kurt Kotrschal and Tecumseh Fitch for scientific advice, our colleagues at Konrad Lorenz Forschungsstelle (KLF) for helping with catching and marking ravens, the ‘Verein der Förderer KLF’ for permanent support, the Cumberland Wildpark and the animal keepers for logistical support, and two anonymous reviewers for valuable comments on the manuscript.


The study received financial support from the Austrian Science Fund (FWF) projects Y-366-B17 and W-1234-G17 to T.B. and project T699-B24 to E.R.

Availability of data and materials

The datasets used and/or analyzed during the current study are available from the corresponding author on reasonable request.

Author information

Authors and Affiliations



TB conceived the framework program. GS and MS conducted the behavioural observations, GS recorded and analyzed the sounds, conducted the playback study, and analyzed the data. ER and GS analyzed the genetic data. GS, ER and TB wrote the paper. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Georgine Szipl.

Ethics declarations

Ethics approval

Trapping and marking of free-ranging ravens as well as taking blood samples for sexing and kinship analysis was performed under the license from the Austrian Government (BMWF-66.006/0010–11/10b/2009). All experimental procedures complied with the Austrian Animal Experiments Act (§ 2, Federal Law Gazette No. 114/2012) and adhere to the latest ASAB/ABS [64] guidelines for the treatment of animals in behavioural research and teaching.

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Additional files

Additional file 1:

Detailed methods on kinship analysis and sound preparation for playback experiments, and Tables S1-S6. (PDF 188 kb)

Additional file 2:

Praat script used to analyze defensive calls in common ravens. (TXT 4.30 kb)

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (, which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver ( applies to the data made available in this article, unless otherwise stated.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Szipl, G., Ringler, E., Spreafico, M. et al. Calls during agonistic interactions vary with arousal and raise audience attention in ravens. Front Zool 14, 57 (2017).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: