Abstract
This study investigated the effects of anxiety on nonverbal aspects of speech using data collected in the framework of a large study of social phobia treatment. The speech of social phobics (N = 71) was recorded during an anxiogenic public speaking task both before and after treatment. The speech samples were analyzed with respect to various acoustic parameters related to pitch, loudness, voice quality, and temporal aspects of speech. The samples were further content-masked by low-pass filtering (which obscures the linguistic content of the speech but preserves nonverbal affective cues) and subjected to listening tests. Results showed that a decrease in experienced state anxiety after treatment was accompanied by corresponding decreases in (a) several acoustic parameters (i.e., mean and maximum voice pitch, high-frequency components in the energy spectrum, and proportion of silent pauses), and (b) listeners’ perceived level of nervousness. Both speakers’ self-ratings of state anxiety and listeners’ ratings of perceived nervousness were further correlated with similar acoustic parameters. The results complement earlier studies on vocal affect expression which have been conducted on posed, rather than authentic, emotional speech.
Similar content being viewed by others
References
Adelmann, P. K., & Zajonc, R. B. (1989). Facial efference and experience of emotion. Annual Review of Psychology, 40, 249–280.
Ambady, N., & Rosenthal, R. (1992). Thin slices of expressive behavior as predictors of interpersonal consequences: A meta-analysis. Psychological Bulletin, 111, 256–274.
American Psychiatric Association. (1994). Diagnostic and statistical manual for mental disorders (4th ed.). Washington, DC: American Psychiatric Press.
Aubergé, V., Audibert, N., & Rilliard, A. (2006). De E-Wiz à C-Clone: Recueil, modélisation et synthèse d’expressions authentiques. Revue d’Intelligence Artificielle, 20, 499–527.
Bachorowski, J.-A., & Owren, M. J. (1995). Vocal expression of emotion: Acoustical properties of speech are associated with emotional intensity and context. Psychological Science, 6, 219–224.
Barrett, J., & Paus, T. (2002). Affect-induced changes in speech production. Experimental Brain Research, 146, 531–537.
Batliner, A., Fischer, K., Huber, R., Spilker, J., & Nöth, E. (2003). How to find trouble in communication. Speech Communication, 40, 117–143.
Biersack, S., & Kempe, V. (2005). Exploring the influence of vocal emotion expression on communicative effectiveness. Phonetica, 62, 106–119.
Boersma, P., & Weenink, D. (2007). Praat: Doing phonetics by computer (Version 4.6.12) [Computer program]. http://www.praat.org/. Retrieved 27 July 27 2007.
Bonner, M. R. (1943). Changes in the speech pattern under emotional tension. American Journal of Psychology, 56, 262–273.
Cowie, R., & Cornelius, R. R. (2003). Describing the emotional states that are expressed in speech. Speech Communication, 40, 5–32.
Cowie, R., Douglas-Cowie, E., Tsapatsoulis, N., Votsis, G., Kollias, S., Fellenz, W., et al. (2001). Emotion recognition in human–computer interaction. IEEE Signal Processing Magazine, 18(1), 32–80.
Darwin, C. (1998). The expression of the emotions in man and animals (with introduction, afterword, and commentaries by P. Ekman). New York: Oxford University Press. (Original work published 1872).
Davitz, J. R. (Ed.). (1964). The communication of emotional meaning. New York: McGraw-Hill.
Devillers, L., Vidrascu, L., & Lamel, L. (2005). Challenges in real-life emotion annotation and machine learning based detection. Neural Networks, 18, 407–422.
Egloff, B., Schmukle, S. C., Burns, L. R., & Schwerdtfeger, A. (2006). Spontaneous emotion regulation during evaluated speaking tasks: Associations with negative affect, anxiety expression, memory, and physiological responding. Emotion, 6, 356–366.
Ekman, P. (1992). An argument for basic emotions. Cognition and Emotion, 6, 169–200.
Ekman, P. (2003). Emotions revealed. New York: Henry Holt.
Ekman, P., & Friesen, W. V. (1969). The repertoire of nonverbal behavior: Categories, origins, usage, and coding. Semiotica, 1, 49–98.
Eldred, S. H., & Price, D. B. (1958). A linguistic evaluation of feeling states in psychotherapy. Psychiatry, 21, 115–121.
Fernandez-Dols, J., Sanchez, F., Carrera, P., & Ruiz-Belda, M. (1997). Are spontaneous expressions and emotions linked? An experimental test of coherence. Journal of Nonverbal Behavior, 21, 163–177.
First, M. B., Gibbon, M., Spitzer, R. L., & Williams, J. B. W. (1998). SCID-I: Interview protocol [Swedish]. Stockholm: Pilgrim Press.
Forsell, M., Elenius, K., & Laukka, P. (2007). Acoustic correlates of frustration in spontaneous speech. Speech, Music and Hearing: Quarterly Progress and Status Report, 50, 37–40. Stockholm, Sweden: Department of Speech, Music and Hearing, KTH.
Fuller, B. F., Horii, Y., & Conner, D. A. (1992). Validity and reliability of nonverbal voice measures as indicators of stressor-provoked anxiety. Research in Nursing and Health, 15, 379–389.
Furmark, T., Appel, L., Michelgård, Å., Wahlstedt, K., Åhs, F., Zancan, S., et al. (2005). Cerebral blood flow changes after treatment of social phobia with the neurokinin-1 antagonist GR205171, citalopram, or placebo. Biological Psychiatry, 58, 132–142.
Furmark, T., Tillfors, M., Everz, P.-O., Marteinsdottir, I., Gefvert, O., & Fredrikson, M. (1999). Social phobia in the general population: Prevalence and sociodemographic profile. Social Psychiatry and Psychiatric Epidemiology, 34, 416–424.
Furmark, T., Tillfors, M., Marteinsdottir, I., Fischer, H., Pissiota, A., Långström, B., et al. (2002). Common changes in cerebral blood flow in patients with social phobia treated with citalopram or cognitive-behavioral therapy. Archives of General Psychiatry, 59, 425–433.
Greasley, P., Sherrard, C., & Waterman, M. (2000). Emotion in language and speech: Methodological issues in naturalistic settings. Language and Speech, 43, 355–375.
Gross, J. J. (2002). Emotion regulation: Affective, cognitive, and social consequences. Psychophysiology, 39, 281–292.
Gross, J. J., John, O. P., & Richards, J. M. (2000). The dissociation of emotion expression from emotion experience: A personality perspective. Personality and Social Psychology Bulletin, 26, 712–726.
Hagenaars, M. A., & van Minnen, A. (2005). The effect of fear on paralinguistic aspects of speech in patients with panic disorder with agoraphobia. Journal of Anxiety Disorders, 19, 521–537.
Harrigan, J. A., Wilson, K., & Rosenthal, R. (2004). Detecting state and trait anxiety from auditory and visual cues: A meta-analysis. Personality and Social Psychology Bulletin, 30, 56–66.
Haskard, K. B., Williams, S. L., DiMatteo, M. R., Heritage, J., & Rosenthal, R. (2008). The provider’s voice: Patient satisfaction and the content-filtered speech of nurses and physicians in primary medical care. Journal of Nonverbal Behavior, 32, 1–20.
Hofmann, S. G., Gerlach, A. L., Wender, A., & Roth, W. T. (1997). Speech disturbances and gaze behavior during public speaking in subtypes of social phobia. Journal of Anxiety Disorders, 11, 573–585.
Johnstone, T., van Reekum, C. M., Bänziger, T., Hird, K., Kirsner, K., & Scherer, K. R. (2007). The effects of difficulty and gain versus loss on vocal physiology and acoustics. Psychophysiology, 44, 827–837.
Johnstone, T., van Reekum, C. M., Hird, K., Kirsner, K., & Scherer, K. R. (2005). Affective speech elicited with a computer game. Emotion, 5, 513–518.
Juslin, P. N., & Laukka, P. (2001). Impact of intended emotion intensity on cue utilization and decoding accuracy in vocal expression of emotion. Emotion, 1, 381–412.
Juslin, P. N., & Laukka, P. (2003). Communication of emotions in vocal expression and music performance: Different channels, same code? Psychological Bulletin, 129, 770–814.
Kashdan, T. B., & Steger, M. F. (2006). Expanding the topography of social anxiety: An experience-sampling assessment of positive emotions, positive events, and emotion suppression. Psychological Science, 17, 120–128.
Kasl, S. V., & Mahl, G. F. (1965). The relationship of disturbances and hesitations in spontaneous speech to anxiety. Journal of Personality and Social Psychology, 1, 425–433.
Kessler, R. C., Stein, M. B., & Berglund, P. (1998). Social phobia subtypes in the National Comorbidity Survey. American Journal of Psychiatry, 155, 613–619.
Koerner, A. F., & Fitzpatrick, M. A. (2002). Nonverbal communication and marital adjustment and satisfaction: The role of decoding relationship relevant and relationship irrelevant affect. Communication Monographs, 69, 33–51.
Kuroda, I., Fujiwara, O., Okamura, N., & Utsuki, N. (1976). Method for determining pilot stress through analysis of voice communication. Aviation, Space, and Environmental Medicine, 47, 528–533.
Lang, P. J. (1985). The cognitive psychophysiology of emotion: Fear and anxiety. In A. H. Tuma & J. D. Maser (Eds.), Anxiety and the anxiety disorders (pp. 131–170). Hillsdale, NJ: Lawrence Erlbaum.
Laukka, P. (2005). Categorical perception of vocal emotion expressions. Emotion, 5, 277–295.
Laukka, P. (2008). Research on vocal expression of emotion: State of the art and future directions. In K. Izdebski (Ed.), Emotions in the human voice. Vol 1. Foundations (pp. 153–169). San Diego, CA: Plural Publishing.
Laukka, P., Juslin, P. N., & Bresin, R. (2005). A dimensional approach to vocal expression of emotion. Cognition and Emotion, 19, 633–653.
Lazarus, R. S. (1991). Emotion and adaptation. New York: Oxford University Press.
Lee, C. M., & Narayanan, S. (2005). Towards detecting emotion in spoken dialogs. IEEE Transactions on Speech and Audio Processing, 13, 293–303.
Levenson, R. W. (1994). Human emotions: A functional view. In P. Ekman & R. J. Davidson (Eds.), The nature of emotion: Fundamental questions (pp. 123–126). New York: Oxford University Press.
Lewin, M. R., McNeil, D. W., & Lipson, J. M. (1996). Enduring without avoiding: Pauses and verbal dysfluencies in public speaking fear. Journal of Psychopathology and Behavioral Assessment, 18, 387–402.
Litman, D. J., & Forbes-Riley, K. (2006). Recognizing student emotions and attitudes on the basis of utterances in spoken tutoring dialogues with both human and computer tutors. Speech Communication, 48, 559–590.
Mahl, G. F. (1956). Disturbances and silences in the patient’s speech in psychotherapy. Journal of Abnormal and Social Psychology, 53, 1–15.
McNally, R. J., Otto, M. W., & Hornig, C. D. (2001). The voice of emotional memory: Content-filtered speech in panic disorder, social phobia, and major depressive disorder. Behaviour Research and Therapy, 39, 1329–1337.
Perry, C. K., Ingrisano, D. R., Palmer, M. A., & McDonald, E. J. (2000). Effects of environmental noise on computer-derived voice estimates from female speakers. Journal of Voice, 14, 146–153.
Planalp, S., DeFrancisco, V. L., & Rutherford, D. (1996). Varieties of cues to emotion in naturally occurring situations. Cognition and Emotion, 10, 137–153.
Pope, B., Blass, T., Siegman, A. W., & Raher, J. (1970). Anxiety and depression in speech. Journal of Consulting and Clinical Psychology, 35, 128–133.
Russell, J. A., Bachorowski, J.-A., & Fernandez-Dols, J.-M. (2003). Facial and vocal expressions of emotion. Annual Review of Psychology, 54, 329–349.
Scherer, K. R. (1986). Vocal affect expression: A review and a model for future research. Psychological Bulletin, 99, 143–165.
Scherer, K. R. (1989). Vocal correlates of emotional arousal and affective disturbance. In H. Wagner & A. Manstead (Eds.), Handbook of social psychophysiology (pp. 165–197). New York: Wiley.
Scherer, K. R., & Ceschi, G. (2000). Criteria for emotion recognition from verbal and nonverbal expression: Studying baggage loss in the airport. Personality and Social Psychology Bulletin, 26, 327–339.
Scherer, K. R., Banse, R., Wallbott, H. G., & Goldbeck, T. (1991). Vocal cues in emotion encoding and decoding. Motivation and Emotion, 15, 123–148.
Scherer, K. R., Koivumaki, J., & Rosenthal, R. (1972). Minimal cues in the vocal communication of affect: Judging emotions from content-masked speech. Journal of Psycholinguistic Research, 1, 269–285.
Siegman, A. W. (1987). The telltale voice: Nonverbal messages of verbal communication. In A. W. Siegman & S. Feldstein (Eds.), Nonverbal behavior and communication (2nd ed., pp. 351–434). Hillsdale, NJ: Lawrence Erlbaum Associates.
Smith, G. A. (1977). Voice analysis for the measurement of anxiety. British Journal of Medical Psychology, 50, 367–373.
Spielberger, C. D., Gorsuch, R. L., Lushene, R. E., Vagg, P. R., & Jacobs, G. A. (1983). Manual for the state-trait anxiety inventory. Palo Alto, CA: Consulting Psychologists Press.
Thompson, A. R. (1995). Pharmacological agents with effects on voice. American Journal of Otolaryngology, 16, 12–18.
Tillfors, M., Furmark, T., Marteinsdottir, I., Fischer, H., Pissiota, A., Långström, B., et al. (2001). Cerebral blood flow in subjects with social phobia during stressful speaking tasks: A PET study. American Journal of Psychiatry, 158, 1220–1226.
Turk, C. L., Heimberg, R. G., Luterek, J. A., Mennin, D. S., & Fresco, D. M. (2005). Emotion dysregulation in generalized anxiety disorder: A comparison with social anxiety disorder. Cognitive Therapy and Research, 29, 89–106.
van Bezooijen, R., & Boves, L. (1986). The effects of low-pass filtering and random splicing on the perception of speech. Journal of Psycholinguistic Research, 15, 403–417.
Williams, C. E., & Stevens, K. N. (1969). On determining the emotional state of pilots during flight: An exploratory study. Aerospace Medicine, 40, 1369–1372.
Williams, C. E., & Stevens, K. N. (1972). Emotions and speech: Some acoustical correlates. Journal of the Acoustical Society of America, 52, 1238–1250.
Zaider, T. I., Heimberg, R. G., Fresco, D. M., Schneier, F. R., & Liebowitz, M. R. (2003). Evaluation of the clinical global impression scale among individuals with social anxiety disorder. Psychological Medicine, 33, 611–622.
Zellner, B. (1994). Pauses and the temporal structure of speech. In E. Keller (Ed.), Fundamentals of speech synthesis and speech recognition (pp. 41–62). New York: Wiley.
Acknowledgments
This research was supported by the Swedish Research Council and the Ryoichi Sasakawa Young Leaders Fellowship Fund (SYLFF) through grants to the first author.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Laukka, P., Linnman, C., Åhs, F. et al. In a Nervous Voice: Acoustic Analysis and Perception of Anxiety in Social Phobics’ Speech. J Nonverbal Behav 32, 195–214 (2008). https://doi.org/10.1007/s10919-008-0055-9
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10919-008-0055-9