Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2017 May;141(5):3070.
doi: 10.1121/1.4982246.

Prosodic exaggeration within infant-directed speech: Consequences for vowel learnability

Affiliations

Prosodic exaggeration within infant-directed speech: Consequences for vowel learnability

Frans Adriaans et al. J Acoust Soc Am. 2017 May.

Abstract

Perceptual experiments with infants show that they adapt their perception of speech sounds toward the categories of the native language. How do infants learn these categories? For the most part, acoustic analyses of natural infant-directed speech have suggested that phonetic categories are not presented to learners as separable clusters of sounds in acoustic space. As a step toward explaining how infants begin to solve this problem, the current study proposes that the exaggerated prosody characteristic of infant-directed speech may highlight for infants certain speech-sound tokens that collectively form more readily identifiable categories. A database is presented, containing vowel measurements in a large sample of natural American English infant-directed speech. Analyses of the vowel space show that prosodic exaggeration in infant-directed speech has the potential to support distributional vowel learning by providing the learner with a subset of "high-quality" tokens that infants might attend to preferentially. Categorization models trained on prosodically exaggerated tokens outperformed models that were trained on tokens that were not exaggerated. Though focusing on more prominent, exaggerated tokens does not provide a solution to the categorization problem, it would make it easier to solve.

PubMed Disclaimer

Figures

FIG. 1.
FIG. 1.
Vowel distributions in IDS (based on data from Swingley, 2009).
FIG. 2.
FIG. 2.
Point vowels for each speaker (f1, d1, w1) in focused (focus) and unfocused (no focus) position.
FIG. 3.
FIG. 3.
Point vowels pooled across speakers. Formants were normalized to z-scores before averaging.
FIG. 4.
FIG. 4.
The entire set of monophthongs. The center of the vowel space is indicated as “+.” Formants were normalized to z-scores before averaging across speakers.
FIG. 5.
FIG. 5.
Distributions of ALL TOKENS (left), NO FOCUS tokens (middle), and FOCUS tokens (right) for the /i/-/ɑ/-/u/ data set (top) and the /i/-/ɪ/-/ɛ/ data set (bottom).
FIG. 6.
FIG. 6.
Acoustic correlates of prosodic focus (from top to bottom: pitch, pitch movement, duration).

Similar articles

Cited by

References

    1. Benders, T. (2013). “ Mommy is only happy! Dutch mothers' realisation of speech sounds in infant-directed speech expresses emotion, not didactic intent,” Infant Behav. Develop. 36, 847–862.10.1016/j.infbeh.2013.09.001 - DOI - PubMed
    1. Boersma, P. , Escudero, P. , and Hayes, R. (2003). “ Learning abstract phonological from auditory phonetic categories: An integrated model for the acquisition of language-specific sound categories,” in Proceedings of the 15th International Congress of Phonetic Sciences, Barcelona, Spain, pp. 1013–1016.
    1. Boersma, P. , and Weenink, D. (2012). “ Praat: Doing phonetics by computer” (Computer program).
    1. Bradlow, A. R. , Torretta, G. M. , and Pisoni, D. B. (1996). “ Intelligibility of normal speech I: Global and fine-grained acoustic-phonetic talker characteristics,” Speech Commun. 20, 255–272.10.1016/S0167-6393(96)00063-5 - DOI - PMC - PubMed
    1. Brent, M. R. , and Siskind, J. M. (2001). “ The role of exposure to isolated words in early vocabulary development,” Cognition 81, B33–B44.10.1016/S0010-0277(01)00122-6 - DOI - PubMed

Publication types