Elsevier

NeuroImage

Volume 46, Issue 1, 15 May 2009, Pages 226-240
NeuroImage

Neural signatures of phonetic learning in adulthood: A magnetoencephalography study

https://doi.org/10.1016/j.neuroimage.2009.01.028Get rights and content

Abstract

The present study used magnetoencephalography (MEG) to examine perceptual learning of American English /r/ and /l/ categories by Japanese adults who had limited English exposure. A training software program was developed based on the principles of infant phonetic learning, featuring systematic acoustic exaggeration, multi-talker variability, visible articulation, and adaptive listening. The program was designed to help Japanese listeners utilize an acoustic dimension relevant for phonemic categorization of /r–l/ in English. Although training did not produce native-like phonetic boundary along the /r–l/ synthetic continuum in the second language learners, success was seen in highly significant identification improvement over twelve training sessions and transfer of learning to novel stimuli. Consistent with behavioral results, pre–post MEG measures showed not only enhanced neural sensitivity to the /r–l/ distinction in the left-hemisphere mismatch field (MMF) response but also bilateral decreases in equivalent current dipole (ECD) cluster and duration measures for stimulus coding in the inferior parietal region. The learning-induced increases in neural sensitivity and efficiency were also found in distributed source analysis using Minimum Current Estimates (MCE). Furthermore, the pre–post changes exhibited significant brain-behavior correlations between speech discrimination scores and MMF amplitudes as well as between the behavioral scores and ECD measures of neural efficiency. Together, the data provide corroborating evidence that substantial neural plasticity for second-language learning in adulthood can be induced with adaptive and enriched linguistic exposure. Like the MMF, the ECD cluster and duration measures are sensitive neural markers of phonetic learning.

Introduction

A fundamental question in cognitive neuroscience is the degree of neural plasticity as a function of age and experience. Classic studies and arguments on the putative “critical” or “sensitive” period for language acquisition highlight the superiority of learning a second language prior to puberty, and data support both maturation and experience as mechanistic explanations for the effect (Flege et al., 1999, Hernandez and Li, 2007, Johnson and Newport, 1989, Kuhl et al., 2008, Lenneberg, 1967, Mayberry and Lock, 2003). In the phonetic domain, there is clear evidence that early language learning does not involve a permanent loss of perceptual sensitivity to all the nonnative distinctions (Best et al., 2001, Werker and Tees, 2005). Furthermore, adults' perception of nonnative speech can be improved by using a variety of short-term intensive training methods (Akahane-Yamada et al., 1997, Bradlow et al., 1999, Hazan et al., 2006, Iverson et al., 2005, Jamieson and Morosan, 1986, Logan et al., 1991, McCandliss et al., 2002, Pruitt et al., 2006, Strange and Dittmann, 1984, Tremblay et al., 1997, Wang et al., 2003, Zhang et al., 2000). These training studies, among others, have not only provided important empirical data for reevaluating the “critical period” hypothesis but also revealed key factors that facilitate second language learning independent of age. However, as epitomized by the classic problem of the /r–l/ phonemic contrast for adult Japanese speakers, neither intensive training nor prolonged naturalistic exposure has led to native-like mastery (Callan et al., 2003, Iverson et al., 2005, McCandliss et al., 2002, Takagi, 2002, Takagi and Mann, 1995). The experiential mechanisms that enhance or limit neural plasticity in adulthood are not well understood.

In our language acquisition model, adults' difficulty with nonnative languages stems from an early strong neural commitment to the statistical and spectral patterns in the language input during infancy (Kuhl et al., 2008). The effects of native language neural commitment (NLNC) are self-reinforcing and bidirectional — it enhances the detection of higher-order linguistic patterns, such as words, that utilize learned phonetic patterns, while at the same time hindering the detection of non-conforming patterns contained in foreign languages, as shown behaviorally (Iverson et al., 2003) and neurally (Zhang et al., 2005). We further theorize that second language acquisition in adulthood can be improved by manipulating the language input to incorporate the basic principles underlying infants' acquisition of the sound patterns of their native language (Kuhl et al., 2001, Zhang et al., 2005).

To address the underlying mechanisms of brain plasticity for phonetic learning in adulthood, we designed a training software program in a preliminary single-subject MEG study to test its success (Zhang et al., 2000). The program incorporated features that were motivated by studies of infant-directed speech (IDS) or “motherese” (Burnham et al., 2002, Fernald and Kuhl, 1987, Kuhl et al., 1997, Liu et al., 2003), including adaptive signal enhancement, visible articulation cues, a large stimulus set with high variability, and self-initiated selection. The preliminary results suggested that rapid improvement could be achieved on the difficult nonnative phonemic contrast. Approximately 12 h of training for the Japanese adult subject showed an overall 22% improvement in identification accuracy with remarkable transfer of learning — there was a 27% improvement in recognizing the /r–l/ tokens by untrained voices. The training effect was also shown in enhanced neural sensitivity for the /r–l/ distinction, particularly in the left auditory cortex. Compared with other /r–l/ training studies with equivalent amounts of behavioral improvement, transfer of learning and sustained effect tested six months after training (e.g., Bradlow et al., 1999, Callan et al., 2003), our program reduced the total training hours by over 70%. As the preliminary results were based on a single subject, more subjects needed to be tested in order to evaluate the training methodology and investigate the neural mechanisms that reflect phonetic learning at both the individual and group levels.

There are two main objectives in the present training study: (a) to test the efficacy of our IDS-motivated training program in adults' learning of second language phonetic categories, and (b) to examine two hypothetical neural markers of learning in terms of brain-behavior correlates: neural sensitivity, as measured by the mismatch field response for phonetic discrimination (Näätänen et al., 1997), and neural efficiency, as measured by the focal degree and duration of brain activation during phonetic perception in terms of equivalent current dipole (ECD) clusters (Zhang et al., 2005). Previous neurophysiological studies have shown strong evidence of learning-induced enhancement in neural sensitivity to support phonetic categorization in adults as well as in children (Cheour et al., 1998, Imaizumi et al., 1999, Kraus et al., 1995, Menning et al., 2002, Näätänen et al., 1997, Nenonen et al., 2005, Rivera-Gaxiola et al., 2000, Tremblay et al., 1997, Winkler, 1999, Zhang et al., 2000). There is also evidence for learning-induced shift toward left hemisphere dominance in terms of enhanced neural sensitivity for linguistic processing (see Näätänen et al., 2007 for a review.). In line with the neural efficiency idea, fMRI studies have reported more focal activation for learned auditory stimuli particularly in native speakers or more advanced learners (Callan et al., 2004, Guenther et al., 2004, Wang et al., 2003). Cross-language MEG data have additionally indicated a shorter duration of bilateral activation for native speech processing in specific brain regions — the superior temporal and inferior parietal cortices (Zhang et al., 2005).

The central question of our study is whether substantial behavioral improvement in second language phonetic learning can be achieved in adulthood and simultaneously reflected by the spatiotemporal markers of neural sensitivity and neural efficiency, resulting in native-like perception and native-like brain activation patterns for learning the difficult speech contrasts in a second language. To cross-validate the brain activation patterns shown by the ECD cluster analysis approach and investigate the relationship between neural sensitivity and efficiency, we also employ distributed source analysis using minimum current estimates with fundamentally different assumptions about the source activity (Uutela et al., 1999, Zhang et al., 2005). We predict that our IDS-motivated training program would help circumvent interference from neural networks that have been shaped by native language experience, yielding significant brain-behavior correlations in both domains of sensitivity and efficiency.

Section snippets

Subjects

A pretest–intervention–posttest design was implemented to assess initial capability and the training effects. Nine right-handed Japanese college students (6 males and 3 females) participated in the study (21–23 in age). Subjects were volunteers under informed consent. They were recruited after screening for hearing, handedness, and language background. The subjects had no history of speech/hearing disorders, and all showed clear N1m responses to a 1000 Hz tone. All had received nine years of

Behavioral effects on trained and untrained natural speech stimuli

The 12 training sessions produced a rapid and highly significant improvement of 21.6% in the trainees, an increase from 60.1% to 81.7% [two-tailed t-test, p < 0.0001] (Fig. 2a). Neither of the two subjects who did not receive training showed comparable changes (− 1.3% for KO and + 2.8% for KI) (Fig. 2b), and they were both statistical outliers against the trainee group [one-tailed Z-test, p < 0.00001]. The average improvements were 19.5% for the CV tokens and 23.7% for the VCV tokens (Fig. 2d),

Successes and limitations of the IDS-motivated training program

The behavioral data provided strong evidence of success for our training program. Japanese listeners achieved a 21.6% improvement in identifying naturally spoken /r–l/ syllables in 12 training sessions, which generalized to the untrained voices, vowel contexts and synthetic prototype stimuli (Fig. 2). This is remarkable given that many Japanese adults continue to misperceive and mispronounce the English /r–l/ sounds after months of laboratory training or years of residence in the US (Takagi,

Acknowledgments

Funding was provided by NTT Communication Science Laboratories (Nippon Telegraph and Telephone Corporation), the University of Washington's NSF Science of Learning Center (LIFE), and the National Institute of Health, and the University of Washington's Institute for Learning and Brain Sciences. Manuscript preparation was supported in part by a visiting scholarship from Tokyo Denki University, a University of Minnesota Faculty Summer Research Fellowship, and the Grant-in-Aid of Research, Artistry

References (114)

  • GottfriedT.L.

    Effects of consonant context on the perception of French vowels

    J. Phon.

    (1984)
  • GrummichP. et al.

    Combining fMRI and MEG increases the reliability of presurgical language localization: a clinical study on the difference between and congruence of both modalities

    Neuroimage

    (2006)
  • HariR. et al.

    Timing of human cortical functions during cognition: role of MEG

    Trends Cogn. Sci.

    (2000)
  • HickokG. et al.

    Towards a functional neuroanatomy of speech perception

    Trends Cogn. Sci.

    (2000)
  • IversonP. et al.

    A perceptual interference account of acquisition difficulties for non-native phonemes

    Cognition

    (2003)
  • JensenO. et al.

    A new method to identify multiple sources of oscillatory activity from magnetoencephalographic data

    Neuroimage

    (2002)
  • JohnsonJ. et al.

    Critical period effects in second language learning: the influence of maturation state on the acquisition of English as a second language

    Cogn. Psychol.

    (1989)
  • KassubekJ. et al.

    Changes in cortical activation during mirror reading before and after training: an fMRI study of procedural learning

    Cogn. Brain Res.

    (2001)
  • KomssiS. et al.

    EEG minimum-norm estimation compared with MEG dipole fitting in the localization of somatosensory sources at S1

    Clin. Neurophysiol.

    (2004)
  • LinF.H. et al.

    Assessing and improving the spatial accuracy in MEG source localization by depth-weighted minimum-norm estimates

    Neuroimage

    (2006)
  • MaestuF. et al.

    Spanish language mapping using MEG: a validation study

    Neuroimage

    (2002)
  • MayberryR. et al.

    The long-lasting advantage of learning sign language in childhood: another look at the critical period for language acquisition

    J. Mem. Lang.

    (1991)
  • MayberryR.I. et al.

    Age constraints on first versus second language acquisition: evidence for linguistic plasticity and epigenesis

    Brain Lang.

    (2003)
  • MayeJ. et al.

    Infant sensitivity to distributional information can affect phonetic discrimination

    Cognition

    (2002)
  • MoulinesE. et al.

    Pitch-synchronous waveform processing techniques for text-to-speech synthesis using diphones

    Speech Commun.

    (1990)
  • NäätänenR. et al.

    The mismatch negativity (MMN) in basic research of central auditory processing: a review

    Clin. Neurophysiol.

    (2007)
  • NenonenS. et al.

    Speech–sound duration processing in a second language is specific to phonetic categories

    Brain Lang.

    (2005)
  • PapanicolaouA.C. et al.

    Differential brain activation patterns during perception of voice and tone onset time series: a MEG study

    Neuroimage

    (2003)
  • PoldrackR.A.

    Imaging brain plasticity: conceptual and methodological issues

    Neuroimage

    (2000)
  • PulvermüllerF. et al.

    Spatiotemporal dynamics of neural language processing: an MEG study using minimum-norm current estimates

    Neuroimage

    (2003)
  • Rivera-GaxiolaM. et al.

    Electrophysiological correlates of cross-linguistic speech perception in native English speakers

    Behav. Brain Res.

    (2000)
  • SchiltzC. et al.

    Neuronal mechanisms of perceptual learning: changes in human brain activity with training in orientation discrimination

    Neuroimage

    (1999)
  • Sebastián-GallésN. et al.

    Online processing of native and non-native phonemic contrasts in early bilinguals

    Cognition

    (1999)
  • StenbackaL. et al.

    Comparison of minimum current estimate and dipole modeling in the analysis of simulated activity in the human visual cortices

    Neuroimage

    (2002)
  • TervaniemiM. et al.

    Lateralization of auditory-cortex functions

    Brains Res. Rev.

    (2003)
  • Akahane-YamadaR. et al.

    Effects of audio-visual training on the identification of English /r/ and /l/ by Japanese speakers

    J Acoust. Soc. Am.

    (1997)
  • AkamatsuT.

    Japanese Phonetics: Theory and Practice

    (1997)
  • BestC.T. et al.

    Discrimination of non-native consonant contrasts varying in perceptual assimilation to the listeners' native phonological system

    J. Acoust. Soc. Am.

    (2001)
  • BoschL. et al.

    First and second language vowel perception in early bilinguals

    Eur. J. Cogn. Psychol.

    (2000)
  • BradlowA.R. et al.

    Training Japanese listeners to identify English /r/ and /l/: long-term retention of learning in perception and production

    Percept. Psychophys.

    (1999)
  • BreierJ.I. et al.

    Lateralization of activity associated with language function using magnetoencephalography: a reliability study

    J. Clin. Neurophysiol.

    (2000)
  • BurnhamD. et al.

    What's new pussycat: on talking to animals and babies

    Science

    (2002)
  • CheourM. et al.

    Development of language-specific phoneme representations in the infant brain

    Nat. Neurosci.

    (1998)
  • DavisB.L. et al.

    Phonetic variability in baby talk and development of vowel categories

  • de BoerB. et al.

    Investigating the role of infant-directed speech with a computer model

    Acoust. Res. Lett. Online

    (2003)
  • EnglundK. et al.

    Changes in infant directed speech in the first six months

    Infant Child Dev.

    (2006)
  • EscuderoP. et al.

    Bridging the gap between L2 speech perception research and phonological theory

    Stud. Second Lang. Acquis.

    (2004)
  • FlegeJ.

    Second language speech learning: theory, findings, and problems

  • FrancisA.L. et al.

    Selective attention and the acquisition of new phonetic categories

    J. Exp. Psychol. Hum. Percept. Perform.

    (2002)
  • GaabN. et al.

    Neural correlates of rapid spectrotemporal processing in musicians and nonmusicians

    Ann. N.Y. Acad. Sci.

    (2005)
  • Cited by (102)

    • Is talker variability a critical component of effective phonetic training for nonnative speech?

      2021, Journal of Phonetics
      Citation Excerpt :

      These patterns show compelling evidence that learning-induced changes of attention to acoustic cues to establish robust categories are responsible for the generalization effects (Francis et al., 2000; Holt & Lotto, 2006). Our findings support the predictions from the Attention-To-Dimension (A2D) models in which the process of phonetic learning is regarded via the simultaneous operation of two mechanisms of selective attention: enhancement of attention and inhibition of attention (Francis & Nusbaum, 2002; Goldstone, 1994, 1998; Nosofsky, 1986; Ylinen et al., 2010; Zhang et al., 2009). More specifically, enhancement of attention to a given dimension would “stretch” the perceptual distance between tokens along this dimension, thus increasing differentiability of the tokens.

    View all citing articles on Scopus
    View full text