Neural signatures of phonetic learning in adulthood: A magnetoencephalography study
Introduction
A fundamental question in cognitive neuroscience is the degree of neural plasticity as a function of age and experience. Classic studies and arguments on the putative “critical” or “sensitive” period for language acquisition highlight the superiority of learning a second language prior to puberty, and data support both maturation and experience as mechanistic explanations for the effect (Flege et al., 1999, Hernandez and Li, 2007, Johnson and Newport, 1989, Kuhl et al., 2008, Lenneberg, 1967, Mayberry and Lock, 2003). In the phonetic domain, there is clear evidence that early language learning does not involve a permanent loss of perceptual sensitivity to all the nonnative distinctions (Best et al., 2001, Werker and Tees, 2005). Furthermore, adults' perception of nonnative speech can be improved by using a variety of short-term intensive training methods (Akahane-Yamada et al., 1997, Bradlow et al., 1999, Hazan et al., 2006, Iverson et al., 2005, Jamieson and Morosan, 1986, Logan et al., 1991, McCandliss et al., 2002, Pruitt et al., 2006, Strange and Dittmann, 1984, Tremblay et al., 1997, Wang et al., 2003, Zhang et al., 2000). These training studies, among others, have not only provided important empirical data for reevaluating the “critical period” hypothesis but also revealed key factors that facilitate second language learning independent of age. However, as epitomized by the classic problem of the /r–l/ phonemic contrast for adult Japanese speakers, neither intensive training nor prolonged naturalistic exposure has led to native-like mastery (Callan et al., 2003, Iverson et al., 2005, McCandliss et al., 2002, Takagi, 2002, Takagi and Mann, 1995). The experiential mechanisms that enhance or limit neural plasticity in adulthood are not well understood.
In our language acquisition model, adults' difficulty with nonnative languages stems from an early strong neural commitment to the statistical and spectral patterns in the language input during infancy (Kuhl et al., 2008). The effects of native language neural commitment (NLNC) are self-reinforcing and bidirectional — it enhances the detection of higher-order linguistic patterns, such as words, that utilize learned phonetic patterns, while at the same time hindering the detection of non-conforming patterns contained in foreign languages, as shown behaviorally (Iverson et al., 2003) and neurally (Zhang et al., 2005). We further theorize that second language acquisition in adulthood can be improved by manipulating the language input to incorporate the basic principles underlying infants' acquisition of the sound patterns of their native language (Kuhl et al., 2001, Zhang et al., 2005).
To address the underlying mechanisms of brain plasticity for phonetic learning in adulthood, we designed a training software program in a preliminary single-subject MEG study to test its success (Zhang et al., 2000). The program incorporated features that were motivated by studies of infant-directed speech (IDS) or “motherese” (Burnham et al., 2002, Fernald and Kuhl, 1987, Kuhl et al., 1997, Liu et al., 2003), including adaptive signal enhancement, visible articulation cues, a large stimulus set with high variability, and self-initiated selection. The preliminary results suggested that rapid improvement could be achieved on the difficult nonnative phonemic contrast. Approximately 12 h of training for the Japanese adult subject showed an overall 22% improvement in identification accuracy with remarkable transfer of learning — there was a 27% improvement in recognizing the /r–l/ tokens by untrained voices. The training effect was also shown in enhanced neural sensitivity for the /r–l/ distinction, particularly in the left auditory cortex. Compared with other /r–l/ training studies with equivalent amounts of behavioral improvement, transfer of learning and sustained effect tested six months after training (e.g., Bradlow et al., 1999, Callan et al., 2003), our program reduced the total training hours by over 70%. As the preliminary results were based on a single subject, more subjects needed to be tested in order to evaluate the training methodology and investigate the neural mechanisms that reflect phonetic learning at both the individual and group levels.
There are two main objectives in the present training study: (a) to test the efficacy of our IDS-motivated training program in adults' learning of second language phonetic categories, and (b) to examine two hypothetical neural markers of learning in terms of brain-behavior correlates: neural sensitivity, as measured by the mismatch field response for phonetic discrimination (Näätänen et al., 1997), and neural efficiency, as measured by the focal degree and duration of brain activation during phonetic perception in terms of equivalent current dipole (ECD) clusters (Zhang et al., 2005). Previous neurophysiological studies have shown strong evidence of learning-induced enhancement in neural sensitivity to support phonetic categorization in adults as well as in children (Cheour et al., 1998, Imaizumi et al., 1999, Kraus et al., 1995, Menning et al., 2002, Näätänen et al., 1997, Nenonen et al., 2005, Rivera-Gaxiola et al., 2000, Tremblay et al., 1997, Winkler, 1999, Zhang et al., 2000). There is also evidence for learning-induced shift toward left hemisphere dominance in terms of enhanced neural sensitivity for linguistic processing (see Näätänen et al., 2007 for a review.). In line with the neural efficiency idea, fMRI studies have reported more focal activation for learned auditory stimuli particularly in native speakers or more advanced learners (Callan et al., 2004, Guenther et al., 2004, Wang et al., 2003). Cross-language MEG data have additionally indicated a shorter duration of bilateral activation for native speech processing in specific brain regions — the superior temporal and inferior parietal cortices (Zhang et al., 2005).
The central question of our study is whether substantial behavioral improvement in second language phonetic learning can be achieved in adulthood and simultaneously reflected by the spatiotemporal markers of neural sensitivity and neural efficiency, resulting in native-like perception and native-like brain activation patterns for learning the difficult speech contrasts in a second language. To cross-validate the brain activation patterns shown by the ECD cluster analysis approach and investigate the relationship between neural sensitivity and efficiency, we also employ distributed source analysis using minimum current estimates with fundamentally different assumptions about the source activity (Uutela et al., 1999, Zhang et al., 2005). We predict that our IDS-motivated training program would help circumvent interference from neural networks that have been shaped by native language experience, yielding significant brain-behavior correlations in both domains of sensitivity and efficiency.
Section snippets
Subjects
A pretest–intervention–posttest design was implemented to assess initial capability and the training effects. Nine right-handed Japanese college students (6 males and 3 females) participated in the study (21–23 in age). Subjects were volunteers under informed consent. They were recruited after screening for hearing, handedness, and language background. The subjects had no history of speech/hearing disorders, and all showed clear N1m responses to a 1000 Hz tone. All had received nine years of
Behavioral effects on trained and untrained natural speech stimuli
The 12 training sessions produced a rapid and highly significant improvement of 21.6% in the trainees, an increase from 60.1% to 81.7% [two-tailed t-test, p < 0.0001] (Fig. 2a). Neither of the two subjects who did not receive training showed comparable changes (− 1.3% for KO and + 2.8% for KI) (Fig. 2b), and they were both statistical outliers against the trainee group [one-tailed Z-test, p < 0.00001]. The average improvements were 19.5% for the CV tokens and 23.7% for the VCV tokens (Fig. 2d),
Successes and limitations of the IDS-motivated training program
The behavioral data provided strong evidence of success for our training program. Japanese listeners achieved a 21.6% improvement in identifying naturally spoken /r–l/ syllables in 12 training sessions, which generalized to the untrained voices, vowel contexts and synthetic prototype stimuli (Fig. 2). This is remarkable given that many Japanese adults continue to misperceive and mispronounce the English /r–l/ sounds after months of laboratory training or years of residence in the US (Takagi,
Acknowledgments
Funding was provided by NTT Communication Science Laboratories (Nippon Telegraph and Telephone Corporation), the University of Washington's NSF Science of Learning Center (LIFE), and the National Institute of Health, and the University of Washington's Institute for Learning and Brain Sciences. Manuscript preparation was supported in part by a visiting scholarship from Tokyo Denki University, a University of Minnesota Faculty Summer Research Fellowship, and the Grant-in-Aid of Research, Artistry
References (114)
- et al.
Effects of language-specific phonological and phonetic factors on cross-language perception of approximants
J. Phon.
(1992) - et al.
A comparison of functional MRI and magnetoencephalography for receptive language mapping
J. Neurosci. Methods
(2007) - et al.
Phonetic perceptual identification by native- and second-language speakers differentially activates brain regions involved with acoustic phonetic processing and those involved with articulatory–auditory/orosensory internal models
Neuroimage
(2004) - et al.
Learning-induced neural plasticity associated with improved identification performance after training of a difficult second-language phonetic contrast
Neuroimage
(2003) - et al.
Acoustic determinants of infant preference for motherese speech
Infant Behav. Dev.
(1987) - et al.
Age constraints on second-language acquisition
J. Mem. Lang.
(1999) - et al.
An fMRI-constrained MEG source analysis with procedures for dividing and grouping activation
Neuroimage
(2002) - et al.
Anatomical correlates of learning novel speech sounds
Neuron
(2002) - et al.
Learning new sounds of speech: reallocation of neural substrates
Neuroimage
(2004) - et al.
Attention modulates hemispheric differences in functional connectivity: evidence from MEG recordings
Neuroimage
(2006)
Effects of consonant context on the perception of French vowels
J. Phon.
Combining fMRI and MEG increases the reliability of presurgical language localization: a clinical study on the difference between and congruence of both modalities
Neuroimage
Timing of human cortical functions during cognition: role of MEG
Trends Cogn. Sci.
Towards a functional neuroanatomy of speech perception
Trends Cogn. Sci.
A perceptual interference account of acquisition difficulties for non-native phonemes
Cognition
A new method to identify multiple sources of oscillatory activity from magnetoencephalographic data
Neuroimage
Critical period effects in second language learning: the influence of maturation state on the acquisition of English as a second language
Cogn. Psychol.
Changes in cortical activation during mirror reading before and after training: an fMRI study of procedural learning
Cogn. Brain Res.
EEG minimum-norm estimation compared with MEG dipole fitting in the localization of somatosensory sources at S1
Clin. Neurophysiol.
Assessing and improving the spatial accuracy in MEG source localization by depth-weighted minimum-norm estimates
Neuroimage
Spanish language mapping using MEG: a validation study
Neuroimage
The long-lasting advantage of learning sign language in childhood: another look at the critical period for language acquisition
J. Mem. Lang.
Age constraints on first versus second language acquisition: evidence for linguistic plasticity and epigenesis
Brain Lang.
Infant sensitivity to distributional information can affect phonetic discrimination
Cognition
Pitch-synchronous waveform processing techniques for text-to-speech synthesis using diphones
Speech Commun.
The mismatch negativity (MMN) in basic research of central auditory processing: a review
Clin. Neurophysiol.
Speech–sound duration processing in a second language is specific to phonetic categories
Brain Lang.
Differential brain activation patterns during perception of voice and tone onset time series: a MEG study
Neuroimage
Imaging brain plasticity: conceptual and methodological issues
Neuroimage
Spatiotemporal dynamics of neural language processing: an MEG study using minimum-norm current estimates
Neuroimage
Electrophysiological correlates of cross-linguistic speech perception in native English speakers
Behav. Brain Res.
Neuronal mechanisms of perceptual learning: changes in human brain activity with training in orientation discrimination
Neuroimage
Online processing of native and non-native phonemic contrasts in early bilinguals
Cognition
Comparison of minimum current estimate and dipole modeling in the analysis of simulated activity in the human visual cortices
Neuroimage
Lateralization of auditory-cortex functions
Brains Res. Rev.
Effects of audio-visual training on the identification of English /r/ and /l/ by Japanese speakers
J Acoust. Soc. Am.
Japanese Phonetics: Theory and Practice
Discrimination of non-native consonant contrasts varying in perceptual assimilation to the listeners' native phonological system
J. Acoust. Soc. Am.
First and second language vowel perception in early bilinguals
Eur. J. Cogn. Psychol.
Training Japanese listeners to identify English /r/ and /l/: long-term retention of learning in perception and production
Percept. Psychophys.
Lateralization of activity associated with language function using magnetoencephalography: a reliability study
J. Clin. Neurophysiol.
What's new pussycat: on talking to animals and babies
Science
Development of language-specific phoneme representations in the infant brain
Nat. Neurosci.
Phonetic variability in baby talk and development of vowel categories
Investigating the role of infant-directed speech with a computer model
Acoust. Res. Lett. Online
Changes in infant directed speech in the first six months
Infant Child Dev.
Bridging the gap between L2 speech perception research and phonological theory
Stud. Second Lang. Acquis.
Second language speech learning: theory, findings, and problems
Selective attention and the acquisition of new phonetic categories
J. Exp. Psychol. Hum. Percept. Perform.
Neural correlates of rapid spectrotemporal processing in musicians and nonmusicians
Ann. N.Y. Acad. Sci.
Cited by (102)
Foreign language learning and the mismatch negativity (MMN): A longitudinal ERP study
2022, Neuroimage: ReportsIs talker variability a critical component of effective phonetic training for nonnative speech?
2021, Journal of PhoneticsCitation Excerpt :These patterns show compelling evidence that learning-induced changes of attention to acoustic cues to establish robust categories are responsible for the generalization effects (Francis et al., 2000; Holt & Lotto, 2006). Our findings support the predictions from the Attention-To-Dimension (A2D) models in which the process of phonetic learning is regarded via the simultaneous operation of two mechanisms of selective attention: enhancement of attention and inhibition of attention (Francis & Nusbaum, 2002; Goldstone, 1994, 1998; Nosofsky, 1986; Ylinen et al., 2010; Zhang et al., 2009). More specifically, enhancement of attention to a given dimension would “stretch” the perceptual distance between tokens along this dimension, thus increasing differentiability of the tokens.