Main

We scanned 14 right-handed normal subjects using event-related fMRI while they listened to jokes. Thirty semantic jokes of the form, “What do engineers use for birth control?. . . Their personalities,” and thirty phonological jokes or puns of the form, “Why did the golfer wear two sets of pants? . . . He got a hole in one,” were presented to subjects in a random order. In corresponding baseline conditions, identical setup lines were used, but the punch lines were modified to be merely descriptive. (For example, “What do engineers use for birth control?. . . The pill.” “Why did the golfer wear two sets of pants? . . . It was a very cold day.”) Items were pre-recorded, balanced across conditions for the number of phonemes and temporal duration, and auditorily presented to subjects in random order (Fig. 1). While in the scanner, subjects were requested to make judgments (recorded by a key press) as to whether or not they found each item amusing. After the scan, subjects reviewed the jokes and rated them for `funniness' on a scale of 1 to 5. The study was approved by the Joint National Hospital for Neurology and Neurosurgery/Institute of Neurology Ethics Committee. Data were analyzed using statistical parametric mapping (SPM 99)1 using a random effects model.

Figure 1: Subjects were presented with semantic and phonological jokes while undergoing fMRI scans.
figure 1

The stimuli were prerecorded as voice files, edited to precise temporal lengths and presented in a random order, in a time-locked single-event design. The setup line began at time 0 and had a duration ranging from 1340 to 3230 ms (mean duration, 2420 ms). A mean pause of 1580 ms gave subjects time to consider the setup line. At 4000 ms, the punch line (or descriptive answer) was delivered. It lasted 480 to 3250 ms (mean, 1540 ms). Thereafter, subjects had a variable amount of time to respond. In all trials, subjects were asked to provide a subjective judgment (by pressing a button on a keypad) of whether they found the item amusing or not. Subjects were instructed to refrain from laughter to minimize head movement. A 2T Siemens VISION system (Siemens, Erlangen, Germany) was used to acquire 48, 1.8-mm thick (with 1.2 mm gap), T2*-weighted echoplanar images (64 × 64, 3 × 3 mm pixels; TE, 40 ms) of the whole brain sensitive to blood oxygenation level dependent (BOLD) contrast. Data was recorded in one 18.1 min session of 120 events using a TR of 4.1 s and random jitter of 1 TR. The BOLD signal was sampled as a hrf at the midpoint between the presentation of the punch line and the subjects' motor response.

Subjects took longer (t29 = 7.18, p < 0.0001) to respond to jokes (mean ± s.d., 2824 ± 259 ms) after the presentation of the punch line than to non-jokes (2550 ± 220 ms). There was no difference in response times to jokes as a function of funniness ratings. While being scanned, subjects rated 62 ± 16% of jokes as `amusing,' with no significant difference between the ratings of semantic and phonological jokes.

To isolate mechanisms underlying juxtaposition of mental sets, we examined the main effect of jokes ((semantic jokes + phonological jokes) – (semantic baseline + phonological baseline)). This comparison revealed activation in left posterior middle temporal gyrus (BA 21/37), left posterior inferior temporal gyrus (BA 37), right posterior middle temporal gyrus (BA 21) and left inferior frontal gyrus (BA 44, 45). However, an analysis of simple main effects and direct comparisons of semantic and phonological joke conditions revealed that juxtaposition of semantic and phonological mental sets require different networks. The comparison of semantic jokes with the corresponding non-joke baseline revealed activation in left posterior middle temporal gyrus (BA 21/37), left posterior inferior temporal gyrus (BA 37) and right posterior middle temporal gyrus (BA 21; Fig. 2a). The comparison of puns with their corresponding non-joke baselines revealed activation in left posterior inferior temporal gyrus (BA 37) and left inferior frontal gyrus (BA 44, 45; Fig. 2b). The direct comparison of semantic and phonological jokes (masked by the main effect of jokes; (semantic + phonological jokes) – (baselines)) revealed an anatomical dissociation. Semantic jokes compared to phonological jokes (semantic jokes – phonological jokes) recruited the right posterior middle temporal gyrus (BA 21/37) and left posterior inferior temporal gyrus (BA 20/37) (Fig. 2c). The reverse comparison (phonological jokes – semantic jokes) revealed activation of left insula/precentral gyrus (Fig. 2d).

Figure 2: An anatomical dissociation occurred in the cognitive processing of semantic and phonological jokes, whereas a common region underlay affective processing.
figure 2

(a) Processing of semantic jokes activated left posterior middle temporal gyrus (BA 21/37; −60, −54, 0; Z = 4.65), left posterior inferior temporal gyrus (–63, −51, −9; Z = 4.81), right posterior middle temporal gyrus (BA 21; 63, −48, 0; Z = 3.21) and cerebellum (36, −63, −30; Z = 4.81 and −36, −54, −33; Z = 4.68). (b) Processing of phonological jokes activated left posterior inferior temporal gyrus (BA 37; −60, −54, −12; Z = 5.34 and −48, −57, −21; Z = 5.08), and left inferior frontal gyrus (BA 44, 45; −51, 15, 15; Z = 5.32 and −54, 36, 6; Z = 4.93). (c) A direct comparison of semantic jokes with phonological jokes revealed activation in right posterior middle temporal gyrus (BA 21/37; 66, −45, 0; Z = 5.01 and 60, −48, 6; Z = 4.84) and left posterior inferior temporal gyrus (BA 20/37; −63, −39, −15; Z = 5.19). (d) The reverse comparison revealed activation of left inferior precentral gyrus/insula (BA 44/4; −48, 0, 6; Z = 4.71). (e) Activation in MVPFC (0, 45, −12; Z = 4.67) and bilateral cerebellum (27, −72, −27; 15, −93, −36; −48, −72, −33; data not shown) was common to both types of humor. Only activity in the MVPFC (3, 48, −12; Z = 4.73) correlated with subjects' post-scan funniness ratings of jokes. (f) Condition-specific parameter estimates show that the MVPFC responded very similarly to both types of funny and not funny jokes. A, semantic funny; B, semantic not funny; C, pun funny; D, pun not funny.

The activation of posterior temporal lobe in semantic jokes—a region involved in semantic processing of language—highlights the importance of differential semantic processing in the joke punch line over and above the non-joke item. The involvement of the right temporal lobe is consistent with several studies that have implicated it in processing and maintaining alternative, less probable word meanings, and in integrative processes requiring global coherence2,3,4,5. These processes are necessary for semantic juxtapositions of meaning required to `get' the punch line. Similarly, activation of left inferior prefrontal cortex and insula—regions involved in processing of speech sounds6,7—highlights the importance of differential phonological processing in the pun punch line over and above the non-pun descriptive line.

To isolate affective components of humor, independent of cognitive juxtaposition of semantic and phonological mental sets, we compared activations associated with jokes subjects reported as funny with activations associated with jokes reported as not funny (on a subject by subject, joke by joke basis). This comparison controlled for cognitive processing associated with juxtaposition of the punch line—because this needed to occur in both cases—and highlighted differences due to the pleasurable affect that was associated with funny jokes, and absent in non-funny jokes. Subsequently, we undertook a conjunction analysis, masked by the main effect of humor (at p = 0.05), of activations associated with semantic and phonological funny jokes (conjunction (subjectively funny semantic jokes – subjectively non-funny semantic jokes), (subjectively funny phonological jokes – subjectively non-funny phonological jokes)) to highlight regions activated in common across these two categories. This analysis revealed significant activation in medial ventral prefrontal cortex (MVPFC; BA 10/11) alone and bilateral cerebellum (Fig. 2e). Furthermore, the activation in MVPFC covaried with the subjects' post-scan ratings of joke funniness (p < 0.05, corrected). The activation in MVPFC during joke appreciation, and its covariation with subjective funniness ratings, was striking in that it involved a region that represents and controls reward-related behaviors8,9,10.

Few lesion studies of humor appreciation exist11,12,13. A study involving frontal lobe patients implicated the right dorsolateral prefrontal cortex in humor appreciation11. However, the neuroanatomical evidence indicated that most impaired patients also had damage to medial prefrontal cortex, a finding consistent with our results. Other patient studies have implicated the right hemisphere more generally, and have used semantic jokes12. Our results suggest that what these studies may be capturing is the right temporal lobe's involvement in the integrative semantic processes requiring global coherence, necessary for juxtaposition of mental sets, rather than the affective component of amusement5,14,15.

In summary, we present evidence for differential systems underlying the cognitive and affective processing of humor. Furthermore, the cognitive processing associated with the juxtaposition of phonological and semantic mental sets is type specific, whereas a common, modality-independent network underwrites the affective component of humor. Semantic juxtaposition uses a bilateral temporal lobe network, whereas phonological juxtaposition uses a left hemisphere network centered around speech production regions. The affective appreciation of humor involves access to a central reward system in MVPFC.