Musical dissonance confirms subcortical pitch extraction

Consonance and dissonance of musical chords: neural correlates in auditory cortex of monkeys and humans

Fishman YI, Volkov IO, Noh MD, Garell PC, Bakken H, Arezzo JC, Howard MA, Steinschneider M

J Neurophysiol 2001 Dec 86:2761-88

Department of Neuroscience, Albert Einstein College of Medicine, Bronx, New York 10461, USA.

Abstract:

Some musical chords sound pleasant, or consonant, while others sound unpleasant, or dissonant. Helmholtz's psychoacoustic theory of consonance and dissonance attributes the perception of dissonance to the sensation of "beats" and "roughness" caused by interactions in the auditory periphery between adjacent partials of complex tones comprising a musical chord. Conversely, consonance is characterized by the relative absence of beats and roughness. Physiological studies in monkeys suggest that roughness may be represented in primary auditory cortex (A1) by oscillatory neuronal ensemble responses phase-locked to the amplitude-modulated temporal envelope of complex sounds. However, it remains unknown whether phase-locked responses also underlie the representation of dissonance in auditory cortex. In the present study, responses evoked by musical chords with varying degrees of consonance and dissonance were recorded in A1 of awake macaques and evaluated using auditory-evoked potential (AEP), multiunit activity (MUA), and current-source density (CSD) techniques. In parallel studies, intracranial AEPs evoked by the same musical chords were recorded directly from the auditory cortex of two human subjects undergoing surgical evaluation for medically intractable epilepsy. Chords were composed of two simultaneous harmonic complex tones. The magnitude of oscillatory phase-locked activity in A1 of the monkey correlates with the perceived dissonance of the musical chords. Responses evoked by dissonant chords, such as minor and major seconds, display oscillations phase-locked to the predicted difference frequencies, whereas responses evoked by consonant chords, such as octaves and perfect fifths, display little or no phase-locked activity. AEPs recorded in Heschl's gyrus display strikingly similar oscillatory patterns to those observed in monkey A1, with dissonant chords eliciting greater phase-locked activity than consonant chords. In contrast to recordings in Heschl's gyrus, AEPs recorded in the planum temporale do not display significant phase-locked activity, suggesting functional differentiation of auditory cortical regions in humans. These findings support the relevance of synchronous phase-locked neural ensemble activity in A1 for the physiological representation of sensory dissonance in humans and highlight the merits of complementary monkey/human studies in the investigation of neural substrates underlying auditory perception.
(Bold text emphasis by Martin Braun)

Comment:

Phase-locking to the beat frequencies of dissonant musical chords, e.g., C4 = 256 Hz plus C#4 = 269.7 Hz resulting in beat of 13.7 Hz, in the compound activity of large neuron populations of the primary auditory cortex is an interesting new finding. It remains to be seen if this effect has a causal relationship to the perception of roughness or unpleasantness when hearing dissonant chords. A more plausible location for the neuronal origin of dissonance perception may be the auditory midbrain, where the frequencies of dissonance beats are represented in a much more complete way and, more importantly, even in the firing activity of single neurons (Sinex et al., 2002). Further, the auditory midbrain is situated very close to major reward-related centers in the upper brainstem, which may respond specifically to temporal coherence in consonant chords and to "roughness" beats in dissonant chords.

The most interesting detail in Fishman's study may be that neuronal beats related to the pitch frequencies (f0) of two tones are in most cases much stronger than the neuronal beats related to the frequencies of single harmonics of the tones, despite identical sound levels for fundamentals and single harmonics of the tones (Figs. 3-7, 10, and 11). Most compelling are the results shown in Figs. 3 and 4, where the chords of "minor second" and "major second" produce by far the strongest phase-locking to the beat between the pitch frequencies, even though these frequencies are clearly below the frequency response range of the recording site and two resp. three octaves below the best-frequency of the recording site. Because the input to the recording sites came from subcortical auditory centers, we have to conclude that the pitch frequencies had already been extracted and amplified below the auditory cortex. The present findings therefore elegantly support earlier evidence that the pitch of speech vowels and musical tones is extracted in the auditory midbrain. The new results actually exclude the alternative assumption that pitch may first be extracted in the auditory cortex. (Comment Martin Braun)

NOM Home