Fundamentals of Sound |
|
Perceptual attributes of
acoustic waves: Pitch
Definition - Ranges - JND
Pitch:
Sonic (i.e. perceptual) attribute of sound waves, related mainly
to frequency.
Discussions on pitch usually revolve around music. However, pitch and pitch contours are quite significant in speech and, in some languages, pitch inflections carry specific semantic meaning. The frequency range of hearing extends from ~20Hz to ~20,000Hz (or 20kHz). These values constitute the low and high absolute thresholds of frequency perception, respectively. Listen to a sine signal sweeping through this range. [This will test your listening equipment more than your hearing].
| |
JND for Pitch: ~0.3-1% of frequency, depending on
register (i.e. on frequency region - see the figure, below).
|
|
|
Listen to pairs of successive tones ranging from 440-441Hz to
440-448Hz. |
|
|
|
|
Pitch of Pure Tones
Music Theory Primer - Tuning Perception
Pitch & Frequency / Intensity / Duration
|
Music Theory Primer - Tuning Perception (Use this virtual piano) |
|
Interval: perceived pitch
distance between two tones (whether pure/simple or complex).
C4-C4 C4-C#4 C4-D4 C4-D#4 C4-E4 C4-F4 C4-F#4 C4-G4 C4-G#4 C4-A4 C4-A#4 C4-B4 C4-C5
|
|
Pitch & Frequency / Intensity / Duration |
|
Pitch & Frequency |
|
For simple/pure tones, pitch closely relates to frequency. Similarly to SIL [Sound Intensity Level] and loudness, frequency and pitch relate logarithmically: addition in perception (pitch) corresponds to multiplication in the physical variable (frequency).
For a detailed overview of the frequency / pitch / music notation relationship see Chapter 2 in Newman, 2023 (source). |
|
|
|
Pitch & Intensity |
|
|
|
The pitch of pure tones also depends on
intensity (see the figures to the left). In
general, increasing the intensity of pure tones:
In addition, the introduction of a high-intensity "interference" tone will change the perceived pitch of existing low-intensity tones, even if the frequency of the low-intensity tones remains unchanged.
In other words, the high
intensity tone pushes the pitch of low intensity tones
away from it, assuming frequency separations beyond one
critical bandwidth.
The figure to the left
offers another, simplified illustration of the average
dependence of the pitch of pure tones on intensity. |
|
Pitch & Duration |
|
|
Pitch also depends on
duration. A tone must last more than a minimum
amount of time (~10-60ms/3-100cycles, depending on
frequency and intensity) in order to sound more than a
'click' and convey a clear sense of pitch (see below).
|
|
Pitch of Complex Tones
Pitch & Spectrum - Analytic &
Synthetic Listening
Pitch Theories
The dependence of the pitch of periodic complex tones (i.e. of complex tones with harmonic spectra) on frequency, intensity, duration, and the introduction of intense 'interference' tones is qualitatively similar to that of pure tones but more complex, due to the variety of frequencies involved. |
|
Pitch & Spectrum |
|
Pitch of Complex Tones with Harmonic Spectra Reminder: Periodic/harmonic complex tones are usually perceived as a single unit, with their multiple spectral components merging into a single tone perception rather than being perceived as a set of individual pure tones. |
|
Pitch of Complex Tones with Inharmonic Spectra
|
|
Analytic & Synthetic Listening
|
|
PITCH
THEORIES |
||
|
Place (Tonotopic) Theory
|
||
|
Temporal (Periodicity/Frequency) Theory
|
||
|
Hybrid Theories
|
.
The Octave - Multidimensionality of Pitch: Pitch Height & Pitch Chroma
|
The Octave (use this virtual piano to help you with the concepts in this section) |
|
|
|
|
|
Multidimensionality of Pitch (pitch spiral) Pitch is multidimensional with at least three
dimensions involving
The Western chromatic scale breaks the octave down
into 12 different pitch chromas. The pitch chroma
dimensions represent a circularity in pitch perception.
The perceptual circularity of pitch is explored in Shepard-tone scales/slides (after 20th century American cognitive scientist and author, Roger Shepard) that present the paradox of a continuously ascending (or descending) pitch. Listen to two pitch spiral examples (Houtsma et al., 1987). Shepard scales/slides are the auditory analog of the continuously ascending/descending staircases, explored conceptually by English mathematecian, Sir Roger Penrose and artistically by 20th century Dutch graphic artist, M.C. Escher (see the images, below). |
|
![]() Relativity (Escher, 1953) Ascending-Decending (Escher, 1960) | |
| Short video on pitch and a/v related audio illusions | |
|
[OPTIONAL SECTIONS] |
|
| Place Theory of Pitch - Drawbacks | |
|
The Pitch-Shift Effects The first
pitch shift effect: The second
pitch shift effect: Vassilakis (1998) showed that these two effects are not distinct but alternative manifestations of a single phenomenon. In a follow-up work, he argued that the pitch shift effect reflects our perceptual system's handling of the interaction between the phase and group velocities of the inharmonic tone complexes used in pitch shift experiments (Vassilakis, 1998b).
|
|
|
|
Unresolved Upper Harmonics
The place (or tonotopic) theory of pitch
requires energy in the lowest 5-8 components
(depending on fundamental frequency) in order to
produce a clear pitch sensation. The problem is that clear virtual pitch sensations persist even when the remaining components in a spectrum are not resolvable (see the 'unresolved upper harmonics' in the image to the left). |
|
The above observations led to the first attempt to a temporal theory of pitch, called the "Residue" theory of pitch, developed first by Dutch physicist, J.F. Schouten (1938). It states that pitch is determined by the temporal interaction, at a neural level, of the unresolved, 'residual' upper harmonics in a spectrum. This theory is challenged by the experimentally determined frequency region most salient to pitch (~400-1500Hz).
|
|
|
Temporal Coding: Phase Locking and Rectification The first systematic temporal
(periodicity) theory of pitch was proposed by
German scientist A. Seebeck (1843), |
|
|
|
Neural response (neural firing) follows (or appears to be locked to) the positive peaks in the stimulus, firing only when the stereocilia are sheared in one direction.
This results in the neural signals of sinusoidal inputs The process of phase locking is closely related to hearing's "temporal coding theory" of encoding frequency information:
Since neurons are not fast enough to encode high frequencies, more than one neuron must be involved in the process. Each neuron fires at some of the peak portions of an incoming signal and, after adding the outputs of all neurons, the signal is represented to the brain in a manner similar to that shown at the bottom graph (left). |
|
Perception of pitch relations - Unit of pitch Mel: Pitch-height unit and scale devised by American psychologist S.S. Stevens (1937, 1940). It is based on 'twice as high' perceptual judgments. 'Twice as high' amounts to a larger musical interval at high registers than at low registers. Reference: 1000 mels = pitch of 1000Hz presented 40dB above threshold. The Mel unit of pitch height is analogous to the Sone unit of loudness. The derivation of the Mel scale has been criticized for flawed methodology and is not in use. |
![]() |
|
Further Resources _ Concise and systematic
historical review of pitch theories
(Alain de Cheveigné, IRCAM, Paris, France -
source). _
Pitch Perception
presentation (A.J. Oxenham, Harvard - MIT -
source)
|
|
Key to the 3-tone
listening example:
DD (the tones have frequencies 12025Hz, 12000Hz, and
11975Hz, in this order)
Loyola Marymount University - School of Film & Television