Fundamentals of Sound |
THE EAR AS A TRANSDUCER - Introduction
/ Outer & Middle Ear
(source) |
Transducers: Devices that convert
|
The ear is generally thought of as a sensor (e.g. a microphone) but it actually is
a bidirectional
transducer
|
|
TRANSDUCTION PROCESS IN THE OUTER AND MIDDLE EAR |
|
OUTER EAR Overall Functions
Main Parts & Functions
|
|
Simplified graph of the ear: in anatomical context (left) and magnified & sectioned (right) |
|
Schematic:
Longitudinal waves reaching the eardrum
(enlarge - animation) |
|
Overall Functions
Main Parts & Functions
The air cavity of the middle ear can connect to the outside via the Eustachian tube, a passage that, when open, functions as an air pressure equalizer between the outer and middle ears (i.e. equalizes the pressure at the two sides of the eardrum). If the air pressure is not equalized, the eardrum's sensitivity drops and its response becomes non-flat (i.e. different sensitivity at different frequencies). |
|
(a) (b) (c) (d) (a)
Eardrum motion
& (b)
Ossicle motion animations
|
TRANSDUCTION PROCESS IN THE INNER EAR
Overall Functions
Main (hearing-related) Parts & Functions The hearing portion of the inner ear consists of the Cochlea, a small, snail-like structure (~9mm in diameter and ~5mm in height) that is split into three liquid-filled compartments (see the pictures, below):
|
|
|
←---------------|
Top & Middle: two simplified
schematics of a
stretched-out cochlea; |
Above Left: Microphotographs of two intact cochleae
and of a dissected one. |
|
Schematic cross-sections of the cochleae cochlea Illustrations of the three key compartments or scalas (scala: stairs) and the main transduction element: the Organ of Corti. |
Schematic close-up of the Organ of Corti
The Organ of Corti (OofC) is bounded |
Transduction Process in the Organ of Corti - Summary The BM (green line on the OofC cross-section animation, below) performs a spectral analysis by resonating at different places for different frequencies: high frequencies near the base - low frequencies near the apex.
As the BM moves, it pushes cells that lay above it (red pillars in the
animation, called hair cells) up
against the TM (gray structure). As already noted, excess fluid vibrations, not absorbed by the BM, reach the scala tympani portion of the cochlea via the helicotrema (passage from scala vestibuli into scala tympani at the apex (far end) of the cochlea), and exit the cochlea through the the round window. |
|
Inner-Ear Innervation | |
Healthy (top) & damaged (below) hair cells Exposure to high intensity sounds can result in
We will return to Hearing Loss and Conservation during the "Loudness" module. |
• ~30,000 auditory nerve fibers
(neurons) are linked to auditory hair cells
• ~ 5-10% of the nerve fibers are linked to the outer hair cells, whose main function is to compress the level of the incoming signals by increasing response to low level signals and reducing response to high level signals.
• Nerve Fiber's Spontaneous Activity:
Firing activity of a single nerve fiber(neuron) in the absence
of a stimulus.
|
Videos outlining the auditory transduction process |
|||
Additional Videos @ https://www.interactive-biology.com/physiologyvideos/ (videos 036-040) |
|||
|
THE BASILAR MEMBRANE
Auditory Interference & Masking
Other
Nonlinearities
CRITICAL BAND vs CRITICAL BANDWIDTH Critical band: Term introduced by Fletcher in the 1940s to refer to the frequency bandwidth of the, then loosely defined, auditory filter. Since von Békésy’s studies (1930s-1960s), the term refers literally to the specific portion of the basilar membrane that goes into vibration in resonance with an incoming sine wave. Its length is determined by the elastic properties of the membrane and, at middle frequencies, has an average value of ~1mm.
Critical bandwidth: the frequency
difference (~ 1/3 of an octave) corresponding to the
physical length of the critical band .
|
||
If the frequency difference between two simultaneous sine waves with comparable levels is within the critical bandwidth, the ear will not be able to resolve the two frequencies and the waves will interact in specific and musically important ways:
|
||
As already mentioned, critical bandwidth may therefore be defined as the minimum frequency separation in Hz. between two simultaneous sine waves necessary for beats/roughness to disappear and for the resulting tones to sound clearly apart. Both, the beating and roughness sensations are perceptual attributes of amplitude fluctuation resulting from sound wave interference (discussed previously). Psycho-physiologically, the beating and roughness
sensations are linked to |
As the interval between two tones
of comparable levels decreases, their respective
disturbances on the basilar membrane (critical bands) increasingly
overlap, resulting in the sensations of roughness and beating |
The first 12 components of C3, shown as black circles
on a stretched music-notation 'stave'. |
Beating/Roughness Resources to review: 1) Read this description of the relationship among beating, roughness, spectral distribution, and critical bands. 2) Listen to a comparison between the roughness and beating sensations. As the lower-pitched tone in the interval (i.e. note-pair) rises and the frequency difference between the tones gradually narrows from D5-Eb5 to Eb5-Eb5, the roughness sensation gradually gives way to the beating sensation. 3)
Watch this clip,
presenting two simultaneous sine tones of equal amplitudes. One is fixed at
1000Hz; the other can sweep between 600Hz and 1300Hz. When the sweep tone is
within ~1-20Hz from 1000Hz, we have beating. For larger frequency
differences between the sweep and the fixed tones we get various degrees of
roughness. |
Because the critical bandwidth
(i.e. the frequency response range on the BM for a single frequency input) is
rather broad (~1/3 octave at middle frequencies), tones separated in frequency by less than the critical bandwidth
|
|
Simultaneous Masking Term describing the ability of one tone or band of noise (masker) to cover or raise the audibility threshold of a second tone (signal). When two tones (or a band of noise and a tone) close in frequency are
presented simultaneously, one (masker) may mask or “cover” the other
(signal) depending on their level difference: the more intense tone may
mask the less intense tone. As the response of the BM around the characteristic (resonant) frequency is
asymmetrical (larger above than below characteristic frequency), the
resulting masking curves are also asymmetrical, with the lower frequency
tone (or noise band) in a pair being more likely to mask (i.e. being a
more efficient masker than) the higher frequency tone. Simultaneous masking may be due to a) “Swamping” at the BM and/or nerve fibers b) “Suppression” |
As the level of a signal increases, the BM response
increases in magnitude and width (inverted v-shaped lines,
above). This means that stronger signals are able to cover
increasingly stronger simultaneous signals and increasingly
further removed in frequency. |
Forward and Backward Masking (Temporal Masking)
During Forward Masking, a signal masks a tone that comes 0-200ms after it. Forward masking effects are slight and do not produce the broad masking effects of simultaneous masking.
Forward masking increases with masker duration (from 0 to 50ms) and with
masker level. It occurs most effectively for
masker-signal delays ~20-30ms and does not occur for delays >200ms.
During Backward Masking, a signal masks a tone that came
0-50ms before it. Backwards masking depends on the signals used and may be due to higher level cognitive processing. |
|
Review the Masking Slides presented in class. Listen to two audio examples of noise bursts masking gaps in a
steady
or
frequency modulated tone . |
|
Distortion
Suppression |
|
Otoacoustic emissions The ear acts not only as a microphone, receiving sound, but also as a speaker, emitting a series of tones referred to as otoacoustic emissions (OAEs). OAEs are classified depending on the context
of emission (i.e. spontaneous vs. evoked and, if evoked, by what). Watch this video describing an application of OAEs to hearing assessment and headset tuning. Here is another relevant product. |
What Types of Hair Cells and Nerve Fibers Populate the Inner Ear? |
|
Motion of the Organ of Corti and inner hair-cell action (Wada Laboratory, Japan - site temporarily down) | |
| |
(a) (b) (c) (d) (a)
Basilar membrane vibration animations and (b)
in-vivo (alive) vs. in-vitro (dead) vibration
measurements
|
|
(a) (b) (c)
Outer hair-cell
(a)
motility,
(b)
explanation, and
(c)
amplification effect (Wada Laboratory,
Japan - site temporarily down) | |
BASILAR MEMBRANE ENCODING OF FREQUENCY & AMPLITUDE |
|
Temporal Coding: Neural response (neural firing) follows (or appears to be locked to) the positive peaks in the stimulus, firing only when the stereocilia are sheared in one direction. This results in the neural signals of sinusoidal inputs
The process of phase locking is closely related to hearing's "temporal coding theory" of encoding frequency. The inner hair cells release neurotransmitters only when the basilar membrane moves in one direction, towards the scala media. They therefore only respond to the positive portions of incoming signals, with their stereocilia opening their ion channels when bending against the TM, mostly at a signal’s positive peak. Temporal coding assumes that, thanks to phase locking, the auditory system encodes
periodic information through the firing rate of neurons.
This rate corresponds to the incoming signal’s period
(or some multiple of it) and, therefore frequency. |
|
Place Coding | |
| |
The above figure illustrates an alternative to the "temporal coding theory" of encoding frequency, referred to as "place coding theory."
As we've discussed, the basilar membrane (BM) responds
at different places for different frequencies, due to
its mass-stiffness gradient. In reality, the basilar membrane is a continuous array corresponding to much more numerous filters than those shown in the graph, above (24 in the Bark scale). High frequencies cause a response closer to the BM's base, while low frequencies cause a response closer to the apex. Frequency is then encoded by the IHCs that correspond to the resonating portion of the BM and share the same characteristic frequency with that portion. | |
|
The image to the left is a schematic diagram illustrating the signals sent to the brain when the basilar membrane is vibrating in response to a complex wave with many sine components. Each low-frequency component
sends individual signals since, as indicated above, the
frequency separation between the low-frequency components is larger
than the critical bandwidth. |
Overview Resources on Ear Anatomy and Function |
|
Dangerous Decibels from the
National Institute of Health website. Although the materials are designed for younger audiences (high school students), they present a good basic overview of our topic. |
|
Well-designed and concise overview of the anatomy and function of the ear, Part of NeurOreille's Journey Into the World of Hearing. | |
The Auditory System: Structure and Function, part of
the
Neuroscience e-Textbook site at the Department of Neurobiology
and Anatomy of the McGovern Medical School, University of Texas. This overview goes into more detail, especially in terms of the inner ear. |
|
Acoustics & Auditory Neuroscience Page - City University of Hong Kong (Table of Contents) |
|
Interactive Sensation Laboratory Exercises (ISLE) - Hanover College, OH (Chapters 10-13) |
|
Details on the nonlinearities of the inner ear - hearing module of a Psychoacoustics course |
|
Everything about hearing, including the optional sections, in an 11-minute video! |
Loyola Marymount University - School of Film & Television