Why You Hear What You Expect to Hear
Why Read This
What Makes This Article Worth Your Time
Summary
What This Article Is About
Linguist and cognitive scientist Karen Stollznow uses the familiar phenomenon of mondegreens — confidently misheard song lyrics such as “kiss this guy” instead of “kiss the sky” from Jimi Hendrix’s Purple Haze — to introduce a deeper truth about human hearing: speech perception is not passive or objective, it is constructive. Real-world speech is a continuous, overlapping stream of sounds that is frequently incomplete, distorted by noise, or acoustically ambiguous. Despite this, listeners comprehend language with remarkable speed and accuracy because the brain does not wait for perfect input — it actively reconstructs meaning from partial information by drawing on stored vocabulary, context, and expectation.
The article’s central scientific evidence is the Ganong effect, first described by psychologist William Ganong in 1980, which demonstrates that listeners are systematically biased toward hearing real words over nonsense sounds when input is ambiguous. This reflects a two-system process: bottom-up processing (raw sensory data from the ears) and top-down processing (knowledge and expectations from the brain). A closely related phenomenon, phonemic restoration, shows that listeners even perceive sounds that are physically absent when context strongly supports their presence. Together, these effects confirm that we do not simply hear what is said — we hear what we expect to hear.
Key Points
Main Takeaways
Hearing Is Constructive, Not Passive
The brain does not record speech like a microphone; it actively reconstructs meaning from incomplete, ambiguous signals using prior knowledge and contextual expectations.
Mondegreens Reveal a Deeper Truth
Confidently misheard lyrics and phrases are not simple errors — they demonstrate that every listener interprets speech through a lens of expectation and plausibility rather than pure acoustic data.
The Ganong Effect: Real Words Win
When a speech sound is acoustically ambiguous, listeners consistently perceive the version that forms a real word — proving that lexical knowledge shapes perception at the earliest stages of processing.
Two Systems Work Together
Bottom-up processing handles raw sensory input from the ears; top-down processing supplies knowledge and context. In ambiguous speech, the top-down system takes control, steering perception toward familiar meanings.
Phonemic Restoration: Hearing the Absent
Listeners perceive sounds that are physically replaced by noise or a cough as though they were present, if the surrounding context strongly implies what the missing sound should be.
A Feature, Not a Bug
The brain’s tendency to fill in gaps and resolve acoustic ambiguity using prior knowledge is an efficient adaptation — it allows us to understand speech in noisy rooms and poor phone connections with remarkable ease.
Master Reading Comprehension
Practice with 365 curated articles and 2,400+ questions across 9 RC types.
Article Analysis
Breaking Down the Elements
Main Idea
Perception Is Prediction, Not Recording
The article’s central claim is that human speech perception is an active, constructive process rather than a passive one. The brain does not simply receive and decode acoustic signals — it predicts, interprets, and fills in missing information using stored lexical knowledge and contextual expectations. This insight challenges a common assumption about hearing and has wide implications for understanding communication, misunderstanding, and the nature of perception itself.
Purpose
To Inform and Demystify
Stollznow’s purpose is to explain a well-established finding in cognitive science to a general readership, using relatable examples — misheard song lyrics and the telephone game — to make the science accessible and memorable. The article serves as a gentle introduction to concepts from psycholinguistics and cognitive psychology, and doubles as promotion for the author’s own book, Beyond Words: How We Learn, Use, and Lose Language, published by Cambridge University Press.
Structure
Hook → Problem → Evidence → Implication
The article opens with a memorable cultural hook (the misheard Hendrix lyric), uses it to identify a genuine scientific puzzle (how do we understand messy, ambiguous speech so easily?), introduces the Ganong effect as the central experimental evidence, then broadens to the related phenomenon of phonemic restoration before closing with the overarching philosophical point: perception is constructive. This is a textbook example of the “inverted pyramid with a twist” structure common in science journalism.
Tone
Accessible, Warm & Gently Scientific
The tone is consistently warm and accessible — the article is written for a Psychology Today readership expecting engaging, jargon-light prose. Stollznow explains technical distinctions (bottom-up vs. top-down processing) clearly and without condescension, using everyday examples like a noisy café and a poor phone call to ground abstract concepts. The overall effect is that of a knowledgeable friend explaining a fascinating finding over coffee rather than a dry academic summary.
Key Terms
Vocabulary from the Article
Click each card to reveal the definition
Build your vocabulary systematically
Each article in our course includes 8-12 vocabulary words with contextual usage.
Tough Words
Challenging Vocabulary
Tap each card to flip and see the definition
The perceptual phenomenon in which a listener hears a speech sound as present even when it has been physically replaced by a noise such as a cough or static, if the surrounding context makes the missing sound predictable.
“The Ganong effect is closely related to another phenomenon known as phonemic restoration. In these cases, listeners perceive missing sounds as present when they are replaced by noise.”
Relating to the words or vocabulary of a language, as distinct from its grammar or sound system; “lexical knowledge” means the brain’s stored mental dictionary of known words.
“The brain prioritizes meaning over acoustic precision, quickly resolving uncertainty by drawing on stored vocabulary and context.”
In its original, pure, and perfect condition; here used to describe ideally clear, undistorted speech — the kind we almost never encounter in real-world listening environments.
“In everyday environments, speech is rarely pristine. The brain prioritizes meaning over acoustic precision, quickly resolving uncertainty by drawing on stored vocabulary and context.”
Seeming reasonable or probable given what is already known; in the article, the brain chooses the most “plausible word match” — the real, familiar word that best fits the acoustic evidence and context.
“The brain is not just decoding sound, it’s choosing the most plausible word match.”
Supports or forms the basis for something; to say one phenomenon “underpins” another means it is the deeper mechanism that explains or makes the other possible.
“It’s also what underpins effects like the ‘telephone game,’ where a message becomes progressively reshaped as it is passed from person to person.”
In perceptual psychology, the act of settling ambiguity into a single clear interpretation; the brain “resolves” competing possibilities about what a sound is by selecting the most contextually likely option.
“The brain constantly balances sensory input with linguistic knowledge, resolving ambiguity in real time.”