The Producer's Bible
Articles
Techniques Reviews Comparisons Breakdowns Recreations Genres AI Music Music Business
Gear
DAWs Plugins Hardware
Site
About Sound Better → Join the newsletter
The Producer's Bible Published by MusicProductionWiki.com 2026 Edition

Frequency Masking

/ˈfriːkwənsi ˈmɑːskɪŋ/

Frequency Masking is a psychoacoustic phenomenon where a louder sound at one frequency reduces the audibility of quieter sounds at nearby frequencies. It is the primary cause of muddy, cluttered mixes and is countered through EQ carving, sidechain compression, and arrangement.

Hear The Difference
Dry vs Processed — Frequency Masking
🎵 Audio examples coming soon — check back shortly.
Dry Processed

01 Definition

Every producer has heard it — a mix that sounds great in solo but falls apart the moment everything plays together. Frequency masking is usually the culprit, and understanding it changes how you hear forever.

Frequency masking is a psychoacoustic phenomenon in which the presence of one sound — the masker — reduces or completely eliminates the perceived audibility of another sound — the maskee — when both occupy similar frequency regions at the same time. The effect arises not from a technical flaw in a recording chain but from a fundamental limitation of the human auditory system: the basilar membrane inside the cochlea responds to different frequencies at different physical locations, and when two tones excite overlapping regions simultaneously, the louder one suppresses the neural response to the quieter one. The result is that the quieter sound effectively disappears from conscious perception even though it is measurably present in the audio signal.

In practical mixing terms, frequency masking explains why a bass guitar and a kick drum can each sound powerful in isolation yet produce a muddy, undefined low-end when combined; why a rhythm guitar and a lead vocal fight for presence in the 2–5 kHz intelligibility range; and why a dense synth pad can swallow an entire lead melody despite both elements being at comparable levels on the fader. The phenomenon operates across the entire audible spectrum — roughly 20 Hz to 20 kHz — but its effects are most destructive in the low-mid range (200–800 Hz) where many instruments converge and where human hearing is least adept at resolving closely spaced pitches.

Researchers distinguish between two primary subtypes. Simultaneous masking occurs when masker and maskee are present at exactly the same moment — the dominant form encountered in dense mix textures. Temporal masking subdivides into forward masking, in which a loud sound suppresses audibility of a subsequent quieter sound for up to 200 milliseconds after the masker ceases, and backward masking, in which the brain's anticipatory processing means a loud upcoming sound can retroactively reduce the perceived loudness of a quieter sound that preceded it by up to 20 milliseconds. Temporal masking is less commonly discussed in mixing tutorials but is directly relevant to transient-heavy material: a snare hit can mask the tail of a hi-hat that immediately follows it, and a bass note's attack can suppress the decay of a preceding kick.

The frequency spread of masking is asymmetric. A masker is more effective at masking frequencies above it than frequencies below — a phenomenon called the upward spread of masking. A 200 Hz bass tone will mask elements at 300–600 Hz far more aggressively than it masks a 100 Hz sub-element. This has direct consequences for arrangement and EQ decisions: stacking a bass synth with significant 200–400 Hz content beneath a vocal that sits at 250–350 Hz will reliably bury the low-mid body of the vocal, even when the vocal fader is at a nominally healthy level. Experienced mixing engineers account for this asymmetry when deciding which element owns which frequency range.

Critically, frequency masking is not a problem to be solved once and forgotten — it is a continuous, dynamic interaction that changes with every note, every transient, and every arrangement decision. A mix that has adequate separation during a sparse verse can collapse into mud during a dense chorus as additional instruments compete for the same spectral real estate. This is why mix decisions made at low listening levels, or while soloing individual tracks, frequently fail to translate: the masking interactions that define perceived clarity only emerge at full arrangement density and at the playback levels at which the psychoacoustic thresholds become significant.

02 How It Works

The physiological basis of frequency masking lies in the mechanics of the cochlea. The basilar membrane — a tapered structure roughly 35 mm long coiled inside the inner ear — acts as a biological spectrum analyzer. High frequencies cause maximum displacement near the base of the membrane; low frequencies cause maximum displacement near the apex. When a complex sound enters the ear, different regions of the membrane vibrate maximally at different frequencies, and the associated hair cells convert those vibrations into neural signals. However, because the membrane is a continuous elastic structure rather than a set of discrete filters, a loud vibration at one point creates a traveling wave that disturbs neighboring regions. This mechanical cross-talk is the physical substrate of masking: the hair cells in the masked region are already partially activated by the masker's traveling wave, reducing their sensitivity to the weaker signal at that location.

Auditory neuroscientists model this behavior using the concept of auditory filters — effectively bandpass filters centered at each frequency whose bandwidth is described by the Equivalent Rectangular Bandwidth (ERB). At low frequencies the ERB is narrow (around 30–50 Hz wide at 200 Hz), but the filters widen proportionally with center frequency (roughly 25 Hz per kHz above 1 kHz). This means that two sounds separated by 50 Hz will compete far more in the low-mids than the same 50 Hz separation would cause competition in the high-mids. It also explains why sub-bass buildup is so persistent: the auditory filters are so narrow at 60–120 Hz that small variations in bass pitch can create dramatically different masking interactions with the kick fundamental. The practical implication is that low-frequency instruments need greater spectral separation — measured in ERBs, not raw Hz — to achieve the same perceptual distinctness as high-frequency instruments.

The masking threshold — the level at which a maskee becomes inaudible — rises steeply with masker level. At low masker levels the threshold elevation is modest; at high levels (above approximately 60 dB SPL) the threshold rises roughly in proportion to masker level and spreads upward in frequency more aggressively. This level-dependence is why mixes that seem acceptably clear at moderate monitoring levels fall apart at high volumes: at higher SPLs, louder elements in the mix raise the masking threshold for adjacent elements by a greater margin, causing previously audible details to vanish. It also explains why parallel processing — blending a compressed, level-controlled version of a signal with its dry counterpart — can improve perceived clarity without increasing peak levels: the dry transients punch through before masking thresholds are fully established.

Modern psychoacoustic models — including Moore and Glasberg's revised excitation-pattern model and the ANSI S3.4 loudness standard — formalize masking thresholds mathematically, and these models underpin the perceptual codecs used in MP3 and AAC encoding (the encoder deliberately discards signal content that falls below predicted masking thresholds). The same models are increasingly embedded in metering and analysis tools used in mixing: spectral clash analyzers such as iZotope's Masking Meter and FabFilter Pro-Q 3's spectrum analyzer in collision mode display real-time masking relationships between tracks, translating the psychoacoustic math into actionable visual information that a mixing engineer can act on without having to mentally compute ERB-weighted thresholds on the fly.

Understanding the mechanism reframes the entire goal of mixing: the job is not simply to set levels and apply tonal balance, but to manage the dynamic masking relationships between every element in the arrangement so that each contributes its intended perceptual role — attack, sustain, warmth, air, presence — without suppressing the corresponding qualities of neighboring elements. EQ, compression, sidechain routing, stereo placement, reverb pre-delay, and arrangement editing are all, at a psychoacoustic level, tools for controlling masking.

Frequency masking diagram showing how a loud bass masker raises the hearing threshold across the low-mid frequency range, suppressing the audibility of a quieter vocal element at 300 Hz. FREQUENCY MASKING — SIMULTANEOUS MASKING THRESHOLD ELEVATIONloudquietFREQUENCY (Hz) →601202505001k4k16kabsolute hearing thresholdMASKERBass / Kick~100 Hz+18 dBFSraised masking threshold(upward spread of masking)MASKEEVocal low-midbelow threshold= inaudibleLEGENDMasker energy + raised thresholdMaskee (below threshold)Absolute hearing threshold

Diagram — Frequency Masking: Frequency masking diagram showing how a loud bass masker raises the hearing threshold across the low-mid frequency range, suppressing the audibility of a quieter vocal element at 300 Hz.

03 The Parameters

Every frequency masking — hardware or plugin — operates on the same core parameters. Know these and you can work with any implementation.

MASKER LEVEL
Loudness of the dominant masking element

Masking threshold elevation is proportional to masker level: a 10 dB increase in masker level raises the masking threshold by approximately 10 dB across neighboring frequencies. In practical terms, a kick drum peaking at −6 dBFS will mask far more of the bass guitar's fundamental than a kick at −18 dBFS. Reducing the masker's level — or controlling its dynamic range with compression — is often more effective than boosting the maskee.

SPECTRAL DISTANCE
Frequency separation between masker and maskee

Masking is most severe when the masker and maskee frequencies are within one critical band (roughly one ERB) of each other. At 250 Hz this is approximately 30–40 Hz; at 2 kHz it widens to around 350 Hz. Increasing spectral distance via high-pass or low-pass filtering on competing elements — for example, rolling off a bass guitar at 200 Hz while boosting the kick at 60 Hz — creates perceptual space even without changing relative levels significantly.

TEMPORAL OVERLAP
Time duration of simultaneous frequency competition

Simultaneous masking only applies when both signals occupy the same frequency region at the same instant. Reducing temporal overlap — for example, using a fast sidechain compressor to duck the bass whenever the kick hits — converts a simultaneous masking problem into brief temporal gaps that the ear resolves as separate events. The minimum compressor release time needed to avoid audible pumping while achieving effective unmasking is typically 80–250 ms for kick-bass interactions.

STEREO POSITION
Lateral separation between competing elements

Binaural unmasking — the reduction of masking when masker and maskee originate from different spatial locations — can improve the perceptibility of the maskee by 3–15 dB depending on frequency and angular separation. Panning a rhythm guitar 30–40% left and a competing synth pad 30–40% right reduces their simultaneous masking interaction even if their frequency content overlaps significantly. Note that mono compatibility must be checked, as panned signals summed to mono lose binaural unmasking benefit.

MASKING ASYMMETRY (UPWARD SPREAD)
Directional bias of frequency masking toward higher frequencies

Masking spreads upward in frequency more aggressively than downward, with the asymmetry increasing at higher masker levels. A 100 Hz masker at 80 dB SPL can raise the threshold of a 400 Hz signal by 20–30 dB, while its effect below 80 Hz is minimal. This means bass-heavy elements threaten low-mid clarity far more than treble-heavy elements threaten bass clarity, which is why high-passing bass instruments aggressively — often at 100–160 Hz using a steep 24 dB/octave filter — is a standard mixing technique for protecting vocal and guitar body frequencies.

CRITICAL BAND DENSITY
Number of competing signals within a single auditory filter bandwidth

When three or more instruments simultaneously occupy the same critical band, masking becomes multiplicative rather than additive: each additional masker raises the threshold of every other element within the band. Dense arrangements with multiple synths, guitars, and vocals between 200 Hz and 1 kHz routinely create critical band saturation. Arrangement-level solutions — doubling tracks in higher octaves, replacing pad chords with single-note stabs, or muting inner voicings during peak density sections — are often more effective than corrective EQ at this stage.

04 Quick Reference Card

Session-ready starting points. Values are starting points — always verify with spectrum analysis in the context of the full arrangement at intended listening level.

ParameterGeneralDrumsVocalsBass / KeysBus / Master
High-pass cutoff (anti-mask)Depends on elementKick: 30–50 Hz / Snare: 90–120 Hz80–120 Hz (preserve chest tone)Bass: 30–50 Hz / Keys: 200–300 HzNo HPF on master bus
EQ cut in competing range3–6 dB, Q 1.0–2.0200–400 Hz notch on drumsCut 250–350 Hz if bass-heavyCut 800–1.2 kHz where vocal sitsSurgical only; ≤2 dB
Sidechain compressor threshold−20 to −12 dBFSKick triggers bass duck at −18 dBFSVocal triggers pad duck at −24 dBFSBass triggered by kick; ratio 3:1–6:1N/A
Sidechain attack / releaseAttack 1–5 ms / Release 80–200 msAttack 1 ms / Release 100–150 msAttack 5–10 ms / Release 200–400 msAttack 1–3 ms / Release 80–120 msN/A
Pan separation (binaural unmask)Competing elements ≥30% apartOverheads wide; toms 20–40% L/RVocal center; BVs 25–45% L/RBass center; keys 20–35% L/RCheck mono compatibility
Spectral gap target (EQ carving)≥1 ERB of separationKick peak 60 Hz; bass peak 100–150 HzVocal presence 3–5 kHz; cut pads thereSub ≤80 Hz; bass fundamental 80–160 HzMonitor with spectrum analyzer
Pre-delay (temporal unmask)20–80 ms on reverb returnsSnare verb pre-delay 20–40 msVocal verb pre-delay 30–80 msKeys reverb pre-delay 25–60 msBus reverb pre-delay 30–60 ms

Values are starting points — always verify with spectrum analysis in the context of the full arrangement at intended listening level.

05 History & Origin

The scientific investigation of auditory masking began in earnest in the late nineteenth century, but the framework that mixing engineers now rely on was largely established between the 1920s and 1950s at Bell Telephone Laboratories. Harvey Fletcher — who also pioneered the equal-loudness contours published with W.A. Munson in 1933 — conducted systematic studies of simultaneous masking throughout the 1930s, publishing foundational data on how pure-tone maskers raised the detection threshold for neighboring tones as a function of frequency separation and masker level. Fletcher's 1940 paper in the Journal of the Acoustical Society of America introduced the concept of the critical band, positing that the ear behaves as though it contains a bank of bandpass filters and that masking operates within these bands. This model, later refined by Eberhard Zwicker at the Technical University of Munich in the 1950s and 60s using the concept of Bark scaling, became the theoretical backbone of all subsequent psychoacoustic research relevant to audio engineering.

The transition from theoretical research to applied audio engineering practice accelerated in the 1970s as multitrack recording became the dominant production paradigm. With 16- and 24-track tape machines enabling dense simultaneous recordings, engineers at studios such as AIR London, Criteria Recording Studios in Miami, and Electric Lady Studios in New York began encountering masking problems at a scale impossible on earlier 4- and 8-track formats. Engineers like Tom Dowd — who mixed records for Aretha Franklin and the Allman Brothers — and Geoff Emerick at AIR developed empirical carving techniques: using narrow parametric EQ cuts (which became practical with the Neve 1073 and 1081 preamp/EQ modules introduced in the early 1970s) to create spectral windows for competing elements rather than simply boosting desired frequencies. The SSL 4000 series console, introduced in 1976 and adopted widely by the early 1980s, gave every channel an in-line parametric EQ, democratizing surgical frequency management across large sessions for the first time.

The concept of sidechain compression as an anti-masking tool emerged from broadcast engineering in the 1960s — the Teletronix LA-2A and UREI 1176, both widely used by the late 1960s, supported external sidechain inputs — but its application specifically to kick-bass masking relationships became a defining characteristic of dance music production in the late 1970s. Giorgio Moroder's productions for Donna Summer, particularly I Feel Love (1977), used precisely controlled low-end relationships that prefigured the explicit sidechain ducking that became ubiquitous in house and techno production after 1987. François Kevorkian, Larry Levan, and other New York dance music engineers refined kick-bass sidechain techniques throughout the early 1980s at venues including Paradise Garage, where the exceptional Levan-designed sound system made low-end masking audible in ways that studio monitors could not reveal.

The perceptual coding revolution of the 1990s brought masking mathematics into mainstream audio technology. The MPEG-1 Layer III standard — MP3 — deployed psychoacoustic masking models (based on Johnston's 1988 work at Bell Labs) to determine which signal components could be discarded without perceptible quality loss, encoding only information above the masking threshold. This forced millions of engineers to grapple with masking thresholds in a new context: material with dense simultaneous content encoded more efficiently because more was maskable, but certain timbral qualities lost in compression revealed pre-existing masking problems in the original mix. By 2000, plug-in developers including Waves, McDSP, and later iZotope began embedding spectrum visualization in EQ and dynamics processors, and the 2010s saw the release of purpose-built masking analysis tools — most notably FabFilter Pro-Q 3's inter-channel collision display (2018) and iZotope Neutron's Masking Meter — that translated psychoacoustic research directly into workflow-integrated visual feedback for the first time.

06 How Producers Use It

Kick and bass. The kick-bass relationship is the most consequential masking interaction in popular music production. The kick's fundamental — typically 50–80 Hz for a punchy pop/R&B kick, or 60–100 Hz for a four-on-the-floor house kick — directly competes with the bass guitar or bass synth fundamental, which commonly sits between 80–160 Hz. The standard solution combines three techniques: tuning the kick and bass to a complementary relationship (the kick fundamental and bass root a fourth or fifth apart reduces worst-case simultaneous masking); using a steep high-pass filter on the bass beginning at 40–60 Hz to clear the sub region for the kick; and routing the kick as a sidechain trigger for a compressor on the bass bus with a 3:1–6:1 ratio, 1–3 ms attack, and 80–150 ms release. The result is a momentary spectral gap in the bass every time the kick hits, converting a simultaneous masking problem into a controlled temporal sequence where kick and bass are each audible in their own time window.

Vocals and guitars / pads. The vocal intelligibility range (1–5 kHz) is the most competed-for spectral real estate in guitar-based rock and singer-songwriter production. Rhythm guitars, overdriven or not, carry significant energy between 2–4 kHz; synthesizer pads routinely span 200 Hz–8 kHz. Rather than boosting vocal presence — which raises the masker threshold for the guitar as well — experienced engineers make a reciprocal cut: a 3–5 dB dip centered at 2.5–3.5 kHz (Q = 1.5–2.0) on the rhythm guitar bus, simultaneously with a presence boost at the same frequency on the vocal, widens the gap between masker and maskee without increasing total spectral energy. On sessions with dense pad arrangements, a dynamic EQ or multiband sidechain triggered by the vocal can automatically cut the pad's 1–4 kHz region only when the vocal is present, eliminating masking during sung phrases while preserving full pad density during instrumental passages.

Drums in dense mixes. Snare clarity is frequently lost in productions with significant low-mid content from bass, guitars, and pad layers. The snare's crack lives at 200–250 Hz (body) and 3–6 kHz (snap), and both regions are susceptible to masking. The body of the snare is commonly masked by bass instrument overtones and guitar low-mids; a 2–4 dB cut at 200–350 Hz on competing instruments — applied selectively to the busses rather than individual tracks — clears the snare body without thinning any single element audibly. Parallel drum compression, routing the drum bus through a heavily compressed parallel channel (ratio 10:1, slow attack 30–50 ms, fast release 50–80 ms, blended at 20–40%), adds sustain to the drum transients that aids their penetration above the masking threshold of surrounding elements without adding peak level that would raise that threshold further.

Reverb and temporal masking. Reverb tails create a continuous masking floor that raises the threshold for everything that follows. Pre-delay — a brief gap between the dry signal and the onset of reverb — exploits the ear's precedence effect to improve source localization and simultaneously protects the attack of the subsequent dry notes from temporal backward masking by the reverb. Setting vocal reverb pre-delay to 30–60 ms means the dry attack of each new word arrives in a momentary window before the reverb tail of the previous word reaches its peak masking level. This technique was popularized by engineers including Hugh Padgham — whose work with Peter Gabriel and Phil Collins on Face Value (1981) made heavy use of gated reverb with controlled pre-delays — and remains a foundational approach in modern pop and R&B production.

AbletonUse EQ Eight on competing tracks with the spectrum analyzer in real-time mode; enable the sidechain input on Ableton's stock Compressor to create kick-bass ducking — route kick to a separate return, enable sidechain in the Compressor on the bass channel, and set Activator to trigger from that return. For spectral collision visualization, EQ Eight's 'Spectrum' display shows both tracks' frequency content when used in conjunction with the Spectrum device on a second track.
FL StudioParametric EQ 2 on each competing track with its spectrum analyzer active provides quick identification of collision zones. For sidechain anti-masking, use the Fruity Peak Controller linked to the kick channel's signal to automate volume reduction on the bass channel — set Peak Controller to 'volume sidechain' mode with fast attack and 100–200 ms release. Alternatively, route the kick to a Mixer send and use the Mixer's sidechain routing in the Fruity Compressor on the bass Mixer insert.
Logic ProChannel EQ includes a real-time spectrum analyzer that can be set to show the output of a different track via the analyzer's 'External Sidechain' option (Logic 10.7+), enabling direct visual masking comparison between two channels. For kick-bass sidechain, enable 'Side Chain' in the Logic Compressor on the bass channel, set the source to the kick channel, and use Fast attack (0.5–2 ms), 4:1 ratio, and medium release (100–150 ms). Smart Tempo and Articulation features do not affect masking — address it in the signal chain.
Pro ToolsPro Tools' stock EQ III seven-band EQ provides adequate carving for masking corrections; pair it with Avid's bundled iZotope Insight metering for visual confirmation of masking zones on competing tracks. For sidechain ducking, use the stock Dynamics III compressor with key input enabled — route the kick to a bus, assign that bus as the key input on the bass compressor, set ratio 4:1, attack 1 ms, release 100 ms. In Dolby Atmos sessions in Pro Tools, object-based panning adds a third dimension to binaural unmasking that should be exploited for competing mid-range elements.
ReaperReaEQ with its per-band spectrum display allows precise EQ carving while watching spectral collisions in real time. For masking analysis across tracks, use the JS: Analysis/gfxspectrograph plug-in on both competing channels simultaneously with the Master FX visible, or install the free ReaPlugs ReaFIR for per-track FFT-based frequency visualization. Sidechain compression in Reaper uses the track's 'Receive' routing to feed an additional channel pair into the compressor's sidechain input — route kick to bass track as channels 3/4, then set the compressor sidechain to channels 3/4.
The Producer's Briefing

Sound better by Friday.

One email a week. The techniques behind the terms — curated by working producers, not algorithms.

No spam · Unsubscribe anytime

07 In the Wild

Abstract knowledge becomes practical when you can hear it in music you know. These tracks demonstrate frequency masking used intentionally, at specific moments, for specific purposes.

Daft Punk — "Around the World" (1997)
0:00–0:30 · Produced by Daft Punk (Thomas Bangalter & Guy-Manuel de Homem-Christo)

The kick-bass relationship here is a textbook example of masking management in French house production. The four-on-the-floor kick sits with its fundamental at approximately 65 Hz, while the bass line — a repeating single-note Moog pattern — is tuned with its fundamental near 100 Hz and cut below 80 Hz. Listen on headphones: the kick lands first, occupying the sub, and the bass is heard distinctly in the low-mid immediately after each kick transient, demonstrating sidechain-style temporal separation even if achieved through arrangement and filtering rather than explicit ducking. No element masks the other because they occupy adjacent, non-overlapping spectral windows.

Kendrick Lamar — "HUMBLE." (2017)
0:00–0:45 · Produced by Mike Will Made-It

The production achieves extreme low-end clarity in a minimal arrangement by ruthlessly eliminating masking. The 808 sub — which slides between approximately 55 and 70 Hz — is the only element with significant sub content; the snare carries a sharp 200 Hz body crack and 5 kHz snap, with the region between cleared of competing material. Listen at 0:20 as the vocal enters: the kick and 808 duck perceptibly beneath the vocal's low-mid body, a textbook sidechain compression application. Mike Will Made-It's production consistently exploits wide spectral gaps between elements, making each frequency range 'owned' by a single source.

Fleetwood Mac — "The Chain" (1977)
3:46–5:00 · Produced by Fleetwood Mac and Ken Caillat

The iconic bass and drums outro section reveals how analog-era engineers managed masking through arrangement rather than processing. John McVie's bass plays in the 100–200 Hz register while Mick Fleetwood's kick drum sits at 55–70 Hz — separated by arrangement choice alone. The guitar power chord enters at 3:59 with significant mid-range content (800 Hz–3 kHz), yet the vocal remains absent, preventing vocal-guitar masking. Listen for how the bass line becomes perceptibly more prominent at 4:10 when the rhythm guitars drop out, demonstrating that removing maskers is often more effective than boosting the masked element.

Billie Eilish — "bad guy" (2019)
0:00–0:40 · Produced by Finneas O'Connell

A modern masterclass in anti-masking through arrangement minimalism. The sub bass occupies 40–80 Hz exclusively; Billie's vocal sits in the 200–800 Hz presence range with almost no competing harmonic content from other instruments during the verse. Finneas applies this principle intentionally: in public interviews he has described removing elements from the arrangement specifically when they competed with the vocal's frequency range. At 0:16 the 808-style bass drop lands entirely in the sub-bass register, below the vocal's content — the ear hears them simultaneously without masking because they occupy non-overlapping critical bands.

Dr. Dre ft. Snoop Dogg — "Nuthin' But a G Thang" (1992)
0:00–0:50 · Produced by Dr. Dre

Mixed by Vance Powell at Death Row Studios, this record demonstrates West Coast hip-hop's characteristic deep low-end separation. The kick sits at 50–60 Hz with a tight, punchy body; the bass line — a heavily processed P-Bass — occupies 100–200 Hz with a strong fundamental and limited sub content. The piano sample carries most of its energy above 400 Hz, leaving the low-mid region uncluttered. Snoop's vocal benefits from the near-complete absence of material between 250–800 Hz during the verses. Monitor in mono at low volume and notice that every element remains separately identifiable — a sign that masking has been effectively controlled.

Listen On Spotify
Daft Punk — Get Lucky
Kendrick Lamar — HUMBLE.
Billie Eilish — bad guy

08 Types & Variants

Simultaneous Masking
FabFilter Pro-Q 3 (collision mode) · iZotope Neutron 4 (Masking Meter)

The most common form in mixing: two sounds at similar frequencies present at the same time, with the louder suppressing the quieter. Addressed primarily through EQ carving, level balancing, and arrangement editing. FabFilter Pro-Q 3's inter-track spectrum display is the industry-standard tool for real-time visual identification of simultaneous masking collisions between any two tracks in a session.

Forward Temporal Masking
Universal Audio LA-2A · Empirical Labs Distressor

A loud sound raises the detection threshold for a quieter sound that follows it by up to 200 milliseconds. In mixing, reverb tails are the primary source of forward masking: the decay of a previous note masks the attack of the next. Controlled with reverb pre-delay (20–80 ms), decay time management, and compressor release settings on reverb returns to reduce tail buildup during dense passages.

Backward Temporal Masking
Dangerous Music Liaison (monitor controller for reference switching) · Any precision DAW clip editor

A loud upcoming sound can retroactively suppress audibility of a preceding quieter sound up to 20 milliseconds earlier, due to the auditory system's anticipatory processing window. Rare in practice but relevant in electronic music with precisely timed transients — a hard-clipped snare arriving immediately after a soft hi-hat can obscure the hi-hat's decay in ways that clip-level editing (nudging the hi-hat 15–20 ms earlier) can resolve.

Upward Spectral Masking
Neve 1073 (high-pass filtering) · API 550B (broad EQ shaping)

The directional bias of masking toward higher frequencies: a loud low-frequency masker suppresses sounds above it more than sounds below it, and this asymmetry increases with masker level. Directly responsible for bass-heavy mixes that bury vocals, guitars, and synths in the low-mid range. The primary corrective tool is aggressive high-passing of bass instruments — often at 100–160 Hz with 18–24 dB/octave slopes — to limit how far the upward masking spread extends into the critical vocal and guitar frequency zones.

Informational (Cognitive) Masking
Monitoring in mono (speaker placement) · Auratone 5C (midrange reference speaker)

A distinct but related phenomenon in which two sounds of similar timbre or rhythm compete for the listener's cognitive attention rather than causing basilar-membrane threshold elevation. Two similar-sounding guitar parts playing similar rhythms in the same frequency range create informational masking even when physical masking analysis shows adequate spectral separation. Addressed through timbral differentiation (different pickup types, capo positions, or amp settings), rhythmic offsetting (part A plays on beat, part B plays off-beat), or arrangement editing rather than EQ alone.

09 Common Mistakes

Interactive Tool
Frequency Map — Where Instruments Live
The audible spectrum runs 20 Hz to 20 kHz. Every instrument occupies a specific range. This is the foundation of every frequency masking decision.
20 Hz100 Hz 500 Hz2 kHz 8 kHz20 kHz

Frequency conflicts — two instruments in the same range at similar levels — are the root cause of muddy mixes.

10 Producers Also Look Up

11 Further Reading

These MPW articles put frequency masking into practice — specific techniques, real tools, and applied workflows.

12 Frequently Asked Questions

Frequency masking is when one loud sound makes a quieter sound at a nearby frequency hard or impossible to hear, even though both sounds are actually present in the audio. It happens because of how the human ear processes sound physically — the cochlea can't fully separate sounds that are too close in frequency when one is much louder than the other. In mixing, it's the main reason a bass guitar and kick drum can each sound great alone but muddy and undefined together.
The clearest sign is that an element sounds good when soloed but disappears or loses definition in the full mix. Other indicators include a muddy or dense low-mid range (200–800 Hz), a vocal that sounds present at low volume but buried at high volume, and bass that feels 'thick' rather than punchy. Use a spectrum analyzer — ideally one that shows two tracks simultaneously, like FabFilter Pro-Q 3 in collision mode — to visually identify frequency regions where two or more elements are peaking at similar levels.
The upward spread of masking refers to the fact that a loud sound masks frequencies above it more aggressively than frequencies below it. A 100 Hz bass fundamental will raise the masking threshold for 250–500 Hz elements far more than it will for 50–80 Hz elements. This matters practically because bass-heavy mixes routinely bury low-mid content — the body of a vocal, the warmth of a guitar, the weight of a piano — even when those elements appear at adequate levels on the fader. The corrective approach is to high-pass bass instruments aggressively, limiting how far the upward masking spread extends into the low-mid vocal range.
Simultaneous masking occurs when two sounds compete at the same moment in time — the most common masking problem in dense mix arrangements. Temporal masking is time-based: forward temporal masking means a loud sound raises the threshold for a quieter sound that follows it (by up to 200 ms), and backward temporal masking means a loud upcoming sound can suppress a quieter preceding sound (up to 20 ms before the loud sound arrives). In mixing, reverb tails are the main source of forward temporal masking, and controlling reverb pre-delay and decay time are the primary solutions.
Almost always cut the masker first. Boosting the masked element increases its absolute level, which simultaneously raises the absolute level of the masker and provides diminishing returns on perceptual clarity — you often end up in a level war that makes the whole mix louder without resolving the underlying spectral conflict. A 3–4 dB cut in the masker at the collision frequency creates a spectral window that the masked element fills without any level increase. Use reciprocal EQ: cut the masker at the maskee's key frequency, and optionally boost the maskee at the same point by a smaller amount.
Sidechain compression genuinely solves simultaneous masking by converting it into temporal masking — specifically, it creates a brief gap in the masker's level each time the maskee needs spectral space. When the kick hits, the bass ducks below the masking threshold for 80–150 ms, allowing the kick to be heard clearly; then the bass returns to full level between hits. This is a real perceptual improvement, not cosmetic, because the ear hears two distinct events in sequence rather than one masking the other. The trade-off is that the bass level fluctuates, which is audible at high ratios — the art is finding the minimum ratio and depth that resolves the masking without creating an obvious pumping artifact.
Perceptual audio codecs including MP3, AAC (used by Apple Music and Spotify), and Opus deliberately exploit masking by discarding signal content that falls below the predicted masking threshold — information the listener cannot hear anyway. Mixes with heavy simultaneous masking encode at lower bitrates because more content is classified as inaudible, but they also risk introducing more codec artifacts: if the codec's masking model slightly misjudges a threshold, previously masked content may reappear as noise post-decode. Mixing with adequate spectral separation — which reduces reliance on masking in the encoded signal — generally produces more consistent results across streaming platforms and codec settings.
Yes, and in many productions dynamic EQ is preferable because it operates surgically on specific frequency bands rather than the full-bandwidth level of the signal. A dynamic EQ band set to reduce 80–120 Hz on the bass channel by 4–6 dB only when the kick triggers it leaves the bass's mid-range and high-frequency content completely untouched, whereas a full-band compressor also reduces the upper harmonics that define the bass's timbre. Plug-ins such as FabFilter Pro-MB, Waves F6, and iZotope Neutron's dynamic EQ support external sidechain inputs and can be configured to apply frequency-specific ducking triggered by a kick drum signal. The result is more transparent than broadband sidechain compression, though it requires more precise setup.

Part of The Producer's Bible — Every term. Every technique. One place.
Published by MusicProductionWiki.com · The Reference Standard for Music Production