Loudness Matching

Loudness Matching is the practice of setting two audio signals to equal perceived loudness before comparison, ensuring that volume differences do not bias critical listening decisions during mixing, mastering, or plugin evaluation.

Hear The Difference

Dry vs Processed — Loudness Matching

🎵 Audio examples coming soon — check back shortly.

Dry Processed

01 Definition

Every time you bypass a plugin and think 'that sounds better,' there's a 70% chance you're just hearing louder — and louder always sounds better until it doesn't.

Loudness Matching is the disciplined practice of aligning the perceived loudness of two or more audio signals before making any comparative judgment — whether that comparison is between a processed and unprocessed version of a track, a mix against a commercial reference, or two competing plugin chains. The human auditory system is not a neutral measuring device. It interprets increased sound pressure as increased quality, clarity, and presence — a psychoacoustic bias so reliable it has been reproduced in dozens of controlled listening studies since the 1960s. Loudness Matching is the countermeasure: it removes volume as a variable so that tonal, dynamic, and spatial qualities can be evaluated on their own merits.

At its technical core, loudness matching involves measuring the loudness of a signal using a perceptually weighted scale — today almost universally LUFS (Loudness Units relative to Full Scale, per ITU-R BS.1770) — and then applying gain to bring competing signals within 0.5 dB or less of each other before listening. The LUFS scale integrates loudness over time using a frequency-weighted filter that approximates the ear's sensitivity curve, giving greater weight to the midrange frequencies (1–4 kHz) where human hearing is most acute and less weight to sub-bass energy that registers on meters but not as clearly in perception. Integrated LUFS measures the entire program duration, while Short-Term LUFS averages over three seconds and Momentary LUFS over 400 milliseconds — each window serving a different diagnostic purpose in a session.

The scope of loudness matching extends well beyond plugin A/B comparisons. It is the foundational discipline behind professional mix referencing, where a producer loads a commercially mastered track into a reference slot and gain-matches it to their working mix before drawing any conclusions about spectral balance or dynamic punch. It governs the calibration of loudness-normalizing streaming platforms — Spotify targets −14 LUFS integrated, Apple Music targets −16 LUFS, YouTube targets −14 LUFS — meaning that any master delivered above these targets will be turned down by the platform, and any master delivered below will be turned up, potentially exposing noise floor or dynamic artifacts that were inaudible at higher gain. Understanding loudness matching is therefore inseparable from understanding how music reaches listeners in 2026.

Importantly, loudness matching is not the same as gain staging, though the two disciplines share vocabulary and tools. Gain staging concerns the management of signal levels throughout a signal chain to prevent clipping, minimize noise floor accumulation, and keep processors operating within their optimal input ranges. Loudness matching is specifically a comparative and perceptual calibration — it answers the question 'are these two things the same volume?' rather than 'is this signal at the right level for the next processor?' A mix engineer may have impeccable gain staging and still make flawed plugin decisions because they never loudness-matched their comparisons. Both disciplines are necessary; they solve different problems.

The professional relevance of loudness matching has expanded significantly since streaming platforms adopted loudness normalization between 2013 and 2017. Before normalization, the dominant commercial strategy was the Loudness War: masters were limited and clipped to achieve the highest possible integrated loudness, because louder playback on radio or CD was perceived as more energetic and competitive. Normalization ended that arms race at the consumer playback level, but it also introduced a new error mode — producers who still optimize for peak loudness rather than perceived quality at normalized levels. A mix that measures −8 LUFS integrated will be turned down 6 dB on Spotify, often revealing pumping, distortion, and a flattened transient envelope that was masked by sheer level. Loudness matching, practiced rigorously throughout the production process, protects against exactly this outcome.

02 How It Works

The mechanical process of loudness matching begins with measurement. A LUFS meter — whether a dedicated plugin like Youlean Loudness Meter, iZotope's Insight, or a DAW's built-in loudness meter — is placed at the end of a signal chain and the signal is played in full or over a representative section. The integrated LUFS value is noted. The same process is applied to the reference signal. The difference between the two readings, in LU (Loudness Units, which are numerically equivalent to dB in this context), is the gain offset required to match them. If the mix reads −18 LUFS and the reference reads −14 LUFS, the mix must be gained up by 4 dB — or more practically, the reference must be gained down by 4 dB — before any comparative listening begins. The 0.5 LU matching tolerance is the professional standard; differences larger than 1 LU are perceptible to most trained listeners and will skew judgments.

The ITU-R BS.1770 algorithm that underpins LUFS measurement applies a two-stage filtering process to the audio before integration. The first stage is a high-shelf pre-filter that compensates for acoustic head-related effects, boosting high frequencies by approximately 4 dB above 1.5 kHz. The second stage is a 100 Hz high-pass filter that removes sub-bass energy from the loudness calculation, reflecting the fact that very low frequencies contribute little to perceived loudness at normal listening levels. The filtered signal is then squared (converting amplitude to power), summed across channels with a weighting factor of 1.41 for left and right surround channels in multichannel configurations, and integrated over time using a gating algorithm that ignores program pauses below −70 LUFS absolute and below −10 LU relative to the ungated loudness — ensuring that silence between phrases does not artificially deflate the measurement.

In practice, producers use two complementary matching strategies: static gain matching and dynamic gain matching. Static gain matching applies a fixed gain offset derived from integrated LUFS measurements — appropriate when comparing full mixes, mastered references, or complete program material of similar duration. Dynamic gain matching uses an automated gain rider or a loudness-matching plugin (such as Melda MAutoGain or the matching function in FabFilter Pro-L 2) to continuously adjust gain so that short-term or momentary loudness tracks between the two signals in real time — more appropriate when comparing instruments with different dynamic envelopes, such as a heavily compressed vocal against an uncompressed parallel version. Neither method is universally superior; the correct choice depends on the nature of the material and the duration of the comparison section.

Null testing is the most rigorous extension of loudness matching and is used to verify that a processing chain has introduced no artifacts beyond gain change. After loudness-matching two versions of a signal, the polarity of one is flipped and the two signals are summed. If the processing chain altered only gain, the null is complete and only silence (or digital black) remains. Any residual signal reveals non-linear processing — saturation, compression, EQ phase shifts, harmonic distortion — that would be impossible to detect by listening to either version alone. Null testing is a diagnostic tool, not a quality judgment: many highly desirable processors do not null, precisely because their character comes from non-linear behavior. But knowing what a processor actually does to a signal, rather than what it appears to do at different volumes, is foundational to informed mix decisions.

Closing the loop between measurement and perception requires understanding that LUFS, while the best available perceptual loudness metric for program material, is not perfectly correlated with perceived loudness in all contexts. Transient-heavy material — percussive records with high crest factors — can measure at a relatively low integrated LUFS while sounding energetically louder than a more compressed program at the same LUFS reading. The Perceived Loudness (PLoud) and EBU R128 frameworks address some of these edge cases, but no algorithm fully replaces the calibrated human ear. Loudness matching is therefore best practiced as a precision tool for eliminating the grossest source of perceptual bias — level difference — while remaining attentive to the ways in which measurement and perception can still diverge.

Diagram — Loudness Matching: Signal flow diagram showing how two audio signals are measured in LUFS, gain-offset applied, then compared at equal perceived loudness.

03 The Parameters

Every loudness matching — hardware or plugin — operates on the same core parameters. Know these and you can work with any implementation.

INTEGRATED LUFS

Full-program perceived loudness averaged over entire duration

Integrated LUFS (also written I-LUFS) measures the loudness of an entire audio file from start to finish using the BS.1770 gating algorithm, which excludes passages below −70 LUFS absolute. This is the primary metric for streaming platform normalization and the reference point for full-mix loudness matching. Professional masters for streaming are typically delivered between −14 and −9 LUFS integrated, with the platform applying normalization to its target.

SHORT-TERM LUFS

3-second sliding window loudness for section-level comparison

Short-Term LUFS averages loudness over a 3-second sliding window, making it responsive enough to track verse-to-chorus energy differences and section-level dynamic shaping. When loudness-matching a mix section against a reference section, Short-Term LUFS provides more relevant data than Integrated LUFS. A well-structured chorus might target −10 to −8 LUFS Short-Term on a modern pop record.

MOMENTARY LUFS

400 ms window for transient-accurate loudness reading

Momentary LUFS averages over 400 milliseconds, making it suitable for tracking the loudness of individual phrases, drum hits in context, or lead vocal lines. It is not used as a matching target directly but provides a ceiling-awareness metric — a momentary peak much higher than the integrated reading indicates a high crest factor (dynamic) signal. Most meters display all three windows simultaneously.

LU OFFSET (GAIN TRIM)

The dB gain difference applied to reach level parity

The LU Offset is the calculated gain adjustment — in Loudness Units, numerically identical to dB — required to bring a signal to the target loudness. Because LUFS values are negative numbers, a mix at −18 LUFS requires a +4 LU trim to match a reference at −14 LUFS. Professional practice holds a tolerance of ±0.5 LU; differences larger than 1 LU are reliably perceived by trained ears and will bias A/B decisions.

LOUDNESS RANGE (LRA)

Statistical spread of loudness values across a program

Loudness Range (LRA), defined in EBU Tech 3342, measures the statistical distribution of short-term loudness across a program — specifically the difference between the 10th and 95th percentile of short-term loudness values in LU. A highly compressed master may have an LRA of 2–4 LU; an orchestral recording may have an LRA of 10–16 LU. LRA informs whether matching via integrated LUFS alone is sufficient or whether dynamic-mode matching is required.

TRUE PEAK (dBTP)

Inter-sample peak ceiling for codec-safe delivery

True Peak measures the maximum reconstructed inter-sample peak level after D/A conversion, expressed in dBTP. Although not a loudness-matching parameter per se, it sets the ceiling within which loudness matching gain trims must remain. Streaming platforms recommend True Peak maxima of −1 dBTP (Spotify, Apple Music) to −2 dBTP (YouTube) to prevent clipping after AAC or MP3 encoding. Gain-matching a mix upward must always be checked against the True Peak ceiling.

04 Quick Reference Card

Session-ready starting points. Values assume a calibrated monitoring environment; adjust True Peak ceiling to −2 dBTP for YouTube or podcast delivery.

Parameter	General	Drums	Vocals	Bass / Keys	Bus / Master
Target Integrated LUFS	−14 to −16	−14 to −12	−16 to −18	−14 to −16	−14 (Spotify norm)
Match Tolerance (LU)	±0.5 LU	±0.5 LU	±0.3 LU	±0.5 LU	±0.1 LU
Measurement Window	Integrated	Short-Term (3s)	Short-Term (3s)	Integrated	Integrated
Typical LRA (LU)	6–9	4–7	7–11	5–8	5–9
True Peak Ceiling (dBTP)	−1.0	−1.0	−1.0	−1.0	−1.0 to −2.0
Reference Match Method	Static gain trim	Static or momentary	Short-term dynamic	Static gain trim	Integrated static
Pre-match listening level	79–83 dB SPL	79–83 dB SPL	75–79 dB SPL	79–83 dB SPL	83 dB SPL

Values assume a calibrated monitoring environment; adjust True Peak ceiling to −2 dBTP for YouTube or podcast delivery.

05 History & Origin

The psychoacoustic principle underlying loudness matching — that louder signals are perceived as higher quality — was formally documented by Harvey Fletcher and Wilden Munson at Bell Laboratories in 1933. Their equal-loudness contour research, published as 'Loudness, Its Definition, Measurement and Calculation' in the Journal of the Acoustical Society of America, demonstrated that perceived loudness is frequency-dependent and that the auditory system's sensitivity varies dramatically with sound pressure level. Although Fletcher and Munson were not developing a mixing methodology, their curves established the scientific foundation on which all subsequent loudness standards would be built. The contours were later refined by Robinson and Dadson in 1956 and codified as ISO 226 in 1987 and again in revised form in 2003.

The practical need for loudness matching in production contexts became acute during the rise of multitrack recording in the 1960s and 1970s, as recording engineers at facilities like Abbey Road, Electric Lady, and Record Plant began routinely comparing tape playback against console sources. Engineers including Tom Dowd, Geoff Emerick, and Eddie Kramer developed informal level-matching rituals — adjusting console faders to null out gain differences before evaluating EQ or effect settings — out of professional necessity rather than any formal standard. The introduction of VU meters, standardized at 0 VU = +4 dBu (professional line level) in the 1939 NAB standard, gave studios a shared reference point for approximate level alignment, though VU meters are RMS-weighted and do not reflect perceived loudness accurately for all program types.

The broadcast industry drove the first formal loudness standards. In Europe, the European Broadcasting Union developed the R68 recommendation in the 1990s, targeting −18 dBFS as an alignment level for digital broadcasting. In the United States, the CALM Act (Commercial Advertisement Loudness Mitigation Act) was signed into law in December 2010 and took effect in December 2012, requiring that television advertisements not exceed the loudness of adjacent program material. To enforce CALM, the ATSC A/85 standard adopted the ITU-R BS.1770 loudness algorithm, which had been published by the International Telecommunication Union in 2006. The EBU simultaneously developed R128, its own loudness normalization recommendation for European broadcasting, establishing −23 LUFS as the target for broadcast and introducing the concept of Loudness Range (LRA). These standards gave engineers, for the first time, a platform-agnostic and perceptually validated unit for loudness comparison.

Streaming platform adoption of loudness normalization between 2013 and 2017 transformed loudness matching from a broadcast-specific concern into a universal production discipline. SoundCloud introduced normalization in 2013; Spotify implemented ReplayGain-based normalization in 2013 and transitioned to a LUFS-based system targeting −14 LUFS integrated in 2017; Apple Music adopted −16 LUFS normalization at the launch of Apple Music in 2015; YouTube's Content ID system began normalizing to −14 LUFS for music content progressively from 2015 onward. By 2018, every major streaming platform had adopted loudness normalization, rendering the loudness war strategies of the CD era not merely aesthetically counterproductive but commercially self-defeating. The mastering community, led in part by engineers like Bob Katz — whose 2002 book 'Mastering Audio' had advocated loudness-referenced calibrated monitoring years before streaming normalization — and Ian Shepherd, whose 2012 'Dynamic Range Day' campaign highlighted the damage of hyper-limited masters, had already developed the vocabulary and methodology that producers needed for the streaming era.

06 How Producers Use It

In day-to-day mixing, loudness matching most commonly appears as a plugin bypass check. A producer inserts a compressor, EQ, or saturation plugin on a channel, processes the signal, and then wants to evaluate whether the plugin is actually improving the sound or merely making it louder. The correct workflow is to play a looped section through the plugin, note the Short-Term LUFS reading with the plugin active, then target the same reading with the plugin bypassed — adjusting the bypass-state gain (often via the plugin's output gain, a trim plugin after the processor, or the channel's pre-fader gain) until both states read within 0.5 LU. Only then is the bypass comparison valid. In practice, most producers use a dedicated utility — a gain plugin, a trim plugin, or a loudness-matched A/B tool like LEVELS by Mastering The Mix — in a dedicated comparison slot.

For full-mix referencing, the workflow scales to the master bus. A commercial reference track — selected for similarity of genre, tempo, and instrumentation — is imported into the session and routed to a reference track or reference plugin such as Reference 2 by Mastering The Mix or Tonal Balance Control 2 by iZotope. The reference is gain-trimmed until its integrated LUFS matches the mix's integrated LUFS, measured over a representative 30–60 second section that includes the loudest part of the arrangement (typically the second chorus). The producer then toggles between mix and reference in real time, listening for tonal balance differences, low-end weight, stereo width, and transient punch — none of which can be reliably evaluated without prior loudness matching. Many producers establish a session template with this reference chain pre-built, ready to activate at any stage of the mix.

Drums present a specific loudness-matching challenge because of their high crest factor — a well-mixed drum bus may have a momentary peak loudness 10–14 dB above its integrated loudness, while a heavily compressed pop mix may have a crest factor of only 4–6 dB. When comparing a more dynamic drum treatment against a compressed alternative, integrated LUFS matching may still leave the dynamic version perceptually louder during transient peaks even though the two versions measure identically integrated. For drums specifically, Short-Term LUFS matching over a one- or two-bar loop is more appropriate than integrated matching, and the 79–83 dB SPL monitoring level should be held constant between comparisons. Bob Katz's K-System calibration — which standardizes monitoring at 83 dB SPL with K-20, K-14, or K-12 scales mapped to the headroom above this reference — was developed specifically to keep drum-level decisions perceptually consistent across sessions.

Vocal loudness matching is particularly critical when evaluating parallel processing chains — a common workflow is to blend a dry vocal with a heavily processed (compressed, saturated, or pitch-shifted) parallel return, tuning the blend by ear. Without loudness-matching the parallel return to the dry signal before blending, producers consistently over-add the parallel — because the processed version, having been through additional gain stages, is typically louder. A properly loudness-matched parallel vocal blend often uses significantly less parallel return than an ear-only blend would suggest, resulting in a vocal that benefits from the character of the parallel chain without the density and mask of an over-blended parallel signal.

AbletonUse the built-in Utility plugin (set to 0 dB then adjust Gain) after the chain being tested, referencing the loudness reading in Ableton's built-in audio analysis or a third-party LUFS meter on a return track. The EXT input toggle on the Master can accept a reference track at matched gain for A/B via a send.

FL StudioRoute both the working mix and reference through the Mixer's Peak Controller-linked send to a Fruity Balance plugin with the Vol knob set to matched LUFS. FL's Parametric EQ 2 shows RMS but not LUFS — insert Youlean Loudness Meter (VST3) on each channel for accurate LUFS readings before matching.

Logic ProLogic's built-in Loudness Meter plugin (under Metering) reads Integrated, Short-Term, and Momentary LUFS directly on any channel strip. Use the Gain plugin from the Utility category as your trim control. The Loudness Meter's Integrated mode with the 30-second integration period is ideal for section-level reference matching.

Pro ToolsInsert a Trim plugin (AudioSuite > Other > Trim) or the stock Gain plugin as a pre-fader insert for gain offset. Reference Pro Tools' integrated loudness reading in the Master Fader's Meter Bridge set to LUFS mode (available in PT 2023.3+). Many Pro Tools engineers use iZotope Insight 2 or Avid's VENUE LoudnessExpert on the reference aux for precise matching.

ReaperReaper's JS: LUFS Meter plugin (included in the ReaPlugs suite) reads Short-Term and Integrated LUFS on any FX chain. Use ReaGain or the Volume Trim JS plugin for trim control. Reaper's flexible routing makes it straightforward to establish a dedicated reference track with pre-calculated gain offset built into the track's Item Properties gain field.

The Producer's Briefing

Sound better by Friday.

One email a week. The techniques behind the terms — curated by working producers, not algorithms.

No spam · Unsubscribe anytime

07 In the Wild

Abstract knowledge becomes practical when you can hear it in music you know. These tracks demonstrate loudness matching used intentionally, at specific moments, for specific purposes.

Billie Eilish — "bad guy" (2019)

0:00–0:45 · Produced by Finneas O'Connell

A textbook case of a commercially normalized master designed to sound competitive at −14 LUFS rather than at a louder pre-normalization target. The track measures approximately −14 LUFS integrated, placing it exactly at Spotify's normalization target — meaning it receives no gain reduction on the platform. Loudness-match it against a mid-2000s pop master and the contrast in headroom and transient preservation becomes immediately apparent: 'bad guy' sounds spacious, punchy, and three-dimensional at matched levels, while the older master sounds dense and congested. This is the correct outcome of post-normalization mastering philosophy in practice.

Daft Punk — "Get Lucky" (2013)

1:05–1:55 (verse into pre-chorus) · Produced by Daft Punk, mixed by Mick Guzauski

Mixed by Mick Guzauski, 'Get Lucky' has a Loudness Range (LRA) of approximately 7 LU — unusually wide for a commercial dance record of its era — and an integrated loudness around −11 LUFS. Loudness-match this track to a modern −14 LUFS mix and the low end reveals itself as remarkably controlled and articulate at normalized levels. The bass guitar and kick drum sit in separate frequency pockets without masking, a relationship that is only audible in a loudness-matched comparison; at the track's native louder playback level, the bass appears to dominate.

Kendrick Lamar — "HUMBLE." (2017)

0:00–0:32 (intro drop) · Produced by Mike WiLL Made-It

The drop at 0:08 is one of the most analyzed moments in modern hip-hop production. 'HUMBLE.' measures approximately −8 LUFS integrated — significantly above Spotify's −14 LUFS target, resulting in 6 dB of platform-applied attenuation. Loudness-match the track to −14 LUFS and the 808 sub in the drop loses considerable apparent weight, revealing that much of the perceived impact of the record at native playback level is volumetric rather than purely mix-structural. This is not a flaw — at normalized levels the record still hits hard — but it is a precise illustration of why loudness matching is necessary before drawing conclusions about bass decisions in a mix.

Taylor Swift — "Anti-Hero" (2022)

0:47–1:15 (verse 1 to pre-chorus) · Produced by Jack Antonoff

Mastered to approximately −9 to −10 LUFS integrated, 'Anti-Hero' receives significant platform attenuation on Spotify and Apple Music. Loudness-match it to −14 LUFS against a less limited contemporary record and the vocal presence remains remarkably consistent — Jack Antonoff's mix prioritizes vocal clarity over pure loudness, and the vocal holds its relative position in the mix even after gain reduction. The acoustic kick and snare transients, however, soften noticeably at normalized level, revealing that their impact is partly achieved by level rather than purely by transient design.

Listen On Spotify

Billie Eilish — bad guy

Daft Punk — Get Lucky

Kendrick Lamar — HUMBLE.

08 Types & Variants

Static Gain Matching

Any gain trim · VCA fader offset

A fixed gain offset is calculated from integrated LUFS measurements and applied to one signal before listening. This is the most common form of loudness matching, appropriate for full-program comparisons — mix versus reference, master A versus master B — where both signals cover a similar duration and dynamic range. The gain offset is applied once and held constant for the duration of the listening session.

Dynamic Loudness Matching

Melda MAutoGain · FabFilter Pro-L 2 (Match function)

An automated gain rider continuously adjusts the gain of one signal in real time so that its short-term loudness tracks the short-term loudness of the reference signal. Dynamic matching is appropriate when comparing signals with significantly different dynamic envelopes — for example, an uncompressed acoustic performance against a compressed mix version. It eliminates momentary loudness discrepancies that static matching cannot address, though it also masks the dynamic differences that may be the subject of the comparison.

Null-Test Matching

Polarity inversion switch · Digital summing

After static gain-matching two versions of a signal, one signal's polarity is inverted and the two are summed to digital zero. Any residual signal reveals the non-linear changes introduced by processing. While not a listening-based comparison, null testing is the most rigorous form of loudness-aware signal analysis and is used by plugin developers, mastering engineers, and researchers to characterize processor behavior beyond simple gain change.

Calibrated Monitor Matching

SPL Phonitor · Sonarworks SoundID Reference

Rather than matching signal levels in the DAW, the monitoring system itself is calibrated to a reference SPL — typically 79 or 83 dB SPL at the mix position — using a measurement microphone and correction software. This approach addresses room-induced loudness anomalies and ensures that the physical listening level remains consistent across sessions. Bob Katz's K-System is the most widely adopted framework for this approach, using K-20, K-14, and K-12 scales that place the reference loudness at defined positions below digital full scale.

Platform-Referenced Matching

Youlean Loudness Meter · Mastering The Mix LEVELS

The mix or master is gain-adjusted to preview what it will sound like after streaming platform normalization is applied. The producer sets a meter's target to the platform's normalization level (e.g., −14 LUFS for Spotify) and monitors the mix at that normalized level throughout production. This method prevents over-limiting by making the consequences of loudness-war mastering audible during the mix stage, before delivery.

09 Common Mistakes

✕
Bypassing plugins without gain compensation
The most common and consequential loudness matching error: pressing a plugin's bypass button while both processing and bypass states produce different output levels, then concluding that the active state sounds better. Because the active state is almost always louder (gain makeup in compressors, output level in EQs and saturators), the comparison is invalid. Fix this by measuring Short-Term LUFS in both states and trimming the louder state to match before listening.
✕
Matching against a reference without checking its LUFS first
Many producers import a commercial reference, set it to 'roughly the same level' by ear, and proceed to compare. Without measuring both signals and applying a precise gain offset, the comparison incorporates loudness bias. A reference master at −8 LUFS played against a mix at −14 LUFS will always appear tonally brighter, punchier, and more present — not because of EQ or compression decisions, but because of 6 dB of level advantage. Always measure, always trim.
✕
Using Peak meters instead of LUFS for matching
Peak meters (dBFS) measure instantaneous sample amplitude, not perceived loudness. Two signals with identical peak readings can have vastly different perceived loudness if they differ in dynamic range. A heavily compressed signal and a dynamic signal can both peak at −3 dBFS while differing by 8 LU in perceived loudness. Using peak meters for loudness matching produces unreliable results for any material other than sine waves. Always use a LUFS meter.
✕
Matching integrated LUFS for short or percussive signals
Integrated LUFS requires sufficient duration to produce a stable reading — for signals shorter than 10 seconds, the gating algorithm may produce unreliable results, and for percussive signals with very high crest factors, integrated LUFS underestimates perceived loudness relative to more compressed material. For short loops or drum comparison, use Short-Term LUFS over a 4–8 bar loop, ensuring the measurement has stabilized before noting the value.
✕
Ignoring monitoring level consistency
Loudness matching within the DAW is necessary but not sufficient if the monitoring level changes between comparisons. The Fletcher-Munson effect means that at lower monitoring levels, bass and high frequencies are perceived as proportionally quieter relative to the midrange — so a comparison made at 75 dB SPL will yield different tonal balance conclusions than one at 83 dB SPL, even with identical LUFS readings. Calibrate and fix your monitoring level before beginning any loudness-matched comparison session.
✕
Over-limiting a master to hit streaming targets, then matching down
Some producers see streaming targets of −14 LUFS as a ceiling to hit rather than a reference to match, heavily limiting their masters to precisely −14 LUFS integrated in the belief that hitting the target exactly is optimal. In fact, a master at −9 LUFS with excellent dynamics will be turned down 5 dB by the platform and will often sound better at normalized level than a master that was crushed to reach −14 LUFS. Loudness match your masters to the target during the mastering session to hear exactly what the platform will deliver.

Interactive Tool

Loudness Matching Calculator

Calculate gain reduction, makeup gain, and output level for any loudness matching setting.

Input Level (dBFS)

Threshold (dBFS)

Ratio (X:1)

Makeup Gain (dB)

11 Further Reading

These MPW articles put loudness matching into practice — specific techniques, real tools, and applied workflows.

→ What Is Lufs → Gain Staging Guide → How To Mix Reference → Loudness War Explained → Mastering For Streaming → Best Loudness Meter Plugins → How To Use A Limiter → Parallel Compression Guide

12 Frequently Asked Questions

What is loudness matching and why does it matter in mixing?+

Loudness matching is the practice of setting two audio signals to the same perceived volume — measured in LUFS — before making any comparative listening judgment. It matters because the human auditory system interprets louder sounds as higher quality, more present, and tonally brighter, even when no real quality difference exists. Without loudness matching, every plugin bypass, every reference comparison, and every parallel blend decision is contaminated by volume bias. It is the single most impactful workflow habit a producer can adopt for more objective mix decisions.

What is LUFS and how is it different from dB?+

LUFS (Loudness Units relative to Full Scale) is a perceptually weighted loudness measurement defined by the ITU-R BS.1770 standard. Unlike dB, which measures signal amplitude at a specific moment, LUFS integrates loudness over time using a frequency-weighted filter that mirrors the ear's sensitivity — giving more weight to midrange frequencies and filtering out sub-bass content that is felt more than heard. The result is a number that correlates much more closely with perceived loudness than peak dBFS does, making LUFS the correct unit for loudness matching.

How do I loudness match a reference track in my DAW?+

Import the reference track into your session on its own track or via a reference plugin. Place a LUFS meter on both the reference track and your mix bus. Play a representative 30–60 second section (including the loudest part) and note the integrated LUFS reading for each. Calculate the difference in LU. Apply that difference as a gain offset — using a Gain or Trim plugin — to the louder signal to bring both readings within 0.5 LU of each other. Now toggle between mix and reference at matched levels. Repeat the measurement every time you make significant changes to your mix.

What loudness target should I mix to for Spotify?+

Spotify normalizes playback to −14 LUFS integrated. This means any master above −14 LUFS integrated will be turned down by Spotify, and any master below −14 LUFS will be turned up. The practical implication is that you should loudness-match your mix and reference to −14 LUFS integrated during the mix and mastering process to hear exactly how they will sound on the platform. There is no advantage to mastering louder than −14 LUFS for Spotify delivery — the gain reduction will be applied regardless.

Can I loudness match by ear or do I need a meter?+

Experienced engineers can make rough loudness matches by ear — within about 1–2 LU for sustained program material — but this is not accurate enough for critical comparisons. The tolerance for meaningful loudness-matched A/B comparison is ±0.5 LU, which is below the threshold of reliable ear-based level matching for most listeners, especially under listening fatigue. A LUFS meter is inexpensive (free options like Youlean Loudness Meter exist) and takes seconds to use. There is no professional justification for relying on ear-based level matching when meters are universally available.

How is loudness matching different from gain staging?+

Gain staging is the management of signal levels throughout a signal chain to keep processors in their optimal operating range and prevent noise accumulation or clipping. It happens at every insert point in your mix, from input trim to output fader. Loudness matching is specifically a comparative calibration technique — it answers whether two signals are at the same perceived volume before you evaluate their relative quality. A mix can have perfect gain staging and still produce biased A/B comparisons without loudness matching, because the two disciplines address fundamentally different questions.

What is a null test and how does it relate to loudness matching?+

A null test is an extreme form of loudness-aware signal analysis in which two versions of a signal are gain-matched and then summed with one polarity-inverted. If the only difference between the two versions is gain, the sum will be silence (digital black). Any residual signal reveals the non-linear processing introduced — harmonic distortion, phase shift, dynamic artifacts. Null testing requires prior loudness matching to be valid; without it, the residual includes both the processing artifacts and the level difference, making it impossible to distinguish the two. Null tests are used by engineers to verify gain-only processes like linear-phase EQ or pure digital gain changes.

Does loudness matching affect the Loudness War and mastering decisions?+

Directly and significantly. The Loudness War — the practice of heavily limiting and clipping masters to achieve higher playback level on CD and radio — was made irrelevant at the consumer level by streaming platform normalization from 2013 onward. A hyper-limited master at −7 LUFS will be turned down 7 dB on Spotify, often sounding worse than a more dynamic master at −9 LUFS that receives only 5 dB of attenuation. Mastering engineers who loudness-match their masters to streaming targets during the mastering session — previewing exactly what normalization will do — produce commercially superior results compared to engineers still targeting maximum loudness. Loudness matching is the tool that makes this preview possible.

MixingLUFSMasteringGain StagingStreaming

Last Verified: May 14, 2026 · 2026 Edition · Part of The Producer's Bible

Part of The Producer's Bible — Every term. Every technique. One place.
Published by MusicProductionWiki.com · The Reference Standard for Music Production