Sound Design Basics: Synthesis Types, Techniques & How to Start
Sound design is the skill that separates producers who use presets from producers who build worlds. Every iconic sound in recorded music — the TB-303 acid bass, the DX7 electric piano, the Serum supersaws of modern EDM — was designed by someone who understood how synthesis works at a fundamental level. This guide explains every major synthesis type, the universal building blocks that apply across all of them, and how to start designing your own sounds from scratch.
Sound design uses synthesizers to generate and shape audio. The main synthesis types are subtractive (start with harmonics, filter them out), FM (modulate frequency to create complex timbres), wavetable (scan through waveform snapshots for evolving sounds), additive (build sounds from sine waves), and granular (manipulate tiny audio fragments). Start with subtractive synthesis — it is the most intuitive and foundational.
The Universal Building Blocks of Synthesis
Before diving into individual synthesis types, every synthesizer shares a set of core concepts. Master these and the transition between different synths and synthesis methods becomes straightforward.
Oscillators — The Sound Source
Oscillators generate the raw waveform. The four basic waveforms are: sine (pure tone, no harmonics), triangle (soft, flute-like, odd harmonics only), square (hollow, woody, odd harmonics), and sawtooth (bright, buzzy, all harmonics). The harmonic content of each waveform determines its timbre before any filtering happens. Most synths allow multiple oscillators running simultaneously, tuned in unison, octaves, or fifths.
Filters — Tone Shaping
Filters remove or boost specific frequency ranges. The low-pass filter (LPF) is the most common — it lets low frequencies through and cuts highs above the cutoff frequency. Resonance boosts frequencies just at the cutoff point, creating a characteristic peak. At extreme resonance settings the filter self-oscillates, producing a pure tone of its own. Filters are the primary tonal sculpting tool in subtractive synthesis.
Envelopes — Shape Over Time (ADSR)
Every sound needs a shape in time. The ADSR envelope defines this: Attack (how long to reach full level), Decay (how long to fall to the sustain level), Sustain (the held level while the key is pressed), Release (how long to fade after the key is released). Envelopes can control amplitude, filter cutoff, pitch, or any other parameter. The same ADSR principle applies across every synthesis type.
LFOs — Modulation and Movement
An LFO (Low Frequency Oscillator) oscillates below the audible range — typically 0.1Hz to 20Hz. Rather than producing audio, it modulates parameters over time. LFO on pitch = vibrato. LFO on amplitude = tremolo. LFO on filter cutoff = a rhythmic filter sweep. LFOs can be synced to the project tempo for rhythmic effects, or left free-running for organic, flowing movement.
Subtractive Synthesis — Start Here
Subtractive synthesis is the foundation of modern synthesizer design and the best place to begin learning sound design. The principle is simple: start with a harmonically rich oscillator (sawtooth or square), then use a filter to subtract harmonics and sculpt the tone.
Classic hardware synths built on this architecture — the Moog Minimoog, Roland Juno series, Sequential Circuits Prophet-5 — define the sound of recorded music from the 1970s to today. Virtually every modern software synth includes a subtractive synthesis mode.
How Subtractive Works in Practice
A typical subtractive patch starts with one or two sawtooth oscillators. One is detuned slightly above pitch and the other slightly below — this creates the familiar "fat" unison tone. The combined oscillators pass through a low-pass filter. Lowering the cutoff rolls off the high harmonics, making the sound warmer and darker. Adding resonance at the cutoff creates an emphasis that sounds nasal or aggressive. The filter cutoff is then modulated by an envelope — opening up on attack (bright) and closing on release (darker, fading) — giving the sound its characteristic feel.
Best Synths for Subtractive Synthesis
Native Instruments Massive is the reference for subtractive bass and lead design. Xfer Records Serum 2 covers subtractive alongside wavetable. For free options, Vital handles subtractive synthesis excellently and has a clear visual interface. Hardware: any vintage Moog clone or Roland SH-101 reissue.
FM Synthesis — Complex Timbres From Simple Operators
FM (Frequency Modulation) synthesis was developed by John Chowning at Stanford in the 1960s and commercialised by Yamaha in the DX7 (1983) — one of the best-selling synthesizers in history. The electric piano, bass, marimba, and bell sounds of the 1980s are almost all FM. Modern FM synthesis is experiencing a renaissance in plugins like Native Instruments FM8, Arturia DX7 V, and the Dexed freeware clone.
How FM Works
FM synthesis uses one oscillator — called the modulator — to modulate the frequency of another oscillator — the carrier. When the modulator's frequency is within the audible range and at high enough amplitude, it creates sidebands — additional frequency components around the carrier frequency. These sidebands are what give FM its distinctive harmonic complexity.
The key insight: unlike subtractive synthesis (which removes harmonics from a rich source), FM synthesis generates harmonics through modulation. The relationship between modulator and carrier frequency determines whether the resulting sidebands are harmonic (musically related to the carrier) or inharmonic (dissonant, metallic, bell-like).
The Operator System
In FM synthesis, oscillators are called operators. The DX7 had six operators. Each operator can act as a carrier (producing audible sound), a modulator (modulating another operator's frequency), or both. The routing of operators — which modulates which — is called an algorithm. The DX7 had 32 algorithms. Different algorithms produce fundamentally different timbral possibilities.
Simple algorithm: one modulator feeding one carrier. Result: a tone with sidebands above and below the carrier frequency. Complex algorithm: a chain of modulators feeding a carrier, or multiple carriers running in parallel. Result: extremely dense harmonic structures capable of anything from realistic piano to sci-fi sound effects.
The FM Ratio — Harmonic vs Inharmonic
The ratio between modulator frequency and carrier frequency is the most important parameter in FM synthesis. When this ratio is a whole number (1:1, 2:1, 3:1), the sidebands land on harmonic frequencies — the sound is tonally rich but still musical. When the ratio is fractional (1.41:1, 2.73:1), the sidebands are inharmonic — the sound becomes metallic, glassy, bell-like, or clangorous. This is how FM synthesis achieves the bell and electric piano timbres that are impossible to replicate with subtractive methods.
FM in Practice
The challenge of FM synthesis is that the interaction between ratio, modulation depth, and algorithm is non-linear — small parameter changes can produce dramatic timbral shifts. The approach is: set a ratio first (whole number for harmonic, fractional for metallic), then increase modulation depth slowly from zero while listening. At low depth, the sound is close to a sine wave. As depth increases, harmonics multiply. At extreme depth, the sound becomes noise-like or breaks into complex textures. The envelope on the modulator's amplitude is critical — it controls how the harmonic complexity changes over time.
Best Tools for FM Synthesis
Native Instruments FM8 remains the most accessible FM synthesizer with a visual algorithm editor. Dexed is free and models the DX7 exactly — excellent for learning classic FM sounds. Arturia DX7 V adds modern features to the classic architecture. In Ableton Live, Operator is a four-operator FM synth built directly into the DAW.
Wavetable Synthesis — Evolution and Movement
Wavetable synthesis was pioneered by Wolfgang Palm (PPG Wave, 1981) and refined into its modern form by Dave Hilowitz and the team behind Xfer Records Serum. It is now the dominant synthesis type in electronic music production — Serum's wavetable engine defines the sound of modern EDM, bass music, and commercial pop.
How Wavetable Works
A wavetable is a collection of single-cycle waveforms — individual snapshots of a complete waveform cycle. Each snapshot represents a different timbre. By sweeping the playback position through the wavetable (either statically or over time), the timbre of the sound morphs continuously. At position 0, you hear one waveform. At position 64, the timbre has transformed entirely. At position 127, it is a completely different sound.
The power of wavetable synthesis is in this morphing — an evolving, animated quality that no static oscillator can achieve. A wavetable pad can shift from a soft sine-like tone at the start of a note to a harmonically dense, buzzing texture as the note sustains, creating a living, breathing sound in a way that subtractive synthesis cannot match.
Wavetable Position Modulation
The wavetable position can be modulated by anything: an envelope, an LFO, velocity, MIDI CC, macro knobs — this modulation is where wavetable synthesis becomes expressive. Common approaches:
- Envelope on position: sweeps from one timbre at the attack through to a different timbre as the note decays. Classic evolving pad technique.
- LFO on position: creates a rhythmic or organic timbral wobble — the signature movement in many modern bass patches.
- Velocity on position: maps playing velocity to wavetable position — soft playing stays in the warm part of the table; hard playing drives into the bright, harmonically complex part. Extremely expressive for melodic leads.
Serum's Wavetable Editor
Serum 2 includes a built-in wavetable editor that allows importing any audio file and converting it to a wavetable, drawing custom waveforms frame-by-frame, applying spectral processing to individual frames, and morphing between frames. This means any sound — a human voice, a guitar chord, a field recording — can become a wavetable and be played as a synthesizer patch. This is how the distinctive "vocal chop" leads and "talking bass" sounds in modern bass music are created.
Unison and Detuning in Wavetable Synthesis
Wavetable synthesizers allow multiple instances of the same oscillator running simultaneously — unison — with slight pitch offsets between them. In Serum, stacking eight voices in unison with subtle detuning and panning creates the enormous, wide supersaw sounds that dominate commercial EDM. The width and density of the unison stack is one of the defining characteristics of modern synthesizer sound design.
Best Synths for Wavetable Synthesis
Xfer Records Serum 2 is the industry standard — if you learn wavetable synthesis in Serum, the knowledge transfers directly to any other wavetable synth. Vital (free) covers nearly all the same ground and is the recommended learning tool. Native Instruments Massive X, Arturia Pigments, and Roland Zenology all include excellent wavetable engines.
Additive Synthesis — Building Sounds From Sine Waves
Additive synthesis is the theoretical foundation of all synthesis types. Fourier's theorem states that any complex waveform can be represented as the sum of sine waves at different frequencies and amplitudes. Additive synthesis applies this directly — it builds sounds by combining individual sine wave partials at specific frequencies and amplitudes, each with its own envelope.
This gives enormous control — you can shape each individual harmonic independently. But this control comes at the cost of complexity. To recreate a realistic piano sound additively, you might need hundreds of partials each with their own individual envelope. This is computationally expensive and practically complex.
Modern additive synthesis tools — Camel Audio Alchemy (now built into Logic Pro), Kilohearts Phase Plant, and the spectral analysis + resynthesis modes in iZotope RX — have made additive synthesis accessible. These tools can analyse an audio file, decompose it into additive partials, and allow you to edit and transform those partials. The result is a powerful form of sound design where you can manipulate the spectral content of any sound at a granular level.
Granular Synthesis — Texture and Transformation
Granular synthesis fragments audio into tiny pieces — grains — typically 1–100 milliseconds long, and reconstructs or transforms sound from those grains. The technique was pioneered by Iannis Xenakis and Curtis Roads and has become a central tool in ambient, experimental, and cinematic sound design.
How Granular Works
A granular synthesizer takes a source audio file (or a live input) and scans through it, extracting grains at a defined scan position. Multiple overlapping grains play simultaneously — their density (grains per second), pitch, duration, and pan position are independently controllable. By freezing the scan position and playing the same grain repeatedly with slight randomisation, a granular synth can produce an infinite, evolving sustain from any audio moment — the characteristic "frozen time" texture of granular synthesis.
Key Granular Parameters
Position — where in the source audio the grains are extracted from. Modulating position creates the characteristic granular sweep effect. Grain size — smaller grains produce more abstract textures; larger grains preserve more recognisable audio content. Density — the number of grains per second. Low density creates a stuttering, rhythmic quality. High density creates smooth, continuous textures. Pitch randomisation — small pitch variations between grains add a cloud-like, diffuse quality to the texture.
Granular in Practice
Granular synthesis excels at creating pads from any audio source (including speech and field recordings), producing atmospheric textures, time-stretching audio without pitch shifting, creating glitchy or abstract sound design for film and game audio, and transforming simple sounds into complex, unrecognisable textures. Leading granular synthesis plugins include Native Instruments Kontakt's grain engine, Reason Studios Grain, Ableton Live's Granulator III, and Output Portal.
Choosing the Right Synthesis Type
| Sound You Want | Best Synthesis Type | Recommended Tool |
|---|---|---|
| Fat bass, classic leads, pads | Subtractive | Massive, Vital, Juno emulation |
| Electric piano, bells, metallic textures | FM | FM8, Dexed, Ableton Operator |
| Modern supersaws, evolving pads, vocal leads | Wavetable | Serum 2, Vital |
| Spectral transformations, unique timbres | Additive | Alchemy, Phase Plant |
| Atmospheric pads, frozen textures, experimental | Granular | Granulator III, Output Portal |
| Realistic instruments | Sampling / Physical Modelling | Kontakt, Pianoteq |
Exercises: Sound Design Fundamentals
🟢 Beginner — Design a Bass from Scratch
Open any subtractive synthesizer (Vital is free and recommended). Start with a sawtooth wave. Add a second oscillator, also sawtooth, detuned to −12 semitones (one octave down). Set the filter to low-pass, cutoff at 400Hz, resonance at 20%. Open the filter envelope: fast attack (0ms), medium decay (300ms), sustain at 30%, release at 200ms. Set the filter envelope amount to 50% so the filter opens briefly on attack and closes on decay. Play a low note (C1 or lower). This is the foundation of almost every synthesizer bass sound. From here, experiment: lower the cutoff for a darker, murkier bass; raise the resonance for a more nasal, mid-focused sound; increase the envelope amount for a more pronounced filter sweep. Every change you make is sound design.
🟡 Intermediate — FM Bell Sound
Open an FM synthesizer (Dexed is free, or use Ableton Operator). Set up a simple two-operator patch: one carrier, one modulator. Set the carrier frequency ratio to 1.0 and the modulator frequency ratio to 1.4. This inharmonic ratio (1.4:1) will produce the characteristic bell-like, slightly metallic timbre. Set the modulator envelope to fast attack, fast decay, no sustain — meaning the modulation is present only at the very beginning of the note. Set the carrier envelope to fast attack, long decay, no sustain, medium release. Increase the modulation depth (how strongly the modulator affects the carrier) slowly from zero. At low depths, the sound is pure and simple. As depth increases, the harmonics multiply and the bell character emerges. At very high depths, the sound becomes noisy. Find the sweet spot where the bell timbre is present without becoming harsh. This exercise teaches the relationship between FM ratio, modulation depth, and harmonic complexity.
🔴 Advanced — Wavetable Evolving Pad
Open Serum 2 or Vital. Load a complex wavetable — one with significant timbral variation across its range (any factory wavetable labelled "evolving" or "spectral" will work). Set the wavetable position to the beginning. In the modulation matrix, assign an envelope to wavetable position with a medium-to-long attack (3–5 seconds) and a slow release (4–6 seconds). Set the envelope amount so the position sweeps from 0 to about 70% of the table over the attack time. Set the amplifier envelope to a long attack (1–2 seconds), full sustain, and long release (3–5 seconds). Enable unison with four to eight voices, detune to taste, and set spread to widen the stereo image. Apply a high-pass filter to remove sub frequencies. Play a chord and hold it — the sound should morph through the wavetable as it sustains, starting in one timbre and arriving at a completely different character by the time the attack phase completes. This is the foundation of evolving wavetable pads that appear in ambient, cinematic, and modern electronic music production.
Frequently Asked Questions
What is sound design in music production?
Sound design is the process of creating, shaping, and manipulating audio from scratch using synthesizers, samplers, and effects. In music production it covers everything from designing a bass patch to building a full sound effects library.
What is the best synthesis type for beginners?
Subtractive synthesis is the best starting point. It uses simple oscillators filtered by a low-pass filter and shaped by an amplitude envelope. The signal flow is intuitive and the sonic results are immediate.
What is FM synthesis?
FM synthesis uses one oscillator (the modulator) to modulate the frequency of another (the carrier), creating sidebands — harmonic and inharmonic frequencies — producing complex, metallic, or glassy timbres impossible to achieve with subtractive synthesis.
What is wavetable synthesis?
Wavetable synthesis stores snapshots of single-cycle waveforms and scans through them over time or via modulation. By moving through a wavetable, the timbre morphs continuously — producing evolving, animated sounds that range from simple tones to complex textures.
What synth should I use to learn sound design?
Vital (free) is the best learning synth for most beginners — it covers subtractive and wavetable synthesis with a clear visual interface. Serum 2 is the industry standard for wavetable. FM8 or Dexed for FM synthesis.
What is an ADSR envelope?
ADSR stands for Attack, Decay, Sustain, Release — the four stages of a volume envelope. Attack is how long the sound takes to reach full volume. Decay is how quickly it falls to the Sustain level. Sustain is the held level. Release is how long the sound takes to fade after the key is released.
What is granular synthesis?
Granular synthesis breaks audio into tiny segments (grains, typically 1–100ms) and reconstructs or transforms sound from those grains. By manipulating grain size, density, pitch, and position, granular synthesis creates textures and pads impossible with any other method.
What is an LFO in synthesis?
An LFO (Low Frequency Oscillator) runs below the audible range and modulates other parameters — filter cutoff, pitch, amplitude — creating movement in a sound. Vibrato, tremolo, and filter sweeps are all LFO effects.
How do I make a bass sound in a synth?
Start with a sawtooth oscillator. Add a second oscillator tuned down one octave. Run through a low-pass filter with cutoff around 300–800Hz. Set a fast attack, fast decay, medium sustain envelope on the filter. Add saturation for grit. This is the foundation of most synthesizer bass patches.
What is additive synthesis?
Additive synthesis builds complex timbres by combining multiple sine waves at different frequencies and amplitudes. Since any waveform can be described as a sum of sine waves, additive synthesis can recreate any timbre — but requires significant processing power to manage many individual partials.
Practical Exercises
Build Your First Subtractive Synth Sound
Open your DAW and load a subtractive synthesizer (Serum, Sylenth1, or your DAW's built-in synth). Start with a sawtooth oscillator at default settings. Play a middle C note and listen to the bright, buzzy tone. Now slowly drag the filter cutoff knob down from fully open to fully closed while holding the note. Hear how the brightness disappears as you remove high frequencies. Set the cutoff halfway and add resonance gradually — notice the peak that forms around the cutoff point. Stop when you have a warm, rounded tone. Save this as "My First Sound." You've just shaped raw harmonics into a musical timbre using only an oscillator and filter.
Design a Dynamic Pad with ADSR Control
Load a subtractive synth and select a triangle oscillator. Create a pad sound by adjusting these ADSR values: Attack (0.5s), Decay (1s), Sustain (0.7), Release (2s). Play a chord and feel how the sound swells in, then settles. Now decide: do you want this pad to feel ethereal or punchy? If ethereal, increase Attack to 2s and Sustain to full volume. If punchy, set Attack to 10ms and Sustain to 0.5. Set your low-pass filter cutoff to 40% and add moderate resonance. Adjust the filter envelope to modulate the cutoff over time — this creates movement. Record 8 bars of your pad playing a simple chord progression. Listen back and refine the ADSR until it matches your intended character.
Create a Evolving Lead Synth with Modulation
Design a lead sound that transforms over time using multiple techniques. Start with two sawtooth oscillators detuned by 8 semitones for thickness. Set up an ADSR envelope with fast attack (20ms), short decay (300ms), full sustain, and medium release (800ms). Patch an LFO to modulate your filter cutoff at 0.5Hz — this creates slow, breathing movement. Now add a second modulation source: route your ADSR envelope to the filter cutoff with positive modulation so the filter opens during the note's attack, adding brightness when the note starts. Set resonance to 50% for character. Record a 16-bar lead melody with intentional note lengths (long notes to hear the LFO breathe, short stabs to hear the envelope snap). Export and compare against a reference lead in your favorite track. Adjust oscillator detuning, LFO rate, and envelope depths until your lead has comparable movement and character.
Frequently Asked Questions
Subtractive synthesis starts with harmonically rich waveforms and filters them to remove frequencies, while FM synthesis modulates one oscillator's frequency with another to create complex timbres from simpler sources. Subtractive is more intuitive for beginners, while FM excels at creating evolving, metallic, and bell-like sounds with fewer oscillators.
The low-pass filter is the primary tonal sculpting tool in subtractive synthesis because it removes or boosts high frequencies to shape timbre and character. By adjusting the cutoff frequency and resonance, you can transform a bright sawtooth wave into warm, mellow, or aggressive sounds—making it essential for most sound design workflows.
Each oscillator waveform has different harmonic content: sine waves are pure with no harmonics, triangles have soft odd harmonics, squares are hollow with odd harmonics, and sawtooths are bright with all harmonics. This harmonic foundation determines the sound's initial timbre before any filters or processing are applied.
ADSR stands for Attack, Decay, Sustain, and Release—the four stages that shape how a sound evolves over time after a key is pressed. Every synthesizer uses this principle because it controls how quickly a sound reaches full volume, returns to a held level, and fades out, making it fundamental to realistic and expressive sound design.
An LFO (Low Frequency Oscillator) modulates parameters like filter cutoff, pitch, or volume at sub-audible rates to create movement and expression in sounds. Unlike regular oscillators that generate audible tones, LFOs operate below hearing range and are primarily used as a modulation source for dynamic, evolving textures.
At extreme resonance settings, a filter self-oscillates and produces a pure tone at its own cutoff frequency, transforming it from a tone-shaping tool into a sound source itself. This technique is popular in electronic music for creating piercing leads and aggressive synth sounds.
Most synthesizers allow you to run multiple oscillators simultaneously, tuning them in unison (same pitch for thickness), octaves (for harmonic richness), or musical intervals like fifths. This layering approach adds complexity and harmonic content before filtering, giving you more material to shape into unique timbres.
Subtractive synthesis is the most intuitive starting point because its signal flow directly shows cause-and-effect: generate harmonics with an oscillator, shape them with a filter, control timing with an envelope. Understanding this foundational method makes transitioning to FM, wavetable, additive, and granular synthesis significantly easier.