Projects • July 4, 2026 • 4 min read

The Language of Sound, Season Two: the voices of the machine

Season Two goes inside the machine: eight episodes on how a synthesizer makes its voices, starting with Subtractive. Start with a bright sawtooth, carve away what you don't need. Warmer, darker, its own theme in F minor.

The Language of Sound, Season Two: the voices of the machine

Watch the season opener: Subtractive. Licensed CC BY 4.0. Season One is complete and live: the playlist.

A sculptor does not add marble. The figure is already in the block, and the work is taking away everything that is not the figure. The oldest trick a synthesizer knows works exactly the same way. That trick is where Season Two begins.

The idea

Season One followed one note all the way up. Waveform to instrument to arrangement to mix to master, and then the machine writing music on its own. Six episodes, a closed loop, finished. You can hand it to someone and say watch this in order and you will get it. The full Season One write-up is here.

Season Two goes somewhere else. Down into the engine room. Every episode is one way a machine makes a voice from nothing: eight methods, eight episodes, each one a complete lesson where the music is built from the very technique it teaches. Same show, same promise. Everything on screen is drawn by the real audio. But it is its own work, not a continuation, and it sounds and looks like one.

Episode one: Subtractive

Subtractive synthesis is the sculptor's approach to sound. You start with too much. A sawtooth wave, a buzzy wave that carries every overtone at once, raw and bright and almost ugly. Then you take a filter, which is just a gate that lets the low part of the sound through and holds the bright part back, the way a wall muffles the party next door. Carve away what you don't want, and what is left is the instrument.

The filter sweep is the star of the episode. You hear the raw saw, then you hear the filter move across it, opening and closing, and the harmonic wall on screen is carved by a curtain that follows the track's real brightness. Add resonance, a small boost right at the filter's edge, and the filter begins to sing as it moves. Give each note its own little sweep, a filter envelope, and the buzz becomes a voice. Oscillator, filter, envelope. That trinity is most of the synthesizer sounds you have ever heard.

The line the whole episode hangs on: you don't build the sound, you reveal it.

Season Two

Subtractive, start with everything and carve away. The saw, the filter, the envelope.
Additive, build a tone one harmonic at a time and watch the sum become a saw.
FM, frequency modulation, one wave wobbling another into bells, electric pianos, and metal.
Karplus-Strong, pluck a string out of nothing but noise and a delay. Harp, koto, guitar.
Granular, clouds of tiny grains, texture you can freeze in the air.
Wavetable, scanning a table of waves, and the slow breath that morphs them.
The Vocal Tract, formants and vowels, a choir with no words.
Game Boy, four little voices, where the whole engine started.

New episodes twice a week, same rhythm as Season One. Eight in all.

How it is made

Season One was the cool oscilloscope lab, cyan lines on deep ink, the wave drawn in time. Season Two is the warm engine room. Amber, magenta, ember on near-black, and the signature picture moves from time to frequency: spectrum bars, harmonic stacks, the spectrogram running warm along the bottom. That matches what the season teaches. Building a voice means building a spectrum, so you watch the spectrum get built.

The sound world is new too. Nothing reused from Season One. Warmer, darker, analog-leaning, with its own theme in F minor that every episode wears. Der Gouverneur, a deep Alpine philosopher voice, narrates the body of each episode. I host the open and the close, and we both go silent for the drops.

Same machinery as always. Animation in Python and PIL, audio in ChipForge, the numpy synthesizer that is also the subject of the course, narration through ElevenLabs, assembled with FFmpeg. A laptop, no GPU, no subscriptions.

License

Films: CC BY 4.0. Remix it, repost it, drop it into your own thing. Credit "Napkin Films / Organic Arts LLC" and link CC BY 4.0. Engine code (Napkin Films, ChipForge) is GPL-3.0-or-later. ElevenLabs voice audio is licensed content and is not redistributed.

napkin-films chipforge the-language-of-sound synthesis subtractive-synthesis sound-design educational made-on-a-laptop