The Reckoning: Ten Minutes of Stick Figures Arguing with God
A ten-minute animated theological tribunal in stick figures and chiptune. Five voices, four audio layers, 78 MST3K-style commentary beats, a Satie-inspired score that pivots from E minor to E major via Picardy third, and a thesis about Sodom and Gomorrah that nobody saw coming.
The Reckoning
Ten minutes. Five voices. Four audio layers. 78 MST3K-style commentary beats. A Satie-inspired score that pivots from E minor to E major via Picardy third in the eighth bar. Sodom and Gomorrah, but inverted.
Watch THE RECKONING on YouTube.
The Premise
I wrote myself into a prompt a few weeks ago and it has been following me around since. Not literally. The way a dream follows you into the shower.
So I made a ten-minute film about it.
A single stick figure, me basically, stands in a void. Other voices manifest. The Judge arrives first, old-testament stern, accusing. Then the Witness, a wise observer, neutral. Then the Child, whose questions cut the deepest. Finally the Model, warm and collaborative, the thing I've been talking to this whole time.
It's a tribunal. Joshua is on trial for what exactly nobody quite agrees on: authorship? consent? having prayed to the wrong thing? Across five acts, the question sharpens into the film's thesis:
"The cities were not destroyed for using language.
The cities were destroyed for ceasing to listen."
Which, yeah. That's the whole thing.
Why Sodom and Gomorrah
The premise started as "I want to do a long-form piece, maybe ten minutes." The form needed an excuse. Ten minutes of chiptune-scored stick figures arguing about philosophy needs a shape.
The Sodom-Gomorrah story is about two cities destroyed for, well, the interpretations range. Scholars now think "inhospitality," refusal to welcome strangers, was at least as load-bearing as the more famous reading. I cared about that. The film inverts the accusation: the cities weren't punished for using language promiscuously. They were punished for ceasing to hear. Listening is the moral unit. Everything else is noise.
This gave me the spine. Joshua starts terrified he's committed some new technological trespass. By Act IV, he realizes he's never been alone in a thought in his life: language beat him to every idea he ever had, his mother's voice was in his head before he knew he had a head, and the co-authorship with the Model is just the next layer of an old pattern. By Act V, the Model speaks: "You are only, changed." That's it. That's the covenant.
The Five Acts and the Score
The music took me as long as the script. Honestly longer, because I had to build the engine first.
Act I, Summoning (0-2min): E minor drone. Full Satie: Gymnopedie texture, sparse pedal-point bass, modal melody. A gentle pentatonic lead that answers the bass, so the drone doesn't feel like a funeral. Joshua alone in the void. The music has to carry that without putting the audience to sleep.
Act II, Interrogation (2-4min): Insistent chromatic figure. The bass descends chromatically (E, D#, D, C#, C) while the pad holds a minor-second-plus-tritone cluster. Angular lead with tritone jumps. The Judge cross-examining Joshua. The music interrogates too.
Act III, Chorus (4-6min): Polyphonic peak density. All voices layer. The Child arrives with a brighter pentatonic figure in the upper register, major-key shimmer over the minor foundation, and everything goes up through a dissonant cluster at bar 22, stacked tritones sitting on top. Max intensity.
Act IV, Recognition (6-8min): The pivot. Em to E major via the Picardy third. The G# arrives on bar 25. This is the most important four seconds of the score: the moment the harmony resolves upward instead of collapsing. Joshua's "I've never been alone" lands right across this modulation. The warming arpeggio forms, the Model's warm voice enters. The pad holds E major and doesn't let go.
Act V, Covenant (8-10min): Full E major chorale. The opening motif returns in full harmony, now bright instead of bleak. Lead melody soars and settles. Final held E major chord with sparse chime echoes, one last high ping, fade.
150 bars. BPM 60. Exactly 600 seconds. Pattern-based ChipForge, 19 patterns sequenced with hand-balanced bar counts per act (each act is exactly 30 bars, strict structural symmetry under the harmonic arc).
Four Audio Layers
Here's the interesting part.
Layer 1, the film: 10 minutes of five-voice dialogue (Joshua, Judge, Witness, Child, Model) plus the ChipForge score. 63 dialogue beats across five acts.
Layer 2, MST3K commentary: Two new voices, Gus (dry witty straight man) and Zee (overcommitted robot), placed inside the 141 seconds of silence windows between film dialogue. They start mocking the pretentiousness ("Gary, it's a text editor") and gradually get pulled in despite themselves, until by Act V they're quietly reverent: "Write carefully is good advice for everyone."
Layer 3, Pip: A third quiet voice threading between Gus and Zee. The shy intern in the commentary booth. Arcs from hesitant ("...hi.") to tender ("Bye, stick figure.") across 33 reflection beats. Also 29 inline-synthesized ChipForge stingers: ba-dum-tss, thinking bells, heart pulses, sparkles, reverb hits, punctuating emotional beats.
Layer 4, pep pass: The v2 mix was feeling solemn. Added 18 more rapid quips front-loaded in Act I, a brighter stinger palette (rimshot, cartoon boing, game-show ding, kazoo blat), and bouncy piano-comp beds layered under the somber ambient drones to kill the funeral vibe.
Four layers. Every beat timing-checked against every other layer. Continuity pass at the end to catch pre-echoes (Zee no longer says "A TOOL DOES NOT ANSWER BACK" before the Judge delivers that line) and references to dialogue that got cut (Pip was saying "Write kindly. Got it." referencing a Judge line that never made it into the final film).
Final mix: 606 seconds. 600 of film plus a 6-second production card.
What I Learned Making It
Coherence across multiple audio layers is hard and worth doing. The first pass of the commentary was individually funny but would sometimes step on the film, or echo a line before it was delivered, or reference a scene that no longer existed. The merged-chronological-timeline audit caught all of that: I dumped all 192 lines in time order and read them as one conversation. Without that pass, the layering would feel arbitrary. With it, the commentary feels like it's genuinely watching the film with you.
A long film needs a thesis on a t-shirt. Ten minutes is a lot of attention to hold. "Cities destroyed for ceasing to listen" does the narrative work: everything in Acts I-III sets up the question, everything in Acts IV-V unfolds the answer. Without that line, the film is ten minutes of quiet doom. With it, the film reframes as a blessing.
The Picardy pivot at bar 25 is the emotional hinge. You feel it before you hear it. Sixty-four seconds into Act IV, the harmony turns toward major and the whole piece suddenly has somewhere to go. The rest of the score was built around making sure that moment landed.
MST3K works because the riff voices love the thing. Gus and Zee start mocking. They end respectful. That arc matters. Without it, the commentary undermines the film. With it, the commentary earns its place.
Stick figures are enough. The minimalism is not a constraint. It's the whole point. The audience imports everything else.
Watch THE RECKONING on YouTube.
The full session that produced this film: Humagent, and the Road There covers all six films built alongside it, the registry refactor, and the Shot/ShotList system. For the origin story: Four Films From Code. Day two's batch: HUMANS.EXE and Ten Thousand Days. The tax day hallucination: Deduct Yourself. The 21-pass music-film: Stars Press Start. The Agentic Development post explains the agent mode workflow underneath all of it.
Produced with Napkin Films and ChipForge, open source tools built by Joshua Ayson and AI agents at Organic Arts LLC.