Paste your script
On the Script Import screen, paste any dialogue script. Expand the format guide if you want a quick reference. Tap `Review Script` and the parser runs.
flexVox turns a written script into a fully produced, multi-voice podcast on your iPhone or iPad. Paste dialogue, cast every speaker, generate speech, sound effects, and music, fix the line that did not land, then export audio and transcripts from the same project.
What flexVox is for
Most TTS tools start with one big text field and one voice. flexVox starts with a structured script and keeps every speaker, sound effect, and music cue tagged through the entire pipeline. The output sounds like a conversation because the input always knew it was one.
Each character maps to its own ElevenLabs voice with independent stability, similarity, style, and speed. The app handles the orchestration, batches dialogue calls for natural conversational flow, and never forces you to manage takes by hand.
Demo mode generates silent placeholder audio with realistic durations, so you can paste a script, parse it, assign voices, generate, edit, mix, and export — all without an ElevenLabs account. The only thing missing is the voices.
The free tier includes the full script-to-audio workflow for up to three projects. flexVox Studio adds unlimited projects, Auto-Cast, shows and series, background music, AI script writing, export presets, and transcript formats.
The workflow
A three-step progress bar keeps the project oriented while the detailed workflow takes you from import to review, voice casting, generation, post-production, playback, and export.
On the Script Import screen, paste any dialogue script. Expand the format guide if you want a quick reference. Tap `Review Script` and the parser runs.
Turns needing review are highlighted with confidence indicators. Tap one to confirm or reassign. Batch-assign unreviewed turns or merge duplicate speakers from the toolbar.
Tap Auto-Cast All for AI-assisted casting, or open a casting session to search, preview, and assign voices speaker by speaker. Expand per-speaker settings to dial in stability, similarity, style, and speed. Quick Preview streams a sample before a full run.
A progress ring shows percentage, current turn, and estimated time remaining. Cancel any time without losing what's already generated. If something fails, `Resume` picks up from where it stopped.
Play each turn individually. Swipe to regenerate the ones that need work, compare variants side by side, adjust per-turn pauses, and exclude any segments that shouldn't ship.
All active, non-excluded turns mix into a single M4A. Listen with follow-along script highlighting, then export using platform presets and share audio, transcript, or caption files via the iOS share sheet.
Who it's for
Solo podcasters who want recurring segments to sound like a conversation. Writers who want to hear a script before pitching it. Audio dramatists and educators who need scenes, cues, voices, and revisions in one place.
For solo podcasters
You write the dialogue. flexVox hands it to a different voice every time the speaker changes — and lets you fix one bad line without re-recording the episode.
For audio dramatists & screenwriters
Hear your dialogue performed by distinct voices before you pitch it. Swap a character's voice in seconds, re-read a scene with different intent, and share a rough draft with collaborators.
Educators producing dialogue-based lessons sit somewhere in the middle. flexVox handles script structure, cast, cues, revision, playback, and export in one native workflow.
Features
Each one earns its place by removing a step the desktop workflow used to demand. The full list is on the features page; here are the ones people notice first.
The parser detects speakers, SFX, music cues, scene markers, and chapter markers across colon (`HOST:`), bracket (`[Host]`), parenthesis (`(Host)`), standalone-name, `[SCENE:]`, `[CHAPTER:]`, `[ACT:]`, and `[PART:]` formats.
It reads the format you already write in, not the other way around.
Assign voices to every unvoiced speaker in one tap. flexVox analyzes each speaker's role, generates a search query from an AI archetype suggestion, and selects the best-match voice while preserving manual assignments.
Start with a plausible cast, then change only what needs changing.
Add a background music layer that spans the whole episode. Describe the mood and style; flexVox generates a track sized to the total dialogue duration and gives the project its own volume control.
A soundtrack that fits the episode instead of the other way around.
With no API key configured, a mock TTS service returns silent WAV audio with realistic durations. Every screen — import, review, voices, generate, edit, mix, export — works end to end.
Learn the entire workflow before spending a dollar.
Swipe a turn or use its context menu to regenerate it. The new take is saved as an additional variant — your previous take is never overwritten.
Fix one line. Leave everything else exactly where it was.
The playback screen shows the script with timestamps, speaker badges, active-turn highlighting, auto-scroll, tap-to-seek, and word-level highlighting when alignment data is available.
Find a mispronunciation by reading along, not scrubbing blindly.
Create ongoing productions with persistent cast, format, tone, narrator mode, episode numbering, and reusable audio identity.
Define the show once. Start each episode with the bones already in place.
Generate scripts in-app with OpenAI or Claude. Configure format, tone, speaker count, scenes, chapters, expression tags, SFX, music, and provider before generation.
Go from premise to production-ready script without leaving the project.
Studio unlocks unlimited projects, Shows and Series, Auto-Cast, dialogue generation mode, background music, underlay and auto-ducking, export presets, AI writing, Sound Library, pronunciation dictionary, templates, and pacing reports.
The upgrade is for scale and polish, not for making the app usable.
Where flexVox fits
flexVox does one thing well: turn a multi-speaker script into produced audio through a guided workflow. If you need waveform editing, beat matching, or multi-track mixing, a desktop DAW is the right tool. If you need a single voice reading a single block of text, almost anything will do.
| What it does | flexVox | Generic TTS apps | Desktop DAWs | Web AI audio tools |
|---|---|---|---|---|
| Multi-speaker in one project | Yes — each character maps to its own voice | Usually single-voice | No built-in TTS | Varies |
| Script parsing | Automatic, with confidence scoring | Manual text entry | N/A | Varies |
| SFX & music generation | Inline tags generate audio automatically | Not available | Manual import | Some offer SFX separately |
| Post-production editing | Per-turn regen, variants, pause control | Not available | Full waveform editing (complex) | Limited |
| Platform | Native iOS — iPhone & iPad | Mixed (web, desktop, mobile) | Desktop or iPad | Browser-based |
| Offline exploration | Demo mode runs the full workflow offline | Usually requires login | Fully offline | Requires internet |
| Price model | Free tier + optional Studio subscription; BYO ElevenLabs key | Subscription or per-character | Free / one-time purchase | Subscription |
flexVox depends on ElevenLabs for voice generation. Audio quality and available voices are determined by that service. The app adds value through script intelligence, workflow structure, post-production controls, and a native mobile experience — not by training its own voice models.
Where the app is, right now
The current docs cover the full 1.0 product surface: script import and review, scene and chapter support, Auto-Cast, voice tuning, speech, SFX, music, background tracks, underlay mixing, follow-along playback, export presets, transcripts, shows and series, AI writing, and flexVox Studio.
Pricing: free to start with up to three projects. flexVox Studio is a monthly or yearly StoreKit 2 subscription for unlimited projects and advanced production features.
Three projects, full script workflow, voice browsing, BYO-key generation, basic post-production, follow-along playback, Podcast export, and demo mode.
Unlimited projects, Auto-Cast, Shows and Series, AI writing, Sound Library, background music, underlay, export presets, and transcripts.
No analytics, no tracking SDKs, no server-side data collection. Scripts, audio, and projects stay on device.
Questions we get a lot
An iOS app that turns multi-speaker scripts into produced podcast audio. Paste a dialogue script, assign AI voices to each character, generate speech and sound effects, then mix and export the result — all on iPhone or iPad.
For real audio generation, yes — flexVox brings your own ElevenLabs API key (a free tier is available at elevenlabs.io). Demo mode works with no account, generating silent placeholder audio so you can explore every feature first.
Script import, parsing, and review work offline. Audio generation requires an internet connection to reach the ElevenLabs API. Demo mode works fully offline.
Yes. Swipe a turn in post-production or use its context menu. The new take is saved as a variant — your previous take isn't overwritten.
The final mix is exported as an M4A (AAC) file. The export sheet includes platform-specific loudness presets and can also export transcripts or captions as SRT, VTT, JSON, or plain text.
A show, also called a production, is a container for related episodes. It stores a persistent cast bible with character profiles and voice assignments, a default format and tone, intro / outro or transition sounds, and an episode template.
Your ElevenLabs API key is stored in the iOS Keychain — the same secure storage iOS uses for passwords. It is never written to a plain file or sent anywhere other than the ElevenLabs API.
Send a note — tell us what you're trying to make, what stopped you last time, and whether you're a podcaster, a writer, or something we haven't named. We'll add you to the list and reply to anything that's not "thanks."
No waitlist form. No tracking. Bring your own ElevenLabs key; we never see your scripts or your audio.
Also from the studio
A native Mac media wall with a mean little grin.
Open flexGrid →
Fast workout logging for solo lifters.
Open flexRep →
Audio levels on your screen edge.
Open flexMeter →
Every format in. Clean Markdown out.
Open flexDoc →
Pull up a chair. Bluesky, at reading speed.
Open Lanai →
Messy spreadsheet in. Print-ready dashboard out.
Open flexStats →