1.0 · pre-release · iOS 26.4 · BYO ElevenLabs key

Paste your script.
Hear your cast perform.

flexVox turns a written script into a fully produced, multi-voice podcast on your iPhone or iPad. Paste dialogue, cast every speaker, generate speech, sound effects, and music, fix the line that did not land, then export audio and transcripts from the same project.

Ask about the beta See the workflow →

Native iPhone & iPad· Keychain-secured key· Offline demo mode· Studio subscription

Episode 12 — Lighthouse Production

Step 2 / 3

HOST

Welcome back to Lighthouse. Tonight, a transmission from the keeper.

SFX

[SFX: foghorn rolling across the bay (3s)]

KEEPER

If you can hear me — the light is still on. I haven't seen another ship in seventy-two days.

MUSIC

[Music: low strings, unresolved (8s)]

Mixing · 14 turns · 2 voices · 1 SFX

02:48

Native iOS

SwiftUI, SwiftData, Keychain. No web views, no Electron, no cross-platform compromises.

BYO API key

Bring your own ElevenLabs key. Stored in the iOS Keychain. The app never proxies it.

Demo mode

Walk the entire workflow offline with silent placeholder audio — no account, no commitment.

Free to start

Three projects and the complete script-to-audio workflow. Studio unlocks scale and pro production tools.

No telemetry

No analytics SDKs, no tracking, no server-side data collection. Scripts and audio stay on device.

What flexVox is for

Three things this app refuses to be sloppy about.

Script-first, not text-box-first

Most TTS tools start with one big text field and one voice. flexVox starts with a structured script and keeps every speaker, sound effect, and music cue tagged through the entire pipeline. The output sounds like a conversation because the input always knew it was one.

Multi-voice in a single project

Each character maps to its own ElevenLabs voice with independent stability, similarity, style, and speed. The app handles the orchestration, batches dialogue calls for natural conversational flow, and never forces you to manage takes by hand.

Try the whole workflow before you pay anyone

Demo mode generates silent placeholder audio with realistic durations, so you can paste a script, parse it, assign voices, generate, edit, mix, and export — all without an ElevenLabs account. The only thing missing is the voices.

Free to start, Studio when the show gets serious

The free tier includes the full script-to-audio workflow for up to three projects. flexVox Studio adds unlimited projects, Auto-Cast, shows and series, background music, AI script writing, export presets, and transcript formats.

The workflow

Script. Production. Export.

A three-step progress bar keeps the project oriented while the detailed workflow takes you from import to review, voice casting, generation, post-production, playback, and export.

01 Step 1

Paste your script

On the Script Import screen, paste any dialogue script. Expand the format guide if you want a quick reference. Tap `Review Script` and the parser runs.

02 Step 2

Review speakers

Turns needing review are highlighted with confidence indicators. Tap one to confirm or reassign. Batch-assign unreviewed turns or merge duplicate speakers from the toolbar.

03 Step 3

Assign voices

Tap Auto-Cast All for AI-assisted casting, or open a casting session to search, preview, and assign voices speaker by speaker. Expand per-speaker settings to dial in stability, similarity, style, and speed. Quick Preview streams a sample before a full run.

04 Step 4

Generate audio

A progress ring shows percentage, current turn, and estimated time remaining. Cancel any time without losing what's already generated. If something fails, `Resume` picks up from where it stopped.

05 Step 5

Edit in post-production

Play each turn individually. Swipe to regenerate the ones that need work, compare variants side by side, adjust per-turn pauses, and exclude any segments that shouldn't ship.

06 Step 6

Mix & export

All active, non-excluded turns mix into a single M4A. Listen with follow-along script highlighting, then export using platform presets and share audio, transcript, or caption files via the iOS share sheet.

Read the full walkthrough →

Who it's for

Creators who need a cast, not a studio.

Solo podcasters who want recurring segments to sound like a conversation. Writers who want to hear a script before pitching it. Audio dramatists and educators who need scenes, cues, voices, and revisions in one place.

01 / 02

For solo podcasters

Sound like a cast when you're recording alone.

You write the dialogue. flexVox hands it to a different voice every time the speaker changes — and lets you fix one bad line without re-recording the episode.

Paste a script in any of four common formats
Use scenes, chapters, SFX, music, and background music cues
Distinct AI voice per character, with per-speaker tuning
Shows and Series keep cast, format, and audio identity across episodes
Regenerate a single line, keep every other take untouched
Export platform-normalized audio plus transcript and caption formats

02 / 02

For audio dramatists & screenwriters

A table read your script can have on a Tuesday night.

Hear your dialogue performed by distinct voices before you pitch it. Swap a character's voice in seconds, re-read a scene with different intent, and share a rough draft with collaborators.

Confidence-scored speaker detection across multiple script formats
Auto-Cast and AI suggestions help find a plausible cast fast
Variant comparison: keep the take you like, delete the rest
Per-turn pause control, underlay mixing, and auto-ducking
Pronunciation rules per project (alias or IPA / Arpabet)

Educators producing dialogue-based lessons sit somewhere in the middle. flexVox handles script structure, cast, cues, revision, playback, and export in one native workflow.

Features

Eight of the 73 that ship in the pre-release.

Each one earns its place by removing a step the desktop workflow used to demand. The full list is on the features page; here are the ones people notice first.

Script

Automatic script parsing

The parser detects speakers, SFX, music cues, scene markers, and chapter markers across colon (`HOST:`), bracket (`[Host]`), parenthesis (`(Host)`), standalone-name, `[SCENE:]`, `[CHAPTER:]`, `[ACT:]`, and `[PART:]` formats.

It reads the format you already write in, not the other way around.

Voices

Auto-Cast

Assign voices to every unvoiced speaker in one tap. flexVox analyzes each speaker's role, generates a search query from an AI archetype suggestion, and selects the best-match voice while preserving manual assignments.

Start with a plausible cast, then change only what needs changing.

Generation

Background music generation

Add a background music layer that spans the whole episode. Describe the mood and style; flexVox generates a track sized to the total dialogue duration and gives the project its own volume control.

A soundtrack that fits the episode instead of the other way around.

Generation

Demo mode

With no API key configured, a mock TTS service returns silent WAV audio with realistic durations. Every screen — import, review, voices, generate, edit, mix, export — works end to end.

Learn the entire workflow before spending a dollar.

Post-production

Single-turn regeneration

Swipe a turn or use its context menu to regenerate it. The new take is saved as an additional variant — your previous take is never overwritten.

Fix one line. Leave everything else exactly where it was.

Playback & export

Follow-along playback

The playback screen shows the script with timestamps, speaker badges, active-turn highlighting, auto-scroll, tap-to-seek, and word-level highlighting when alignment data is available.

Find a mispronunciation by reading along, not scrubbing blindly.

Shows & series

Shows and series

Create ongoing productions with persistent cast, format, tone, narrator mode, episode numbering, and reusable audio identity.

Define the show once. Start each episode with the bones already in place.

AI script writing

AI script generation

Generate scripts in-app with OpenAI or Claude. Configure format, tone, speaker count, scenes, chapters, expression tags, SFX, music, and provider before generation.

Go from premise to production-ready script without leaving the project.

Studio

flexVox Studio

Studio unlocks unlimited projects, Shows and Series, Auto-Cast, dialogue generation mode, background music, underlay and auto-ducking, export presets, AI writing, Sound Library, pronunciation dictionary, templates, and pacing reports.

The upgrade is for scale and polish, not for making the app usable.

See all 73 features →

Where flexVox fits

Not a DAW. Not a single-voice TTS app. Something narrower.

flexVox does one thing well: turn a multi-speaker script into produced audio through a guided workflow. If you need waveform editing, beat matching, or multi-track mixing, a desktop DAW is the right tool. If you need a single voice reading a single block of text, almost anything will do.

What it does	flexVox	Generic TTS apps	Desktop DAWs	Web AI audio tools
Multi-speaker in one project	Yes — each character maps to its own voice	Usually single-voice	No built-in TTS	Varies
Script parsing	Automatic, with confidence scoring	Manual text entry	N/A	Varies
SFX & music generation	Inline tags generate audio automatically	Not available	Manual import	Some offer SFX separately
Post-production editing	Per-turn regen, variants, pause control	Not available	Full waveform editing (complex)	Limited
Platform	Native iOS — iPhone & iPad	Mixed (web, desktop, mobile)	Desktop or iPad	Browser-based
Offline exploration	Demo mode runs the full workflow offline	Usually requires login	Fully offline	Requires internet
Price model	Free tier + optional Studio subscription; BYO ElevenLabs key	Subscription or per-character	Free / one-time purchase	Subscription

flexVox depends on ElevenLabs for voice generation. Audio quality and available voices are determined by that service. The app adds value through script intelligence, workflow structure, post-production controls, and a native mobile experience — not by training its own voice models.

Where the app is, right now

1.0 · pre-release. The product is documented, pre-release, and not public yet.

The current docs cover the full 1.0 product surface: script import and review, scene and chapter support, Auto-Cast, voice tuning, speech, SFX, music, background tracks, underlay mixing, follow-along playback, export presets, transcripts, shows and series, AI writing, and flexVox Studio.

Pricing: free to start with up to three projects. flexVox Studio is a monthly or yearly StoreKit 2 subscription for unlimited projects and advanced production features.

Free

Three projects, full script workflow, voice browsing, BYO-key generation, basic post-production, follow-along playback, Podcast export, and demo mode.
Studio

Unlimited projects, Auto-Cast, Shows and Series, AI writing, Sound Library, background music, underlay, export presets, and transcripts.
Privacy

No analytics, no tracking SDKs, no server-side data collection. Scripts, audio, and projects stay on device.

Questions we get a lot

The short version of every email we answer.

What is flexVox? +

An iOS app that turns multi-speaker scripts into produced podcast audio. Paste a dialogue script, assign AI voices to each character, generate speech and sound effects, then mix and export the result — all on iPhone or iPad.

Do I need an ElevenLabs account? +

For real audio generation, yes — flexVox brings your own ElevenLabs API key (a free tier is available at elevenlabs.io). Demo mode works with no account, generating silent placeholder audio so you can explore every feature first.

Does flexVox work offline? +

Script import, parsing, and review work offline. Audio generation requires an internet connection to reach the ElevenLabs API. Demo mode works fully offline.

Can I regenerate just one line? +

Yes. Swipe a turn in post-production or use its context menu. The new take is saved as a variant — your previous take isn't overwritten.

What format is the exported audio? +

The final mix is exported as an M4A (AAC) file. The export sheet includes platform-specific loudness presets and can also export transcripts or captions as SRT, VTT, JSON, or plain text.

What is a show/production? +

A show, also called a production, is a container for related episodes. It stores a persistent cast bible with character profiles and voice assignments, a default format and tone, intro / outro or transition sounds, and an episode template.

How is my API key stored? +

Your ElevenLabs API key is stored in the iOS Keychain — the same secure storage iOS uses for passwords. It is never written to a plain file or sent anywhere other than the ElevenLabs API.

Read every question →

Pre-release. Beta opens soon. Want a key when it does?

Send a note — tell us what you're trying to make, what stopped you last time, and whether you're a podcaster, a writer, or something we haven't named. We'll add you to the list and reply to anything that's not "thanks."

Ask about the beta Read the feature list →

No waitlist form. No tracking. Bring your own ElevenLabs key; we never see your scripts or your audio.

Also from the studio