How Oteria works, and what to do when it doesn’t.
A practice tool for anyone who reads scripted dialogue out loud — actors learning lines, kids rehearsing for plays, anyone recording an audition tape, even off-label uses like turning a textbook into a mini audiobook or prototyping a scripted podcast.
Paste a script, choose which character the AI plays, then read your lines aloud. The app listens through your microphone, advances when you’ve delivered the line, and plays the AI’s response.
Works best in Chrome (Web Speech API). Safari is okay. Firefox and Edge are limited.
Standard screenplay format and CHARACTER: line format both work. The parser also handles:
(soft, chilling) — auto-converted to v3 audio tags.[whispers] — passed straight to the AI voice as delivery cues.INT./EXT.), and transitions (CUT TO) — stripped.If your script is messy or PDF-extracted, click AI cleanup to have Claude rewrite it into the clean format.
Drop a standalone [pause:Xs] on its own line to override the beat before the next line.
THERAPIST: How could you keep that from me? [pause:3s] JACK: I... I don't know.
Variants accepted:
[pause:2s] — 2 seconds[pause:1.5s] — 1 and a half seconds[pause:500ms] — half a second[pause] — defaults to 1 secondWorks in standard, memorisation, and performance modes. Inline [pause]inside a dialogue line still passes through to ElevenLabs as a v3 audio tag — the marker is only recognised on its own line, so the two don’t collide.
Performance mode lets you choose the silence between an AI line ending and the next line starting:
Per-line [pause:Xs] markers override the scene-wide setting for that one transition.
Each AI character can be sped up or slowed down independently — 50% to 200% in 10% steps. Click the percentage between the +/− buttons to reset to the voice’s default. Some voices struggle below 70% or above 150% — if the audio sounds bad, dial back toward 100%.
The voice list is your live ElevenLabs library — every voice you’ve added there shows up here automatically. Curated archetypes appear first; the rest of your library follows. Star a voice to favourite it for quick re-use across scenes.
Inside a dialogue line, square brackets are passed to ElevenLabs v3 as delivery cues. Common ones:
[whispers], [shouting], [softly], [chilling], [unsettling][laughs], [sighs], [crying][pause] — inline pause within a line (different from the standalone pause marker)Paste a paragraph — or a chapter, or a textbook section — instead of a script. If no characters are detected, the app switches to Reader mode. Pick a voice, hit Start, and you’ve got a mini audiobook.
Use the ↓ Download MP3 button on the Begin or Finished screens to save the audio file.
[pause:Xs] markers in the script for any line that needs a specific beat.The screen stays dark with just the orb visible — no reflections off your face on tape.
Hit Save scene on the setup page to keep a scene in your library. Your scenes are private to your account — no one else can see them.
The Scenespage lists everything you’ve saved. Click a scene to load it, or hover and tap Delete to remove it.
Pre-rendered AI lines are cached in your browser (IndexedDB) so re-runs are free and offline. The cache grows over time. To free space, head to Scenes and tap Clear cached audio at the bottom. Pre-rendering will run again the next time you open a scene.
Your scripts and scene settings are stored in your account on the server. Audio files are pre-rendered and cached locally in your browser via IndexedDB — they don’t leave your device. You can clear the audio cache any time from the Scenes page.
Sign in is via Google OAuth. We store your email, name, and avatar — no passwords. Sign out from the dashboard to end your session.