Lispr ← All posts
Craft

Editing your text by voice

April 26, 2026 · 6 min read

Dictation is wonderful for getting words out. Editing those words is a different job, and it is worth being honest about how well voice handles it. The short version: voice is excellent for drafting and only sometimes good for editing. Knowing where the line falls will save you a lot of frustration.

This post lays out what voice editing realistically can and cannot do, and suggests a workflow that plays to the strengths of both your voice and your hands.

Why editing by voice is hard

Drafting and editing are not the same activity. Drafting is generative — you produce a stream of words and your voice is a fast, natural way to do that. Editing is positional and precise. It means putting a cursor between two specific characters, selecting exactly the right span of text, and making a small surgical change.

Speech is bad at precision of that kind. To delete one word in the middle of a sentence by voice, you have to describe which word, unambiguously, in language the tool can parse. Your hand can do the same thing with one click. The mouse and the arrow keys are pointing devices, and pointing is exactly what editing needs.

So the honest framing is this: voice editing is not a replacement for hands-and-eyes editing. It is a partial tool with a few genuine strengths and several real limits.

What voice editing actually does well

There are cases where speaking a change is genuinely faster or more comfortable than reaching for the keyboard.

Notice the pattern. Voice editing works best when it does not have to locate anything precisely, or when your hand has already done the locating.

What voice editing struggles with

Be realistic about the rest.

If you find yourself constructing an elaborate sentence just to describe a one-character fix, that is the signal to stop and use your hand.

A sane workflow: draft by voice, edit by hand

The most productive approach treats the two modes as separate phases rather than trying to do everything by voice.

Phase one — draft by voice, fast and loose. Speak the whole thing through without stopping to fix anything. Misheard word? Leave it. Clumsy sentence? Leave it. The goal of this phase is volume and momentum. Stopping to correct each error kills the flow that makes dictation fast in the first place, and you will revisit everything anyway.

Phase two — edit by hand, slow and careful. Now switch tools. Read the draft on screen, with your hands on the keyboard and mouse. Fix the misrecognitions, tighten the sentences, move things around. This is ordinary editing, and your hands are the right instrument for it.

Phase three — re-dictate, not repair. When a whole sentence or paragraph needs to be rebuilt rather than nudged, do not edit it. Select it, delete it, and dictate a fresh version. Re-dictating a clean sentence is usually faster and produces better writing than patching a broken one.

The one exception worth keeping is the immediate correction. If you misspeak and notice right away, fixing it on the spot — by voice or by a quick keystroke — is fine, because you have not lost your place yet. It is the delayed, hunt-for-the-error edits that belong in phase two.

Catching misrecognitions efficiently

Since editing is mostly hands-and-eyes, the practical skill is reading a dictated draft well.

When to skip voice entirely

Sometimes the right call is not to dictate at all. A dense paragraph that needs three precise edits is faster to fix purely by keyboard. A short reply — a couple of words into a form field — is faster to type. Voice earns its place on the generative work: first drafts, long messages, notes, anything where you have something to say and just need it on the page.

Treating voice as a drafting tool rather than a universal interface is not a limitation to apologize for. It is just using each tool for what it is good at.

The honest summary

Voice editing has a few genuine wins — appending, replacing whole chunks, dictating into a spot your hand has already chosen — and a lot of cases where a click and a keystroke simply win. The productive pattern is to keep the phases separate: draft by voice with momentum, then edit by hand with care, and re-dictate anything that needs more than a nudge.

Lispr is built for the drafting phase. Hold the right Option key, speak, release, and the text lands at your cursor in whatever app you are in. When it is time to edit, put your hands back on the keyboard — that is still the best editing tool you own. Used that way, voice and hands complement each other instead of competing.

Try Lispr

Voice to text in any Mac app — hold a key, talk, let go. Free, no account, ~4 MB.

Download for macOS