Anyone else coding by voice?

I have been developing numerous projects in Python using Cursor AI. However, due to a condition affecting my hands, I seldom use the keyboard and instead rely on Windows Voice Access to transcribe everything.

I commenced this approach approximately five months ago, and it has been life changing in terms of how rapidly I have been able to create projects and solve problems. Does anyone else utilize voice-to-text technology for coding and Cursor AI?

7 Likes

Actually, this message is entirely written by dictation. I’m using Superwhisper and have various prompts depending on the workflow, which will then translate what I’m saying into a series of product manager-type instructions written by a developer.

4 Likes

Interesting. I’ve thought about exploring voice but I’m so used to typing that, in fact, I often start typing when the thoughts are still forming. The act of typing helps me flesh them out.

Because of this, I haven’t really tried voice other than dictation that comes with my operating system.

When you say you have voice prompts that take your spoken words and ā€˜translates’ them, this seems a step up from dictation. Would you mind sharing examples of the sorts of prompts you have used?

Thanks in advance.

But superwhisper is still iOS only rigth? I am on waiting list for windows beta :).

Anyone knows any good alternatives for windows to bridge the time :)?

Windows voice access is built in to Windows 11. I use it all day every day. The transcription isn’t always accurate, but AI like Claude 3.5 easily figures out your intent, even with atrocious word transcription errors.

1 Like

Sim 100% kkkk apenas encosto no teclado para copiar e colar algumas coisas oi clicar m. Objetos todos os comandos são por voz

For those using Ubuntu / Linux, I have been using Speech Note and it works quite well:

Yes — I actually code exclusively by voice now. I write all my prompts via speech. Because Cursor doesn’t offer the feature I needed, I built a small macOS app that uses a shortcut and OpenAI’s transcription model so I can straightforwardly convert speech to prompts. I can now also select and edit text via prompt everywhere on my computer just like I do in Cursor. GitHub - mgsgde/whisper-shortcut: Fast macOS menu bar app for text to speech and GPT-5 context-aware prompts with smart clipboard and hotkeys.

Should Cursor support this natively in-app?

12 Likes

Should Cursor support this natively in-app?

Yes - in open-plan offices it’s impractical, but for remote solo work speech is very useful. It’s the most natural way to communicate, so implement it with minimal friction. Preferably via a shortcut (a button is okay), since that’s the most seamless option. I guess People type mainly from habit - voice is the future.

1 Like

I use VS Code extension Whisper Assistant by MartinOpenSky Whisper Assistant - Visual Studio Marketplace .

I’m on Debian (using AppImage) and the extension is better than ā€œSpeechNoteā€ for Linux, because:

  1. When I stop recording it pastes the transcribed text wherever my cursor is in Cursor (chat, file, UI field, search bar etc.)
  2. Works out of the box very fast and accurate without me picking the right AI model for that.
  3. Everything is completely free.

Cursor should have that natively in-app to add context-aware transcription - replacing the wrongly spelled words with relevant terms from the codebase. For example, I have shopping cart in my app, but often I have the word ā€œcardā€ in my transcription rather than ā€œcartā€. Variable/function names with underscores, etc.

Yep, I don’t type anything anymore. I use Wispr Flo all day.

Yes!

1 Like

definitely!

Yes

Please, we need a Cursor integration to add file names, functions names, variable names in context to the dictionary used by those speech to text software.

They don’t understand my python modules with the name of my company in it or my function names with _ in it etc.

On Linux, there is now a free, user-friendly voice-typing / dictation app:

VOXD - Voice-Type / dictation app for Linux

Completely local, even with ai-rewriting (if needed).
You can set it up as a user daemon:
voxd --autostart true
ready to get triggered by a hotkey, whenever and wherever you want to type.

I recommend Wispr Flow, it supports voice tagging of files in your project, I’ve been using it for months, couldn’t be happier :slight_smile:

I begrudgingly use Wispr Flow as well. The app is extremely buggy, constantly cuts out and just stops dictating your voice at random times. They also have some of the worst customer service I’ve ever experienced. I’ve messaged and emailed them multiple times, and I’ve never heard back.

The only reason I still use them is I managed to get a 50% discount on the subscription.

wow sorry to hear that, so it stops listening despite you still holding the keyboard key? in my case it works flawlessly, even with extremely long dictations! :man_shrugging:t2: