Built a Whisper-powered app that lets you code with your voice via Cursor

Cursor has been a huge productivity boost and I’m easily writing code 50-100% faster now. To the point where the major bottleneck is how fast I can express my ideas, which, with typing, isn’t very fast.

I’ve seen it mentioned several times that people are using speech-to-text options, but most solutions seem to be Mac-only or stuck behind a waiting list. As a Windows dev, I have the Windows Voice Typing (Win+H), it’s just not reliable enough and doesn’t work with my way of thinking out loud, which involves a lot of pauses.

Long story short, I decided (with the help of Cursor) to build myself a minimal desktop app using Python for this and was encouraged to open source it when I shared it with others. Features:

  • Uses OpenAI Whisper for high-accuracy transcription
  • Works in ANY text field or editor (including Cursor chat/composer)
  • Lets you navigate to other windows while continuing to record
  • Activates globally with the previously useless Caps Lock key
  • Keeps your recent transcriptions in the system tray

No waitlists, no subscriptions, no platform lock-in. Just clone, add your API key, and start coding with your voice.

Check it out on GitHub:

1 Like