Speech-to-text extension built with cursor

qwert · April 20, 2025, 1:05pm

function as follows
ctrl +1 shortcut to start, start recognizing
Recognition of hundreds of words is only 2 seconds, recognition accuracy of 95% + (with the API, fill in their own key, the data run locally, no server), recognition of a minute only need about 0.0005 U.S. dollars He is very cheap, and at present, I adapted to Windows, Linux, because I do not have an Apple computer temporarily can not do Macos, support AI optimization prompt word structure output, then AI to structure output at the same time support automatic copying clipboard. He is very cheap, and currently I adapt to Windows, Linux, because I do not have an Apple computer temporarily can not do Macos, support for AI optimization cue word structured output, the recognition of the text to the AI, and then use the AI to structure the output at the same time to support the automatic copy to the clipboard. I spent a dollar a month after using it a few thousand times.

Currently I only made the Chinese one, and I want to go for an English version in the near future. But I am not aware of any good API speech recognition overseas. I’ve looked at Google’s and they are expensive, if say it’s used a few thousand times a month it would cost $20-100 , do you guys have any good suggestions , I’d like to hear them and am currently thinking about accessing Google’s recognition

qwert · April 20, 2025, 1:07pm

opensourcedeveloper · April 20, 2025, 1:11pm

i use

it works and i do not pay any money. Bu i need to select the chat window after stopping recording. (you assign the keyboard shortcuts for starting and stopping recording. it is also fast. )

this is for windows,
there is an alternative software in mac.

qwert · April 20, 2025, 1:14pm

Yes, windows has its own such as win+H, but I mainly use ubuntu development so I made a plugin in the cursor to assist me, for me ctrl+1 this shortcut is very convenient!

qwert · April 20, 2025, 1:18pm

In addition, I’ve added an AI optimization that converts the text from the voice output into cue words Structured output, so far it’s testing ok

soechin · April 21, 2025, 1:55am

you can try Bing Translate API

jokerfool · April 21, 2025, 2:16am

I currently use this with Cursor https://withaqua.com

Aqua Voice uses a fusion transcription architecture + a client context engine to be the most accurate speech-to-text system available. Text is automatically formatted to fit the specific application and document. This enables using voice for entirely new applications like technical prompting. Aqua produces the highest quality output of any voice to text system.

qwert · April 21, 2025, 2:33am

Okay, I’ll try.

qwert · April 21, 2025, 2:38am

It would be great if Cursor could build a built-in speech-to-text. It would increase everyone’s efficiency. Voice talk is far faster than typing.

soorfett · April 21, 2025, 7:00am

I can recommend Whispering - https://whispering.bradenwong.com/

dibun · April 23, 2025, 9:58am

If you are using windows then try Win key + H it opens like this below and if you keep your cursor in any window and just speak. It will convert voice to text. You don’t need any other software. It works in window 11

FreeMeWat · August 23, 2025, 9:26pm

+1 on aqua. nothing comes close and all I need to do is press the right alt key to transcribe. just wish it was on linux…

Topic		Replies	Views
Voice Typing Extensions in Cursor? Discussions	3	128	December 10, 2025
Anyone else coding by voice? Discussions	21	2346	February 10, 2026
Built a Whisper-powered app that lets you code with your voice via Cursor Built for Cursor	4	2906	June 22, 2025
Any good Speech to text? Discussions	9	439	February 6, 2026
Voice typing tool for cursor on macos and windows Built for Cursor	0	73	September 29, 2025

Speech-to-text extension built with cursor

Related topics