Hey, I would be a lot faster if I could just speak to the a composer/chat and it translates the speech to words.
I guess you’re referring to STT (speech-to-text) instead? Being able to speak your prompt instead of typing?
→ Definitely needed! Though, there are external solutions and applications to this, or you could even build your own in the meanwhile - best of those are platform-specific however (Apple)…
Well really, TTS (model-to-user) and STT (user-to-model): I would like to have a conversation with the agent. In a sense, I think a key metric for cursors success is the split between the agent window and the code window. The smaller the code window per unit of time spent, the better cursor works.
For example, I would like to talk to the AI while I am traveling. Plan out the app, check implementation variations, run tests etc. Then when I come back to the laptop, I have a summary of what was discussed: a nicely organized dashboard. So then I can simply hammer out the various approaches on the laptop.
The TTS-STT is needed because a) I can’t type the complex instructions into my mobile when afk and b) I program now in language so no need for the precision - I need speed and language is faster than typing (and more convenient).
Just think of it like the OpenAI real time API with a great code interface and dashboard.