Speech-to-Text Functionality Issues with Image Pasting in Agent Mode

Problem Summary:

Currently, there are two issues limiting the flexibility of speech-to-text in the Agent typing bar:

  1. Speech-to-Text Disabled After Image Paste

    • When users paste an image into the Agent typing bar, speech-to-text becomes unavailable

    • Current workaround: Users must inject text via speech-to-text before pasting images

    • Desired behavior: Speech-to-text should remain active after pasting images

  2. Cannot Continue Speech Input

    • After injecting text via speech-to-text and pressing the microphone button again, users cannot continue adding text

    • Desired behavior: Users should be able to press the microphone multiple times consecutively to keep adding speech-injected text to the Agent typing area

Impact:

Many users rely on speech-to-text as their primary method for prompt injection while working with Agents. These limitations reduce workflow efficiency and flexibility.

Feature Request:

Enable speech-to-text to work seamlessly alongside image pasting and allow multiple consecutive speech-to-text inputs without interruption.

Hey @Ilan_Aviv!

Thanks for the report!

I believe that both of these issues have been solved over in the Agent Window (the Editor Window may be lagging behind). Could you confirm you’re facing these issues in the Editor window?


New to Cursor - looks like the agent window in app. Attaching a screenshot that might explain the issue much better. A part of attaching files that disable the microphone speech to text, also - whenever stopping the speech to text, It’s not possible to keep on. As you can see in the screenshot.

It looks to me that whenever some text injected into the typing area It disable the microphone.

That’s the Editor Window! So it falls into the existing issue we know about.

In the event that the Agents Window suits your needs, you’ll find that both issues are solved there (you can always continue speech-to-text, even if you’ve pasted an image or started/stopped a recording)