Read images from filesystem

ElKristor · March 20, 2025, 9:25am

Hi

I use multiple MCP servers, one of them is to manipulate a Chrome instance.
It exposes a tool to take screenshots.

In Agent mode, this tool is being called to analyze the current state of the page. The issue is that once the screenshot is taken, the Agent doesn’t actually use it. I believe it’s because to pass it back to the LLM, it needs to “attach” it to the chat and doesn’t know how to do it.

See example of the Agent recognizing it should take a screenshot but not using it:

I confirmed that by opening a new chat and asking an Agent: what’s in the image located at path “/tmp/…” and it couldn’t do it.
However, if I “attach” the image to the chat it can successfully describe it.

Is there a way to have tools that write images “cooperate” with the way we pass images back to the LLM for analysis?

Thanks !

pulkitsharma07 · March 23, 2025, 10:15am

Did you find any workarounds ?

Topic		Replies	Views
Support Reading Images from MCP tools Feature Requests	1	186	April 23, 2025
"here is the screenshot" messages Bug Reports	6	66	June 24, 2025
Images in MCP, How to do it? Discussions	3	1787	April 24, 2025
Agent Unable to Save Images from MCP Feature Requests	0	25	June 11, 2025
Enable Autonomous Image Analysis for Agents Feature Requests	2	390	April 23, 2025

Read images from filesystem

Related topics