[Feature Request] Gpt-4o vision

fun_strange · May 13, 2024, 8:25pm

Gpt-4o is described as having better vision capabilities. Is there a plan to add image upload feature to it?

Lesslaw-m · May 14, 2024, 7:30am

I’m curious regarding your anticipated use case for this in an IDE.

TheArtfulBogler · May 14, 2024, 8:39am

It would help me to be able to point to how I want something in my frontend to look and let it then implement it in the code. Currently I’m mostly stuck with going back and forth through me explaining and the GPT taking stabs in the dark.

fun_strange · May 14, 2024, 8:53am

For example giving the AI a Figma screenshot with description to implement a UI component. Or attaching a screenshot of an issue in the UI.
With the upcoming video capabilities perhaps it would be possible to automate the debugging and development process by iteratively feeding the AI a video of the UI and it automatically editing the code with continuous feedback, where it has access to the console, then possibly Network/Elements tabs (possibly via screenshots/video as well if it’s good enough).

Lesslaw-m · May 14, 2024, 12:44pm

ah, right. sounds fun

clayton · May 14, 2024, 2:44pm

When ‘gpt-4o’ model is selected you can use images in the chat. Are you referring to another use - or does it look like images being handled under the covers differently?

fun_strange · May 14, 2024, 3:04pm

When I select gpt-4o, the ‘Image’ button disappears.

clayton · May 14, 2024, 3:30pm

Ah, ok - I realized I don’t usually use that button and just copy-paste an image into the chat (or you can mention the @imagefilename and it will load in the chat). Not sure why the image button is missing, but it will let you add + process images, using gpt-4o presumably, those ways currently.

fun_strange · May 14, 2024, 4:01pm

Wow, I didn’t know about that, thanks for the tip!

prefer · May 14, 2024, 11:37pm

I use it to do rapid Jupyter Notebook debugging. The amount of context you can give a vision model with a simple screenshot is amazing and so fast!

Topic		Replies	Views
Will the image generation capability of the gpt-4o model be supported in the future? Feature Requests	1	228	March 28, 2025
The addition of GPT-Vision Feature Requests	8	1060	December 1, 2023
GPT-4o in Long Context Mode 😉 Feature Requests	12	1833	July 26, 2024
"new AI project" with vision input Feature Requests	1	329	December 1, 2023
GPT4 Turbo and GPT4O models are unable to input images Discussion	1	305	May 15, 2024

[Feature Request] Gpt-4o vision

Related topics