Gemini 1.5 support would make it OP =)

HappyQuokka · February 28, 2024, 8:24pm

Cursor will become godlike, if uses Gemini 1.5 pro, since people report it does miracles with its 1 mil token context, for example https://twitter.com/SullyOmarr/status/1760066335898513655 etc’

Do you consider supporting API of that Gemini 1.5 as soon as it’ll be possible? for large code base it seems better than GPT4 or the open source models (do you run a model similar by quality to GPT4 but it’s not the openai model behind the scenes, correct? something like mixtral/llama/all-that-stuff? I mean when we run in a mode without using our own openapi key for Cursor [was there a setting for this or i’m hallucinating lol] )

Will be surreal experience to code with the power of your embeddings feature plus the Gemini 1.5 navigation\reasoning across the content supplied to it by your current underlying tech (for example current AIs will find and summarize the ‘relevant’ information from the embeddings/docs/codebase, then feed it to Gemini to make the final decisions and reply to the prompt by refactoring code/bug fixing/whatever needed) what do you think of this idea?

truell20 · February 28, 2024, 8:35pm

Coming soon

We are eagerly awaiting API access

HappyQuokka · February 28, 2024, 8:42pm

from the three.js youtube video example it seems like the experience working with medium/large projects will become x10-x20 faster if the model is able to hold 1m token in memory during the process, and also if you’ll make dynamic arrangement of what’s in context using local lightweight models that index all classes/functions and know to assemble a “relevant current context” quickly from the IDE and send it to Gemini (or from the backend, if all the indexing/embedding stuff happens on backend as I understand [i’ve read just a bit of docs so not sure] ) that’ll make those 1m space into a “highly relevant 1m” which will be a “” experience. I saw people on twitter commenting that they’re in a hurry to buy google stock after seeing this aaahah good point, Google has quite a manpower/brainpower/datacenter-capacity to give openai a fight for a slice of that AI pie

olegdater · March 4, 2024, 11:39am

that would be cool, using gemini pro for my project and it’s good. Did you try Gemini PRO 1.0 ? via Vertex API, or context window is too small?

deanrie · March 4, 2024, 1:04pm

I tried it through openrouter.ai, it seemed good, I still couldn’t figure it out with the context window, I need to test it.

HappyQuokka · March 4, 2024, 2:45pm

we can use it in cursor? honestly I browsed the settings and didn’t find where to put openapi key or anything, I think I’m such a dummy that even the AI won’t help me to do what I want to do

deanrie · March 4, 2024, 2:54pm

LOL

Click on gear:

Niko_Bellic · March 4, 2024, 4:11pm

As long as GPT 4 turbo in Cursor uses 10k context only, regardless of the model itself, I am not sure that with Gemini there will be any difference and considerable context…

HappyQuokka · March 5, 2024, 4:57pm

in the future maybe the Cursor editor will allow to redirect queries into ‘any custom api endpoint’ for the LLM part of request, one day? I would probably run a local LLM, which will use the embeddings synced from my Cursor account into local folder, that way everything will work on my machine and will require less GPU processing from Cursor backends, but then their engine will only be responsible for scanning web pages and making embeddings, plus the vector DB where all that is stored? plus the workspace scanning and storing in the db… still a lot of things that the service will need to take care of, but less costs in terms of GPU for all the customers, since 20$/mo can burn rather quickly with a lot of usage from some customers so the company won’t be profitable in this case, local LLMs for ‘power users’ are a good solution

deanrie · March 5, 2024, 5:32pm

As far as I know you can’t:

But you can try through openrouter.ai, it’s not free but quite cheap

HappyQuokka · March 5, 2024, 6:54pm

but how can I use it, in the settings there is only key for openai or azure, how did you use a model from openrouter and where do these models actually run?

deanrie · March 5, 2024, 7:41pm

Here:

HappyQuokka · March 5, 2024, 8:56pm

I see that Claude 3 on openrouter shows up to 200k context window possible, but how can we control what exactly and how much is being sent to the LLMs from Cursor? in case with the docs and @codebase for example, if we send 200k each time, it’s easy to go bankrupt with that Claude pricing I’m afraid to try it lol. What system decides how much current prompt is needed for the LLM? (the context means ‘all chat history’ but if we refer to @docs and the backend pulls some embeddings and sends to LLM, we cannot know how much from the context it’ll utilize, correct? and long chats are quickly going to fill in the 200k, but what’s after that, oldest messages content gets forgotten automatically by the LLM APIs on their side?

leoing · April 10, 2024, 11:32am

Now is the time, I guess?

s44002 · April 12, 2024, 3:53am

@truell20 now that the gemini 1.5 pro window has opened for people to integrate… When will we see it in cursor

cryptopunk4201 · April 12, 2024, 8:52am

waiting for gemini api!!

JefftheWise · April 16, 2024, 7:04pm

Excited about this, especially with 1.5 Pro! Do we have a timeline?

sualeh · April 17, 2024, 8:07am

we shall have it public soon!

JefftheWise · April 17, 2024, 5:48pm

Hell yeah. Pumped!

demelain · May 2, 2024, 10:20pm

Any updates on this?

Topic		Replies	Views
Please support the latest Gemini 1.5 Experimental 0827 ASAP Feature Requests	17	1384	September 21, 2024
Support Gemini of Google Vertex AI in addition OpenAI Feature Requests	14	3186	July 4, 2025
Gemini 2.5 Pro's paid version is released! Feature Requests	6	1876	April 8, 2025
Add Gemini-2.5-pro-exp to cursor Feature Requests	115	22174	June 2, 2025
New Gemini 2.5 Flash (05-20) Feature Requests	18	2513	June 5, 2025

Gemini 1.5 support would make it OP =)

Related topics