Embedding Models for Different LLM Versions (GPT, Claude, etc.) in Cursor

litecode · October 2, 2024, 11:31am

I don’t have an answer for your specific questions, but I have seen some bits and pieces around the forum and in the docs that may be of interest.

Based on these bits of information, my assumption is that:

In all interactions (Ctrl + K, Ctrl + L, Ctrl + I, Cursor Tab and Apply), Cursor does not just embed input using the same model as the selected LLM, send it to the LLM and return a response from the LLM
Rather, I imagine there is a more sophisticated play of deconstructing inputs and outputs, using different functions for different tasks and optimisations, with different parts of the data

But that is a guess.

On Cursor Tab and custom models:

Cursor Tab is our native autocomplete feature…powered by a custom model, Cursor Tab can: Suggest edits around your cursor, not just insertions of additional code; Modify multiple lines at once; Make suggestions based on your recent changes and linter errors.

Source: https://docs.cursor.com/tab/overview

Our custom models are hosted with Fireworks…

Source: https://www.cursor.com/security#infrastructure

On prompt building:

Are requests always routed through the Cursor backend?
Yes! Even if you use your API key, your requests will still go through our backend! That’s where we do our final prompt building.

Source: https://docs.cursor.com/privacy/privacy

On inference, embedding and codebase context (when enabled):

At inference time, we compute an embedding, let Turbopuffer do the nearest neighbor search, send back the obfuscated file path and line range to the client, and read those file chunks on the client locally. We then send those chunks back up to the server to answer the user’s question.

Source: https://www.cursor.com/security#indexing

If you choose to index your codebase, Cursor will upload your codebase in small chunks to our server to compute embeddings, but all plaintext code ceases to exist after the life of the request. The embeddings and metadata about your codebase (hashes, obfuscated file names) are stored in our database, but none of your code is.

Source: https://docs.cursor.com/privacy/privacy#does-indexing-the-codebase-require-storing-code

Related posts:

Note these posts are just provided for reference, for the most up to date details refer to the security and privacy pages and the docs.

Topic		Replies	Views
Inquiry: AI Model Utilization in Cursor's Different Modes Discussions	2	245	November 1, 2024
Are all models cursor uses just APIs? Or do they host any of their own? Discussions	2	340	August 1, 2025
Question about Cursor's Tab Completion Model Discussions	3	1131	April 18, 2025
How Does Cursor Use Pinecone for Public Web Document Embeddings? Help	0	113	May 22, 2025
"Local mode" is misleading, even with BYO OpenAI Key Feedback	10	3140	January 18, 2024

Embedding Models for Different LLM Versions (GPT, Claude, etc.) in Cursor

Related topics