Using deepseek-v4-pro:cloud in Cursor via local Ollama and GitLens — lessons from the reasoning_content mess

Outbad · April 28, 2026, 8:27am

DeepSeek V4 (`deepseek-v4-pro:cloud`) in Cursor via local Ollama + GitLens

What / why: Pointing Cursor straight at the DeepSeek API often breaks on V4 + thinking + tools, because callers must replay reasoning_content correctly; Cursor’s own OpenAI-compatible path isn’t fully there yet. Workaround: run deepseek-v4-pro:cloud through local Ollama, point GitLens → Ollama, then add the same model id under Cursor → Settings → Models → Add Custom Model.

My experience: Agent mode works — tokens billed on the DeepSeek side. Subagents / nested agent routing do not work — that chain is different and longer; this bridge doesn’t land there.

Prerequisites (short)

Install Ollama — get it from ollama.com → Downloads (Windows: PowerShell one-liner on the same page; Linux/macOS: install script linked there too).
Ollama account in browser — sign-up, then device keys / API keys where the dashboard asks (cloud usage).
DeepSeek — sign up at platform.deepseek.com, issue an API key, add a small balance.
Cursor + GitLens extension (Extensions sidebar → search GitLens → GitKraken).
Accept local CPU/RAM churn — Ollama still runs as a process on your box.

Steps

A — Ollama + cloud model

Put DeepSeek’s API key where your Ollama version asks when you use deepseek-v4-pro:cloud (exact labels change — follow in-app prompts / Ollama help).
Terminal 1:
```
ollama run deepseek-v4-pro:cloud
```
(Often run from pwsh on Windows — easier lifecycle.)
Terminal 2:
```
ollama ps
```
Copy the exact model name printed (often deepseek-v4-pro:cloud).

B — GitLens ↔ Ollama in Cursor

Command Palette (Ctrl+Shift+P): run GitLens: Switch AI Provider / Model (wording may differ).
Provider: Ollama.
Base URL GitLens expects (OpenAI-compatible):
http://localhost:11434/v1
Paste Ollama API key when asked.
Pick the model id identical to ollama ps (including the :cloud tag if shown).

C — Cursor model list

Cursor Settings (gear) → sidebar Models → Add Custom Model → paste the same string → confirm / restart Cursor if picker is stale.

D — Optional one-liner “prime” (experimental)

Paste once in the shell where ollama run is alive:

Keep reasoning on; across tool-heavy turns preserve full assistant payloads so thinking + tools chains stay coherent.

(YMMV — not a substitute for correct host URL / keys.)

Honest limits

Yes	No
Agent in Cursor with billed usage	Cursor subagent UX / routing
Heavier chats vs flaky direct DeepSeek hostname in Cursor for some setups	Guaranteed on every Cursor/Ollama update — retest after upgrades

Disclaimer

Not affiliated with Cursor, Ollama, or DeepSeek. Blur API keys in any screenshot. If it breaks, check ollama ps, GitLens version, Cursor → Help → About.

Good luck.

Sgroove · May 15, 2026, 5:21pm

you can this do 100% free also with litellm and a clouflare tunnel.

Topic		Replies	Views
How to use DeepSeek Code in Cursor — Updated February 2026 Guides byok	1	2876	March 4, 2026
How to use MiniMax-M2.7 in Cursor Guides byok	1	254	May 25, 2026
Cursor ignoring your custom provider? Here's how to fix it Built for Cursor byok	0	96	June 25, 2026
Forcing Opus to use subagents instead of reading files itself: no quality loss, ~4× token reduction Guides context , large-codebases , anthropic , subagents	0	158	April 24, 2026
How can I use a local LLM on my desktop/AI computer? Help byok	9	2766	July 7, 2026