Sub-agents are not using custom OpenAI base URLs

ngochip · February 22, 2026, 12:06pm

Where does the bug appear (feature/product)?

Cursor IDE

Describe the Bug

I’ve currently reached my usage limit, and I’m using custom OpenAI base URLs while waiting for refreshes.

However, the sub-agent seems to be malfunctioning. It reports that the model is unavailable with the slow pool. This message usually occurs when I don’t enable custom OpenAI base URLs.

Note that the main agent is working fine; only the sub-agents are not functioning. Could you please tell me if this is a bug or a Cursor action to minimize the use of custom endpoints?

Steps to Reproduce

Enable a custom OpenAI URL and instruct the agent to call a sub-agent.

Expected Behavior

Sub-agents should function normally like previous versions.

Screenshots / Screen Recordings

Operating System

MacOS

Version Information

Version: 2.5.20
VSCode Version: 1.105.1
Commit: 511523af765daeb1fa69500ab0df5b6524424610
Date: 2026-02-19T20:41:31.942Z
Build Type: Stable
Release Track: Early Access
Electron: 39.4.0
Chromium: 142.0.7444.265
Node.js: 22.22.0
V8: 14.2.231.22-electron.0
OS: Darwin arm64 25.3.0

For AI issues: which model did you use?

Opus 4.6

Does this stop you from using Cursor

Yes - Cursor is unusable

deanrie · February 22, 2026, 12:15pm

Hey, thanks for the report. Looks like this is a bug. Sub-agents aren’t inheriting your custom OpenAI Base URL settings, so they fall back to Cursor’s servers and hit the “slow pool” error because the fast requests have already been used up.

Could you please grab the Request ID from the failed sub-agent chat? Top-right chat menu > Copy Request ID. That’ll help us investigate.

maeriyn · March 2, 2026, 3:34pm

Hi we’ve been running into the same problem. Are there any workarounds discovered? Is there an estimated timeline to getting this fixed?

deanrie · March 2, 2026, 5:56pm

Hey, unfortunately there’s no workaround for this yet. Sub-agents don’t inherit custom API key or base URL settings, and there’s no way for the user to bypass this on their side.

I’ll update the thread when there’s a fix.

KeentGG · March 3, 2026, 9:22am

getting the same issue, i was wondering why my On Demand usage goes up when i’ve been just using my own custom API keys (OpenAI override).

So this is 99% like the cause, sub-agents are using on demand usage instead of inheriting agent model (custom model)

Plask_0x · March 25, 2026, 5:49am

Any update on this?

deanrie · March 25, 2026, 7:10am

Hey, no updates on this bug yet. Sub-agents still don’t inherit custom API keys or the base URL settings. There isn’t a workaround yet either.

The team is aware of the issue. I’ll update this thread when there’s progress.

Anuj_Butail · April 3, 2026, 5:34am

Where does the bug appear (feature/product)?

Cursor CLI

Describe the Bug

When using Cursor Agent, I get ERROR_NETWORK_ERROR with [resource_exhausted] in the stack. The stack traces runAgentLoop → streamFromAgentBackend → getAgentStreamResponse. I want OpenAI-compatible traffic to go to my LiteLLM gateway (self-hosted behind AWS ELB). curl from my environment to the same base URL works (GET /v1/models, POST /v1/chat/completions with stream: true). LiteLLM pod logs during Cursor failures often show only /health/liveliness and /health/readiness, not POST /v1/chat/completions, so it appears Cursor is not calling my gateway for this flow (or the failure happens before any request reaches LiteLLM).
Environment:
Cursor: 3.0.4 (Stable), VS Code 1.105.1, macOS Darwin arm64

Model: custom OpenAI-compatible (e.g. LiteLLM alias), base URL like http://:4000/v1 (HTTP)
Steps to reproduce:
Configure OpenAI API + Override OpenAI Base URL (or OpenAI-compatible provider) to LiteLLM.
Add/select custom model (e.g. my-sonnet).

Open Agent and use it until the error appears.
Expected:
Chat/Agent requests should hit LiteLLM (visible as POST /v1/… in LiteLLM access logs) when BYOK / custom base URL is configured.
Actual:
ERROR_NETWORK_ERROR / resource_exhausted; LiteLLM often does not log POST /v1/… when the error occurs.
Request IDs (examples):
a570bab3-81b9-4734-b2e1-ea1795ca4ce6

(add others from Copy Request ID in the chat menu if you have more)
Evidence:

Same base URL and model work with curl to /v1/models and /v1/chat/completions (streaming).
kubectl logs on LiteLLM during failure: no matching POST /v1/ lines.
Questions:
Does Agent use the cursor streamFromAgentBackend path for all model calls, or should OpenAI-compatible BYOK traffic go directly to my URL?
Are sub-agents or other Agent steps known not to inherit custom base URL (see forum thread on sub-agents)?

Is resource_exhausted here from Cursor’s side (quota) rather than my provider?
Request ID: a570bab3-81b9-4734-b2e1-ea1795ca4ce6
{“error”:“ERROR_NETWORK_ERROR”,“details”:{“title”:“Network Error”,“detail”:“We’re having trouble connecting to the model provider. This might be temporary - please try again in a moment.”,“isRetryable”:true,“additionalInfo”:{},“buttons”:,“planChoices”:},“isExpected”:true}
[resource_exhausted] Error
rae: [resource_exhausted] Error
at Ntw (vscode-file://vscode-app/Applications/Cursor.app/Contents/Resources/app/out/vs/workbench/workbench.desktop.main.js:43958:24479)
at Ptw (vscode-file://vscode-app/Applications/Cursor.app/Contents/Resources/app/out/vs/workbench/workbench.desktop.main.js:43958:23385)
at Htw (vscode-file://vscode-app/Applications/Cursor.app/Contents/Resources/app/out/vs/workbench/workbench.desktop.main.js:43959:6355)
at j5u.run (vscode-file://vscode-app/Applications/Cursor.app/Contents/Resources/app/out/vs/workbench/workbench.desktop.main.js:43959:11154)
at async qIn.runAgentLoop (vscode-file://vscode-app/Applications/Cursor.app/Contents/Resources/app/out/vs/workbench/workbench.desktop.main.js:56301:11753)
at async s0d.streamFromAgentBackend (vscode-file://vscode-app/Applications/Cursor.app/Contents/Resources/app/out/vs/workbench/workbench.desktop.main.js:56371:11057)
at async s0d.getAgentStreamResponse (vscode-file://vscode-app/Applications/Cursor.app/Contents/Resources/app/out/vs/workbench/workbench.desktop.main.js:56371:17161)
at async s3e.submitChatMaybeAbortCurrent (vscode-file://vscode-app/Applications/Cursor.app/Contents/Resources/app/out/vs/workbench/workbench.desktop.main.js:44070:19892)
at async mu (vscode-file://vscode-app/Applications/Cursor.app/Contents/Resources/app/out/vs/workbench/workbench.desktop.main.js:55354:4887)

Steps to Reproduce

my chosing my own custom model

Expected Behavior

I shud get an output.

Operating System

MacOS

Version Information

Version: 3.0.4
VSCode Version: 1.105.1
Commit: 63715ffc1807793ce209e935e5c3ab9b79fddc80
Date: 2026-04-02T09:36:23.265Z
Layout: editor
Build Type: Stable
Release Track: Default
Electron: 39.8.1
Chromium: 142.0.7444.265
Node.js: 22.22.1
V8: 14.2.231.22-electron.0
OS: Darwin arm64 24.6.0

For AI issues: which model did you use?

Custom Model Option pointing to litellm

Does this stop you from using Cursor

Yes - Cursor is unusable

takomine · April 5, 2026, 1:13pm

Is this fixed in Cursor 3? We really want to maximize our custom API without incurring further cost from our plan.

Topic		Replies	Views
Cursor Agent always routes requests through its own backend servers (streamFromAgentBackend) regardless of custom base URL settings, so the resource_exhausted error is a Cursor-side quota issue — not your LiteLLM gateway — as confirmed by zero POST /v1/ e Bug Reports chat , byok	3	67	April 3, 2026
Custom Model Provider Error Bug Reports byok	2	215	March 29, 2026
Cursor cannot connect to a custom URL compatible with the OpenAI protocol Bug Reports byok	3	316	March 29, 2026
The custom override of the OpenAI base URL is unusable Help byok	1	543	February 26, 2026
Provider Error after 10–20 seconds when using OpenRouter in Cursor Bug Reports chat , byok	6	383	February 20, 2026

Sub-agents are not using custom OpenAI base URLs

Where does the bug appear (feature/product)?

Describe the Bug

Steps to Reproduce

Expected Behavior

Screenshots / Screen Recordings

Operating System

Version Information

For AI issues: which model did you use?

Does this stop you from using Cursor

Where does the bug appear (feature/product)?

Describe the Bug

Steps to Reproduce

Expected Behavior

Operating System

Version Information

For AI issues: which model did you use?

Does this stop you from using Cursor

Related topics