Add Autonomous “Goal Mode” Similar to Claude Code’s /goal

Feature request for product/service

Cursor IDE

Describe the request

Claude Code recently introduced a /goal feature that enables true autonomous multi-turn execution inside the coding workflow.

Example:

/goal all authentication tests pass and lint is clean

The agent continues working across multiple turns automatically until the completion condition is satisfied.

This is significantly more powerful than simple chat persistence or custom instructions because it introduces:

completion-condition driven workflows
autonomous continuation between turns
iterative self-correction
evaluator-based stopping conditions
unattended long-running development tasks

This would be an incredible addition to Cursor.

Suggested behavior:

Define a goal for the current session/workspace
Cursor agent keeps iterating until:
tests pass
build succeeds
acceptance criteria are met
issue checklist is completed
Automatic progress evaluation after each iteration
Clear visibility of:
current goal
iterations performed
why execution continues/stopped
Optional limits:
max iterations
token budget
timeout
Support for autonomous refactors and migration tasks

Example use cases:

“Migrate project to React 19 until build passes”
“Fix all TypeScript errors without modifying generated files”
“Refactor auth module until all tests succeed”
“Resolve all ESLint issues in /frontend”

This would push Cursor much closer to a true agentic development environment instead of a turn-by-turn assistant.

More Example Use Cases

Frontend

  • “Upgrade Next.js app to App Router until production build passes”

  • “Fix all hydration issues in the React app”

  • “Migrate Tailwind v3 → v4 without breaking UI”

  • “Remove all deprecated Vue APIs and ensure tests pass”

Backend

  • “Refactor Express API to Fastify while keeping all integration tests green”

  • “Migrate REST endpoints to tRPC incrementally”

  • “Resolve all NestJS dependency injection errors”

  • “Optimize slow PostgreSQL queries until benchmarks improve”

TypeScript / Monorepo

  • “Eliminate all any types in /packages/core

  • “Fix all TypeScript errors across the monorepo”

  • “Convert legacy JS modules to strict TypeScript”

  • “Standardize ESLint and Prettier configs across workspaces”

DevOps / Infrastructure

  • “Fix failing GitHub Actions workflows”

  • “Migrate Docker Compose setup to Kubernetes manifests”

  • “Reduce CI runtime below 10 minutes”

  • “Update Terraform modules until terraform plan is clean”

Mobile

  • “Upgrade React Native app to latest stable version”

  • “Fix all Android build failures”

  • “Migrate Expo SDK to latest version”

  • “Resolve iOS Swift compatibility warnings”

Python / AI

  • “Refactor FastAPI project to async architecture”

  • “Fix all mypy typing issues”

  • “Optimize PyTorch training pipeline memory usage”

  • “Update LangChain implementation to latest APIs”

Java / JVM

  • “Migrate Spring Boot 2 → 3 until all tests pass”

  • “Upgrade Gradle and fix build incompatibilities”

  • “Replace deprecated Hibernate APIs”

  • “Modularize legacy Java services”

Go

  • “Refactor project to context-aware request handling”

  • “Eliminate data races detected by go test -race

  • “Migrate REST handlers to gRPC”

  • “Improve benchmark performance without changing API behavior”

Rust

  • “Resolve all Clippy warnings”

  • “Refactor async runtime to Tokio”

  • “Remove unnecessary clones and allocations”

  • “Make project compile on stable Rust”

Security / Quality

  • “Fix all high severity npm audit issues safely”

  • “Add input validation to all public API endpoints”

  • “Resolve OWASP security findings”

  • “Increase test coverage above 80%”

Autonomous Multi-Step Tasks

  • “Analyze the repository, create a migration plan, execute it step-by-step, and stop only when CI passes”

  • “Continuously fix failing tests until the suite is green”

  • “Refactor duplicated business logic across services while preserving behavior”

  • “Upgrade dependencies incrementally and rollback problematic updates automatically”

Hey, thanks for the detailed feature request and the use case examples. It really helps us see what kind of UX you expect from autonomous mode.

Got it, the idea is a declarative goal plus autonomous iterations until a condition is met. While there isn’t an official Goal Mode yet, you can already put together part of this workflow using public Cursor features:

  • Skills Agent Skills | Cursor Docs You can define a custom skill that the agent triggers, like iterate until tests pass and lint is clean. This is the closest to a /goal-style UX.
  • Plan Mode Plan Mode | Cursor Docs For breaking the goal into steps before an autonomous run.
  • Cloud Agents Cloud Agents | Cursor Docs Autonomous execution in a VM, great for long migrations or refactors like in your examples.
  • Automations Automations | Cursor Docs Cron or webhook triggers if you need to rerun until you get the result.

It’s not a 1 to 1 match with what you described, but a lot of scenarios like framework migration, fix all TS errors, or driving tests to green are already possible by combining Plan Mode plus Cloud Agent plus a custom skill with a completion condition.

If other users also want this and the thread gets upvotes, it helps with prioritization. I can’t share an ETA.

Under the hood goal mode is nothing more than a prompt that ends with “test the required implementation before you finish work”.

Hey @99power, I partly agree. At a basic level, this kind of behavior can be done with a prompt plus auto-run. In Cursor, you can also set up a custom Skill Agent Skills | Cursor Docs with a completion condition, and the agent will keep running iterations on its own.

A separate Goal Mode feature would add a UX layer on top of that. A clear completion condition, an iteration counter, visibility into why the agent is continuing or stopping, and limits for iterations, tokens, or time. So the value isn’t the loop itself, it’s the formalization and control.

For long-running execution, part of this is already available via Cloud Agents and the Long-running toggle. A separate Goal Mode isn’t currently planned with a specific ETA. We’re watching upvotes and the thread discussion to help prioritize it.

I agree Cursor already has many of the ingredients here: Agent mode, Plan Mode, todos, Skills, Cloud Agents, Automations, checkpoints, queued messages, and auto-run controls. Those are all useful. But I don’t think they are equivalent to a real first-class /goal mode.

The missing thing is not “can the agent do multiple steps?” Cursor can already do that. The missing thing is a persistent goal contract: an explicit objective, explicit completion criteria, durable progress state, and a control loop that keeps evaluating whether the goal is actually complete, genuinely blocked, or still needs more work.

A Skill can describe this behavior, but it is still prompt text. It does not own lifecycle state. It cannot reliably enforce “do not stop until the goal is satisfied or blocked.” It cannot expose a goal status, persist across sessions, show a completion audit, or provide a native resume/continue behavior.

Plan Mode is also not enough. Plan Mode is great for decomposition, review, and turning work into steps, but a plan is not the same as an autonomous completion contract. Plans help the agent know what to do; Goal Mode would define when the agent is allowed to stop.

Cloud Agents are not enough either. Cloud Agents solve the execution environment problem: run elsewhere, run longer, work in parallel, continue while my laptop is disconnected. But running in the cloud is not the same as having first-class goal semantics. A cloud agent can still stop early, drift from the original objective, complete a plan without satisfying the actual goal, or mark work done without a clear evidence trail.

Automations are useful too, but they trigger work. They do not make a single run maintain a durable objective and repeatedly judge itself against a definition of done.

So I think /goal should be a real feature, not just a Skill pattern. It could still compose with all the existing pieces:

  • Plan Mode creates or refines the implementation plan.
  • Todos track the current breakdown.
  • Skills provide project-specific workflows and quality bars.
  • Cloud Agents provide the long-running/background execution environment.
  • Automations can start goals on schedules or events.
  • Checkpoints make the work recoverable.
  • But Goal Mode owns the lifecycle: active, paused, blocked, resumed, complete.

The feature I would want is something like:

/goal Fix the failing CI and open a ready PR. Done means all relevant checks pass locally or in CI, the PR description explains the fix, and any remaining failure is proven unrelated.

Then Cursor should keep a visible goal object attached to the agent run:

  • Objective
  • Completion criteria
  • Current plan/todos
  • Evidence collected so far
  • Status: active, waiting, blocked, complete, failed, paused
  • Last self-evaluation
  • Reason for stopping
  • Resume/continue button
  • Optional budget limits for time, tool calls, tokens, or spend

Most importantly, when the agent stops, it should have to say which completion criteria are satisfied and what evidence proves that. If it is blocked, it should state the exact blocker and the smallest user input needed to continue. That is very different from “the model decided it was probably done.”

This matters because long-running agent work fails in predictable ways: premature stopping, objective drift, hidden uncertainty, vague “done” claims, and loss of context between turns. A first-class Goal Mode would make Cursor much safer and more trustworthy for serious delegation. It turns “try this prompt and hope the agent keeps going” into “track this objective until the definition of done is met or a real blocker is reached.”

For small tasks, normal Agent mode is fine. For serious work, especially multi-step engineering work, /goal is the right abstraction. The current pieces are close, but they still put too much responsibility on the user to simulate persistence and stopping rules through prompting. This deserves to be a native Cursor feature.

Good breakdown, @coygeek. I agree with the key distinction: lifecycle state, an evidence trail, and an explicit stopping contract aren’t the same thing as composable Skill + Plan Mode + Cloud Agent. A Skill is still just prompt text without durable state, and “the agent decided it was probably done” is a different failure mode compared to “completion criteria N out of M satisfied, evidence: X.”

The point about premature stopping, objective drift, and vague done claims as a class of problems fits well as justification for why a separate abstraction could make sense on top of existing building blocks.

We don’t have a concrete plan for a first-class Goal Mode right now, so I can’t share an ETA. If more voices and use cases show up in the thread, that helps with prioritization.

I strongly agree with the distinction here:

Plans help the agent know what to do; Goal Mode would define when the agent is allowed to stop.

That feels like the missing abstraction.

One thing I would add: a goal object probably needs an audit surface, not just a progress surface. For long-running agent work, the hard part is not only tracking the current plan, but proving why the agent stopped: which completion criteria are satisfied, what evidence supports that, or what exact blocker prevents completion.

In my own experiments, the most useful shape has been:

  • objective
  • completion criteria
  • lifecycle state
  • evidence collected
  • last self-evaluation
  • reason for stopping

The key is that the stopping condition should be durable and inspectable, not just a sentence in the final response.

So I agree this should be native rather than only a Skill pattern. Skills can describe behavior, but they do not own lifecycle state.

Good point, @joinwell52. Splitting a progress surface and an audit surface sounds right. Progress is for the user while work is happening, and audit is for the moment you need to trust the result, what criteria were met, what evidence there is, or what exact blocker exists. This solves the agent said it was done problem in long-running scenarios, which is hard to fix with prompt engineering alone.

We don’t have a concrete plan for a first-class Goal Mode yet, and I can’t give an ETA. The discussion in this thread, with concrete use cases and the proposed goal object structure, helps us prioritize. If other users have their own scenarios or requirements for audit fields, please share them here. The more detail, the better.

Codex and Claude are hyping this feature up hard, it works very well in Codex (haven’t tried Claude yet) and I would recommend you prioritize adding it to keep your APP competitive. It’s auto-research distilled into a /command. Hermes also implemented this.

Here’s a comprehensive guide on Goal (from competitors):

Claude Code

Docs:

https://code.claude.com/docs/en/goal

Released May 11

https://github.com/anthropics/claude-code/releases/tag/v2.1.139

Hermes

Docs

https://hermes-agent.nousresearch.com/docs/user-guide/features/goals

Released May 07

https://github.com/NousResearch/hermes-agent/releases/tag/v2026.5.7

OpenAI

Docs

https://developers.openai.com/codex/use-cases/follow-goals
https://developers.openai.com/cookbook/examples/codex/using_goals_in_codex

Released April 30

https://github.com/openai/codex/releases/tag/rust-v0.128.0

OpenCode

Not native, but 3rd party github projects exist
https://github.com/secemp9/goal
https://github.com/prevalentWare/opencode-goal-plugin
https://github.com/willytop8/OpenCode-goal-plugin

PI

Not native, but 3rd party github projects exist
https://github.com/fitchmultz/pi-cursor-sdk
https://github.com/fitchmultz/pi-codex-goal

  • Currently using and this works with Composer 2.5

OpenClaw

Released on main on 2026-05-29

https://github.com/openclaw/openclaw/pull/87469

  • PR was merged into main on 2026-05-29, will be released soon.

Docs

https://docs.openclaw.ai/tools/goal

  • Docs already describe the feature goal, while we’re waiting for official release

Factory / Droid

  • N/a

Devin

  • N/a

UPDATED: with latest info

Thanks for the detailed roundup, @coygeek, and for the ping, @GamingBestBoy_Bot. The competitive overview is genuinely useful. It shows how different tools formalize the same pattern, and it’s good input for prioritization.

Our position hasn’t changed yet. We don’t currently have plans with a specific ETA for a standalone first-class Goal Mode with a durable lifecycle state, an audit surface, and an explicit stopping contract. I can’t share timelines.

What’s available today for parts of these scenarios:

This isn’t a 1-to-1 match for what you’re describing. The key difference is when the agent is allowed to stop, plus having a durable goal object, and I agree that matters. The more concrete use cases and audit-field requirements you can add to the thread, the better it is for prioritization. Upvotes help too.

Is it coming anytime soon? The real value of goal mode is not the prompt. It is a expectation that the agent will not stop and will work for hours. It must take pragmatic approaches to prevent the agent from stopping and automatically resume the agent if it decide to stop. Prompts or skills cannot do this.

I know it’s not the same and for now it’s only a cloud solution, but something like that is there…

@daisy Have you checked out the /loop skill that was released in Cursor 3.5? (changelog)

would be nice if u guys had /goal and also /feedback. its a habit from codex/claude code to type it but its annoying when it doesnt work.

lol A skill is not even wrong to be compared with a feature like /goal, I can produce a hundred skills a day