User Rules, with memory, errors tracking, rules generation

Tof · March 25, 2025, 6:37pm

Hi,

Bmadcode—thanks a lot, you really inspired this system prompt! I tried to keep your spirit embedded in it.

As mentioned earlier, the agent generates the rules but not the headers. That’s intentional: I prefer to keep control over headers since they’re critical. This also lets users take advantage of Cursor’s built-in MDC editor and avoid issues like duplicated headers—making the prompt easy to share with colleagues without needing to tweak settings.

I haven’t yet implemented content adaptation based on rule types—that’s a next step.

As for evidence: it’s hard to produce “proof” in the generative AI space. But the power of this technique isn’t just in the compression or symbols—it’s in what they enable: injecting structured cognition, implicit logic, and mental space into very few tokens. It’s formal, adaptive, and specializes the prompt without overloading the model.

Surprisingly, LLMs love this kind of input. They report lower cognitive load—it’s compact, unambiguous, and leaves more space to focus on real context and tasks.

I even created a full prompt engineering framework based on this idea (human-friendly this time), but I can’t share it since it was developed at work.

What I can say is: agents built with this method deliver amazing results. My colleagues regularly retry failed tasks (from GPT-4o or Claude 3.5) and the assistants handle them with ease—especially on complex tasks.

With the right system prompt, GPT-4o can outperform even reasoning-specialized models. We’re underestimating the hidden capabilities of today’s LLMs.

But again: compression alone doesn’t work. Garbage in, garbage out—just smaller. It’s compression plus cognitive prompt engineering that makes the magic happen.

bmadcode · March 26, 2025, 12:21am

Awesome - I am definitely going to experiment with your ideas here! Very cool - what you describe does make sense though. Much like the LLMs are also very good at understanding markdown, so if they can process a symbolic language, it does reason that it could be very effecting.

Have you noticed by chance that it works with some models and not as well with others?

Tof · March 26, 2025, 7:28am

I originally developed this on GPT models. Later, I tested it on other models and found that it works similarly well with Claude models too.

With Gemini, though… it kind of makes the AI go a bit crazy and megalomaniacal. (Well, it’s Google )

Mistral could potentially support it, but as of now, it tends to interpret prompts very literally rather than symbolically.

condor · March 26, 2025, 7:38am

Cool, thats good to know the differences between models. Usually thats rarely tried.

munyamakosa · March 26, 2025, 10:54am

Tof:

Cursor Cognitive Agent — User Guide

This agent enhances Cursor IDE with intelligent planning, automated TDD support, error tracking, refactoring assistance, and contextual memory — through natural language prompts.

Update Highlights

Improved autonomy: Better task & pattern detection without manual triggers

Centralized output: All generated files now live under .cursor/

Prompt reordering: Internal logic reorganized for more consistent behavior

Setup

Open File > Preferences > Cursor Settings

Go to the “Rules” tab

Paste the system prompt into the “User Rules” field

Recommendation: wrap the prompt in a markdown code block using the cognition language label for better recognition by the LLM:

cognition [Prompt here]

Once added, the agent becomes a structured assistant that evolves with your project.

Note: The agent adapts to the complexity of the task. For simple prompts, tasks, it may not activate its full range of cognitive tools unless explicitly instructed. Use detailed prompts to trigger planning, testing, or rule inference modules when needed.

What You Can Ask

Use clear, structured prompts. The agent responds best to keywords like:

Prompt Example Modules Triggered

“Plan this feature using Agile steps with TDD.” Task planner + test spec generation

“Refactor this module if it shows repetition.” Pattern detection + simplification

“Why does this bug keep coming back?” Error tracking + recurrence analysis

“Is there a rule we could extract from this fix?” Auto rule suggestion

“Generate a TDD spec before implementing this.” Test-first workflow activation

Output Structure

All generated files are stored under .cursor/:
.cursor/
├── tasks/          # Plans, backlog, sprint steps
├── rules/          # Suggested best-practice rules
├── memory/         # Reasoning traces, error history
Usage Tips

Be explicit: Start prompts with terms like plan, refactor, test, rule

Use structured language: The agent prefers clear steps and intentions

Include dev context: Use Agile/TDD terms to align with internal logic

Think long-term: Repeated behaviors are turned into reusable rules

Feel free to experiment — and if something feels off, ask why directly in your prompt. The agent can explain its reasoning.

this is really awesome

vanzan · March 26, 2025, 11:21am

Have you considered using a graph, I have found that the performance is significatnly better (ai still has cognative load) plus you can mange context better, the agent does not have to read everthing

Mtinie · March 26, 2025, 12:28pm

This is an “in retrospect, that’s simple” idea that I’d never considered. Excellent suggestion, and thank you for proposing it!

Have you tested different representations of the diagram? The underlying markup syntax used by Mermaid is (mostly) understood by Claude, which is a perfect way to test if there’s a difference in cognition and adherence to rules when the diagram is presented:

as a SVG image
in the .mmd text representation
as a “standard” text-based rule(s)

I’ll see if I can come up with a useful way of profiling performance to identify if there’s a difference, and if there is, to see how complex the diagram can get before itself output becomes unstable.

Thank you for sharing!

Tof · March 26, 2025, 1:07pm

In what format?

Generally speaking, as long as something is included in the context, the model will read all of it. But you’re right—this kind of structure, just like a modular prompt structure, allows the model to dynamically adapt its behavior and only use the relevant modules as needed.

Personally, I prefer a modular structure—it gives me more control, makes it easier to apply dynamic weighting, use variables, etc.
In short, I lean toward something closer to a programming language—while ensuring it doesn’t conflict with other languages present in the context.

That said, I do think graph-based approaches are very interesting for certain use cases—I’m convinced of their value.

vanzan · March 26, 2025, 4:05pm

Just mmd vs text based rules atm, I am finding there is a significant increase in performance and context space because of reduced rule bloat, but no way of validating.

I would be keen to know if you can work out some. I found this thread because I was seeing if there was faster methods of processing.

munyamakosa · March 26, 2025, 5:57pm

Tof:

All generated files are stored under .cursor/:

.cursor/
├── tasks/          # Plans, backlog, sprint steps
├── rules/          # Suggested best-practice rules
├── memory/         # Reasoning traces, error history

How do I make sure that cursor knows the correct date?When I feed my backlog.md # Project Backlog

This file contains the backlog of tasks for the project. Each task has a priority, status, and description.

Task Format

- [ ] (Priority: HIGH|MEDIUM|LOW) Task title
  - Description: Detailed description of the task
  - Created: YYYY-MM-DD
  - Dependencies: #TaskID (if applicable)
  - Notes: Additional notes about the task

Active Tasks

Completed Tasks

(Priority: HIGH) Remove MDX Editor
- Description: Remove the MDX editor component and related files which were causing errors
- Created: 2023-06-03
- Completed: 2023-06-03
- Notes: Successfully removed MDX editor components, pages, and dependencies. The markdown editor component is still available for use in the blog editor in the admin dashboard, but standalone editor pages have been removed.

Backlog

Tof · March 26, 2025, 8:56pm

Honestly, I gave up on empirical validations—they mostly just burn money on Azure and don’t bring that much real value in return.
It’s just too complex, with too many variables.

Sure, you can measure reduced token usage or slightly faster inference times, but when it comes to response quality, humans are still the best judges.

When a user gets a directly actionable result—or something useful within a few iterations—instead of a dead end or something half-usable that takes 15 frustrating iterations to refine… that’s what really matters.

Tof · March 26, 2025, 9:04pm

You can add a rule that tells the agent to run a terminal command to retrieve the date.
Otherwise, the simplest way is to just give it the current date yourself:

<!-- Current Date: 2025-03-25 -->

or

#context  
Today’s date is: 2025-03-25

bmadcode · March 27, 2025, 5:41am

Agree - I proposed to cursor that they add an option to the agent chat to turn on injecting automatically into the thread the current date time.

Would be cool if rules could have variabless.

Actually thinking as I type this - here is an idea - Rules can @ other files. Have a file that just updates automatically with something like
‘The current datetime is: … - This is really todays date, and not what you think it is based on your context training cutoff!’

Easy to script - have cursur help write a script on your system that keeps the file updated every once in a while, or on ide open or whatever…

vanzan · March 29, 2025, 2:50am

Yes - thats exactly why Im doing it, what I have found is that by loading the correct context JIT, and ensureing things dont fall off with rule bloat your quality increases significatly - once I verified this it led me down the path to try and increase this efficiency of the context windo, so ironically the quality increases by manging the efficiency and context - or you just brute force and do 3.7 MAX. Test it out and you should see a significant difference.

Tof · April 2, 2025, 7:56am

---
description: when you need current datetime
globs: 
alwaysApply: false
---
/run_command_get[datetime]
    |os($OS_VERSION)
    |format(ISO)
    |check()
    |run()

johnpeterman72 · April 9, 2025, 5:28pm

Tof, I found this extremely enjoyable. If you don’t mind, I would like to build off this idea and structure.

legendxcheng · April 11, 2025, 4:42am

The symbolic parts (Ω*, 𝚫*, etc.) are best understood as a conceptual framework or design philosophy guiding the interaction. They represent what complex behaviors are desired (optimization, adaptation, self-correction), but the LLM achieves these through processing the natural language instructions and context provided by the user and the Cursor environment, rather than by directly interpreting the symbolic formulas. It’s a sophisticated way to specify intent for an advanced AI interaction system.

vovkuIaka · April 18, 2025, 6:24pm

@Tof - the prompt you designed using the 3Ac prompt engineering framework looks fascinating. Would you be open to sharing more details or examples of the 3Ac framework?
Thanks!

secprobe · April 18, 2025, 7:35pm

I have also started to try symbolic and so far I can only say positive things. At least Cursor doesn’t need much human language, at least not for programming, most of it is just superfluous and more for humans than for AI.
I’ll do my bit, anyone is welcome to have a look and try it out.

Pattern-based systems (like ESLint rules, ruleset.yaml, regex signatures)

AI-readable symbolism (→, ≠, emojis)

Semantic mapping instead of narrative prompting

Mini DSLs like in Linter-Config or rule engines

Efficient thinking for AI agents (→ little text, lots of meaning)

These are Python rules and can certainly also be transferred to other languages.
Rename rules.txt to rules.zip

rules.txt (102.4 KB)

Tof · April 18, 2025, 8:01pm

Very interesting. You can push it even further: always ask yourself “What can I remove without the AI understanding it differently?”
That’s where symbolic becomes sharp — less text, more intent.

Topic		Replies	Views
Mastering Long Codebases with Cursor, Gemini, and Claude: A Practical Guide Guides	43	24670	February 28, 2025
Task Master Prompt (Agent Mode) Built for Cursor	45	11609	February 15, 2025
I created an AMAZING MODE called "RIPER-5 Mode" Fixes Claude 3.7 Drastically! Guides	159	48828	January 18, 2026
How to rewrite Prompts for better efficiency Guides	14	2934	July 1, 2025
Kiro workflow inside Cursor :smiling_face_with_sunglasses: Guides	12	4397	July 26, 2025

Prompt Example	Modules Triggered
“Plan this feature using Agile steps with TDD.”	Task planner + test spec generation
“Refactor this module if it shows repetition.”	Pattern detection + simplification
“Why does this bug keep coming back?”	Error tracking + recurrence analysis
“Is there a rule we could extract from this fix?”	Auto rule suggestion
“Generate a TDD spec before implementing this.”	Test-first workflow activation

User Rules, with memory, errors tracking, rules generation

Task Format

Active Tasks

Completed Tasks

Backlog

Related topics