Is there a way to make Cursor rules more reliable, similar to how Promptfoo handles prompt testing?

may relate to #Please make "thinking toggle" visible