BugBot raising Invalid Bug Reports

Where does the bug appear (feature/product)?

BugBot

Describe the Bug

Starting this week, we have been experiencing high volumes of completely invalid bug reports on branches being analyzed by BugBot. The invalid reports cover a range of issues:

  • referencing code from old commits or MAIN that no longer exists in HEAD of the branch (previously did not experience this at all)
  • similarly to above, re-raising the same bug report even after that section of code is changed and the originally reported bug is fixed.
  • complete misunderstanding of the function of the piece of code a bug is raised for (previously for the same code, seemed to have no issues)
  • in packages with multiple files where different files are being worked on in separate branches… bugbot is raising bugs in code that was not changed in files that have no changes in 1 branch, and not in other branches… a very misleading situation where the same exact script has a magical (often invalid) bug in 1 branch despite no changes, and in another is perfectly fine.

This seems to primarily (for us at least) be an issue with Python and BASH scripts.

Steps to Reproduce

For us, we run bugbot automatically on an open GitHub PR that is marked ready for review, and it re-runs on commit. Essentially, all you do is run the bugbot on a branch of code to reproduce this behavior.

Expected Behavior

I expect the bugbot to raise valid reports, not repeat itself, reference the HEAD of branch, and have a contextual awareness of the section of code it suspects a bug may exist in (so as to determine if that’s true or not based on the use case of the snippet it’s targeting)

Screenshots / Screen Recordings

Operating System

Windows 10/11
MacOS

Current Cursor Version (Menu → About Cursor → Copy)

Version: 1.7.38 (user setup)
VSCode Version: 1.99.3
Commit: fe5d1728063e86edeeda5bebd2c8e14bf4d0f960
Date: 2025-10-06T18:18:58.523Z
Electron: 34.5.8
Chromium: 132.0.6834.210
Node.js: 20.19.1
V8: 13.2.152.41-electron.0
OS: Windows_NT x64 10.0.26100

For AI issues: which model did you use?

BugBot

Does this stop you from using Cursor

No - Cursor works, but with this issue

Hey @Deastham

Sorry for the bad experience! We just deployed the improved version, which should surface relevant bugs more often. Could you pls check if that helps?

Also if you see this problem again, could you pls share bugbot request id? It’s available in Github Checks (press on Cursor Bugbot: Details)

I invoked Bugbot again in GitHub on one of my open PR’s and observed:

Bugbot Review

Bugbot Analysis Progress (2m 54s elapsed)

:white_check_mark: Gathered PR context (3s)
:white_check_mark: Analyzed code changes (1s)
:white_check_mark: Completed bug detection — 4 potential bugs found (2m 28s)
:white_check_mark: Validation and filtering completed (0s)
:white_check_mark: Posted analysis results — 3 bugs reported (21s)
:white_check_mark: Analysis completed successfully (0s)

Final Result: Bugbot completed review and found 3 potential issues

Request ID: serverGenReqId_37ff7597-897e-470e-8700-3d890a3f0ec5

of the 3 detected issues:

  • The first bug it reported:
    • No such reference _validate_check_health exists in the code anymore, not around like 317 or within the function defined therein, this class attribute was originally named _validate_check_health and was then replaced by an upgraded version _validate_check_health_threaded but the bugbot continuously threw bugs claiming we were calling the original name, even though that had been fully replaced to calls and class definitions for the new name.
      • in response, I adjusted the naming convention to attempt to get the bot to “see” a fully different function, to no effect, it still seems to be expecting _validate_check_health which does not exist in code anywhere at HEAD of branch
  • The second bug it reported did turn out to be a valid report.
  • The third bug it reported was valid in some use cases, just not in our particular use case.

While the second two bugs reported are new, which represents new behavior from this morning; the first is still the exact same problem we’ve had for the past 24 to 48 hours with reports that seem to originate in a “hallucination”

Additionally, in pushing a fix for the 1 valid bug, it found a new bug in un-changed code between commits… that… is completely invalid.

Here’s the request details:
Bugbot Review

Bugbot Analysis Progress (2m 52s elapsed)

:white_check_mark: Gathered PR context (3s)
:white_check_mark: Analyzed code changes (1s)
:white_check_mark: Completed bug detection — 4 potential bugs found (2m 27s)
:white_check_mark: Validation and filtering completed (0s)
:white_check_mark: Posted analysis results — 3 bugs reported (20s)
:white_check_mark: Analysis completed successfully (0s)

Final Result: Bugbot completed review and found 3 potential issues

Request ID: serverGenReqId_5fd85236-90cf-44c5-a92e-cfa2c6c181e7

Context:

  • the second bug was valid but not relevant to the use case presented
  • the third bug was valid and a fix was identified

Here is another example. It provided a summary of the fix as a PR comment / bug instead of in the description. serverGenReqId_e3bd5749-904c-4dfd-bcc1-83366050ecf3

Did an update get pushed to bugbot since yesterday’s comments?

The latest few evaluations bugbot has done seem to be working as expected, and not raising the invalid bug(s) repeatedly.

→ they also didn’t re-raise reports for bugs that could be valid in some circumstances but were not valid in our particular implementation.

A successful recent evaluation was on:

  • Request ID: serverGenReqId_b8dc97e9-05e7-4ecf-bd28-9643e78ca92d

Hey! Yes, we detected a regression that may have affected a few users for a few days. Should be all resolved now :slight_smile: Thank you for raising this issue and definitely feel free to tag me if something seems wonky in the future (pls not for like one false positive but if something feels largely different)

1 Like

This topic was automatically closed 22 days after the last reply. New replies are no longer allowed.