Agents unable to parse a single 200k file

sububi · April 4, 2025, 3:52pm

I have a 200k html file that includes a table inside. deepseek and gpt 4o, when asked about the table or to extract contents from within it, are unable to find it in the file.

condor · April 4, 2025, 3:54pm

Its very unusual to have a 200k file. Even a 10k file is huge. As Cursor reads in small increments its not going likely to read all at once.

Can the content be processed by a script?

Even browsers may have issues with that lage file from my experience

sububi · April 4, 2025, 3:55pm

this is a file I’d been working on for a few weeks, with the recent update it seems none of the LLMs are able to parse it.

condor · April 4, 2025, 3:56pm

Agreed, it wont likely happen soon.

Usually websites and web apps contain tens to hundreds of smaller files to prevent exactly this case.

Can you share a bit why the file is so big?

sububi · April 4, 2025, 3:57pm

Yeah, it’s big because it’s from a very old website I’m converting.

condor · April 4, 2025, 3:57pm

Ok thats why i asked if it can be split..

For example if you know the structure or what parts are inside AI could write a script to extract that data.

sububi · April 4, 2025, 3:58pm

Cursor changed recently, right? It used to be able to parse this…

condor · April 4, 2025, 4:02pm

You can download from Cursor website a previous version if you believe this was a version issue.

It would also be good to know which version you used, model used, if you had large context enabled in settings or not and other details you can find in Cursor Docu: Common Issues & Troubleshooting Guide.

Even if Cursor hasnt changed, continuing to add content to a large file will eventually reach the Context limit. so it will become not possible to process. If the Chat session was long it helps starting with new session as it clears the context of past messages that can reach context limit.

Overall if the structure is the same for the whole table extracting data is long term solution that will work and is recommendable.

Fellow Cursor user here.

maxfahl · April 4, 2025, 7:56pm

Try copy and paste everything from the file directly into the chat box together with your instructions, and don’t reference any other files there at all. If the contents of that prompt is larger than the model’s context window, it will probably refuse to process it and Cursor will tell you so.

If this doesn’t work, go to Gemini on the web and use pro 2.5, it should handle it just fine.

sububi · April 4, 2025, 8:08pm

I had success going back to version 0.45.

0.48 was unable to parse the file at all, it acted as if it wasn’t there. Perhaps it was trying to poke around my enormous codebase, even when I didn’t want it to. (I only use “ask” mode, and only work on one or two files at a time)

condor · April 4, 2025, 8:28pm

Yes, thats why is suggested in this specific case though i would not suggest otherwise due to other issues which were fixed later. As you didnt report other issues with previous version it was a likely temporary workaround.

Cursor Editor doesnt parse any file at all, but it has to do with later versions using more advanced capabilities of AIs and fixes for other issues users had, but it means context must be likely differently managed over a thread and specially in Agent mode.

One option i didnt mention yet is using model Gemini 2.5 pro - exp as its still labeled experimental by Google though it has 1M token context in MAX mode. And using MAX mode is more complex and usage based pricing but also Google is still working on many fixes and Cursor team mentioned in other threads to have reported issues to Google so they can properly support it.

Im glad it worked for you. Youre right to use Ask mode

gadgetsfan · April 5, 2025, 8:27am

Anything over 1200 lines my agent cant edit the file? Anyone else got the same issue?

sububi · April 5, 2025, 3:37pm

yes, install 0.45 from here:

then go to settings and turn off vscode updates.

gadgetsfan · April 5, 2025, 4:37pm

Are you talking to me sorry?

system · May 5, 2025, 4:38pm

This topic was automatically closed 30 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
RAM killer - Cursor One Love Bug Reports	5	110	April 21, 2025
How to work with large codebase? Help	1	230	June 5, 2025
Cursor 2 not applying edits Bug Reports	3	82	November 25, 2025
File is too long for Cursor to edit Help	7	3283	August 22, 2024
AI Agent times out when attempting diff edits on large files - needs intelligent fallback strategy Bug Reports	4	77	September 29, 2025

Agents unable to parse a single 200k file

Related topics