Document Indexing Fails After Latest 0.43.5 Update

Thank you for looking into this issue, and helping resolve this!

I just attempted to re-index some doc sources, and it looks like the indexing is working now.

Log here:

2024-12-16 01:55:11.168 [info] Creating Indexing Repo client:  https://repo42.cursor.sh
2024-12-16 01:55:11.337 [info] InternalRepoInfo constructor with preferredEmbeddingModel 0
2024-12-16 01:55:11.347 [info] Creating merkle client.
2024-12-16 01:55:11.347 [info] Done creating merkle client.
2024-12-16 01:55:11.347 [info] Doing a startup handshake.
2024-12-16 01:55:11.362 [info] Finished initializing merkle tree in 4.890480000000025 ms.
2024-12-16 01:55:11.363 [info] Doing the initial handshake with hash: 1a0d9f57918cb7b3c26bf89ba78555f0a04e4450ebab899e6efe8efbfb3438cc
2024-12-16 01:55:11.366 [info] Handshake start
2024-12-16 01:55:11.710 [info] Handshake timing: 347.172544
2024-12-16 01:55:11.711 [info] Handshake result: {"status":"STATUS_UP_TO_DATE","repoName":"b4203cac-67ca-4257-a907-12a6cd3c4727"}
2024-12-16 01:55:11.711 [info] Starting fast remote sync.
2024-12-16 01:55:11.711 [info] Total num embeddable files: 1
2024-12-16 01:55:11.716 [info] Root hash: 1a0d9f57918cb7b3c26bf89ba78555f0a04e4450ebab899e6efe8efbfb3438cc
2024-12-16 01:55:11.716 [info] Finished indexing repository.
2024-12-16 01:55:11.716 [info] Indexing job successfully done or aborted.
2024-12-16 01:55:12.968 [info] Creating Indexing Repo client:  https://repo42.cursor.sh
2024-12-16 01:55:12.970 [info] InternalRepoInfo constructor with preferredEmbeddingModel 0
2024-12-16 01:55:12.973 [info] Creating merkle client.
2024-12-16 01:55:12.973 [info] Done creating merkle client.
2024-12-16 01:55:12.973 [info] Doing a startup handshake.
2024-12-16 01:55:12.980 [info] Finished initializing merkle tree in 0.6869209999999839 ms.
2024-12-16 01:55:12.981 [info] Doing the initial handshake with hash: 1a0d9f57918cb7b3c26bf89ba78555f0a04e4450ebab899e6efe8efbfb3438cc
2024-12-16 01:55:12.983 [info] Handshake start
2024-12-16 01:55:13.251 [info] Handshake timing: 270.34418300000016
2024-12-16 01:55:13.252 [info] Handshake result: {"status":"STATUS_UP_TO_DATE","repoName":"b4203cac-67ca-4257-a907-12a6cd3c4727"}
2024-12-16 01:55:13.252 [info] Starting fast remote sync.
2024-12-16 01:55:13.252 [info] Total num embeddable files: 1
2024-12-16 01:55:13.252 [info] Root hash: 1a0d9f57918cb7b3c26bf89ba78555f0a04e4450ebab899e6efe8efbfb3438cc
2024-12-16 01:55:13.252 [info] Finished indexing repository.
2024-12-16 01:55:13.252 [info] Indexing job successfully done or aborted.

Some sources that were failing before have completed indexing now, and some other sources are still indexing (for me), but this seems to be working now! :partying_face:

Thank you for your help restoring this functionality @ericzakariasson !!

1 Like

Hey, Thank you so much. I can now able to index Drizzle Docs.
Also, there are so many docs that we sometimes can’t able to index because of this problem. So we hope this will be fixed in the upcoming update.

And, Is it possible to index any github repo docs or any github repo?

glad to hear it’s working now!

yes, github docs should be indexable for sure. let me know if you run into any that doesn’t work

3 Likes

Yeah, there is this Doc from Brevo (formerly SendInBlue) Typescript Node - https://github.com/getbrevo/brevo-node

I wasn’t able to index this because it keeps showing indexing but never indexed properly and no status failed. There’s the only status showing ‘Indexing [some-page]’ .

I left for 7 hours to index this docs but got no luck.

So, there are some problems, I’ll be sharing those here whenever I encounter them related to Docs Indexing.

Cursor is my fav and I will be sharing bugs or errors here to help you guys (development team) to make it best :100: (not better) .

Thank You!

Best Wishes,
Sujay

thank you! just looked into the docs, is it this url?

or are you referring to index complete github repositories (code)?

1 Like

Actually, they have the docs (when you navigate to ‘Supported Programming Languages’ section from their menu) for the languages that are currently supported.

When you navigate to this link https://developers.brevo.com/docs/api-clients and select ‘Typescript Node’ or any other prog languages then you will be redirected to their specific GitHub Repo for the respective selection you did.

I know their docs are not organised well.

So, this is the only way to index their Typescript Node docs.

I want only for this - https://github.com/getbrevo/brevo-node

afaik we don’t support indexing of documentation code, just READMEs and other websites. i’ll make sure to pass this along though, thanks for the feedback!

1 Like

My goal was only to use this repo’s README but wasn’t able to use it.

Can you help me with Indexing only README for the repo?

Why not just copy/paste the raw README.md from the Github Repo into your .cursorrules file for your project? :wink:

1 Like

Yup! That’s actually the good point.
Thank you :+1:

1 Like

Thank you so much, getting this fixed has been a huge lifesaver! What a great team and happy to see the urgency!

2 Likes

I think I can confirm this by version: 0.43.6. I see indexing failed but it did successfully index all pages

1 Like

Hey, download the latest version of Cursor from this link.

1 Like

Updated to 0.44.8 but still getting the mismatch

Could you share the link to the documentation?

Here you go: Preface

1 Like