it’s been 2 years already. is there any timeline?
Hey, not yet, unfortunately. We have not seen a huge demand for this feature yet, as remote PDFs should work by pasting the URL in - it’s currently only local PDFs that do not have good AI integration right now!
Hey,
Just adding an upvote here that I badly need this feature too!
Upvote here. I think this is crutial for many niche domains other than SDE.
Bump and upvote
I wrote a little tool bib4llm that converts PDFs into Markdown + PNGs. Either from a directory of PDF files or a Zotero BibTex file. Its currently using PyMuPDF4LLM for the conversion and leaves any RAG / indexing to Cursor.
I just drop a bib
file synced with my Zotero collection (BetterBibTex + keep bib
file updated during export) into my project folder, run bib4llm
on it and start chatting with the papers.
Maybe you find it useful.
bump and upvote
Bumping this again. I deal with a lot of academic papers and expecting users to do imperfect PDF conversions to markdown just to regain a feature already built into these LLMs is crazy at this point in the product lifecycle.
Interesting project. Major use case issue for me personally when trying to use it is very limited efficacy converting equations back to markdown. Not familiar enough with OCR to know how much there is that one can do about this (other than ironically feed the PDFs into an LLM and have them fix rendering the equations…)