How does @Web work?

nateland · August 17, 2024, 2:00am

Love the @Web feature! How does it work under the hood? What APIs are you calling? Just curious

litecode · August 17, 2024, 2:39am

You have probably already seen this and are seeking more detailed information, but just adding to the topic for reference:

https://docs.cursor.com/context/@-symbols/@-web

Also, for reference, I asked Cursor (using gpt-4o):

Hello - i am using Cursor (the fork of VS Code).  

It has a feature known as `@` symbols.  

One of the symbols that can be called is `@Web`.  

There is some documentation about it here:

@Web https://docs.cursor.com/context/@-symbols/@-web

Do you know how it works 'under the hood'?  

In other words, what sort of technologies it would be using to get the information?  

If you don't know, would you be able to have a guess?

It replied with:

The @Web symbol in Cursor (a fork of VS Code) is designed to fetch and display web content directly within the editor. While the exact implementation details are not publicly documented, we can make an educated guess about the technologies and methodologies it might be using based on common practices in similar tools.

Possible Technologies and Methodologies

Web Scraping:

Libraries: It might use libraries like axios or fetch to make HTTP requests to web pages.

Parsing: Libraries like cheerio (for Node.js) could be used to parse HTML and extract relevant content.

APIs:

Public APIs: It could leverage public APIs provided by websites to fetch structured data.

Custom APIs: Cursor might have its own backend service that interacts with various web services and APIs to fetch and process data.

Browser Automation:

Puppeteer: A headless browser automation tool like Puppeteer could be used to render web pages and extract content dynamically.

Selenium: Another option could be Selenium, which is often used for web testing but can also be used for scraping dynamic content.

Natural Language Processing (NLP):

Text Extraction: NLP techniques might be used to extract and summarize relevant information from the fetched web content.

Entity Recognition: Tools like spaCy or NLTK could be used to identify and extract key entities from the text.

Integration with Search Engines:

Search APIs: It might use search engine APIs (like Google Custom Search API) to fetch search results and then process the returned data.

Example Workflow

User Input: The user invokes the @Web symbol with a query.

HTTP Request: The tool makes an HTTP request to a relevant web page or API.

Content Extraction: The HTML content is parsed, and relevant data is extracted using a library like cheerio.

Data Processing: The extracted data is processed, possibly using NLP techniques to summarize or highlight key information.

Display: The processed information is displayed within the editor.

References

Cursor Documentation on @Web

Cheerio Documentation

Puppeteer Documentation

Google Custom Search API

This is a speculative overview based on common practices in web content extraction and processing. The actual implementation in Cursor might differ.

nateland · August 17, 2024, 3:09am

"spaCy’!? what a throwback!

Secret sauce it is, I suppose… ty for this

system · June 3, 2025, 3:42pm

This topic was automatically closed 30 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
@web is not working properly Bug Reports	2	183	April 2, 2024
@Web not working Bug Reports	1	11	May 6, 2025
@Web does not work as expected Discussions	1	65	January 9, 2025
@web dead? Major issue Feedback	7	147	January 24, 2025
Adding @web not generating anything Bug Reports	2	12	May 7, 2025

How does @Web work?

Possible Technologies and Methodologies

Example Workflow

References

Related topics