Show HN: DR Web Engine – JSON-based web scraping that doesn't break on change

Github.com•December 23, 2025•1 min read

Original Article Summary

Built this during my PhD research after getting frustrated with traditional web scrapers breaking every time sites updated their HTML. DR Web Engine uses declarative JSON5 queries instead of imperative scraping code. Key features: - JSON5 query language (can…

Read full article at Github.com

✨Our Analysis

DR Web Engine's development of a JSON-based web scraping engine that doesn't break on change introduces a significant advancement in web scraping technology. This means that website owners can expect more resilient and adaptable web scraping attempts, potentially leading to increased AI bot traffic on their sites. As a result, website owners may need to reassess their content protection strategies and consider more sophisticated methods to prevent unauthorized data extraction. To prepare for this shift, website owners can take the following actionable steps: monitor their website's traffic for unusual patterns, update their llms.txt files to include specific rules for JSON-based scraping engines like DR Web Engine, and implement robust content protection measures such as rate limiting and IP blocking to mitigate potential data breaches.

Track AI Bots on Your Website

See which AI crawlers like ChatGPT, Claude, and Gemini are visiting your site. Get real-time analytics and actionable insights.

Start Tracking Free →

Show HN: DR Web Engine – JSON-based web scraping that doesn't break on change

Original Article Summary

✨Our Analysis

Related Topics

Track AI Bots on Your Website

Related Articles

A doctor’s humbling journey through prostate cancer recovery [PODCAST]

Saving limbs from the silent threat of peripheral artery disease [PODCAST]

Italy Tells Meta To Suspend Its Policy That Bans Rival AI Chatbots From WhatsApp