Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

I'm hoping we see a move to allow the rendering of the webpage to be entirely up to the users. Just provide the data, and let me decide how I want to interact with it. But that would ruin SEO and Ads, so we're gonna get in a buncha legal battles about web scrapers instead.


“Reader Mode” is a successful example. I’m actually shocked it exists because of how it impedes the things you mention.


But reader mode is mostly bunch of heuristics with tons of ad-hoc special cases and hacks instead of relying documents to be well-structured. So in many ways it is the opposite of successful example.

https://github.com/mozilla/readability/blob/main/Readability...


Oh true. Which kind of demonstrates the penalty for abusing HTML so much that it’s no-longer semantically reliable.


How long can it be called abuse if it is how html has been used like almost entirety of its lifetime.


By then AI will have disrupted the ad-revenue model so fingers crossed we get the clean data!




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: