AI Internet Culture News Non-Google

Internet Archive's Mark Graham says blocking access is not the solution to AI scraping

Dwayne Cubbins

Feb 18, 2026 2 Min Read

Mark Graham, the director of the Wayback Machine at the Internet Archive, has published a post on TechDirt, regarding the recent concerns about AI companies scraping the internet using the Wayback Machine.

He emphasizes that these concerns are understandable, but they’re targeting the wrong problem. Graham states that the Internet Archive does not function as a bulk data scraping pipeline for AI companies. The service is only built for human access and research use.

There are built-in technical controls that block abusive automation from AI companies. The Wayback Machine consists of rate limits, traffic monitoring, and filtering. These already stop AI companies from large-scale data extraction. According to him, treating the Internet Archive as a scraping tool ignores how the system actually operates.

Graham also argues that online libraries are being blamed for an economic conflict created by AI platforms. The archive does not sell datasets or license news content for the training of AI models. The whole point of the Internet Archive is to preserve web pages, so journalists, courts, and researchers can cite records. Blocking this by not allowing the Wayback Machine to access websites will result in the removal of a long-standing layer of accountability that the open web has depended on for decades.

He further argues that news sites have coexisted with web archiving for almost 3 decades, with no major disruption. The current wave of restrictions is due to commercial AI model training, and it’s not because of any shifts in the behavior of the Internet Archive.

Replacing the open preservation of the internet will result in the past disappearing through paywalls, or worse, shutdowns. Publishers want leverage in licensing and tighter control over how their work enters AI systems.

It’s not just publishers pushing back on the Internet Archive. Microblogging platforms such as Reddit have restricted a large portion of its content from Wayback Machine access. Posts, comments, and profiles can no longer be archived, and it can only access the homepage. As a result, the web is only easily visible in the present, but it will be harder to study its past a few years later.

The Internet Archive is a non-profit, public library of information, ruling out any financial motives. He concludes by saying that systems can always be improved, but urges the rest of the internet to continue working together to preserve its past.

We’ve used the Wayback Machine to bust myths recently, like the controversy surrounding TikTok’s privacy terms. So it does serve a very important role on the internet in its own right.

Share your thoughts about this situation in the comments below.

We stand out from the tech-media crowd because we break news stories; we mainly bring you stuff that you won’t find anywhere in the mainstream tech media. Our stories have been picked up by some of the world’s most popular websites and media outlets—more info is available here.

Dwayne Cubbins
2752 Posts

I cover fast-moving stories across apps, online platforms, and everyday tech — phones, wearables, consoles, and whatever else people are fighting with this week. Bugs, rollouts, scams, policy enforcement, and the occasional internet-culture rabbit hole are all fair game. My goal is simple — make confusing tech news readable. When I'm not working, I'm working out or chilling with my dog. Got a tip? You can find me on X @dcubbins.

Next article View Article

Google Gemini users report chat history suddenly disappearing from the sidebar [U: Acknowledged]

Update 20/02/26 - 09:09 am (IST): Google has confirmed that it's working on a fix for the problem. The acknowledgment was made on X, as well as...

Feb 19, 2026 1 Min Read

Internet Archive's Mark Graham says blocking access is not the solution to AI scraping

Dwayne Cubbins 2752 Posts

Next article View Article

Google Gemini users report chat history suddenly disappearing from the sidebar [U: Acknowledged]

Dwayne Cubbins
2752 Posts