Date: August 12, 2025
Reddit says AI firms exploited archived content via the Internet Archive—now only its homepage remains archivable.
Starting August 11, 2025, Reddit revealed that it will block the Internet Archive’s Wayback Machine from harvesting most of its site. This ban will specifically focus on post pages, comments, and user profiles. From now on, only the Reddit.com homepage will be archived, leaving much of Reddit’s history off‑limits.
This decision comes after Reddit says it discovered that AI companies had been scraping its data via archived snapshots, bypassing Reddit’s own licensing rules and access controls. Reddit spokesperson Tim Rathschmidt told The Verge,
“Internet Archive provides a service to the open web, but we’ve been made aware of instances where AI companies violate platform policies, including ours, and scrape data from the Wayback Machine.”
Reddit’s action isn’t just about controlling content—it’s about safeguarding user privacy and deleted content. This keeps posts that have been removed for moderation or privacy reasons out of reach of archiving systems. Rathschmidt further added,
“Until they’re able to defend their site and comply with platform policies (e.g., respecting user privacy, re: deleting removed content), we’re limiting some of their access to Reddit data to protect redditors,”
The move reflects a broader effort by Reddit to tighten data access across the board. That includes recent changes to its APIs, paid licensing deals with Google and OpenAI, and even legal challenges, like its lawsuit against Anthropic for unauthorized data scraping.
Though Reddit reached out to the Internet Archive in advance, the Archive’s Wayback Machine director, Mark Graham, described the two organizations’ relationship as longstanding, saying discussions are still ongoing.
Still, this move raises broader concerns about the future of web preservation. By limiting archives, Reddit is effectively erasing a portion of public record and making it harder to trace how discussions, communities, or web pages evolved over time.
By Sakshi Kaushik
Sakshi Kaushik is a curious storyteller who brings clarity to the chaos of innovation. She dives into artificial intelligence, blockchain, fintech, and healthtech, turning complex concepts into content that's both insightful and easy to follow. With a knack for making tough topics feel approachable, Sakshi writes for readers who want to stay informed without getting overwhelmed. Her work is where smart meets simple—blending curiosity with clarity, and delivering tech stories that resonate in a world that never stands still.
Pinterest Follows Amazon in Layoffs Trend, Shares Fall by 9%
AI-driven restructuring fuels Pinterest layoffs, mirroring Amazon’s strategy, as investors react sharply and question short-term growth and advertising momentum.
Clawdbot Rebrands to "Moltbot" After Anthropic Trademark Pressure: The Viral AI Agent That’s Selling Mac Minis
Clawdbot is now Moltbot. The open-source AI agent was renamed after Anthropic cited trademark concerns regarding its similarity to their Claude models.
Amazon Bungles 'Project Dawn' Layoff Launch With Premature Internal Email Leak
"Project Dawn" leaks trigger widespread panic as an accidental email leaves thousands of Amazon employees bracing for a corporate cull.
OpenAI Launches Prism, an AI-Native Workspace to Shake Up Scientific Research
Prism transforms the scientific workflow by automating LaTeX, citing literature, and turning raw research into publication-ready papers with GPT-5.2 precision.