Reddit — Policy Change
Executive Summary
Reddit's SEC filing ahead of its IPO revealed data licensing arrangements worth $203 million over 2–3 years, with a $60 million/year deal later confirmed to be with Google. The filing disclosed that Reddit's content had already been used by AI companies and that data licensing was expected to become a significant revenue stream. Users had not been directly consulted about the commercial sale of their contributions.
What Happened
Reddit disclosed in its February 2024 SEC filing for its IPO that it had entered into data licensing arrangements worth $203 million over two to three years, with at least $66.4 million expected in 2024. A $60 million annual deal with Google was announced, allowing Google to use Reddit posts for training AI models and improving services like Google Search. Reddit had previously provided free access to its data but reversed course in 2023, deciding to charge AI companies for access to its over 1 billion posts and more than 16 billion comments.
Who Is Affected
All Reddit users who have contributed posts and comments to the platform are affected, as their publicly shared content is now being licensed to AI companies for commercial purposes. The arrangements apply to Reddit's entire corpus of user-generated content, including ongoing and future contributions. Users were not directly consulted about the commercial licensing of their contributions before these deals were implemented.
Why It Matters
This event sets a precedent for how tech companies obtain publicly available user-generated content for AI training purposes, occurring amid multiple lawsuits testing whether such practices involving copyrighted material are permissible. The scale is significant given Reddit's massive dataset of human-written conversational content that AI companies consider valuable for training large language models. The arrangement demonstrates how platforms can monetize user contributions without explicit user consent for such commercial purposes, though the content remains publicly accessible.
What You Should Do
Review Reddit's current user terms and privacy policy to understand how your content may be used. If you want to limit future contributions from being used for AI training, consider deleting posts and comments you don't want included, as Reddit states it deletes content everywhere when users remove it. Be mindful that any new content you post to Reddit may be licensed to AI companies under these ongoing arrangements.
AI-Assisted
Event summaries are generated by Claude AI from verified sources and reviewed by humans before publication.
Sources