Reddit Went Public, Sold Its Data to Google, and Quietly Became the Most Important Website on the Internet
$34 IPO. $282 all-time high. A $203M data licensing business. The number-one most cited domain in AI search results. 1.21 billion monthly users. And 14.7% of posts are now AI-generated, threatening the very thing that makes Reddit valuable. Inside the most unlikely transformation in tech.
Two years ago, Reddit was a money-losing message board that had spent 18 years failing to figure out its business model. It had gone through multiple CEO changes, a near-death experience during the 2023 API pricing revolt, and a private valuation that had cratered from $10 billion to $6.5 billion. The conventional wisdom was that Reddit was the internet's most chronically underperforming asset -- a site that everyone used but no one could monetize.
Today, Reddit is a $27.57 billion public company that generated $2.2 billion in revenue in 2025, posted $529.7 million in net income, and has become the single most important data source for the AI industry. It is the number-one most cited domain in Google AI Overviews, the number-one source in Perplexity results, and the foundation upon which the largest language models in the world are trained.
This is the story of how a website built on anonymous human conversation became the most strategically important property on the internet -- and why the thing that makes it valuable might also be the thing that destroys it.
The IPO That Shouldn't Have Worked
Reddit priced its IPO at $34 per share on March 21, 2024, raising $519 million at a roughly $6.5 billion valuation. The stock opened at $47 and closed its first day at $50.44, a 48% pop that suggested the market saw something that most tech analysts had been missing for years.
What they saw was the data.
Reddit's S-1 filing disclosed something that reframed the entire business: $203 million in aggregate data licensing contracts, spanning two-to-three-year terms with AI companies. This was not incremental SaaS revenue. This was a new category of monetization that did not exist 18 months earlier, built on a corpus of human-generated content that Reddit had been accumulating for two decades without fully understanding its value.
The stock hit an all-time high of $282.95 intraday on September 18, 2025. Reddit was briefly worth more than $50 billion. That number has since corrected -- shares trade at approximately $139.39 as of March 2026, roughly 51% below the peak -- but the correction has been about broader market conditions and AI sentiment shifts, not about the fundamentals of the business. Revenue grew 69% year-over-year in 2025. Net income swung from a $90.8 million loss in 2023 to $529.7 million in profit. The number of active advertisers grew 75%. Needham named Reddit its "Top Pick" for 2026.
The turnaround has been faster and more complete than almost anyone predicted.
How Reddit Became the AI Industry's Most Valuable Data Source
The transformation began with a confrontation. In April 2023, Reddit announced it would begin charging for API access that had been free since 2008. Steve Huffman, Reddit's CEO, framed the decision in characteristically blunt terms: "The Reddit corpus of data is really valuable, but we don't need to give all of that value to some of the largest companies in the world for free."
The pricing he set -- $12,000 per 50 million API requests -- was designed to be affordable for academic researchers and small developers but punishing for large-scale commercial scraping. Christian Selig, the developer behind Apollo, the most popular third-party Reddit app, calculated that he would need to pay $20 million per year to keep his app running. He shut down Apollo on June 30, 2023. Sync for Reddit, BaconReader, and Boost for Reddit followed.
The community revolt was immediate and enormous. Approximately 8,500 subreddits went dark between June 12 and 14, 2023, as moderators made their communities private in protest. It was the largest organized user protest in Reddit's history, and it looked like it might be the beginning of the end.
Instead, it was the beginning of a data licensing empire.
Within months, Reddit had converted the controversy into commercial leverage. The API pricing change established a clear principle: Reddit's data had commercial value, and companies that wanted to use it for AI training would pay for access. The deals that followed were historic.
Google signed a $60 million per year deal announced in February 2024, giving it access to Reddit's real-time, structured content to train Gemini and its Vertex AI products. The deal also gave Google exclusive rights to surface Reddit content via the data API -- a provision that meant other search engines effectively lost access to Reddit's real-time data.
OpenAI signed a deal estimated at $70 million per year, announced in May 2024. Reddit content was integrated directly into ChatGPT. The conflict of interest embedded in this deal is extraordinary: Sam Altman, OpenAI's CEO, owns 8.7% of Reddit as its third-largest shareholder. He is simultaneously the buyer and a major beneficiary of the sale.
Anthropic did not get a deal. In June 2025, Reddit sued Anthropic in Northern California court for allegedly scraping Reddit more than 100,000 times after claiming to have blocked its bots. The lawsuit claimed Anthropic took "millions, if not billions" of pieces of user-generated content without authorization. The case moved to mediation in August 2025 and remains unresolved. The message to the rest of the AI industry was unambiguous: pay for the data or face legal consequences.
Combined, Reddit's data licensing revenue reached an estimated $143 million in 2025, approximately 10% of total revenue according to COO Jen Wong. That is a meaningful but not yet dominant revenue stream. What makes it strategically significant is the leverage it provides: Reddit is now negotiating from a position of proven legal willingness to sue and proven market demand for its data.
The Google Flywheel: 1,328% Visibility and Counting
The Google deal did not just generate $60 million in annual licensing revenue. It created one of the most powerful distribution flywheels in the history of the internet.
Here is how it works mechanically. Google gets Reddit's real-time content to train its AI models. In return, Reddit gets dramatically increased visibility in Google Search results. More visibility drives more traffic. More traffic drives more users. More users create more content. More content makes the data licensing deals more valuable. The cycle repeats.
The numbers tell the story of just how dramatically the Google deal reshaped Reddit's search presence:
| Metric | Before Deal | After Deal | Change |
|---|---|---|---|
| SEO visibility | Baseline (July 2023) | April 2024 | +1,328% |
| US organic search rank | 68th most visible domain | 5th most visible domain | +63 positions |
| Share of Voice rank | 29th place | 3rd place | +26 positions |
| Mobile SERP rank | 20th place (July 2024) | 2nd place (June 2025) | +18 positions |
| Search result presence | Baseline (2023) | 2024 | +191% |
| Top 3 rankings | Baseline | 2024 | +446% |
Reddit became the number-one most cited domain by Google AI Overviews, the number-one source cited by Perplexity with a 46.7% share, and the number-two source cited by ChatGPT with a 21.0% share. When someone asks an AI system a question, there is a high probability that the answer draws directly from a Reddit thread.
This creates an almost paradoxical dynamic. Google is paying Reddit $60 million per year for data to train an AI system that, in many cases, replaces the need for users to click through to Reddit at all. But that same AI system cites Reddit as its primary source, driving brand awareness and credibility that pulls new users back to the platform. Reddit's organic traffic rose from 160 million in August 2023 to 420 million in February 2024 -- a 162% increase in six months.
1.21 Billion Users and the International Arbitrage
Reddit now has 1.21 billion monthly active users, 108.1 million daily active unique visitors, and 379 million weekly active users. Average time on platform is 20 minutes per day. Users generated 550 million posts and 2.72 billion interactions in 2024, a 17.27% year-over-year increase. Median comment thread length increased from 7.4 to 8.1 comments per post, meaning conversations are getting deeper, not just more frequent.
But the most strategically important user metric is geographic. Over half of Reddit's audience is now outside the United States. International daily active users reached 60.1 million in Q2 2025, growing 32% year-over-year compared to just 11% for US users. Brazil saw nearly 80% DAUq growth. India, the UK, the Philippines, and France are all emerging as significant growth markets. Reddit now supports 23 languages with machine translation, up from 8 in the prior quarter, with plans for 30 more.
Here is the arbitrage: despite over 50% of users living outside the US, international revenue accounts for only 18% of total revenue. International revenues grew 71.7% year-over-year to $91 million in Q2 2025, but the monetization gap remains enormous. If Reddit can close even a fraction of this gap -- bringing international average revenue per user closer to US levels -- the revenue implications are measured in billions, not millions.
This is the bull case for Reddit at its current $139 share price. The US advertising business is already performing at scale ($2.06 billion in ad revenue in 2025). International growth represents a second curve that has barely begun to inflect.
The Advertising Machine Nobody Talks About
Data licensing gets the headlines. But advertising is 93% of Reddit's business, and its growth trajectory has been quietly spectacular.
| Year | Ad Revenue | YoY Growth |
|---|---|---|
| 2023 | ~$788M | -- |
| 2024 | ~$1.19B | +51% |
| 2025 | ~$2.06B | +74% |
Reddit's ad revenue topped $2 billion for the first time in 2025. Active advertiser count grew 75% year-over-year. Revenue from small and medium businesses doubled. These are not vanity metrics -- they represent a fundamental broadening of Reddit's advertiser base beyond the tech and gaming companies that historically dominated its ad inventory.
The product innovations driving this growth include AMA-style ads that let brands host fully integrated "Ask Me Anything" threads, a Pro Trends tool that surfaces trending conversations for advertisers, and Reddit Max, a campaign optimization solution that has delivered 17% lower cost per acquisition and 27% higher conversion volume in early testing. Dynamic Product Ads showed over 90% higher return on ad spend compared to traditional digital ads.
What makes Reddit's advertising position structurally different from Meta or Google is the nature of user intent. People come to Reddit to research purchases, ask for recommendations, and compare products. A user asking r/headphones for advice on noise-cancelling headphones is in a fundamentally different mental state than someone scrolling Instagram. That purchase-intent signal is what advertisers pay a premium for, and Reddit has 20 years of it organized into 100,000 active communities.
The Human Moat and the Poisoning Problem
Every strategic advantage Reddit has -- its data licensing deals, its Google visibility, its advertiser value proposition -- rests on a single premise: the content on Reddit is authentically human.
Reddit's corpus represents 40.1% of LLM training data sources in 2025, surpassing Wikipedia as the single largest input into how large language models understand the world. The platform has accumulated more than 1 billion posts and 16 billion comments across 20 years of operation. That corpus is unique because it is conversational, opinionated, community-vetted through upvotes and downvotes, and covers virtually every topic that humans discuss.
This is what investors call the "human moat." As AI-generated content floods the internet, making most of the web's text synthetic and unreliable, the value of a corpus that is provably human-generated increases. Reddit's data is not just big -- it is trustworthy, which is an increasingly rare quality in training data.
But the moat has a crack in it. An Originality.AI study found that 14.7% of Reddit posts are now AI-generated, up from 13% in 2024. That means nearly one in seven posts on the platform that AI companies pay $130 million per year to access because of its human authenticity is, in fact, not human at all.
The implications are recursive and uncomfortable. AI companies train models on Reddit data because it is human. Those models generate content that humans post back to Reddit. That AI-generated content then becomes part of the training data for the next generation of models. Each cycle dilutes the authenticity of the corpus. Taken to its logical conclusion, AI companies could end up paying Reddit for the privilege of training on their own models' outputs.
Reddit has not publicly disclosed a comprehensive strategy for detecting and removing AI-generated content at scale. The platform's moderation system -- a layered approach combining platform-wide rules, subreddit-specific policies, 60,000 volunteer moderators, and community voting -- was designed for a world where all content was human-generated. Adapting that system to a world where 14.7% of content is synthetic is a challenge that no social platform has solved.
This is the central tension in Reddit's long-term thesis: the thing that makes Reddit valuable to AI companies is the same thing that AI is slowly eroding.
The Stack Overflow Warning
The cautionary tale sits just across the hall. Stack Overflow, the programming Q&A site that was once as essential to developers as Reddit is to the broader internet, signed its own AI data licensing deals in 2024 -- with OpenAI in May 2024, plus partnerships with Google and GitHub.
But Stack Overflow's community did not survive the AI transition the way Reddit's has. Question volume on the platform collapsed 76%, from 108,000 questions per month in November 2022 to 25,000 by December 2024. By December 2025, only 3,862 questions were posted -- a 78% decline from the prior year. Developers who used to post questions on Stack Overflow now ask ChatGPT or Copilot instead.
Stack Overflow has managed to grow revenue despite the engagement collapse -- from $89 million in 2022 to $125 million in 2024 -- by pivoting to enterprise products. But the Q&A community that made Stack Overflow's data valuable in the first place is effectively dead. The platform survived as a business. It died as a community.
The difference between Stack Overflow and Reddit comes down to scope. Stack Overflow served one use case: programming questions with definitive answers. AI could replicate that use case almost perfectly. Reddit serves a fundamentally different function: open-ended conversation, subjective opinion, cultural commentary, product recommendations, community belonging. These are things AI can simulate but not replace. When someone posts on r/relationship_advice or r/personalfinance, they are not looking for a technically correct answer from a model. They are looking for a human perspective from someone who has been in their situation.
That distinction is what has allowed Reddit's engagement to grow while Stack Overflow's has collapsed. But it depends on users continuing to believe that the perspectives they are reading are human -- which brings the conversation back to the 14.7% problem.
Revenue: From $804 Million to $2.2 Billion in Two Years
The financial transformation is worth examining in full because it illustrates how quickly a platform business can inflect when multiple growth vectors align simultaneously.
| Year | Total Revenue | YoY Growth | Net Income |
|---|---|---|---|
| 2023 | $804M | +20% | -$90.8M |
| 2024 | $1.3B | +62% | -$484.3M* |
| 2025 | $2.2B | +69% | +$529.7M |
*2024 net loss driven by IPO-related stock-based compensation, not operating deterioration.
Reddit achieved its first profitable quarter in Q4 2024 with $71 million in net income, a 16.6% margin. It then posted its first full profitable year in 2025 with net income of $529.7 million. Q4 2025 alone generated $252 million in profit.
The company announced a $1 billion share repurchase program alongside its Q4 2025 results. That is not a decision a management team makes when they are uncertain about future cash flows. It is a declaration that Reddit believes its current profitability is sustainable and that the stock is undervalued.
Analyst projections for 2026 put revenue at approximately $2.95 billion, implying roughly 40% growth. Longer-term models project $3.8 billion in revenue and $1.0 billion in earnings by 2028. Reddit was added to the S&P 500 index, a milestone that brings automatic inflows from index funds and validates the company's position as a large-cap public company.
The Licensing Precedent and the Future of AI Training Data
Reddit's approach to data licensing is not just a revenue strategy. It is an attempt to establish the legal and commercial framework for how AI companies pay for the content they train on.
Three elements of Reddit's strategy are shaping the broader market.
First, aggressive litigation. The Anthropic lawsuit is not primarily about recovering damages from one company. It is about establishing legal precedent that scraping user-generated content without a license constitutes breach of contract, unjust enrichment, and trespass to chattels. If Reddit prevails, the ruling would force every AI company to negotiate licensing deals or risk similar lawsuits from every major content platform on the internet.
Second, collective action. Reddit, Quora, and Yahoo are backing a new standard called RSL (Responsible Sharing of Language) to create a unified framework for how AI companies pay for web content. This is an attempt to prevent AI companies from playing content platforms against each other and to establish industry-standard pricing.
Third, dynamic pricing. Bloomberg reported in September 2025 that Reddit was in early talks to renegotiate its deals with Google and OpenAI for better terms. The initial deals were flat-rate annual fees. Reddit now wants variable compensation that increases as its content becomes more integral to AI outputs. Given that Reddit is the number-one cited source in AI search results, the argument for performance-based pricing is strong.
If Reddit succeeds in establishing dynamic pricing tied to AI output citations, it would fundamentally change the economics of AI training. Instead of paying a fixed annual fee for a static dataset, AI companies would pay a variable fee that scales with usage -- effectively turning Reddit into an ongoing royalty business rather than a one-time data supplier.
What Reddit Is Actually Worth
The bull case for Reddit at $139 per share rests on four pillars, each of which is independently verifiable.
Advertising growth with international upside. The US ad business is scaling at 74% year-over-year. International users represent over 50% of the audience but only 18% of revenue. Closing the international monetization gap alone could add billions in annual revenue.
Data licensing as a recurring and growing revenue stream. Current data licensing revenue of approximately $143 million has clear room to expand as additional AI companies negotiate licenses and as Reddit shifts to dynamic pricing. If Anthropic and Perplexity eventually sign deals, analysts suggest this revenue could double.
Engagement depth that resists AI substitution. Reddit users spend an average of 20 minutes per day on the platform. Conversations are getting longer and deeper. Unlike Stack Overflow, Reddit's community is growing, not contracting, because its use cases are fundamentally harder for AI to replace.
The human content moat. Twenty years of authentic human conversation, organized into 100,000 communities, vetted by community voting, covering every topic on the planet. No AI company can generate this corpus synthetically. No competitor can replicate it. It is a one-of-one asset.
The bear case rests on one question: what happens when the 14.7% becomes 25%, then 40%, then a majority? If Reddit cannot solve the AI content contamination problem, the human moat drains, the data licensing premium erodes, and the advertising value proposition weakens. Every bull thesis depends on the content remaining authentically human.
The Paradox at the Center
Reddit is profiting from AI companies that need human data to build systems that are slowly filling Reddit with non-human data. The company is simultaneously the most important supplier of AI training data and the most visible victim of AI's effects on content authenticity. It is both the coal mine and the canary.
The next twelve months will determine whether Reddit can solve this paradox or whether it becomes another cautionary tale about platforms that extracted value from a resource they failed to protect. The financial momentum is undeniable. The strategic position is unprecedented. But the 14.7% number is rising, and no amount of data licensing revenue changes the math of a corpus that is slowly losing the quality that made it worth licensing in the first place.
Reddit has become the most important website on the internet. The question is whether it can stay that way.
Frequently Asked Questions
How much is Reddit's data licensing business worth?
Reddit disclosed $203 million in aggregate data licensing contract value in its January 2024 S-1 filing, spanning 2-3 year terms. The company's two largest deals are with Google ($60 million per year for real-time content to train Gemini) and OpenAI (estimated $70 million per year for ChatGPT training data). Reddit COO Jen Wong stated in February 2025 that AI licensing deals make up approximately 10% of Reddit's total revenue, which would place data licensing revenue at roughly $143 million for 2025 based on $2.2 billion in total revenue. Reddit is actively pursuing additional licensing deals and suing companies like Anthropic that scrape without paying.
What was Reddit's IPO performance and stock price history?
Reddit went public on the NYSE under ticker RDDT on March 21, 2024, at an IPO price of $34 per share. The stock opened at $47, a 38% pop, and closed its first day at $50.44. Reddit raised $519 million at a roughly $6.5 billion valuation, a significant discount from its $10 billion private valuation in 2021. The stock hit an all-time high of $282.95 intraday on September 18, 2025, giving Reddit a market cap above $50 billion at its peak. As of March 2026, shares trade at approximately $139.39 with a market cap of $27.57 billion, roughly 51% below the all-time high.
Why is Reddit important for AI training and AI search results?
Reddit is critically important for AI for two reasons. First, Reddit content makes up 40.1% of LLM training data sources in 2025, surpassing Wikipedia as the single largest source. Its 20 years of accumulated human discourse -- over 1 billion posts and 16 billion comments -- provide the conversational, opinion-rich, community-vetted content that AI models need. Second, Reddit is the number-one most cited domain by Google AI Overviews and Perplexity, and the number-two most cited by ChatGPT. Reddit's SEO visibility surged 1,328% between July 2023 and April 2024, and its Share of Voice jumped from 29th to 3rd place in US organic search.
What is the Google-Reddit data deal and how does it work?
Google signed a $60 million per year data licensing deal with Reddit, announced in February 2024 ahead of Reddit's IPO. The deal gives Google access to Reddit's real-time, structured user-generated content to train its Vertex AI and Gemini models. Google also gained exclusive rights to surface Reddit content via the data API, meaning other search engines lost direct access to Reddit's real-time data. In return, Reddit received a massive boost in Google Search visibility: a 1,328% increase in SEO visibility and a jump from 68th to 5th most visible domain in US organic search. The deal created a powerful flywheel where Google gets training data, Reddit gets search traffic, more traffic drives more users, and more users create more valuable data.
Is AI-generated content threatening Reddit's value?
Yes, AI-generated content is an emerging threat to Reddit. An Originality.AI study found that 14.7% of Reddit posts are likely AI-generated as of 2025, up from 13% in 2024. This is concerning because Reddit's core value proposition -- to both AI companies licensing its data and to users seeking authentic human perspectives -- depends on the authenticity of its content. If AI-generated posts proliferate further, they risk creating a data poisoning problem where AI models train on synthetic content rather than genuine human discourse. This paradox -- Reddit's data is valuable because it is human-generated, but AI tools are making it increasingly synthetic -- is the central tension in Reddit's long-term strategy.
What is Reddit's revenue breakdown and is the company profitable?
Reddit's total revenue grew from $804 million in 2023 to $1.3 billion in 2024 (62% growth) to $2.2 billion in 2025 (69% growth). Advertising accounts for the vast majority of revenue at approximately $2.06 billion in 2025 (about 93% of total). Data licensing and other revenue contributed roughly $143 million (about 10% of revenue). Reddit achieved its first profitable quarter in Q4 2024 with $71 million in net income, and its first full profitable year in 2025 with $529.7 million in net income. The company announced a $1 billion share repurchase program alongside its Q4 2025 results, signaling confidence in sustained profitability.