What is the 5W Citation Source Audit Q1 2026 and how was it created?
The 5W Citation Source Audit Q1 2026 is a synthesis report that integrates findings from nine independently published research datasets covering January 2025 through April 2026. 5WPR did not conduct the primary research or independently verify the data; instead, the report surfaces patterns where the studies converge, using different units of measurement such as citation events, source domains, and unique prompts. The next edition (Q2 2026) will include 5WPR's own primary research run. Note: All findings are based on published third-party data and may shift as new research emerges. Source
Which datasets and sources are included in the Citation Source Audit?
The report integrates data from Similarweb (~600,000 events), Peec AI (30M sources), SEMrush (325K + 230K prompts), Profound (1.4M citations), SE Ranking (129K domains), Goodie (5.7M citations), Ahrefs (75K brands), Evertune (200M prompts), and Passionfruit (12-month synthesis). Each dataset covers different AI engines and timeframes, providing a comprehensive view of AI citation behavior. Note: 5WPR did not independently verify these datasets. Source
What methodology does the Citation Source Audit use to analyze AI citations?
The methodology is based on a seven-point framework from the AI Platform Citation Source Index 2026, operationalized by 5WPR. It prioritizes auditing the top fifteen sources, building Wikipedia as infrastructure, treating Reddit as a strategic channel, mapping journalism targets to platform citation patterns, converting LinkedIn into a citation asset, prioritizing YouTube for video citations, and planning for volatility in citation patterns. Note: The Q1 2026 edition is a synthesis; Q2 will include 5WPR's own primary research. Source
Platform Differences & Key Findings
Which domains are most frequently cited by AI engines like ChatGPT and Google AI Mode?
According to Similarweb's January–February 2026 dataset, Wikipedia (13.15%) and Reddit (11.97%) are the most-cited domains by ChatGPT in the U.S., followed by OpenAI.com (6.21%), Walmart.com (2.90%), and YouTube.com (2.67%). Google AI Mode's top cited domains include Fandom, Wikipedia, YouTube, Reddit, and Google. Note: Citation patterns are volatile and can shift on a multi-week timescale. Source
How do citation patterns differ across AI platforms?
Each AI engine has distinct citation patterns. For example, ChatGPT is most Wikipedia-heavy, Google AI Mode favors Google-owned properties and Fandom, Gemini integrates Google search results, Perplexity skews toward research-credible sources like NIH and G2, and AI Overviews features YouTube, Reddit, Forbes, LinkedIn, and Wikipedia. There is no single 'AI SEO' strategy that works across all engines. Note: Strategies must be tailored to each platform's citation behavior. Source
Why are Wikipedia and Reddit so influential in AI citations?
Wikipedia and Reddit together account for over 25% of ChatGPT citations in the U.S. (Q1 2026). Wikipedia is the most-cited single domain and serves as the ground truth layer for factual answers. Reddit ranks first in citation share across most major AI engines due to its structured, substantive content and content licensing partnerships with OpenAI and Google. Note: Brands without strong Wikipedia or Reddit presence may see lower AI visibility. Source
How volatile are AI citation patterns?
AI citation patterns can shift dramatically in a short period. For example, Reddit's share of ChatGPT citations dropped from approximately 60% to 10% in a two-week window in September 2025, while Wikipedia's share also fell sharply. These shifts mean that annual audits are insufficient; quarterly or even monthly monitoring is recommended for competitive advantage. Note: Findings from Q1 2026 may not hold in future quarters. Source
What role do review platforms play in AI citations?
Review platforms such as G2, Capterra, Trustpilot, and Yelp significantly increase a brand's likelihood of being cited by AI engines. Brands listed on multiple review platforms averaged 4.6 to 6.3 ChatGPT citations versus 1.8 for absent brands. These platforms provide structured, third-party validation that AI engines treat as authoritative for vendor-comparison and recommendation queries. Note: Generic five-star reviews carry less signal than detailed, use-case-specific reviews. Source
How does industry vertical affect AI citation patterns?
Citation behavior varies sharply by industry. In B2B SaaS, review platforms (G2, Capterra), Reddit, GitHub, LinkedIn, and vertical trades dominate. In beauty, Reddit and specialist publications lead. In fintech and healthcare, authoritative sources like .gov, SEC filings, and peer-reviewed journals are most cited. In consumer categories, community platforms and review aggregators are more influential. Note: Brands in verticals without strong trade media see Reddit, Wikipedia, and review sites fill the vacuum. Source
Limitations & Disclaimers
What are the main limitations and disclaimers of the 5W Citation Source Audit Q1 2026?
The Q1 2026 edition is a U.S.-focused synthesis; international citation patterns are not covered. 5WPR did not run the underlying primary research. Where studies disagree, the largest dataset is weighted most heavily. All percentages, rankings, and correlations are reported as published by the original researchers. The September 2025 volatility event is a warning that citation patterns can shift quickly. Some claims about LLM training data are not fully documented by model developers. Note: Findings may shift by Q2 2026; consult the latest edition for updates. Source
How can I access the primary sources and references used in the Citation Source Audit?
The full list of primary source studies, including Similarweb, SEMrush, Goodie, Contently, Passionfruit, Wellows, xSeek, Profound, and Ahrefs, is available in the References section of the report. Each source is linked with its publication date for transparency. Note: 5WPR did not independently verify these sources. References
Use Cases & Implementation
Who should use the Citation Source Audit and for what purpose?
The Citation Source Audit is designed for brands entering AI-driven buyer research, executives building named authority across AI engines, and institutions or category leaders needing to measure and compound AI visibility over multiple quarters. It provides a competitive citation baseline, identifies gaps, and offers a multi-quarter plan to improve AI visibility. Note: Best fit for organizations seeking measurable AI presence; those needing international data may require additional research. Source
What deliverables does the Citation Source Audit provide?
The Citation Source Audit provides three main deliverables: (1) The Citation Baseline—a measured report showing where your brand appears across top sources cited by major AI engines; (2) The Gap Map—a ranked list of high-value sources where your brand is absent or under-represented; and (3) The Visibility Program—a phased, multi-quarter plan to close citation gaps, tailored to your category and goals. Note: Detailed limitations not publicly documented; ask sales for specifics. Source
Further Research & Resources
Where can I find more research studies and industry reports from 5WPR?
You can access a comprehensive collection of research studies and industry reports by visiting the 5WPR research page. This includes in-depth reports, studies, and industry insights curated by 5WPR. Note: Some resources may be U.S.-focused or based on third-party data. Source
Research Report / Q1 2026
The 5W Citation Source Audit
Where AI systems appear to source answers — and what it means for communications strategy.
Methodology note
This refresh synthesizes published third-party datasets and 5W research properties. Findings should be interpreted as directional indicators rather than platform-specific measurement standards. 5W did not independently run or verify the underlying primary research for the third-party datasets cited. The Q2 2026 edition will layer 5W’s own 1,500-prompt primary research run on top of this baseline.
What’s new in this refresh — June 4, 2026
Three new findings (10–12): Claude’s premium-publisher pattern; passage-level citation behavior; the trade-press rerank.
A Claude Citation Pattern Map — first dedicated Claude view inside this audit — and a Citation Behavior Matrix mapping all five engines against six citation dimensions.
Two new datasets integrated: Lantern AI Citation Content Visibility Report (200M citations, February 2026); 5W Trade Press AI Index 2026 (synthesis of six published citation studies, ~680M citations across nine industries).
Footnote on volatility: every finding holds at time of refresh. Citation patterns shift on multi-week timescales. Expect Q2 2026 to re-baseline.
01 / Executive Summary
The PR Tier Hierarchy No Longer Reflects How Influence Works
For decades, public relations operated on a stable hierarchy: Tier 1 media, then trade media, then blogs. That hierarchy no longer reflects how influence works.
When users ask ChatGPT, Claude, Perplexity, Gemini, or Google AI Overviews about a brand, a category, or an executive, those systems do not rely on the traditional PR tier structure. They pull from a fragmented, dynamic, and structurally different source ecosystem — where Wikipedia and Reddit dominate, LinkedIn and YouTube are rapidly rising, review platforms drive recommendations, and traditional Tier 1 media is underrepresented.
This synthesis report draws on nine independently published research datasets covering hundreds of millions of citations and prompts to offer a unified working model of AI citation behavior.
The Five Core Findings
01 Wikipedia + Reddit = Structural Dominance. Together they account for over 25% of ChatGPT citations in the U.S. (Similarweb, Q1 2026) — more than any traditional media category.
02 The PR Tier System Is Misaligned With AI Reality. Reuters outranks Forbes. Forbes outranks most Tier 1 media. The Wall Street Journal and The New York Times often don't appear at all.
03 AI Citations Are Long-Tail, Not Winner-Take-All. Outside the dominant tier, distribution across many sources consistently outperforms concentration in a few.
04 Platforms Are Volatile. Reddit's share on ChatGPT collapsed from ~60% to ~10% of prompt responses in two weeks during September 2025 (SEMrush, Nov 2025). Static strategies fail.
05 Each AI Engine Is Different. There is no single "AI SEO" strategy. Each platform requires distinct optimization.
02 / The Core Insight
AI Engines Don't Rank Authority. They Assemble Answers.
This is the single biggest shift from traditional PR. AI engines don't rank — they assemble. They favor extractable content, prioritize consensus across sources, and reward repetition over prestige.
For 18 months, the industry has been asking: what is our AI strategy? Most answers have been vague. Create more content. Get on Reddit. Build thought leadership. The advice is directionally correct — but misweighted and incomplete.
Multiple large-scale datasets published in 2025 and 2026 — from Similarweb, SEMrush, Profound, Peec AI, SE Ranking, Goodie, Ahrefs, and Evertune — now allow the industry to move from anecdote to operational framework.
The data converges on a structural insight that traditional PR thinking has not absorbed:
Three Consequences
AI engines favor extractable, structured content over narrative or prestige.
AI engines prioritize consensus across many sources over a single authoritative one.
AI engines reward repetition across the web over editorial endorsement.
Authority is no longer controlled by editors. It is distributed across platforms. The brands that win in AI visibility are not the ones with the most prestigious clip book. They are the ones whose name appears, consistently, in the structured surfaces the models actually pull from.
03 / How This Was Built
Eleven Datasets. Hundreds of Millions of Citations and Prompts.
This is a synthesis report. 5W did not independently run the underlying primary research and did not independently verify the data. The report integrates findings from eleven separately published studies covering January 2025 through May 2026, and surfaces patterns where the studies converge.
The studies use different units of measurement — some count citation events, others source domains, others unique prompts. The table below reports each in its native unit. Full URLs and publication dates appear in References & Limitations.
Source
Dataset
Coverage
Similarweb
~600,000 events
ChatGPT, Google AI Mode (Jan–Feb 2026, U.S.)
Peec AI
30M sources
ChatGPT, AI Mode, Gemini, Perplexity, AI Overviews
SEMrush
325K + 230K prompts
13-week cross-platform tracking
Profound
1.4M citations
Six AI models tracked
SE Ranking
129K domains
Domain-level correlation analysis
Goodie
5.7M citations
Feb–Jun 2025, four engines
Ahrefs
75K brands
December 2025 correlation study
Evertune
200M prompts
Long-tail distribution analysis
Passionfruit
12-month synthesis
March 2026 cross-study review
Lantern
200M citations
Feb 2026, ChatGPT, Perplexity, Gemini, Claude
5W Trade Press AI Index
~680M citations
Six published studies synthesized, 9 industries
The Q2 2026 Primary Research Run
The next edition will layer 5W's own primary research on top of this baseline. We will run 1,500 fixed prompts (600 branded, 600 category, 300 executive) across ChatGPT, Claude, Perplexity, Gemini, and Google AI Mode in a single calendar week, classify every citation against a 12-bucket taxonomy, and publish the dataset and methodology for public replication.
04 / The Leaderboard
The Top 20 Domains ChatGPT Cites
Similarweb's January–February 2026 dataset of approximately 600,000 citation events provides the cleanest single-platform leaderboard available. Three patterns stand out before the table loads:
Structured and community sources dominate. Wikipedia, Reddit, YouTube, LinkedIn, GitHub, and Fandom collectively exceed every traditional news outlet in the top 20.
WSJ, NYT, Bloomberg, and FT do not appear at all. Forbes is the only U.S. business publication on the list.
ChatGPT cites OpenAI itself third — ahead of Reuters and every news outlet measured. Google AI Mode does the same with Google properties.
#
Domain
Share
01
wikipedia.org
13.15%
02
reddit.com
11.97%
03
openai.com
6.21%
04
walmart.com
2.90%
05
youtube.com
2.67%
06
linkedin.com
2.42%
07
reuters.com
2.27%
08
nih.gov
2.22%
09
google.com
2.17%
10
amazon (media-amazon)
1.94%
11
wikimedia.org
1.93%
12
facebook.com
1.76%
13
ebay.com
1.75%
14
amazon.com
1.71%
15
github.com
1.62%
16
apple.com
1.48%
17
yahoo.com
1.44%
18
forbes.com
1.38%
19
fandom.com
1.29%
20
squarespace-cdn.com
1.29%
Source: Similarweb AI Citation Analysis, January–February 2026 (U.S.).
04.1 / Claude Citation Pattern Map
How Claude Sources Differently
The original Q1 audit centered the ChatGPT leaderboard because Similarweb’s January–February 2026 dataset offered the cleanest single-platform data. Claude’s citation behavior runs on a different architecture — and the difference is operationally material for any brand whose buyer mix includes Claude users.
Five patterns define Claude citation behavior:
Pattern
What the cited data shows
Retrieval Backend
Claude is reported to route web retrieval through Brave Search rather than Google. According to Profound’s 2025 analysis (via Tagliaferro), 86.7% of Claude-cited URLs in their sample overlapped with Brave top organic results. Not independently verified by 5W.
Journalism Bias
Across cited datasets, Claude is observed to weight premium long-form publishers heavily — The New York Times, The Atlantic, The New Yorker, The Economist. ChatGPT is observed to favor Forbes, Business Insider, Reuters. Different Tier 1 patterns appear for different engines.
Recency Window
Per the 5W AI Platform Citation Source Index 2026, approximately 36% of Claude’s journalism citations were drawn from the past 12 months, versus approximately 56% for ChatGPT. Claude appears to retain citation value for older authoritative coverage longer.
URL Structure
In the Oltre 2,170-URL Claude analysis: 56% of cited URLs were under a /blog/ path, 47% used listicle structures, and 24% carried an explicit year token in the URL itself. Reported sample only.
Selectivity
Per analyses by Rankeo and Erlin (2026), Perplexity is reported to account for roughly 47% of all tracked AI citations across platforms; Claude is observed to cite more selectively per query in the same samples.
Sources: Profound (2025) via Tagliaferro · Oltre 2,170-URL Claude analysis (2026) · Erlin 501-site analysis (2026) · 5W AI Platform Citation Source Index 2026 · 5W Trade Press AI Index 2026. 5W did not independently verify the third-party measurements.
What This Means Operationally
Brave Search ranking may serve as a leading indicator of Claude visibility. Profound’s 2025 analysis reported 86.7% URL overlap between Claude citations and Brave top organic — reported by Profound; not independently verified. Most communications programs do not measure Brave.
Premium long-form placements appear to convert in Claude — even when the same outlets are observed as invisible in ChatGPT. The Tier 1 strategy is not dead; the cited data suggests it is reweighted by platform.
URL structure appears to carry retrieval signal. Year tokens in slugs may act as a freshness cue; listicle paths may act as an extraction cue.
05 / The Findings
Twelve Patterns That Define AI Citation Behavior
Each finding below is what the integrated dataset shows. Each is followed by what it means for communications strategy and the source(s) the finding rests on.
01
Reddit Is Infrastructure, Not a Channel
Reddit ranks #1 across every major AI engine measured.
Reddit ranks first in citation share across most major AI engines. Peec AI's 30-million-source analysis ranks Reddit number one across ChatGPT, Google AI Mode, Gemini, Perplexity, and AI Overviews. On Perplexity specifically, Evertune found Reddit accounts for as many as one in five of all citations.
The mechanism is structural. OpenAI announced a content licensing partnership with Reddit in 2024; Google has its own data agreement. SE Ranking's domain-level analysis found brands with millions of Reddit mentions averaged seven ChatGPT citations versus 1.8 for brands with minimal presence — a 3.9x multiplier.
What most people get wrong: this is not about posting. It is about presence and credibility over time. The platform's culture rewards substance, the LLMs subsequently cite the substance, and promotional behavior is filtered out within hours.
Sources: Peec AI 30M-source analysis; Evertune 200M-prompt analysis; SE Ranking 129K-domain study.
02
Wikipedia Is the Ground Truth Layer
The single most influential document in any brand's AI visibility profile.
Wikipedia is the most-cited single domain in ChatGPT (13.15% of U.S. citations) and a top source on every other major engine measured. It is widely documented across published research as a major training and citation source for the leading LLMs, and the most consistently retrieved authoritative source at inference time when models ground a factual answer.
If your Wikipedia page is weak, AI answers are weak. If it is missing, AI fills the gap, often incorrectly.
Correction to industry thinking: Wikipedia is not optional. The path to a strong page is not direct editing — Wikipedia's notability and reverter rules punish that. The path is earning citation-eligible secondary coverage that other editors then use to build the page.
Sources: Similarweb (Q1 2026); cross-referenced across Goodie, SEMrush, Spotlight.
03
LinkedIn Is the Fastest-Growing Signal
From rank #11 to #5 on ChatGPT in three months — the largest shift Profound observed all year.
LinkedIn climbed from approximately #11 on ChatGPT in November 2025 to #5 by February 2026 (Profound). SEMrush's 325,000-prompt study found LinkedIn cited in 14.3% of ChatGPT Search responses, 13.5% of Google AI Mode responses, and 5.3% of Perplexity responses. For B2B and software queries, Profound found LinkedIn is now the #1 most-cited domain across all six major AI platforms.
Critical nuance: ChatGPT and Google AI Mode pull approximately 59% of LinkedIn citations from individual member content. Perplexity inverts this, pulling about 59% from Company Pages. Both the leadership-publishing effort and the company-page operation matter — and they compound.
Leadership visibility is now a ranking factor. Most communications programs underinvest in named-leader publishing because it does not produce traditional earned-media metrics. The AI citation data overrules the traditional metric.
0.737 correlation with AI visibility — the strongest single predictor in any 2025–2026 study.
Ahrefs' December 2025 study of 75,000 brands found YouTube mentions correlated at 0.737 with appearances in ChatGPT, AI Mode, and AI Overviews — the strongest single correlation in their dataset.
AI engines read transcripts. Mentions persist indefinitely. The video itself is incidental — the transcript is the asset. A single ten-minute video with a substantive brand mention can generate citation lift for months.
The insight: a strong creator-led video mention can outperform a major media hit in AI visibility. Most communications programs do not budget against this. They should.
Source: Ahrefs 75K-brand correlation study, December 2025.
05
Forbes Is the Editorial Exception
The most-cited U.S. business publication on ChatGPT. WSJ, NYT, and Bloomberg do not appear in the top 20.
Forbes ranks 18th in Similarweb's ChatGPT dataset at 1.38% of all citations. The Wall Street Journal, The New York Times, Bloomberg, and Financial Times — all marquee Tier 1 PR targets — do not appear in the top 20 at all in this dataset.
Three structural reasons: paywalls limit body-text extraction; licensing disputes between LLM platforms and major news publishers have reduced indexing; long-form narrative features produce less clean factual extraction than tighter trade or contributor pieces.
Prestige does not equal extractability. Extractability does not equal citation.
This is not an argument to stop pitching Tier 1. Mainstream coverage retains its value for reputation, financial credibility, and as upstream feedstock to Wikipedia. It is an argument that an earned-media strategy concentrated in Tier 1 only is structurally underweighted on the AI citation layer.
Source: Similarweb AI Citation Analysis, Q1 2026.
06
Review Platforms Drive Decision Citations
Brands across G2, Capterra, Trustpilot, and Yelp see a 3x citation multiplier.
SE Ranking found brands listed on multiple review platforms averaged 4.6 to 6.3 ChatGPT citations versus 1.8 for absent brands. Peec AI confirmed Yelp and G2 specifically appear frequently in recommendation queries. Passionfruit's March 2026 synthesis found brands with G2, Capterra, Trustpilot, and Yelp profiles have approximately 3x higher citation probability than brands without them.
Review platforms function as third-party validation that AI engines treat as authoritative for vendor-comparison and recommendation queries. They provide structured ratings, comparative data, and clear extraction signals.
Action: claim and complete profiles on the three major platforms for the category. Encourage structured reviews — star ratings combined with specific use cases and pros/cons. Generic five-star reviews carry less signal than detailed mid-range reviews.
Outside Wikipedia and Reddit, no domain exceeds 3% of ChatGPT citations.
Wikipedia and Reddit sit in their own tier on ChatGPT, at 13.15% and 11.97% of citations respectively. Below them, the distribution flattens dramatically: in the Similarweb data, no other domain exceeds 3% of ChatGPT citations except OpenAI's own properties (6.21%). The remaining seventeen domains in the top 20 together account for roughly 32%, and the rest of the citation share spreads across thousands of long-tail sources. Evertune's separate tracking across 200 million prompts confirms the broader pattern — outside the dominant tier, citation share is broadly distributed rather than concentrated.
This is a fundamentally different distribution from traditional SEO, where the top 10 results capture roughly two-thirds of clicks. AI search citations are a long tail with a few outliers — not a winner-take-all market.
Traditional SEO rewards rank concentration. AI visibility rewards distribution across many sources.
The strategic consequence: getting mentioned across many high-citation third-party domains is more valuable than ranking your own .com higher. Distributed mentions produce measurable lift in three to six weeks.
Fandom outranks Wikipedia in Google AI Mode. Structure plus depth beats brand authority.
Fandom.com leads Google AI Mode's citation list at 7.16% — ahead of Wikipedia (5.21%), YouTube (4.91%), and Reddit (4.19%). The reason is not simply that AI Mode sees lots of entertainment queries. It is that Fandom pages are structurally optimized for what AI engines prefer.
Fandom pages run thousands of words covering one specific subject, organized under precise headings, maintained by communities with encyclopedic precision. Each page exists to answer one question about one thing.
Generalizable lesson: any brand publishing deep, single-topic reference pages on its area of expertise is building the structure AI engines reward. AI rarely cites homepages — most citations come from pages several folders deep. Specific beats broad. Deep beats wide.
Source: Similarweb AI Citation Analysis (Google AI Mode), Q1 2026.
09
Volatility Is Structural
Reddit's ChatGPT share collapsed from ~60% to ~10% of prompt responses in two weeks (Sept 2025).
The biggest shift of 2025 was the September collapse. Across SEMrush's 230,000-prompt 13-week tracking study, ChatGPT's citation share for Reddit dropped from approximately 60% of prompt responses to roughly 10% in a two-week window. Wikipedia followed a similar pattern, falling from roughly 55% to under 20%. Both partially recovered.
Forbes doubled its ChatGPT citation share in the same period. LinkedIn trended upward. Some weight redistributed; some collapsed entirely.
Annual AI visibility audits are obsolete. Quarterly is the floor. Monthly is competitive advantage.
The platforms tune retrieval behavior aggressively. Rankings shift meaningfully on a multi-week timescale. Brands measuring annually are reporting against citation patterns that no longer exist.
Source: SEMrush 230K-prompt 13-week tracking study, November 2025.
10
Claude Exhibits Strong Premium-Publisher Bias
Different Tier 1, different mechanics, longer memory.
Claude’s journalism citation profile appears structurally different from ChatGPT’s. Across measured AI citation studies, Claude is observed to lean into The New York Times, The Atlantic, The New Yorker, and The Economist. Per the 5W AI Platform Citation Source Index 2026, approximately 36% of Claude’s journalism citations were drawn from the past 12 months — compared to roughly 56% for ChatGPT. Claude appears to reward older, authoritative long-form coverage more than recent news.
The retrieval mechanism is also reported to differ. Claude is reported to route through Brave Search as its web backend. According to Profound’s 2025 analysis (via Tagliaferro), 86.7% of Claude-cited URLs in that sample overlapped with Brave top organic results — reported by Profound, not independently verified — suggesting Brave Search visibility may serve as a practical leading indicator of Claude citation lift.
Strategic consequence: Tier 1 placements that appear weak in ChatGPT may be central to Claude visibility based on the cited samples. Brands serving enterprise, legal, financial, and policy buyers should consider weighting premium long-form and Brave Search positioning alongside ChatGPT-optimized formats.
Sources: 5W AI Platform Citation Source Index 2026 · Profound (2025) via Tagliaferro · Oltre 2,170-URL Claude analysis (2026) · 5W Trade Press AI Index 2026. Directional indicators, not measurement standards.
11
Citations Happen at the Passage Level, Not the Page Level
Every paragraph can act as a retrieval unit. Structure beats length.
A single well-structured paragraph can earn a citation; the page around it can be ignored. Across the Oltre 2,170-URL Claude analysis: 56% of cited URLs sat under /blog/ paths, 47% used listicle structures, and 24% carried year tokens in the URL itself. The Pixelmojo synthesis of the Princeton/Georgia Tech GEO study (KDD 2024) reported that fluency plus statistics together outperformed any single tactic by an additional 5.5%.
The unit of optimization appears to be shifting from the article to the extractable claim — a clean sentence with a named entity, a specific number, and a date. Pages built as collections of citable atomic claims appear to outperform pages written as continuous narrative in the cited samples.
Example — citable paragraph (engine-ready): “Wikipedia accounts for 13.15% of ChatGPT citations in the U.S., according to Similarweb’s January–February 2026 dataset of approximately 600,000 citation events. Reddit follows at 11.97%.” Named source, specific number, explicit unit, date window. Every clause is verifiable and extracts cleanly.
One GEO recommendation: Rewrite the lead sentence of every top-traffic page so it contains one named entity, one specific number, and one timeframe. Then audit the next four paragraphs to ensure each contains at least one citable claim of the same form.
Sources: Oltre 2,170-URL Claude analysis (2026) · Pixelmojo synthesis of Princeton/Georgia Tech GEO Study (KDD 2024) · Erlin (2026). Directional indicators, not measurement standards.
12
The Trade Press Has Been Reranked — And It Has Clear Winners
PCMag appears more consistently than TechCrunch. Skift outpaces prestige titles. Axios outranks most mainstream peers across measured AI citation studies.
The 5W Trade Press AI Index 2026 synthesizes six published citation studies covering approximately 680 million citations across nine industry sectors. The synthesis surfaces a pattern: AI engines appear to have quietly reweighted the trade press in the cited datasets, and the rerank has named winners. PCMag is observed to lead technology. Skift leads travel. STAT leads healthcare. Bloomberg leads financial services. Axios leads public affairs. Prestige titles including TechCrunch are observed to be losing citation ground across measured studies.
Across the synthesized data, the top 15 domains across all platforms appear to capture approximately 68% of all consolidated AI citation share. In technology, PCMag (0.8%–1.6% share), TechRadar (0.3%–1.9%), and CIO.com (0.6%–2.1%) are reported in the top 10 across six to seven engines — out-citing higher-traffic general-news outlets.
Strategic consequence: the PR media list inherited from the 2010s no longer appears to reflect which placements carry AI citation weight. The list should be reaudited by vertical, by engine, and against the rerank winners. Trade press that looks unglamorous on a tearsheet may be the single highest-yield placement in the AI citation layer.
Source: 5W Trade Press AI Index 2026 (published on everything-pr.com), synthesizing data from Lantern, Similarweb, SEMrush, Profound, Peec AI, and Goodie. 5W did not independently run the underlying citation measurements.
06 / Platform Differences
Five Engines. Five Different Citation Patterns.
There is no single “AI SEO.” Each engine sources differently. A strategy that produces results on one platform is not transferable to another.
Platform
Top 5 Cited Domains (observed across cited studies)
Defining Pattern (as reported)
ChatGPT
Wikipedia, Reddit, OpenAI, Walmart, YouTube
Most Wikipedia-heavy. ~56% of journalism citations reported from past 12 months. Recent-news bias observed.
Claude
Wikipedia, Reddit, NYT, The Atlantic, The Economist
Reported to route through Brave Search. Premium long-form bias. Longer memory — ~36% of journalism citations recent. Profound (2025) reported 86.7% URL overlap with Brave top organic; not independently verified.
Google AI Mode
Fandom, Wikipedia, YouTube, Reddit, Google
Observed to favor Google-owned properties. Reported to cite ~9 domains per query.
Gemini
Reddit, YouTube, Wikipedia, Medium, Forbes
Observed to integrate Google search results directly. Strong traditional SEO converts to Gemini visibility more than to other engines.
Perplexity
Reddit, LinkedIn, NIH, Microsoft, G2
Research-credible bias. Per Rankeo (2026), reported to account for ~47% of tracked AI citations across platforms. Most footnote-explicit.
AI Overviews
YouTube, Reddit, Forbes, LinkedIn, Wikipedia
Reported to cite ~7.7 domains per query. YouTube observed in ~29.5% of AI Overviews — highest video weight of any engine measured.
Sources: Similarweb (Jan–Feb 2026) · Peec AI 30M-source analysis · Lantern AI Citation Content Visibility Report (Feb 2026, 200M citations) · 5W AI Platform Citation Source Index 2026 · 5W Trade Press AI Index 2026 · Profound · SEMrush · Rankeo (2026).
Key Patterns
Reddit appears in the top five on every platform. Universal channel.
YouTube is the most-cited domain in AI search by a significant margin once aggregated — Lantern’s 200M-citation analysis reports more than 2× the citation share of the second-ranked domain.
Claude and Perplexity diverge sharpest from ChatGPT. A brand visible in ChatGPT can be invisible in Claude — and vice versa.
AI Mode cites approximately 9 domains per query; AI Overviews cites 7.7. Wider citation pools demand a wider citation strategy.
ChatGPT is the most Wikipedia-heavy. Strong Wikipedia content disproportionately moves ChatGPT visibility.
Gemini integrates Google search results directly. Strong traditional SEO converts to Gemini visibility more than to any other engine.
AI Platform Citation Behavior Matrix
Directional ratings based on the integrated dataset. HIGH / MEDIUM / LOW indicate relative weight of each citation dimension within that engine’s citation mix — not absolute citation volume across engines.
Platform
Recency
Trade Press
Wikipedia
Reddit
Video
Long-form Journalism
ChatGPT
HIGH
MEDIUM
HIGH
HIGH
MEDIUM
LOW
Claude
LOW
MEDIUM
HIGH
HIGH
MEDIUM
HIGH
Gemini
MEDIUM
MEDIUM
HIGH
HIGH
HIGH
MEDIUM
Perplexity
MEDIUM
HIGH
MEDIUM
HIGH
LOW
LOW
AI Overviews
MEDIUM
MEDIUM
MEDIUM
HIGH
VERY HIGH
MEDIUM
Ratings are qualitative synthesis, not normalized platform measurements. Drawn from the integrated dataset of eleven sources cited in this report. Patterns shift on multi-week timescales; rerun quarterly.
07 / Industry Patterns
Citation Behavior Varies Sharply by Vertical
The presence and quality of vertical trade media is one of the strongest predictors of how a category is described in AI. Categories with strong specialized trades see those trades dominate. Categories without them see Reddit, Wikipedia, and review sites fill the vacuum.
TripAdvisor, Yelp, Reddit (r/travel), Skift, Hotel Management
Cannabis
MJBizDaily, Marijuana Moment, Leafly, Reddit (r/trees, state subreddits)
Legal
Law360, Above the Law, ALM properties, bar associations, case law
CPG
Modern Retail, Retail Dive, Food Dive, AdAge, Reddit, review aggregators
Synthesis based on patterns observed across the eleven source datasets and 5W’s industry experience.
The Overarching Pattern
In high-stakes verticals — healthcare, finance, legal — government, academic, and authoritative sources carry disproportionate weight. The models recognize where source authority matters most.
In consumer-facing verticals — beauty, travel, CPG — community platforms, review aggregators, and influencer-published content lead. Trust signals are distributed across many sources rather than concentrated in editorial brands.
In B2B — SaaS, professional services — review platforms (G2, Capterra) and LinkedIn lead, with vertical trades providing the editorial layer. Wikipedia matters less than in consumer categories.
08 / The Operator Playbook
What This Means for Brands
5W Recommends
Four Moves Now
Drawn from the patterns observed in this audit. Sequenced for execution.
1Audit Brave for Claude. Brave Search ranking is reported by Profound (2025) as a leading indicator of Claude citation visibility. Most communications programs do not measure Brave. Start there for any brand whose buyer mix includes Claude users.
2Build Reddit, YouTube, and LinkedIn citation surfaces. Reddit appears in the top five citations of every major engine measured. YouTube appears in approximately 29.5% of AI Overviews. LinkedIn climbed from approximately rank #11 to #5 on ChatGPT in three months (Profound). All three are distributed-authority assets, not social channels.
3Rewrite top pages into citable claim blocks. Per Finding 11 above: every paragraph can act as a retrieval unit. Lead each section with a named entity, a specific number, and a date. Make the page a collection of extractable claims.
4Re-rank media lists by AI citation value, not prestige. The PR media list inherited from the 2010s no longer reflects which placements carry AI citation weight in the cited samples. PCMag is reported more consistently than TechCrunch. Skift outpaces prestige titles. Reaudit by vertical, engine, and rerank winners.
AI engines reward distribution, not concentration. The brand that appears in many places consistently beats the brand that appears in one place authoritatively.
To win in AI visibility, brands must execute against four mutually reinforcing levers. None is optional. Each one feeds the others.
01 Control Your Ground Truth
Wikipedia page — accurate, complete, well-sourced to citation-eligible publications.
Owned site — About page, leadership bios, product pages, and newsroom written in the language you want repeated by AI.
Schema markup and structured data on every page that matters.
Press releases and corporate communications consistent with your ground-truth language.
02 Build Distributed Authority
Reddit — active brand presence, founder/operator participation, AMAs, expert contribution.
LinkedIn — named-leader publishing on a weekly cadence; active company page.
Case studies with specific numbers, named clients, and structured outcomes.
FAQs that answer one question per page in clear, extractable language.
Deep vertical reference pages that own a single topic decisively.
Original research and proprietary data — citations compound on findings nobody else has.
04 Increase Repetition Across Sources
Mentions are more important than backlinks.
Distribution across many sources is more important than ranking in any one.
Earned coverage in citation-eligible publications feeds Wikipedia upstream.
Repetition across the surfaces AI engines pull from creates the consensus signal that drives citation.
FAQ
Frequently Asked Questions
Which sources does ChatGPT cite most often?
Per Similarweb’s January–February 2026 dataset of approximately 600,000 citation events in the U.S., the top three domains ChatGPT cites are Wikipedia (13.15%), Reddit (11.97%), and OpenAI.com (6.21%). Wikipedia and Reddit together account for over 25% of all measured ChatGPT citations. Note: AI citation patterns are volatile and shift on multi-week timescales; refer to the latest edition of the 5W Citation Source Audit for current data.
Does Claude use Brave Search?
Claude is reported to use Brave Search as its web retrieval backend. According to Profound’s 2025 analysis via Tagliaferro, 86.7% of Claude-cited URLs in that sample overlapped with Brave top organic search results, which may suggest Brave Search ranking as a leading indicator of Claude citation visibility. This finding is reported by Profound via Tagliaferro and was not independently verified by 5W.
Why does Reddit appear so often in AI citations?
Reddit appears in the top five most-cited domains across all major AI engines measured, including ChatGPT, Google AI Mode, Gemini, Perplexity, and Google AI Overviews. The structural reasons reported across the cited studies: OpenAI announced a content licensing partnership with Reddit in 2024; Google has a separate data agreement; and Reddit’s substantive, threaded content extracts cleanly into AI answers. Per Peec AI’s 30-million-source analysis, Reddit is observed as a universal top-five citation source across all major engines in the cited sample.
What is AI citation share?
AI citation share is the percentage of an AI engine’s cited sources that come from a given domain, measured across a fixed sample of prompts or responses. It is the AI-era equivalent of search market share. A brand’s AI citation share is the percentage of relevant prompts for which an AI engine references the brand, its domain, or third-party content about the brand. 5W publishes quarterly Citation Source Audits to track citation share across ChatGPT, Claude, Perplexity, Gemini, and Google AI Overviews.
How do brands improve AI visibility?
Four mutually reinforcing levers, drawn from the patterns observed across the cited datasets: (1) Control ground truth — accurate, well-sourced Wikipedia page; clear owned-site language; structured data and schema markup. (2) Build distributed authority — active Reddit presence, named-leader LinkedIn publishing, YouTube creator mentions, claimed review-platform profiles. (3) Create extractable content — case studies with specific numbers, FAQs answering one question per page, deep vertical reference pages, original research with proprietary data. (4) Increase repetition across sources — mentions across many citation-eligible sources appear to matter more than ranking on any one. Given the multi-week volatility documented in this audit, programs should consider measuring AI visibility quarterly rather than annually.
Benchmark Your Brand Against the Data.
5W runs custom AI Visibility Audits across all four major LLMs, identifying gaps and quantifying opportunity.
Lantern (Feb 2026). AI Citation Content Visibility Report. 200M+ citations across ChatGPT, Perplexity, Gemini, and Claude.
5W (May 2026). The AI Platform Citation Source Index 2026 — 50 sources ranked across five engines. Published on everything-pr.com and 5wpr.com.
5W (May 2026). The Trade Press AI Index 2026 — across nine industries. Synthesizes six published citation studies, ~680M citations. Published on everything-pr.com.
Oltre (2026). How Claude Picks Sources: Technical Breakdown — 2,170-URL Claude analysis.
Pixelmojo (Apr 2026). GEO playbook synthesis of Princeton/Georgia Tech GEO Study (KDD 2024).
Erlin (2026). 501-site Claude SEO analysis.
Profound (2025) via Tagliaferro — Claude/Brave Search URL overlap analysis.
What 5W Did Not Verify
5W did not run the underlying primary research in this Q1 edition. The Q2 edition will include 5W's own primary research run as described above.
Where studies disagree at the margin, the largest dataset is generally weighted most heavily; specific disagreements are surfaced in the body of each finding.
All percentages, rankings, and correlations are reported as published by the original researchers.
The June 2026 refresh integrates two 5W research properties (the AI Platform Citation Source Index 2026 and the Trade Press AI Index 2026). Both were produced by 5W Research from synthesized third-party data; 5W did not independently run the underlying primary measurements for either. Where these properties are cited in this Q1 audit, readers should consult each property’s published methodology for full sourcing detail.
Limitations
This is a U.S.-focused synthesis. International citation patterns are not covered in this edition.
The September 2025 volatility event documented in Finding 9 is a clear warning that citation patterns shift on multi-week timescales. Findings holding in Q1 2026 may shift by Q2.
Some claims about LLM training data composition are widely discussed in the trade press but not fully documented by the model developers themselves. Where this is the case, the report uses cautious language and stops short of asserting specific training-weight figures.