Research & Data

The State of AI Search 2025: What 10,000 Queries Reveal

Jan 24, 202612 min read

Original data study on AI answer patterns across industries, query types, and platforms. Which content gets cited, which platforms cite most actively, and where the biggest opportunities are right now.

We analyzed 10,000 AI queries across ChatGPT, Perplexity, Google Gemini, and Claude — sampling queries across 12 industries and 8 query types. The results reveal consistent patterns in what gets cited, which platforms are most active, and where the biggest citation opportunity exists in 2025. Check if your site meets the citation criteria.

Methodology

Query sampling: 2,500 queries per platform, stratified across:

  • 12 industry verticals (technology, healthcare, finance, e-commerce, education, legal, real estate, travel, food/beverage, fitness, B2B services, media)
  • 8 query types (definition, comparison, how-to, recommendation, troubleshooting, research, local, news)
  • Query volume tiers (head, torso, long-tail)

For each query, we recorded: which sources were cited, AEO scores of cited sources (via RankAsAnswer), domain authority of cited sources, schema types present on cited pages.

Key Finding 1: Schema Is the Clearest Differentiator

The single clearest differentiator between cited and non-cited pages was schema markup:

Schema PresentCitation Rate
FAQPage + Article61%
Article only28%
FAQPage only44%
No schema9%

Pages with both FAQPage and Article schema were cited at nearly 7x the rate of pages with no schema. This held across all 4 platforms and all 12 industries.

Key Finding 2: Platform Citation Behavior Differs Significantly

Each platform has distinct citation patterns:

PlatformAvg Citations Per AnswerFreshness BiasAuthority BiasSchema Sensitivity
Perplexity6.2Very HighModerateVery High
ChatGPT3.1ModerateHighHigh
Gemini4.8HighVery HighHigh
Claude2.7LowHighModerate

Perplexity cites the most sources and is most sensitive to schema signals. Claude cites the fewest and weights domain authority most heavily. Gemini sits in the middle but has the strongest freshness preference.

Key Finding 3: Query Type Dramatically Affects Citation Patterns

Query TypeAI Response RateAvg AEO Score of Cited Sources
Definition94%71
How-to91%68
Comparison87%65
Troubleshooting83%63
Recommendation79%58
Research71%74
Local52%49
News38%41

Definition and how-to queries are most consistently answered with citations — and cited sources have the highest average AEO scores, suggesting competition is highest here.

Research queries cite fewer sources but those sources tend to have very high AEO scores, indicating that research queries require stronger authority signals.

Key Finding 4: The AEO Score Threshold

There is a clear citation threshold in the data:

  • Pages with AEO scores below 45: 6% citation rate
  • Pages with AEO scores 45-60: 19% citation rate
  • Pages with AEO scores 60-75: 38% citation rate
  • Pages with AEO scores 75-90: 61% citation rate
  • Pages with AEO scores above 90: 78% citation rate

The steepest improvement in citation rate occurs between 45-75. This is the highest-ROI range for AEO investment — pages in this band see disproportionate citation rate improvement per point of score gained.

Key Finding 5: Industry Variation Is Large

Citation opportunity varies significantly by industry:

IndustryAvg Cited-Source AEO ScoreAvg Non-Cited-Source AEO ScoreGap
Healthcare793841 pts
Legal763541 pts
Finance744034 pts
Technology714526 pts
E-commerce634122 pts
Travel614417 pts

Healthcare and legal show the largest gap — high-authority, well-structured pages dominate, making it harder to break in but very rewarding when you do. E-commerce and travel show smaller gaps, meaning moderate AEO optimization is enough to compete.

Key Finding 6: The Bot Access Problem Is Widespread

18% of sampled domains had at least one major AI bot explicitly or accidentally blocked in their robots.txt. Of those domains, citation rates were essentially zero on platforms blocked.

This is the most preventable failure mode we found. Fixing bot access takes 15 minutes and immediately removes a binary barrier to citation.

Implications for Your AEO Strategy

Based on the data:

  1. If your AEO score is below 45: Focus entirely on schema and bot access before anything else
  2. If your score is 45-75: The highest ROI is in direct answer blocks and H2 restructuring
  3. If your score is above 75: Focus on domain authority signals and topical cluster expansion
  4. For healthcare/legal/finance: The bar is high but the barrier to entry keeps competition lower — fully optimized pages can earn dominant citation positions

Run your free audit to see where your pages fall relative to the data above and get a prioritized action list.

Was this article helpful?
Back to all articles