GEO Score: How to Measure Your Website's AI Visibility

You cannot improve what you do not measure. And in GEO, measurement is the main obstacle. There is no Google Search Console for ChatGPT. No rank report in Perplexity. No click data from Gemini.

Yet AI visibility can be measured. Not with the same tools or metrics as SEO, but with a structured approach that gives you a clear diagnostic and actionable priorities.

This article explains what a GEO Score is, how it is calculated, how to interpret your results, and most importantly where to start to improve your visibility in AI answers.

Why measuring AI visibility is hard

In SEO, the measurement ecosystem is mature. Google Search Console tells you which queries lead to your site, which pages are indexed, what your CTR is. Tools like Ahrefs, Semrush, or Moz track your rankings daily.

In GEO, none of that exists — for three fundamental reasons.

AI answers are dynamic. Unlike Google SERPs that are relatively stable, ChatGPT or Perplexity answers change with every query. Asking the same question twice can produce different answers, citing different sources. There is no fixed "position 1."

AI engines do not share their data. OpenAI, Google, and Anthropic do not provide an analytics console for site publishers. You cannot know how many times ChatGPT cited your site, or for which queries.

Citations are contextual. An AI may cite your site for one query and not for another very similar one. The citation depends on the conversation context, the question wording, and sometimes even the user's profile.

Despite these challenges, two complementary approaches let you measure your AI visibility.

Approach 1: manual testing

This is the most direct method. It consists of querying AI engines the way a potential customer would, and observing whether your site appears in the answers.

How to proceed

Step 1 — Define your target queries

List 10 to 15 questions your customers actually ask. Think in three categories:

Recommendation queries: "What is the best [service] in [city]?", "What solution for [problem]?"
Expertise queries: "How do I [do X]?", "What's the difference between [A] and [B]?"
Brand queries: "What does [your company] do?", "Reviews of [your product]"

Step 2 — Query 3 AI engines

Ask each question on ChatGPT, Gemini, and Perplexity. For each answer, note:

Is your site cited as a source? (yes/no)
Is your brand mentioned? (yes/no)
Which competitors are cited?
Does the answer reference content from your site?

Step 3 — Analyze the results

Calculate your presence rate: across your 10-15 queries x 3 engines, in what percentage of answers do you appear? Below 20%, you have a significant AI visibility problem.

Limitations of manual testing

This approach gives you a snapshot at a single point in time, not ongoing tracking. It is also subjective: results vary based on wording and context. It is a good starting point, but it is not enough to drive a GEO strategy.

Approach 2: technical audit

The technical audit analyzes your site on objective criteria that determine AI citation-worthiness. Unlike manual testing, it measures citation potential — the technical and editorial conditions that maximize your chances of being selected as a source.

This is the approach Detekia uses: an automated audit that evaluates your site on 8 weighted criteria and assigns a score out of 100.

The 8 criteria of the Detekia GEO Score

Each criterion is measured by analyzing the actual DOM of your page — not by estimation or approximation.

1. Extractability & direct answer (25 points)

This criterion measures whether your content contains answers ready to be extracted by an AI. The analysis checks:

The presence of a substantial introduction in the first 100 words
Informational density of paragraphs (facts, figures, definitions vs. vague text)
Use of lists and tables that structure information
Presence of direct answers to the page's implicit questions

Typical score observed: most sites score between 8 and 15 out of 25. Sites with vague commercial introductions or homepages without informational content fall below 8.

2. Verifiability & evidence (20 points)

This criterion measures the density of verifiable evidence in your content: precise statistics and figures, named sources (studies, reports, institutions), reference dates, concrete examples and use cases.

Typical score observed: B2B sites with expert content score between 12 and 18. Sites with purely commercial messaging fall to 4-8.

3. Authority & E-E-A-T (15 points)

This criterion evaluates expertise and credibility signals: a complete About page with verifiable information, identified authors on content, complete legal pages, trust signals (certifications, client references, partner logos).

Typical score observed: 6 to 10 out of 15 for SMBs. Freelancers without an About page fall to 2-4. Companies with identified authors and a complete institutional page reach 12-15.

4. AI crawlability (15 points)

This criterion verifies whether AI bots can access your content:

Analysis of the robots.txt file for AI user-agents (GPTBot, ClaudeBot, PerplexityBot, Google-Extended)
Presence of an llms.txt file
Content accessibility without mandatory JavaScript
Accessible XML sitemap

Typical score observed: highly variable. Sites blocking AI bots (often unknowingly) score 0 to 3. Those not blocking but without an llms.txt score 8-10. Fully optimized sites reach 13-15.

→To fix crawlability issues, read llms.txt, robots.txt, and AI crawlability.

5. Structured data (10 points)

This criterion analyzes the presence and quality of Schema.org markup: Organization on the homepage, Article or BlogPosting on editorial content, FAQPage on Q&A pages, markup validity (no errors).

Typical score observed: 0 to 3 for sites without structured data (roughly 23% of the web). 5-7 for those with basic markup. 8-10 for sites with complete, valid markup.

→Implementation guide in Schema.org and AI: the practical guide.

6. Editorial neutrality (10 points)

This criterion evaluates your content's tone — analyzed by AI: absence of unsourced superlatives ("the best," "the leader," "unmatched"), factual and informative tone rather than promotional, presence of balanced comparisons, absence of emotional manipulation.

Typical score observed: 4 to 7 for most sites. Sites with a very commercial tone fall to 2-3. Editorial and technical sites reach 8-10.

7. External presence (5 points)

This criterion evaluates your visibility outside your own site: mentions on third-party sites (press, forums, directories), active profiles on recognized platforms (LinkedIn, Reddit), citations in other publishers' content.

Typical score observed: 1 to 3 for SMBs and startups. Companies with an active PR strategy reach 4-5.

8. Freshness & maintenance (5 points)

This criterion checks how current your content is: visible last-modified date, update frequency, absence of obsolete content (expired dates, old stats).

Typical score observed: 2 to 3 for sites updated regularly. 0-1 for sites whose last content is over a year old.

How to interpret your score

Thresholds

75 – 100Excellent

Your site is well-positioned to be cited by AI engines. Focus on criteria that are not yet maxed out and on tracking over time.

45 – 74Average

You have foundations, but significant gaps prevent your site from being cited regularly. Identify the 2-3 weakest criteria and fix them first.

0 – 44Low

Your site has little chance of being cited by AI in its current state. Structural corrections are needed. The good news: the margin for improvement is large and early gains will be fast.

The average score observed

Across all sites analyzed by Detekia, the average score is around 38/100. Most sites are not at all optimized for generative engines. That is bad news for them, but good news for you: by optimizing now, you gain a considerable head start over your competitors.

The most commonly failing criterion

Extractability is the criterion most frequently under-optimized. Most sites have content, but that content is not structured to be extracted and cited by an AI. It is also the heaviest criterion (25 points) — meaning it offers the biggest potential gains.

How to read the thematic groups

Detekia organizes the 8 criteria into 3 thematic groups that make reading and prioritization easier:

AI Readability50 pts max

Criteria: Extractability + Crawlability + Structured data

This group measures whether AI engines can access your content and understand it. This is the technical baseline. If this group is weak, AI cannot cite you — even if your content is excellent.

PRIORITY: High — fixes are often quick (robots.txt, Schema.org, intro restructuring)

Credibility45 pts max

Criteria: Verifiability + Authority + Neutrality + External presence

This group measures whether AI engines trust your content. Even if they can read it, they will not cite it if it is not perceived as credible.

PRIORITY: Medium — requires more work (sourcing claims, building external presence, adjusting tone)

Freshness5 pts max

Criteria: Freshness & maintenance

This group measures whether your content is current and maintained. It is a weak but constant signal: AI engines favor recent content.

PRIORITY: Ongoing — not a one-time project but a regular discipline

Prioritize your actions by impact

Not all criteria carry the same weight, and not all fixes have the same effort-to-impact ratio. Here is how to prioritize:

Immediate-impact actions (this week)

Unblock AI crawlers — if your robots.txt blocks GPTBot or ClaudeBot, you go from 0 to potentially 12-15 points in crawlability just by removing a few lines. Effort: 15 minutes. Impact: up to +15 points.
Rewrite introductions on your key pages — add a direct answer in the first 100 words of your 5 main pages. Effort: 1-2 hours. Impact: up to +10 points in extractability.
Add Schema Organization — a single JSON-LD block on your homepage. Effort: 30 minutes. Impact: +2-3 points in structured data.

Medium-term impact actions (this month)

Source your claims — review your key pages and replace every vague assertion with evidence. Effort: half a day. Impact: up to +8 points in verifiability.
Complete your structured data — add Article on your editorial content and FAQPage on your FAQ pages. Effort: 2-3 hours. Impact: +4-6 points in structured data.
Create or complete your About page — photo, bio, background, expertise, contact info. Effort: 2 hours. Impact: +3-5 points in authority.

Long-term impact actions (this quarter)

Build your external presence — Reddit, industry press, guest posts. Effort: ongoing. Impact: +2-4 points in external presence, plus indirect effects on authority.
Adjust editorial tone — rewrite overly commercial content in a factual, expert tone. Effort: varies. Impact: +3-5 points in neutrality.

Track progress over time

A single score is useful for diagnostics. But tracking over time is what lets you steer a GEO strategy.

What to track

Overall score — is it increasing after each optimization wave?
Per-criterion scores — which criterion is progressing, which is stalling?
Score vs. competitors — analyze your competitors' sites on Detekia to benchmark yourself
Manual test results — redo the 3-question test (recommendation, expertise, brand) every month

Recommended frequency

Technical audit: after each optimization wave, then monthly
Manual test: monthly on your top 5 queries
Editorial review: quarterly for content updates

Correlations to observe

After a technical optimization (robots.txt, Schema.org), check the manual test 2-4 weeks later. You should see an improvement in your presence in AI answers.

After a content optimization (extractability, verifiability), allow 4-8 weeks before seeing an impact on citations.

After external presence work (Reddit, press), allow 8-12 weeks. This is the slowest lever but also the most durable.

What the score does not measure

The GEO Score is an indicator of potential, not a guarantee of citation. A few important nuances:

The score does not predict exact queries. A good score means your site meets the technical and editorial conditions for being cited. But AI engines decide case by case for each query. A score of 85/100 does not mean you will be cited in 85% of answers.

The score does not measure topical relevance. If your site is perfectly optimized technically but your content does not address the topic a user is asking about, the AI will not cite you. GEO optimizes the conditions for citation, not content relevance.

The score evolves with the market. A score of 60/100 can be excellent in an industry where the average is 25, and insufficient in an industry where your competitors are at 75. Always interpret your score in context.

Frequently asked questions

Does a good GEO Score guarantee being cited by AI?

No. The score measures citation potential — the conditions your site meets for being selected as a source. AI engines remain unpredictable in their selection. But a site at 80/100 statistically has far better chances of being cited than a site at 30/100.

What score should I aim for?

Above 65/100, you are in a very strong position relative to the market (average score: 38/100). Above 80/100, your site is in the top 5% for AI citation-worthiness. The goal is not 100/100 but to be significantly above your competitors.

Does the score change if I do nothing?

It can decline, yes. If your competitors optimize and you do not, your relative position degrades. And if your content ages without updates, the freshness criterion will naturally decrease.

Can I compare my score to my competitors'?

Yes. Run a Detekia audit on the sites of your 3-5 main competitors. Compare overall scores and per-criterion scores. This tells you exactly where you lag behind — and where you have an edge.

How do I go from 30 to 60/100?

The three most impactful actions: 1) unblock AI crawlers in robots.txt (+10-15 potential pts), 2) rewrite introductions to make them extractable (+8-12 pts), 3) add Organization + FAQPage structured data (+5-8 pts). In one focused week of work, a gain of 25-30 points is realistic.

Run your audit

The first step is knowing your current score. No guessing, no gut feeling — an objective measurement across 8 GEO criteria.

Analyze your site for free on Detekia — score out of 100, 8 detailed criteria, recommendations prioritized by impact. In under 60 seconds, no signup required.