AEO Guide · Measurement

How to Track AI Visibility Across AI Engines

Victor Xu

Founder, SIGNALS · AI Visibility Intelligence

Updated June 2026

TL;DR

AI visibility is not a single score — it is share of voice across four engines (ChatGPT, Google AI Overview, Perplexity, Claude), measured per buyer query, and the four numbers rarely agree. Tracking it means running the questions your buyers actually ask, on each engine's default settings, multiple times, and recording which firms get named and how often. A single run is unreliable because the same query returns a partly different answer next time — which is exactly why share of voice is measured across repeated runs, not read from one response.

What "AI visibility" actually means

AI visibility is how often, and how prominently, an AI engine names your company when a buyer asks a question your company could answer. It is the AI-era equivalent of being on the shortlist — except the shortlist is now assembled by the engine before the buyer contacts anyone, and there is no second page to climb to.

The unit that makes it measurable is share of voice: across repeated runs of a given buyer query, the percentage of those runs in which an engine names your firm. A firm named in four of five runs has 80% share of voice on that query and engine; a firm named in zero has 0% — a position no competitor owns, and one the firm could claim.

Why your AI visibility is four numbers, not one

The four engines read from different indexes and weight sources differently, so a firm dominant on one is frequently absent from another. This is not noise — it is structural and stable across repeated runs. In a SIGNALS study of 27 buyer queries, no firm held a strong position across all four engines at once in 20 of them (SIGNALS, 2026). A firm can lead Google, Perplexity, and Claude and still be missing from ChatGPT entirely.

The consequence for measurement: a single blended "AI visibility score" hides the thing that matters. You need the number per engine, because the gap is almost always engine-specific — and the fix usually is too.

Step 1 — Build the query set buyers actually use

Start from the questions a buyer types when sourcing what you sell — not your product names, but the buyer's phrasing: "best [category] for [application]," "who builds [thing]," specialist and standard-specific variants. Specialist phrasings and generic phrasings return very different rosters, so include both to see where you can compete and where the category consolidates around large incumbents. This query set is the measurement surface; everything else is run against it.

Step 2 — Run each query on all four engines, on default settings

Run every query on ChatGPT, Google AI Overview, Perplexity, and Claude, using each engine's default consumer configuration — because that is what a real buyer experiences. Capture the full response, including the sources each engine cites. Note that public APIs do not reliably reproduce the consumer surfaces; the answer a buyer sees comes from the default interface, so that is what has to be measured.

Step 3 — Run each query multiple times

A single run is unreliable: the same query, run again, returns a partly different roster. Run each query several times per engine and measure share of voice as the percentage of runs in which a firm is named. This is what turns a noisy one-off answer into a stable, comparable number — and it is why credible AI visibility measurement is repeated, not snapshot.

Step 4 — Record share of voice and parse the citations

For each query and engine, record which firms were named and how often (their share of voice), and capture which sources the engine cited. The citation source matters: across SIGNALS' industrial studies, 73–89% of citations pointed to firms' own pages rather than directories or listicles (SIGNALS, 2026) — which tells you the lever for improving the number is the firms' own content, not third-party placement.

Step 5 — Track movement over time against named competitors

A number is only useful against a baseline and a competitor set. Establish where you stand today across the four engines, identify the queries where a competitor is named and you are not, then re-measure after you change anything. AI citation takes two to six weeks to shift after structural changes, so monthly re-measurement during active work shows what is actually moving — and which engine recovered.

How SIGNALS tracks this

This is what PULSE measures. It runs your buyer queries across ChatGPT, Google AI Overview, Perplexity, and Claude on default settings, captures the full responses, and reports your share of voice query by query and engine by engine — against the competitors winning your category. It is built for exactly the behavior above: four numbers, measured across repeated runs, not one blended score from a single pass.

Frequently asked questions

What is AI share of voice?

The percentage of runs of a given buyer query in which an AI engine names your company. Measured per query and per engine across repeated runs, it quantifies how often you appear in the answers buyers actually receive.

Can I track AI visibility with one tool or score?

You can report it in one place, but it cannot honestly be one number. Visibility differs by engine because each reads from a different index — so a single blended score hides the engine-specific gaps that are the whole point of measuring. Track four numbers, one per engine.

Why does the same AI query give different answers each time?

AI engines introduce variation between runs, so a single query returns a partly different roster each time. This is why share of voice is measured across multiple runs rather than read from one response — repetition is what makes the number reliable.

Can I use the AI engines' APIs to track visibility?

APIs do not reliably reproduce the default consumer surfaces buyers actually see, so they are a poor proxy for real visibility. Measurement has to reflect the default interface a buyer uses.

How often should I re-measure AI visibility?

Monthly during active optimization, since AI citation takes two to six weeks to shift after changes; quarterly once stable. Always re-measure immediately after structural changes to confirm they registered.

How do I get a baseline measurement of where I stand?

Request a free PULSE visibility assessment. It establishes your share of voice across all four engines, query by query, against your named competitors — the baseline everything else is measured against.

Related guides

What is AI visibility? — the state this measures (vs. the practice that improves it).
How to get cited by ChatGPT — now fix what the measurement reveals.
How to appear in Google AI Overviews — the engine-specific fix.
The 27-query deep cut — share-of-voice measurement worked end to end.
The research methodology behind share of voice, and the 7 SIGNALS dimensions.