AEO

How to get cited by Perplexity AI.

Perplexity AI is one of the fastest-growing search products in the world, with hundreds of millions of queries processed each month. Unlike ChatGPT — which blends training data with optional retrieval — Perplexity is built as a real-time answer engine. Every response is grounded in live web search. That makes Perplexity different from other AI systems in a critical way: your optimization efforts have immediate impact.

If you get cited by Perplexity, your page is shown directly to users as a numbered source alongside the generated answer. It is visible, attributable, and — unlike a buried Google result — seen by everyone who reads the answer. Here is how to become one of those sources.

How PerplexityBot works

Perplexity's crawler is called PerplexityBot. It crawls the web continuously to build its index, and it also performs live fetches at query time to retrieve current information. This dual approach means your content needs to be both indexable in advance and immediately fetchable when a relevant query arrives.

Like most AI crawlers, PerplexityBot does not execute JavaScript. It sends an HTTP request, receives the raw HTML response, and extracts text from that response. A page that requires JavaScript to display its content will appear empty or nearly empty to PerplexityBot. This is the single most common reason sites fail to get cited despite having excellent content.

PerplexityBot identifies itself in the User-Agent header. Check your robots.txt to confirm it is not blocked — many sites inadvertently block AI crawlers with blanket disallow rules written for other purposes.

What makes a page citable

Freshness and accuracy

Perplexity places a strong premium on current, accurate information. Answers are generated in real time, and stale or factually wrong content gets surfaced alongside its recency metadata. Pages with clear publication dates, recent updates, and accurate facts are favored over evergreen content that has not been touched in years. Add a visible dateModified to your schema markup and keep key pages updated.

Direct, self-contained answers

Perplexity's interface shows a generated answer with numbered citations alongside it. For your page to earn one of those citation slots, it needs to contain a direct, self-contained answer to the query. Content that requires five pages of context before reaching the point does not get cited. Content that leads with a clear answer in the first 100 words does.

Source authority signals

Perplexity's ranking logic incorporates source authority, similar to traditional SEO. Pages from recognized publications, established brands, and frequently cited domains earn higher trust. If your domain is new or thin on content, building broader web presence — through guest publications, PR, and consistent content — improves your authority over time.

Formatting for citations

The way you structure your content directly affects the likelihood of citation. Perplexity pulls excerpts to display as evidence for its answers. Those excerpts need to be clean, coherent sentences that work out of context.

  • Write quotable sentences. Each paragraph should contain at least one sentence that is a complete, accurate statement of fact. Avoid compound sentences that only make sense in the context of the surrounding paragraphs.
  • Use clear section headings. H2 and H3 headings that match natural question phrasing help Perplexity match page sections to specific queries. "What is churn rate?" is a better heading than "Understanding the Basics."
  • Keep paragraphs short. Short paragraphs are easier to extract and display as citation evidence. Three to five sentences per paragraph is ideal.
  • Avoid tables for key facts. Table data is harder to extract and summarize. If a table contains critical information, mirror it in prose form as well.

How Perplexity picks sources for a given query

When a query arrives, Perplexity runs a web search and retrieves the top results. It then fetches the content from those pages and uses its model to synthesize an answer, selecting the most relevant excerpts as citations. The page selection happens in two stages:

  • Stage 1 — search ranking. Perplexity's underlying search layer uses signals similar to traditional search ranking — relevance, authority, freshness. Pages that rank well for the query terms are candidates for retrieval.
  • Stage 2 — content extraction quality. Among retrieved pages, those with clean, extractable content perform better than those with rendering issues, heavy boilerplate, or thin text. The model is selecting the most useful excerpt it can find — give it something worth selecting.

Practical page optimization tips

  • Serve pre-rendered HTML. Use server-side rendering or static generation for all public content pages. PerplexityBot cannot execute JavaScript.
  • Allow PerplexityBot in robots.txt. Add User-agent: PerplexityBot / Allow: / explicitly, or verify your current rules do not block it.
  • Add Article schema with datePublished and dateModified. These signals help Perplexity assess freshness.
  • Lead with your answer. Put the most important, directly quotable statement in the first paragraph of your page and the first sentence of each section.
  • Include specific data. Statistics, named examples, and precise figures make your content more citable than vague generalizations.
  • Build external mentions. Being cited in third-party articles, news pieces, and industry publications builds the authority signals that affect Perplexity's underlying search ranking.
  • Test your pages as PerplexityBot sees them. Fetch your URLs with curl -A "PerplexityBot" and verify the content you want cited is visible in the raw HTML response.

Perplexity is one of the most actionable AI visibility opportunities available right now because its citation pipeline is fully grounded in live web retrieval. Fix your rendering, structure your answers, and keep your content current — and the results are measurable within days, not months.

See exactly what PerplexityBot fetches from your site right now.