Technical

Structured data for AI visibility: the complete guide.

Structured data — machine-readable markup added to web pages using the schema.org vocabulary — has long been associated with Google. Rich results, featured snippets, knowledge panels: these are all powered by schema markup, and SEOs have used them for years to gain visibility in traditional search.

But structured data is now more important than ever, for a different reason: AI systems use it to understand your content without needing to parse your prose. When ChatGPT, Perplexity, Gemini, or Claude retrieves your page, structured data gives them a pre-interpreted summary of who you are, what the page is about, and how your content fits into broader knowledge relationships. It is the clearest signal you can send to any automated system reading your site.

Why structured data matters for AI (beyond Google)

AI language models are pattern-matching systems. They are very good at reading and understanding natural language prose, but they are even better at processing structured, labeled data. When you add JSON-LD to a page, you are essentially pre-answering the questions an AI system would otherwise have to infer:

  • Who published this content?
  • What is this page about?
  • When was it written?
  • What questions does it answer?
  • Is this a product, a company, an article, or something else?

The AI doesn't have to guess — you've told it. This reduces ambiguity, improves attribution accuracy, and makes your content more citable. It is the difference between an AI confidently quoting you by name versus paraphrasing your content without attribution.

Schema types that matter for AI visibility

Organization

Every site should have an Organization schema in the sitewide header or homepage. This declares your entity to AI systems: your legal name, URL, logo, description, and social profiles. Without it, AI systems have to infer your identity from context — and they may get it wrong, conflate you with a competitor, or fail to attribute content to you correctly.

Example:

{
  "@context": "https://schema.org",
  "@type": "Organization",
  "name": "Appear",
  "url": "https://appear.so",
  "logo": "https://appear.so/logo.png",
  "description": "AI visibility monitoring for modern businesses.",
  "sameAs": [
    "https://twitter.com/appear_so",
    "https://linkedin.com/company/appear-so"
  ]
}

Article

Every blog post, guide, and long-form page should have Article (or BlogPosting) schema. At minimum, include headline, description, author, datePublished, dateModified, and publisher. The dateModified field is particularly important for AI systems that factor freshness into their source selection.

FAQPage

FAQPage schema is one of the highest-impact types for AI visibility. It pre-extracts question-and-answer pairs in machine-readable form, giving AI systems perfectly formatted answer units to cite. If your page answers five common questions, mark them all up. The AI can reference individual Q&A pairs without needing to parse your full prose.

Use FAQPage on: product pages with common questions, support documentation, pricing pages ("Is there a free trial?"), and any page that has a natural Q&A structure.

HowTo

HowTo schema marks up step-by-step processes with named steps, descriptions, and optionally images and time estimates. AI systems love this schema because it maps directly to "how do I..." queries, which are among the most common AI search patterns. If your content walks users through a process, mark it up with HowTo.

Product

If you sell something, Product schema is essential. It declares your product name, description, price, availability, and reviews. AI systems increasingly answer product comparison queries by synthesizing product data from multiple sources. Without Product schema, your product's attributes are inferred from unstructured text — and inferences are less reliable than explicit declarations.

SoftwareApplication

SaaS companies and app developers should use SoftwareApplication schema on their product and feature pages. It captures applicationCategory, operatingSystem, offers, and aggregateRating. When users ask AI systems to recommend tools in a category, well-marked-up SoftwareApplication pages give AI systems the structured data they need to accurately represent your product.

Implementation best practices

  • Use JSON-LD, not Microdata. JSON-LD is a script tag in the document head. It is cleaner, easier to maintain, and the format all major AI systems prefer. Avoid inline Microdata attributes.
  • Place JSON-LD in the document head. This ensures AI crawlers find it even if body content parsing is incomplete.
  • Do not mark up content that is not on the page. Schema that describes content absent from the visible page is considered spam and can undermine trust in your markup. Keep schema and page content in sync.
  • Use the most specific type available. BlogPosting is more precise than Article, which is more precise than CreativeWork. Specificity helps AI systems understand your content better.
  • Test with Google's Rich Results Test. It catches syntax errors and validates your markup, even though the goal is AI visibility beyond Google.

Common mistakes

  • No schema at all. Still the most common issue. Many sites have zero JSON-LD markup. Start with Organization on the homepage today.
  • Homepage-only schema. Structured data on the homepage alone is not enough. Each page — blog post, product page, FAQ — should have its own page-specific schema.
  • Missing dateModified. Publishing a date but never updating it makes your content look stale to AI systems even if you've kept it current.
  • Schema injected by JavaScript only. If your schema is added by a client-side script, AI crawlers that do not execute JavaScript will not see it. Embed JSON-LD in the server-rendered HTML.

Structured data is one of the highest-leverage investments you can make in AI visibility. It requires a one-time implementation effort and delivers ongoing dividends as AI systems grow in usage. Start with the types most relevant to your business and expand from there.

Check whether your schema markup is visible to AI crawlers — not just Google.