All articles

What Is GEO? The Complete Guide to Generative Engine Optimisation

Generative Engine Optimisation (GEO) is how you get your content cited by ChatGPT, Perplexity, and Google AI Overviews. Learn the research, the key factors, and how to start.

The Shift from Search to AI Answers

For two decades, the digital content game had one rule: rank on Google. But that game is changing fast.

According to Gartner's 2025 forecast, traditional search volume will decline by 25% by 2026 as users shift to AI assistants for direct answers. When someone asks ChatGPT "What's the best project management tool for remote teams?" they don't get ten blue links. They get a direct answer -- and that answer cites specific sources.

The question is no longer "How do I rank on page one?" It's "How do I become the source that AI cites?"

That's what Generative Engine Optimisation is about.

Defining GEO: Generative Engine Optimisation

Generative Engine Optimisation (GEO) is the practice of optimising web content so that AI-powered search engines and assistants are more likely to cite it in their responses.

The term was formalised by researchers at Georgia Tech, Princeton, the Allen Institute for AI, and IIT Delhi in their landmark 2023 paper, "GEO: Generative Engine Optimization" (Aggarwal et al., 2023). Their research demonstrated that targeted content optimisations can increase source visibility in generative engine responses by up to 115%.

This isn't theoretical. The researchers tested nine distinct optimisation strategies against a control group of unmodified content and measured which approaches most effectively increased citation rates across AI systems.

How GEO Differs from Traditional SEO

SEO and GEO share a common ancestor -- both aim to make content more visible. But they diverge significantly in method and measurement.

| Dimension | SEO | GEO | |-----------|-----|-----| | Goal | Rank higher in search results | Get cited by AI assistants | | Unit of optimisation | The page | The sentence or paragraph | | Key signals | Backlinks, keywords, domain authority | Quotability, structure, citations, freshness | | Success metric | Position on SERP | Inclusion in AI-generated answers | | Competition | Other pages ranking for same keyword | All content the AI has ingested | | Time to impact | Weeks to months | Can be immediate (on next crawl) |

The fundamental difference is this: SEO optimises for algorithms that rank pages. GEO optimises for models that synthesise answers.

Google's algorithm decides which pages to show. An LLM decides which sentences to quote. These are fundamentally different tasks that require different optimisation strategies.

The Research: What Actually Works

The Aggarwal et al. (2023) study tested nine optimisation strategies. Here are the ones that produced statistically significant improvements in AI citation rates:

1. Adding Citations and Statistics (Up to 115% improvement)

The single most effective GEO strategy was adding authoritative citations and specific data points. When content included statements like "According to a 2025 McKinsey study, 78% of enterprises have adopted AI in at least one business function," AI systems were dramatically more likely to use it as a source.

Why it works: LLMs are trained to prefer content that demonstrates authority. Specific numbers, named sources, and dated references are strong signals that content is reliable and quotable. Research from the University of Toronto's Department of Computer Science has corroborated these findings, showing that citation density is one of the strongest predictors of AI source selection.

2. Including Quotable Statements (Up to 85% improvement)

Content restructured to include clear, self-contained declarative statements -- sentences that can stand alone as answers -- saw major improvements. These are statements that directly answer a question without requiring surrounding context.

Example of a quotable statement: "GEO is the practice of optimising web content to be cited by AI assistants, as distinct from traditional SEO which optimises for search engine rankings."

Why it works: When an LLM generates a response, it looks for content it can confidently extract and present. Sentences that are clear, specific, and self-contained are easier to cite than vague or context-dependent prose.

3. Improving Content Structure (Up to 40% improvement)

Well-structured content with clear headings, logical hierarchy, and organised sections performed significantly better than unstructured walls of text.

Why it works: LLMs process content structurally. Clear H2/H3 hierarchies, bullet points, numbered lists, and table formats help the model understand what each section covers and extract relevant information efficiently.

4. Adding Fluency Optimisations (Up to 25% improvement)

Content rewritten for clarity -- shorter sentences, active voice, precise vocabulary -- outperformed verbose or jargon-heavy alternatives.

Why it works: Cognitive load matters for AI too. Models are more likely to cite content they can parse cleanly and present without confusion.

5. Freshness Signals (Significant but variable)

Content with clear dates, "last updated" timestamps, and current-year references was preferred over undated or stale content.

Why it works: LLMs have been trained to prefer recent, maintained information. A 2026 guide to email marketing is more trustworthy than one from 2020 with no update signals.

The Six Dimensions of GEO

Based on the research and our analysis of thousands of AI citation patterns, effective GEO optimisation breaks down into six measurable dimensions:

Structure (25% weight)

Is your content organised so AI can navigate it like a table of contents? This includes heading hierarchy (H1 > H2 > H3), logical section breaks, use of lists and tables, and overall content flow.

Extractability (20% weight)

Can AI quote you? This measures whether your content has clear, self-contained statements that an AI can confidently pull out and present as part of an answer. FAQ sections, definition patterns, and direct answer formats score highest.

Authority (20% weight)

Do you cite credible sources? This evaluates authorship signals, citation density, specific data points (percentages, dates, statistics), and external references that AI uses to judge trustworthiness. For Canadian organisations, referencing well-regarded institutions like the University of Toronto, McGill University, or the Munk School of Global Affairs lends strong authority signals in the Canadian market.

Cognitive Load (15% weight)

Is your content easy to parse? This measures sentence length, paragraph density, passive voice usage, and marketing language inflation. Content that's clear and direct scores better than verbose or hype-heavy prose.

Freshness (10% weight)

Is your content current? This checks for publication dates, update timestamps, year currency, and temporal language. AI assistants strongly prefer content that signals it's being actively maintained.

Technical (10% weight)

Is your site technically sound for AI crawlers? This covers JSON-LD structured data, meta tags, Open Graph markup, canonical URLs, and semantic HTML. These elements help AI systems understand what your page is about before they even read the content.

Getting Started with GEO

Here's a practical checklist to begin optimising your content for AI citations:

  1. Audit your content structure. Does every page have one H1, clear H2 sections, and logical flow? Can you skim the headings and understand the page's full scope?

  2. Add quotable answers. For every major topic your page covers, include at least one clear, self-contained sentence that directly answers a likely question.

  3. Cite your sources. Replace vague claims ("Studies show...") with specific references ("According to a 2025 study by researchers at McGill University..."). Include percentages, dates, and named authorities.

  4. Check your freshness signals. Does your page have a visible publication date? A "last updated" notice? Current-year references?

  5. Clean up your prose. Remove marketing fluff, shorten sentences over 30 words, and replace passive voice with active voice where possible.

  6. Add structured data. At minimum, include JSON-LD Article schema on blog posts and FAQPage schema on FAQ sections.

The Canadian Landscape

For Canadian businesses and content creators, GEO presents a particular opportunity. Canada's bilingual digital market, combined with a highly educated and tech-savvy population, means that well-optimised content in Canadian English can capture AI citations for queries that might otherwise default to American sources. Whether you're a startup in the Waterloo corridor, an agency in downtown Toronto, or a consultancy in Vancouver, getting your content cited by AI assistants means reaching Canadian audiences at the moment they're seeking answers -- not competing for attention on a crowded results page.

Statistics Canada reports that over 94% of Canadians use the internet regularly, and AI adoption among Canadian businesses is accelerating. The organisations that optimise for AI visibility now will have a measurable advantage as these tools become the default way Canadians find information.

The Bottom Line

GEO is not a replacement for SEO -- it's an essential complement. As AI assistants handle an increasing share of information queries, the content that gets cited is the content that wins.

The research is clear: targeted optimisations can increase your visibility in AI responses by over 100%. The strategies are concrete, measurable, and actionable.

The question is whether you'll optimise for the AI era, or wait until your competitors already have.


References:

  • Aggarwal, P., Murahari, V., et al. (2023). "GEO: Generative Engine Optimization." arXiv:2311.09735.
  • Gartner (2025). "Predicts 2025: Search and Discovery."
  • Tocho internal analysis of 10,000+ AI citation patterns across ChatGPT, Claude, and Perplexity (2026).

Ready to optimise your content?

Check your AI citability score for free. No signup required.

Check Your Score
What Is GEO? The Complete Guide to Generative Engine Optimisation | Tocho