The Content Optimization Glossary is a collection of terms used in 📝search engine optimization and 📝generative engine optimization — the practices of preparing content to be found and cited by search engines and AI answer assistants.
Glossary
A
- AI Overviews — Google's generative-answer feature appearing above traditional search results, synthesizing source citations into a direct response.
- Anchor text — The visible clickable text in a hyperlink, used by search engines as a relevance signal for the linked page.
- Anonymized queries — Search terms Google Search Console suppresses for privacy when query volume falls below a threshold, often accounting for 20-50% of total impression volume on a property.
- 📝Answer Engine Optimization (AEO) — A digital marketing strategy focused on optimizing content so it is easily understood, extracted, and cited by AI-powered answer engines.
B
- Brand citation — A mention of a brand name in third-party content that AI assistants weight when generating answers and source attributions.
C
- 📝canonical link — The preferred URL for a piece of content when duplicate or variant URLs exist, declared via the
rel="canonical"tag. - Click-through rate (CTR) — The ratio of clicks to impressions for a query or page, expressed as a percentage.
- Content cluster — A set of interlinked pages organized around a central pillar topic to build topical authority.
- Core Web Vitals — Google's page-experience signals measuring loading (LCP), interactivity (INP), and visual stability (CLS).
- Crawl budget — The number of URLs a search engine bot will crawl on a site within a given timeframe, allocated based on site authority and freshness.
- CTR ceiling — A diagnostic class where a page ranks at position 3 or better but underperforms expected click-through rate for that position, indicating a title or snippet problem rather than a ranking problem.
D
- Domain authority — A third-party score predicting a domain's ability to rank in search results, originating with Moz and adopted by Ahrefs and others; not a metric Google itself uses.
E
- E-E-A-T — Google's content quality framework: Experience, Expertise, Authoritativeness, and Trustworthiness.
- Entity — A distinct thing (person, place, concept) that search engines recognize as a node in their knowledge graph and treat as a unit of meaning.
F
- Featured snippet — An extracted answer Google displays above the traditional results, drawn from a high-ranking page deemed to directly answer the query.
G
- 📝Generative Engine Optimization (GEO) — A paradigm for improving content visibility within generative search engine responses powered by LLMs.
- 📝Google Search Console — Google's free dashboard reporting how a verified site property performs in Google Search, including queries, pages, impressions, clicks, and position.
I
- Impressions — The count of times a URL appeared in search results for a query during the measurement window.
K
- Keyword cannibalization — A situation where multiple pages on the same site compete for the same query, splitting ranking signal and depressing each page's individual rank.
- Knowledge graph — A search engine's structured database of entities and the relationships between them, used to disambiguate queries and surface direct answers.
L
- 📝LLMs.txt — A proposed file standard analogous to robots.txt for declaring how LLM crawlers should access and prioritize a site's content.
- 📝Long-tail keyword — A search query with low volume but typically high specificity and conversion intent, often a multi-word phrase.
P
- Phantom page — A URL appearing in GSC reports that does not actually exist on the live site, often a stale-indexed URL, redirect target, or variant artifact.
- Pillar page — The central, comprehensive page in a content cluster, linked outward to every cluster page and inward from each.
- Position — The average ranking of a URL in search results for a query over a time period, with position 1 being the top result.
Q
- Query intent — The goal behind a search query, classified as informational, navigational, transactional, or commercial investigation.
- Query-to-page mismatch — A situation where a search query ranks a non-canonical or unintended page on a site, indicating an internal-link or content-clarity problem.
R
- 📝Retrieval-Augmented Generation (RAG) — An AI architecture that retrieves relevant documents from a corpus before generating an answer, grounding model responses in fetched content rather than parametric memory alone.
- Robots.txt — A text file at the root of a site declaring which paths web crawlers may or may not access, by user-agent.
S
- Schema markup — Structured data (typically JSON-LD) embedded in a page to help search engines and AI assistants understand its content type and entity attributes.
- 📝Search Engine Optimization (SEO) — The process of improving website visibility and ranking in organic search results through keyword research, link building, and technical optimization.
- 📝Search Engine Results Pages (SERPs) — Search Engine Results Page, the page a search engine returns for a query, including organic results, ads, featured snippets, and AI overviews.
- SGE (Search Generative Experience) — Google's framework for AI-generated answers inside Search, rebranded and rolled out broadly as AI Overviews.
- 📝Sitemap — An XML file at the site root listing canonical URLs to help crawlers discover and prioritize content.
- 📝Striking Distance — A class of search queries where a site already ranks in positions 4-20 with meaningful impressions, making them the highest-leverage ranking-improvement opportunities.
T
- TF-IDF — Term Frequency-Inverse Document Frequency, a statistical measure of how distinctive a word is to a document within a corpus, foundational to lexical retrieval.
