glossary

Plain-language definitions for SEO and AI-search terminology.

A

Anchor text
The clickable text of a hyperlink. Used by search engines as a relevance signal for the linked page; over-optimised exact-match anchors can trigger penalties.
AI Overview
A generative AI answer shown at the top of Google's search results, synthesised from multiple sources with citations. Often produces zero-click outcomes for informational queries.
Algorithm update
A change to how a search engine ranks results. Google deploys broad core updates several times per year; smaller updates are continuous.
Alt text
The text alternative for an image, used by screen readers and search engines. Should describe the image accurately and concisely.
AS (Authority Score)
Semrush's proprietary score (0-100) estimating a domain's overall SEO authority, combining link, traffic, and quality signals.

B

Backlink
A hyperlink from another website pointing to yours. One of the strongest ranking signals; quality matters more than quantity.
Black hat SEO
Tactics that violate search engine guidelines: cloaking, sneaky redirects, automated content generation, link schemes. Carries penalty risk.
Bounce rate
The percentage of visits where the user leaves without further interaction. A weak SEO signal in isolation; context dependent.
Breadcrumbs
A navigation aid showing the user's location within a site hierarchy. Often marked up with BreadcrumbList schema for SERP display.

C

Canonical tag
An HTML element (`<link rel="canonical">`) declaring the preferred URL when multiple URLs serve substantially the same content.
Citation (local)
An online mention of a business including its name, address, and phone number (NAP). Important for local pack ranking.
CLS (Cumulative Layout Shift)
A Core Web Vitals metric measuring unexpected layout shifts during page load. Good: under 0.1.
Cluster (content cluster)
A group of pages covering related sub-topics, linked back to a central pillar page. Builds topical authority.
Core Web Vitals
Google's set of three user experience metrics used as ranking signals: LCP (load speed), INP (interactivity), and CLS (visual stability). Introduced as a ranking factor in 2021.
Crawl budget
The number of URLs a search engine crawler will fetch on a site within a given period. A constraint primarily for very large sites.
Crawlability
The degree to which search engine bots can access and fetch the pages of a site.
CTR (Click-Through Rate)
The percentage of impressions that result in a click. A primary measure of SERP performance.

D

DA (Domain Authority)
Moz's proprietary score (0-100) estimating a domain's link-based authority. Not used by Google directly.
Disavow file
A file uploaded via Search Console listing backlinks Google should ignore. Useful for manual action recovery; widely overused otherwise.
Dofollow
A link without rel="nofollow". Passes link equity normally. The default state of any link.
DR (Domain Rating)
Ahrefs' proprietary score (0-100) for a domain's link profile strength. Logarithmic; not used by Google directly.
Duplicate content
Substantially identical or very similar content appearing at multiple URLs. Dilutes ranking signals across pages; resolved with canonical tags, redirects, or consolidation.

E

E-E-A-T
Experience, Expertise, Authoritativeness, Trustworthiness. Google's quality framework, particularly stringent for YMYL content.
Entity SEO
Optimising for search engines and AI systems that reason about real-world entities (people, organisations, products) rather than keywords.

F

Featured snippet
A search result format where Google extracts and displays a passage from a ranking page directly in the SERP, above the standard organic results.
FID (First Input Delay)
A former Core Web Vital measuring responsiveness to the first user interaction. Replaced by INP in March 2024.

G

GBP (Google Business Profile)
Google's free local business listing, the primary asset for local pack and Google Maps visibility. Formerly Google My Business.
GEO (Generative Engine Optimisation)
The practice of structuring content for retrieval and citation by AI answer engines. Overlaps with traditional SEO; emphasises entity clarity and direct-answer formats.
Googlebot
Google's web crawler. Operates in two-pass indexing: initial HTML processing, then JavaScript rendering.
Google-Extended
Google's user agent for AI training data collection, separate from Googlebot. Can be blocked in robots.txt without affecting search indexing.
GSC (Google Search Console)
Google's free webmaster tool providing data on indexing status, search performance, crawl errors, and manual actions. The primary diagnostic interface between site owners and Google.

H

Heading tags
HTML elements (H1 through H6) marking the structural hierarchy of page content. The H1 signals the primary topic; lower levels organise sub-sections. A modest on-page ranking signal.
Hreflang
An HTML attribute (or sitemap element) declaring the language and regional targeting of a page. Manages multilingual and multi-regional sites.
HSTS (HTTP Strict Transport Security)
An HTTP header that forces browsers to use HTTPS for the domain. Strong commitment; cannot be quickly reversed.

I

Indexing
The process by which a search engine stores and organises crawled pages, making them eligible to appear in search results.
INP (Interaction to Next Paint)
A Core Web Vitals metric measuring overall page responsiveness to user interactions. Good: under 200ms.
Internal link
A hyperlink from one page on a site to another page on the same site. Distributes PageRank and signals topical structure.

J

JavaScript SEO
The discipline of ensuring search engines can crawl, render, and index content delivered via JavaScript.

K

KD (Keyword Difficulty)
A score (0-100) estimating how hard it is to rank for a query, typically based on the link profiles of currently ranking pages.
Keyword stuffing
Repetitive, unnatural use of target keywords in content. Damages readability; flagged by quality systems.
Knowledge graph
A database of entities and their relationships used by Google (and AI systems) to answer entity-based queries.
Knowledge panel
An information box about an entity (person, organisation, place) shown alongside search results, drawn from Google's knowledge graph.

L

LCP (Largest Contentful Paint)
A Core Web Vitals metric measuring perceived load speed via the largest visible element. Good: under 2.5 seconds.
Link equity (PageRank)
The authority a page passes through its outbound links. The basis of Google's original ranking algorithm; still influential.
Link velocity
The rate at which a site acquires new backlinks over time. Sudden spikes can appear unnatural and attract algorithmic or manual scrutiny.
llms.txt
A proposed Markdown file at the root of a domain providing an AI-readable index of a site's most important content. Not yet a formal standard.
Log file analysis
The process of parsing server access logs to understand exactly which URLs search engine bots crawled, at what frequency, and which were skipped. More reliable than crawl simulation tools.
LocalBusiness schema
Schema.org structured data declaring a business's local entity properties: name, address, phone, hours, services.
Local pack
The map and three business listings shown at the top of search results for queries with local intent.
Long-tail keyword
A specific, often longer search query with low individual volume but typically clearer intent and lower competition.

M

Manual action
A penalty issued by Google's spam team after human review. Reported in Search Console; requires reconsideration request to lift.
Meta description
An HTML attribute summarising a page for search engines and link previews. Not a direct ranking factor; influences CTR.
Mobile-first indexing
Google's policy of using the mobile version of a site as the primary version for indexing and ranking. Standard since 2019.
Mojibake
Text corrupted by character encoding mismatches, typically appearing as garbled symbols (e.g. â€" instead of an em dash).

N

NAP
Name, Address, Phone. The core business information that must be consistent across local citations for local SEO.
Noindex
A meta tag (or HTTP header) instructing search engines not to index a page. Pages must be crawlable for the directive to be seen.
Nofollow
A link attribute (`rel="nofollow"`) telling search engines not to pass link equity through the link. Used for sponsored, user-generated, or untrusted links.

O

Off-page SEO
Optimisation activities that happen outside a site: link building, digital PR, brand mentions, citations.
On-page SEO
Optimisation of elements on individual pages: title tags, headings, content, internal linking, schema.
Open Graph
Meta tags (og:title, og:description, og:image) controlling how a URL appears when shared on social media and messaging platforms.
Organic search
Search results not paid for by advertising. Organic traffic is the audience SEO is built around.
Orphan page
A page with no internal links pointing to it. Difficult for crawlers to discover and receives no internal link equity.
Outbound link
A hyperlink from a page on your site to a page on a different domain. Passes link equity if dofollow, and signals topical associations to search engines.

P

PageRank
Google's original algorithm for valuing pages based on the quantity and quality of inbound links. Still influential, though heavily evolved.
Pagination
Splitting long content or lists across multiple pages. Affects crawl, indexing, and how authority flows through a site.
Passage indexing
Google's ability to index and rank individual passages within a page independently of the page's overall topic, allowing relevant sections to surface for specific queries.
PBN (Private Blog Network)
A network of sites created specifically to manipulate link equity. Violates Google's guidelines.
Pillar page
A comprehensive overview page on a topic, linking to deeper cluster pages on specific sub-topics.

Q

Query
The words or phrase a user types into a search engine. Distinct from 'keyword' in that a query is what the user typed; a keyword is what the SEO targets.

R

Redirect (301)
An HTTP status code indicating a permanent URL change. Transfers ranking signals to the destination URL.
Redirect (302)
An HTTP status code indicating a temporary URL change. Does not transfer ranking signals.
Redirect chain
A sequence of redirects (URL A → URL B → URL C). Reduces signal transfer and adds latency. Best avoided.
Rendering
The process of executing JavaScript and producing the final DOM. Googlebot renders pages in a separate pass after initial HTML processing.
Rich result
An enhanced search result with visual elements (star ratings, FAQ accordions, recipe cards, product information) driven by structured data.
Robots.txt
A plain text file at the root of a domain declaring which crawlers can access which paths. A request, not enforcement.

S

Schema markup
Structured data using the schema.org vocabulary, declaring the meaning of page content explicitly to search engines and AI systems.
Search intent
The underlying goal behind a query: informational, navigational, commercial investigation, or transactional.
Search volume
The estimated number of times a keyword is searched per month within a given market. A primary input in keyword research alongside keyword difficulty.
SEO (Search Engine Optimisation)
The practice of improving a site's visibility in unpaid search results.
SERP (Search Engine Results Page)
The page returned in response to a search query, including organic results, ads, and SERP features.
Sitemap
An XML file listing the URLs of a site, providing search engines with a catalogue for crawling.
Sitelinks
Additional links to sub-pages shown beneath a brand's main search result. Algorithmically generated; not directly controllable but influenced by site structure and internal linking.
Soft 404
A page returning a 200 OK status code but containing content that signals it should be a 404. Confuses indexing.
Structured data
Machine-readable metadata embedded in a webpage. Most commonly delivered as JSON-LD using schema.org vocabulary.

T

Thin content
Pages with little or no unique value: doorway pages, scraped content, auto-generated text, or pages with minimal original material. A quality signal targeted since the Panda era.
Title tag
The HTML element (`<title>`) defining a page's title. Appears in browser tabs, SERPs, and link previews. One of the most influential on-page ranking factors.
Topic cluster
A group of related pages organised around a pillar page, used to build topical authority.
Topical authority
A measure of how comprehensively and credibly a site covers a subject area, built through depth of content and internal linking structure. Influences rankings across a topic regardless of individual page signals.

U

UGC (rel=ugc)
A link attribute (`rel="ugc"`) indicating the link was created by a user, such as in comments or forum posts. Introduced alongside rel=sponsored in 2019 as a refinement of nofollow.
URL (Uniform Resource Locator)
The address of a webpage. Clean, descriptive URLs reinforce topical relevance and improve usability.

V

Viewport meta tag
An HTML meta tag (`<meta name="viewport">`) controlling how a page scales on mobile devices. Essential for passing mobile-friendliness signals.
Voice search
Searches performed by speaking to a device rather than typing. Tends towards conversational, longer-tail, and question-format queries.

W

WebP
An image format with better compression than JPEG and PNG. Recommended for most web image use cases.
White hat SEO
SEO tactics that comply with search engine guidelines. Sustainable; carries no penalty risk.

X

XML sitemap
An XML-formatted sitemap providing search engines with a list of canonical URLs to crawl. See also: Sitemap.

Y

YMYL (Your Money or Your Life)
Content categories Google scrutinises with stricter E-E-A-T standards: medical, financial, legal, safety, civic.

Z

Zero-click search
A search where the user finds the answer in the SERP itself (via featured snippet, knowledge panel, or AI Overview) without clicking through.