What is XML Sitemap?
An XML sitemap is a file that lists all the URLs on your website that you want search engines to crawl and index. It acts as a roadmap for Googlebot, telling it which pages exist, when they were last updated, and how important they are relative to each other. Submitting a sitemap to Google Search Console speeds up indexation and helps Google discover pages it might otherwise miss.
- Only include canonical, indexable URLs in your sitemap — no noindex pages, no redirects.
- Submit your sitemap to Google Search Console on day one of going live.
- Keep your sitemap accurate — stale or broken sitemaps can confuse Googlebot.
- Large sites should use sitemap index files to split into multiple sitemaps (50,000 URL limit per file).
- Next.js and most CMSs can auto-generate sitemaps — don't maintain them manually.
What Goes in an XML Sitemap?
A sitemap is an XML file listing your URLs with optional metadata: last modification date (lastmod), change frequency (changefreq), and priority (0.0–1.0).
Only include pages you want indexed. Exclude: noindex pages, redirect URLs, paginated duplicates (keep only canonical), and admin or login pages.
The priority attribute is relative — Google mostly ignores changefreq. The lastmod date is the most useful signal: it tells Google a page was updated and is worth recrawling.
How to Submit Your Sitemap to Google
In Google Search Console, go to Sitemaps in the left nav. Enter your sitemap URL (usually /sitemap.xml or /sitemap_index.xml) and click Submit.
Google will show you how many URLs were submitted vs how many were indexed. If the numbers diverge significantly, investigate why — it usually means you have noindex tags on some URLs, canonical issues, or duplicate content.
Resubmit whenever you make significant structural changes to your site.
Most guides are already outdated.
One email a week. The search stuff that actually matters — what shifted, what died, and what to do about it.
Subscribe free →| XML Sitemap | HTML Sitemap |
|---|---|
| Intended audience: search engine crawlers | Intended audience: human visitors |
| Machine-readable XML format | Human-readable HTML page |
| Submitted directly to Google Search Console | Linked from site navigation or footer |
| Includes metadata: lastmod, priority, changefreq | No metadata — just hyperlinks |
| Not visible or useful to regular site visitors | Helps users navigate large sites |
| Critical for crawl efficiency on large sites | Modest SEO benefit; mainly UX-focused |
How does your site score on SEO?
Paste your URL. Get a score and a fix list across all three disciplines. No form, no email.
Run Free Audit →Frequently Asked Questions
Not directly. A sitemap helps Google find and index your pages faster, but it doesn't influence ranking signals. The SEO value is in faster indexation — especially for new content and large sites where discovery via crawling alone would be slow.
An XML sitemap is a machine-readable file for search engines. An HTML sitemap is a human-readable page linking to your site structure — it's less common now but can help users navigate large sites. For SEO, focus on the XML sitemap.
- 1.Google Search Central — Sitemaps documentation
- 2.Ahrefs Blog — Sitemap Best Practices
Read next
robots.txt
A robots.txt file is a plain text file at the root of your domain that instructs web crawlers which pages or s…
Canonical Tags
A canonical tag (rel="canonical") is an HTML element that tells search engines which version of a URL is the '…
Crawl Budget
Crawl budget is the number of pages Googlebot will crawl on your website within a given timeframe. It's determ…