How to use the XML Sitemap Generator
A sitemap.xml is your formal list of URLs you want indexed. It's the single most reliable way to surface URLs to Google — sitemap submissions in Search Console get crawled within hours, even on low-authority domains.
Paste your URL list
One URL per line. Include only canonical URLs you actually want indexed — no UTM-tagged links, no noindex pages, no redirects.
Set lastmod dates (optional)
lastmod tells Google when each URL was last meaningfully updated. Honest dates improve crawl prioritization; faking lastmod by setting it to today on every URL hurts trust.
Skip changefreq and priority
Google has publicly stated it ignores both directives. Including them is harmless but adds noise. Better to skip and keep the sitemap clean.
Save as sitemap.xml at your domain root
Upload to https://yoursite.com/sitemap.xml. Add Sitemap: https://yoursite.com/sitemap.xml to your robots.txt. Submit in Google Search Console for fastest crawl.
Why sitemap.xml is your fastest path to indexation
On a low-authority domain, the sitemap is the single most important crawl signal. Google's crawlers read your sitemap before they decide what to fetch — every URL you list raises its odds of getting crawled and indexed.
What belongs in a sitemap
- Canonical URLs only — never list non-canonical or redirected URLs.
- Indexable pages only — no noindex, no auth-walled, no 404s.
- 200 OK pages — every URL must return HTTP 200; redirected URLs hurt crawl trust.
- HTTPS pages — never mix HTTP and HTTPS in the same sitemap.
- Pages on the same domain as the sitemap (cross-domain sitemaps are valid but require Search Console verification of both).
What to skip from the sitemap
- Pages you've blocked in robots.txt — Google can't crawl them anyway.
- Pages with noindex meta tags.
- Duplicate URLs (Google dedupes automatically but it's noise).
- Low-value pages — search results, faceted nav, login pages, terms/privacy pages.
- Image and video URLs (unless using image-sitemap or video-sitemap extensions).
Sitemap size limits
A single sitemap.xml can hold up to 50,000 URLs or 50 MB uncompressed. For sites larger than that, use a sitemap index — a parent sitemap that lists multiple child sitemaps. Most CMS platforms auto-generate this structure (e.g., sitemap_index.xml linking to post-sitemap.xml, page-sitemap.xml, etc.).
Frequently asked questions
Do I need a sitemap.xml?
Yes — especially on a low-authority or new domain. Google will eventually find your URLs through internal links and external backlinks, but a sitemap is dramatically faster. Sitemap submission in Search Console typically gets URLs crawled within 24–72 hours, versus weeks via organic discovery.
Where should I put my sitemap?
Convention is at the root of your domain: https://yoursite.com/sitemap.xml. Reference it in your robots.txt with a Sitemap: directive. Submit it in Google Search Console under Sitemaps. Bing Webmaster Tools also supports sitemap submission.
How often should I update my sitemap?
Whenever you add, remove, or significantly update a page. Most sites auto-generate the sitemap on every CMS change. Static-site builders rebuild the sitemap on every deploy. The lastmod date should reflect the actual last meaningful update; faking it hurts crawl trust.
Should I include lastmod, changefreq, and priority?
Include lastmod with honest dates — Google uses it for crawl prioritization. Skip changefreq and priority — Google has publicly stated it ignores both. Including them isn't harmful but adds bytes for no benefit.
What's the maximum size for a sitemap?
50,000 URLs or 50 MB uncompressed (whichever comes first). For larger sites, use a sitemap index file that references multiple smaller sitemaps. Most CMSes auto-split sitemaps when a site grows past the limits.