Free Webinar: Website Migrations & Redirect Mapping Dos & Don’ts Sign up now!

Dealing with XML Sitemaps issues?

Crawl your website with Sitebulb for 300+ tech SEO checks

Try for Free
Critical This Hint requires immediate attention, as the issue may have a serious impact upon crawling, indexing or ranking. Issue This Hint represents an error or problem that needs to be fixed.

Not Found (4XX) URL in XML Sitemaps

This means that the URL in question returns a HTTP status of 4XX, yet is included in an XML Sitemap.

Why is this important?

Your XML Sitemap should only contain URLs you wish for search engines to index. URLs in your sitemaps should be clean - i.e. sitemaps should only include URLs that are HTTP status 200 (OK), indexable, canonical and unique.

If search engines find 'dirt' in sitemaps, such as 404 pages, they may stop trusting the sitemaps for crawling and indexing signals.

Eric Enge once interviewed Duane Forrester while he was at Bing;

Duane Forrester quote

What does the Hint check?

This Hint will trigger for any internal URL which returns an HTTP status of 4XX, and is included in one of the submitted XML Sitemaps.

Examples that trigger this Hint:

Consider the URL: https://example.com/page-a, which is included in a submitted XML Sitemap.

The Hint would trigger for this URL if it had a 404 (Not Found) header response:

HTTP/... 404 Not Found

...

How do you resolve this issue?

This Hint is marked 'Critical' as it represents a fundamentally breaking issue, which may have a serious adverse impact upon organic search traffic. It is strongly recommended that Critical issues are dealt with as a matter of high priority.

To resolve this issue, simply remove any URLs that return 4XX from all XML Sitemaps.

Further Reading

Sitebulb Desktop

Find, fix and communicate technical issues with easy visuals, in-depth insights, & prioritized recommendations across 300+ SEO issues.

  • Ideal for SEO professionals, consultants & marketing agencies.

Sitebulb Cloud

Get all the capability of Sitebulb Desktop, accessible via your web browser. Crawl at scale without project, crawl credit, or machine limits.

  • Perfect for collaboration, remote teams & extreme scale.