Skip to main content

Extract sitemap: FAQs

  1. What is the expected user input to extract a sitemap?
    • A valid URL is all that is required to extract URLs from a sitemap. Some examples include:
      • XML sitemap URL (example.com/sitemap.xml)
      • Domain URL (example.com)
      • Subdomain URL (subdomain.example.com)
      • File path (example.com/subfolder)
      • Any page on the domain / subdomain (example.com/page.html)
  2. How will URLs be extracted?
    • If you input an XML sitemap URL, all URLs present within the sitemap file would be extracted.
    • For other input formats, URLs would be extracted in the following manner:
      • Base domain / subdomain URL would be extracted from the input.
      • Within the base domain / subdomain the extractor would search for robots.txt file contains sitemap URL(s).
      • URLs would then be extracted from the sitemap(s) fetched above.
    • Please note that the count of URLs extracted would be capped at 10k to avoid any computation implications.

We're sorry to hear that. Please share your feedback so we can do better

Contact our Support team for immediate help while we work on improving our docs.

We're continuously improving our docs. We'd love to know what you liked






Thank you for your valuable feedback

Is this page helping you?

Yes
No

We're sorry to hear that. Please share your feedback so we can do better

Contact our Support team for immediate help while we work on improving our docs.

We're continuously improving our docs. We'd love to know what you liked






Thank you for your valuable feedback!

Talk to an Expert
Download Copy