Skip to content
Permalink
Branch: master
Find file Copy path
Find file Copy path
Fetching contributors…
Cannot retrieve contributors at this time
7 lines (6 sloc) 339 Bytes

extract_urls_from_sitemap_index

Scrape all the URLs from a sitemap index or a sitemap.xml. The parameter is the URL of the sitemap_index. Only works with XML format. The script will output an excel with three columns:

  • ID
  • Sitemap: in which sitemap was found the url
  • Url: A list of all the urls that appears in the sitemap(s)
You can’t perform that action at this time.