Skip to content

0.10

Closed Jun 7, 2018 100% complete

Release 0.10 (2018-06-05)

  • Add JAX-B dependencies to POM (jnioche) #207
  • [Sitemaps] Add method to parse and iterate sitemap SiteMapParser#walkSiteMap(URL,Consumer) (Luc Boruta) #190
  • [Sitemaps] Sitemap file location to ignore query part of URL (sebastian-nagel) #202
  • [RSS sitemaps] Link extraction from RSS feeds fails on XML entities (sebastian-nagel) #204
  • […

Release 0.10 (2018-06-05)

  • Add JAX-B dependencies to POM (jnioche) #207
  • [Sitemaps] Add method to parse and iterate sitemap SiteMapParser#walkSiteMap(URL,Consumer) (Luc Boruta) #190
  • [Sitemaps] Sitemap file location to ignore query part of URL (sebastian-nagel) #202
  • [RSS sitemaps] Link extraction from RSS feeds fails on XML entities (sebastian-nagel) #204
  • [RSS sitemaps] Resolve relative links in RSS feeds (sebastian-nagel) #203
  • [RSS sitemaps] Extract links from elements (sebastian-nagel) #201
  • [Sitemaps] Limit on "bad url" log messages (sebastian-nagel) #145
  • EffectiveTldFinder to parse Internationalized Domain Names (sebastian-nagel) #179
  • Add main() to EffectiveTldFinder (sebastian-nagel) #187
  • Handle new suffixes in PaidLevelDomain (kkrugler) #183
  • Remove Tika dependency (kkrugler) #199
  • Improve MIME detection for sitemaps (sebastian-nagel) #200
  • Make RobotRules accessible (aecio via kkrugler) #134
  • SimpleRobotRulesParser: Expose MAX_WARNINGS and MAX_CRAWL_DELAY (aecio via kkrugler) #194
  • Added main to SimpleRobotRulesParser for testing (sebastian-nagel) #193
  • Allow for legacy URIs when checking sitemap namespaces (sebastian-nagel) #211

This milestone is closed.

No open issues remain. View closed issues or see open milestones in this repository.