You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
- ignore query part of URL to determine sitemap location prefix
for URL validation, fixescrawler-commons#202
- resolve relative links in RSS feeds, fixescrawler-commons#203
- allow non-continuous content (containing XML entities or CDATA)
when parsing links in RSS feeds, fixescrawler-commons#204
- extract links from <guid> elements in RSS feeds, fixescrawler-commons#201
- ignore query part of URL to determine sitemap location prefix
for URL validation, fixescrawler-commons#202
- resolve relative links in RSS feeds, fixescrawler-commons#203
- allow non-continuous content (containing XML entities or CDATA)
when parsing links in RSS feeds, fixescrawler-commons#204
- extract links from <guid> elements in RSS feeds, fixescrawler-commons#201
RSS feeds may contain relative links:
<link>/news/428087/</link>
(https://www.yuga.ru/ingushetia.rss)<link>/oa/darticle.aspx?type=view&id=201712001</link>
(http://www.zjepc.com/oa/rss.aspx)The sitemap parser should resolve them.
The text was updated successfully, but these errors were encountered: