You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
If the content of elements in RSS feeds contain XML entities the link extraction does not the correct link: eg. <link>http://get2ch.net/?category=it&pickup_id=2426594</link> (from http://get2ch.net/feed/news/?category=it). Need a solution same as in 8467517.
The text was updated successfully, but these errors were encountered:
- ignore query part of URL to determine sitemap location prefix
for URL validation, fixescrawler-commons#202
- resolve relative links in RSS feeds, fixescrawler-commons#203
- allow non-continuous content (containing XML entities or CDATA)
when parsing links in RSS feeds, fixescrawler-commons#204
- extract links from <guid> elements in RSS feeds, fixescrawler-commons#201
- ignore query part of URL to determine sitemap location prefix
for URL validation, fixescrawler-commons#202
- resolve relative links in RSS feeds, fixescrawler-commons#203
- allow non-continuous content (containing XML entities or CDATA)
when parsing links in RSS feeds, fixescrawler-commons#204
- extract links from <guid> elements in RSS feeds, fixescrawler-commons#201
If the content of elements in RSS feeds contain XML entities the link extraction does not the correct link: eg.
<link>http://get2ch.net/?category=it&pickup_id=2426594</link>
(from http://get2ch.net/feed/news/?category=it). Need a solution same as in 8467517.The text was updated successfully, but these errors were encountered: