… content/url mismatches.
1. Reworked parser so that only RSS parsing is performed. 2. Switched to Goose text extractor. 3. Grabbing images as well as text. 4. Exporting to XML for import to Drupal site.
Curata as a crawling source, other small bug fixes.