Question: IN/ACTIVE status on NewsWebsite? #91

senoadiw · 2017-06-09T07:47:52Z

Hello,

Quick newbie question, I have a use case where I have 3 NewsWebsite entries where all scrape the same domain url with only the keyword differentiating each other like the following

NewsWebsite 1 url is "http://www.somewebsite.com/?q=keyword1
NewsWebsite 2 url is "http://www.somewebsite.com/?q=keyword2
etc

this way I can filter by a keyword on the Article admin as well as only needing to create 1 scraper for all. However I notice the IN/ACTIVE status is on the scraper, thus setting the scraper INACTIVE will stop scraping for all NewsWebsite when I actually only need to disable one keyword scraping. So is there a way to accomplish this in DDS?

Cheers

holgerd77 · 2017-06-09T07:58:26Z

No, there's no way to achieve this the way you described it, either you have to use different scrapers (you can use the CLONE action from the Django admin scraper overview site), or you have to live with the fact that all scrapers will stop on the status change.

P.S.: These kind of questions are better suited for the mailing list:
https://groups.google.com/forum/#!forum/django-dynamic-scraper

Cheers
Holger

senoadiw · 2017-06-09T08:05:18Z

Ah yes, I didn't notice there was a clone scraper in the admin actions. Looks like this will do. I'll make sure to post questions in the list next time. Much appreciated for the answer.

Cheers

holgerd77 · 2017-06-09T08:14:37Z

You might also want to look if pagination is doing the job for you (with a FREE_LIST with your keywords e.g.), but this depends on the specific use case.

Cheers
Holger

senoadiw · 2017-06-09T09:18:20Z

@holgerd77 nice tip, that could come handy instead of creating multiple NewsWebsite entries. Just put every keyword in the scraper's FREE_LIST.

holgerd77 closed this as completed Jun 9, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Question: IN/ACTIVE status on NewsWebsite? #91

Question: IN/ACTIVE status on NewsWebsite? #91

senoadiw commented Jun 9, 2017

holgerd77 commented Jun 9, 2017

senoadiw commented Jun 9, 2017

holgerd77 commented Jun 9, 2017

senoadiw commented Jun 9, 2017

Question: IN/ACTIVE status on NewsWebsite? #91

Question: IN/ACTIVE status on NewsWebsite? #91

Comments

senoadiw commented Jun 9, 2017

holgerd77 commented Jun 9, 2017

senoadiw commented Jun 9, 2017

holgerd77 commented Jun 9, 2017

senoadiw commented Jun 9, 2017