New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Question: IN/ACTIVE status on NewsWebsite? #91
Comments
No, there's no way to achieve this the way you described it, either you have to use different scrapers (you can use the CLONE action from the Django admin scraper overview site), or you have to live with the fact that all scrapers will stop on the status change. P.S.: These kind of questions are better suited for the mailing list: Cheers |
Ah yes, I didn't notice there was a clone scraper in the admin actions. Looks like this will do. I'll make sure to post questions in the list next time. Much appreciated for the answer. Cheers |
You might also want to look if pagination is doing the job for you (with a FREE_LIST with your keywords e.g.), but this depends on the specific use case. Cheers |
@holgerd77 nice tip, that could come handy instead of creating multiple NewsWebsite entries. Just put every keyword in the scraper's FREE_LIST. |
Hello,
Quick newbie question, I have a use case where I have 3 NewsWebsite entries where all scrape the same domain url with only the keyword differentiating each other like the following
NewsWebsite 1 url is "http://www.somewebsite.com/?q=keyword1
NewsWebsite 2 url is "http://www.somewebsite.com/?q=keyword2
etc
this way I can filter by a keyword on the Article admin as well as only needing to create 1 scraper for all. However I notice the IN/ACTIVE status is on the scraper, thus setting the scraper INACTIVE will stop scraping for all NewsWebsite when I actually only need to disable one keyword scraping. So is there a way to accomplish this in DDS?
Cheers
The text was updated successfully, but these errors were encountered: