Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Scraper stalls on random websites #280

Open
m1cr0man opened this issue May 21, 2016 · 5 comments
Open

Scraper stalls on random websites #280

m1cr0man opened this issue May 21, 2016 · 5 comments

Comments

@m1cr0man
Copy link
Member

Occasionally the scraper will stall indefinitely on a website. If I go into the screen session and hit ctrl+c it kills something associated with the requests library and then continues to the next website.

My guess would be that to reproduce this all you would need to do is open 100 or so random URLs in series and watch for one of them not to respond.

This is in turn stopping the stream page updating and also stalling the email summaries. It needs to be fixed

@CianLR
Copy link
Member

CianLR commented May 21, 2016

Can you check the logs and let us know what feed?
On 21 May 2016 6:17 p.m., "Lucas Savva" notifications@github.com wrote:

Occasionally the scraper will stall indefinitely on a website. If I go
into the screen session and hit ctrl+c it kills something associated with
the requests library and then continues to the next website.

My guess would be that to reproduce this all you would need to do is open
100 or so random URLs in series and watch for one of them not to respond.

This is in turn stopping the stream page updating and also stalling the
email summaries. It needs to be fixed


You are receiving this because you are subscribed to this thread.
Reply to this email directly or view it on GitHub
#280

@m1cr0man
Copy link
Member Author

The last time it ran it crashed on http://erikbern.com/atom.xml, but it seems since then it hasn't run as someone actually killed scraper (double ctrl+c). I saw it freeze on Reddit once too. Like I said it's random (IE any feed).

@CianLR
Copy link
Member

CianLR commented May 21, 2016

Hm, I'll try and reproduce this evening
On 21 May 2016 6:21 p.m., "Lucas Savva" notifications@github.com wrote:

The last time it ran it crashed on http://erikbern.com/atom.xml, but it
seems since then it hasn't run as someone actually killed scraper (double
ctrl+c). I saw it freeze on Reddit once too. Like I said it's random (IE
any feed).


You are receiving this because you commented.
Reply to this email directly or view it on GitHub
#280 (comment)

@CianLR
Copy link
Member

CianLR commented May 21, 2016 via email

@devoxel
Copy link
Contributor

devoxel commented May 24, 2016

It seems to be dead 100%. Can't ssh into it. @m1cr0man, is m1cr0man.com down

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

3 participants