Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

njcourts.gov scrape responds with HTTP 403 Forbidden Imperva page #999

Open
sentry-io bot opened this issue Apr 12, 2024 · 1 comment
Open

njcourts.gov scrape responds with HTTP 403 Forbidden Imperva page #999

sentry-io bot opened this issue Apr 12, 2024 · 1 comment

Comments

@sentry-io
Copy link

sentry-io bot commented Apr 12, 2024

This is not a common 404 or temporary error page, but a 403 Forbidden response triggered by Imperva, an anti scraping service. So far there are only 2 events, but we should keep track of this in case it scalates. It is going to sentry as an UnexpectedContentTypeError

I got to replicate this using my Peruvian IP. With VPN it is not triggered
image

Sentry Issue: COURTLISTENER-71S

UnexpectedContentTypeError: https://www.njcourts.gov/system/files/court-opinions/2024/a2137-22.pdf
'"text/html" not in ['application/pdf']

Filed by @grossir

Copy link
Author

sentry-io bot commented Apr 13, 2024

Sentry Issue: COURTLISTENER-71T

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

0 participants