Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

AU-5 data issue #323

Closed
siemvaessen opened this issue Feb 15, 2021 · 6 comments
Closed

AU-5 data issue #323

siemvaessen opened this issue Feb 15, 2021 · 6 comments
Assignees
Labels
Needs Testing Needs testing label for IATI team

Comments

@siemvaessen
Copy link

Both Datastore and D-Portal are unable to retrieve AU-5 data from the originating AU-5 source. Humans are able to download the files, but it's very likely some machine crawling not-allowed of kinds is in place on the originating location making it impossible to download files in an automated fashion. As this is not a Datastore nor D-Portal issue, we have closed the issue on the Datastore end have added it to this issue tracker as it seems a more appropriate place to log this issue.

@andylolz
Copy link
Contributor

Same problem raised here: codeforIATI/iati-data-bugtracker#9

I contacted the email addresses I found in DFAT’s metadata, but didn’t hear back.

@andylolz
Copy link
Contributor

andylolz commented Mar 4, 2021

The registry archiver is now also showing issues fetching ausgov data:

Screenshot 2021-03-04 at 08 32 30

@ss-bhat
Copy link
Contributor

ss-bhat commented Mar 12, 2021

@andylolz

This is giving timeout

        with requests.get(url, headers=req_headers, timeout=url_timeout, verify=False, stream=True) as res:
            resp_headers = res.headers

requests.get(url) works not sure if it has to do with headers or is a bug in the requests package.

It's a header. They do not allow the user agent to 'Mozilla/5.0' (this is the user-agent we are using in the archiver)

ss-bhat added a commit that referenced this issue Mar 25, 2021
@ss-bhat
Copy link
Contributor

ss-bhat commented Mar 31, 2021

@andreaszenasidi @andylolz

I have changed the user agent, theoretically, this should be fixed when the archiver runs for the next time.

@ss-bhat ss-bhat added the Needs Testing Needs testing label for IATI team label Mar 31, 2021
@ss-bhat ss-bhat self-assigned this Mar 31, 2021
@andylolz
Copy link
Contributor

andylolz commented Apr 1, 2021

I have changed the user agent, theoretically, this should be fixed when the archiver runs for the next time.

Nice – looks like it worked! 👏

@andreaszenasidi
Copy link
Contributor

Looks good.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Needs Testing Needs testing label for IATI team
Projects
None yet
Development

No branches or pull requests

4 participants