Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Expand WASAPI Limits #11

Closed
amhanson9 opened this issue Mar 31, 2023 · 3 comments
Closed

Expand WASAPI Limits #11

amhanson9 opened this issue Mar 31, 2023 · 3 comments
Assignees
Labels
warc_metadata Specific to warc_metadata_report.py (originally warc_csv.py)

Comments

@amhanson9
Copy link
Contributor

WASAPI calls currently have a limit of 1000 WARCs and we have more than that now. Test if -1 works to get all (that works with the Partner API) or else increase the current limit.

@amhanson9 amhanson9 added the warc_metadata Specific to warc_metadata_report.py (originally warc_csv.py) label Apr 4, 2023
@amhanson9 amhanson9 self-assigned this Apr 10, 2023
@amhanson9
Copy link
Contributor Author

Considered if it isn’t a problem, since we usually are getting the WARCs for a quarter, and that is still well under 1000. But there may be times we want all our metadata in one CSV, so do want the ability to get everything.

@amhanson9
Copy link
Contributor Author

amhanson9 commented Apr 14, 2023

Page limit of -1 does not work with WASAPI. It gives the default limit of 100. Have to expand the page limit number: see WASAPI help page

@amhanson9
Copy link
Contributor Author

Solution: use a standard page number, but if the count returned from WASAPI is larger than that page number, call WASAPI again using count as the page number. Using 500, which is plenty for our quarterly downloads.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
warc_metadata Specific to warc_metadata_report.py (originally warc_csv.py)
Projects
None yet
Development

When branches are created from issues, their pull requests are automatically linked.

1 participant