Skip to content

[Bug]: Problems when crawling ipres2022.scot #1750

@anjackson

Description

@anjackson

Browsertrix Version

v1.10.0-beta.5-a3911f6

What did you expect to happen? What happened instead?

During the IIPC session, I tried to archive https://ipres2022.scot/ via a seeded crawl:

  • That homepage crawl failed with a 'Page Worker Timeout'.
  • The sitemap did not appear to be accessed
  • Adding sitemap URL as an additional URL didn't seem to work.

Reproduction instructions

  1. Make a new seeded crawl
  2. Use https://ipres2022.scot/ as the seed
  3. Watch the crawl hang until timeout

Screenshots / Video

No response

Environment

IIPC Browsertrix Instance

Additional details

No response

Metadata

Metadata

Assignees

Labels

bugSomething isn't working

Type

No type

Projects

Status

Done!

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions