Skip to content

genealogics.org partially scraped #18

@marfox

Description

@marfox

Out of the claimed 668,261 items:

  • the first run yielded 447,045 items, then timed out;
  • the second run yielded 266,492, then timed out as well.

For the Development corpus milestone, let's move on and combine the outputs of the runs.

Then, we can leave this issue open, assign it to the Production Corpus milestone, and re-run the scraper until we have all the data.

Metadata

Metadata

Assignees

No one assigned

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions