Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[bug] stale images being treated as new content, causing duplicates #1

Closed
nberlette opened this issue Aug 1, 2023 · 0 comments · Fixed by #4
Closed

[bug] stale images being treated as new content, causing duplicates #1

nberlette opened this issue Aug 1, 2023 · 0 comments · Fixed by #4
Assignees
Labels
bug Something isn't working enhancement New feature or request good first issue Good for newcomers

Comments

@nberlette
Copy link
Owner

I've noticed a couple instances of the oxblue.com photo stream being down recently. During these time periods, an outdated image in the response is coupled with inaccurate header metadata, making it seem as if its new content.

I need to add a deduplication step that checks the incoming bytes themselves and only proceeds with the scrape if they are new when compared to the most recent scrape. I've had to manually disable the workflow for now until the server returns to normal operation, in the meantime I'll work on a fix for this. I'll also remove all of the duplicates that have amassed during the last couple of outages.

@nberlette nberlette added bug Something isn't working enhancement New feature or request good first issue Good for newcomers labels Aug 1, 2023
@nberlette nberlette self-assigned this Aug 1, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working enhancement New feature or request good first issue Good for newcomers
Projects
None yet
Development

Successfully merging a pull request may close this issue.

1 participant