Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Unaligned Dates #6

Closed
tyliec opened this issue Aug 4, 2021 · 1 comment · Fixed by #12
Closed

Unaligned Dates #6

tyliec opened this issue Aug 4, 2021 · 1 comment · Fixed by #12
Labels
bug Something isn't working

Comments

@tyliec
Copy link
Collaborator

tyliec commented Aug 4, 2021

Context

Currently, we deal with three types of dates in the Arrest Log Reports.

  1. The date of the arrest
  2. The date the arrest log was published
  3. The date our script scraped the arrest log

We currently use 2 as the source of truth for where we get our dates (we store this in the database), but this is not always accurate. For example, there might be an arrest that happened on 7-12-2021 that is included in the report published on 7-13-2021. This may be due to processing time or if the report for the day had already been published, the rest of the arrests of that day are just rolled into the next.

Potential Solutions

  1. Parse the actual arrest date from the PDF
@tyliec tyliec added the bug Something isn't working label Aug 4, 2021
@tyliec
Copy link
Collaborator Author

tyliec commented Nov 17, 2021

This issue is also the root cause of a duplicate record problem - with multiple records overlapping over the course of a few days.

@tyliec tyliec linked a pull request Jan 3, 2022 that will close this issue
@tyliec tyliec closed this as completed in #12 Jan 4, 2022
tyliec added a commit that referenced this issue Jan 4, 2022
* feat: improve ocr recognitions (#3, #6)
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

Successfully merging a pull request may close this issue.

1 participant