Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Ingest posted results and analyze/compare them #4

Open
boredzo opened this issue Feb 10, 2021 · 3 comments
Open

Ingest posted results and analyze/compare them #4

boredzo opened this issue Feb 10, 2021 · 3 comments
Labels
help wanted Extra attention is needed

Comments

@boredzo
Copy link
Owner

boredzo commented Feb 10, 2021

If you have a copy of all the Assembly District pages with the posted results (including vote counts!) on them, it should be possible to compare those vote counts to our own counts of votes found in valid ballots.

This will likely involve using Beautiful Soup to scrape the HTML in each file.

@boredzo boredzo added the help wanted Extra attention is needed label Feb 10, 2021
@boredzo
Copy link
Owner Author

boredzo commented Feb 10, 2021

As described in the README, these pages should be expected in ./posted-results/ad-99.html (replace 99 with each AD number).

I would say this step should be optional; if posted-results is missing or empty, no biggie.

@boredzo
Copy link
Owner Author

boredzo commented Feb 10, 2021

It may also be a good idea to have a separate tool for converting scraped HTML to CSV.

@boredzo
Copy link
Owner Author

boredzo commented Feb 12, 2021

One step toward this happened in a1768e9: analyze.py ingests the HTML pages and builds a CSV of the posted results. No comparison to independent scans yet, though.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
help wanted Extra attention is needed
Projects
None yet
Development

No branches or pull requests

1 participant