Skip to content

as130/algos_final_project

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

104 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Algorithms For the People Final Project: Atlas of Surveillance

The Atlas of Surveillance (AoS) is an EFF database of the surveillance technology owned and operated by local law enforcement agencies. The DB can be downloaded as a csv (it’s about 3MB). In this project you will use the data in AoS to investigate how surveillance equipment is distributed across the US as a function of the racial and ethnic composition of a county, the median household income of a county, and any other characteristics you find interesting. To do this you will need other sources of data like the Census.

We are going to use the US Census demographic data in order to make conclusions regarding demographics, gender, and surveillance technologies.

Link to CLI (please only use from this link):

https://github.com/as130/algos_final_project_cli

Report:

https://www.overleaf.com/3262343194kdkyqcnrncwy

Presentation

http://brown.edu/go/atlas-presentation

Open Questions

  • How do we make sure that we are talking about these numbers/findings in an ethical and respecftul way in our final report/presentation?

Data

us-census-demographic-data: https://www.kaggle.com/muonneutrino/us-census-demographic-data This dataset includes demographic data from throughout the US from 2015-2017. This works well with the Atlas of Surveillance dataset because it includes data from before 2015 as recent as 2020. It contains gender and race for different counties and states. It is saved as a csv file in the us-census-data directory.

atlas-of-surveillance: csv file: https://atlasofsurveillance.org/

Methods for Analysis

CLI (reach goal)

  • Have a python script that runs in the command line that allows people to clone our repo and run our k-nearest neighbors algorithm with their own information and play around with it with different county information.

Schedule

  • Oct 12: methods & proposal
  • Oct 19: have database cleaned and loaded into python (pandas), develop research question/hypothesis and determine which tests will be used
  • - Oct 26: statistical methods run
  • Nov 2: analyze results
  • Nov 9: works on data visualization/cli(time permitting)
  • Nov 16: Have presentation prepared and rehearsed
  • Nov 19: Present! :)

TODO

  • Clean Atlas CSV dataset (turn into .db file), clean census data (all)
  • Load data into python files (Jupyter notebook setup) (all)
  • Investigate census data
  • Investigate what methods we want to use for analysis
  • Write more detailed proposal
  • Add figure numbers in report

Environment & Dependencies

About

Algorithms For the People Final Project: Atlas of Surveillance

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors