Skip to content

Create 311 data CSV files that can be accessed through a Jupyter notebook #177

@akhaleghi

Description

@akhaleghi

Overview

We want to download 311 data and split by year, then month, so each is under 100MB and we can host tan append-only data warehouse on GitHub.

Action Items

  • Get cleaning rules from the 311-data repo and add a link to the rules to Resources below.
  • Get city data
  • Split by year, then by month
  • Outline what you did to clean the data in a comment below
  • Create Jupyter notebook to access the data and add notes explaining the cleaning rules
  • Create a website (ideally ghpages) that can display the jupyter notebook so that people don't have to know how to download and install one.

Resources/Instructions

Cleaning Rules: https://github.com/hackforla/data-science/blob/main/311-data/CSV_files/Docs/CleaningRules.txt
City Data:: https://data.lacity.org/browse?q=311%20data%20%2C%202024&sortBy=relevance (Please update the filter for the year 2024 based on the requirements.)
Website (ghpages): https://hackforla.github.io/311-data-jupyter-notebooks/lab (navigate to folder : 311_Data_CleaningScript)
Google Colab: Implemented an alternative using Google Colab, allowing easy execution of the notebook and direct access to raw and monthly CSV files without relying on GitHub Pages.
Link to Colab notebook: https://colab.research.google.com/drive/1_HFqnSOIDqDCtF3jmslmzkZ82eho10lY?usp=sharing

Metadata

Metadata

Assignees

Type

No type

Projects

Status

Needs Peer Review

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions