Skip to content

Latest commit

 

History

History
 
 

Week02

Week 2: Data in Urban Studies

October 4, 2021

Agenda

  • This class will be recorded (remind me if I forget!)
  • Log in to the class JupyterHub using the UP206A Git Puller to avoid the bottleneck
  • Welcome to week 2
    • Concerns? Questions?
    • Office hours - we're here for you!
  • Group creation
  • Break
  • Lab session
  • A discussion on a selection from last week's readings

Hands on

First, grab the course material, and "pull" it into your JupyterHub space:

  • UP206A Git Puller (This link will automatically launch JupyterHub and clone the course material into your directory)

Topics:

Assignments (due 23:59 Sunday, the day before class)

Group Assignment #1: Project Proposal

The course will largely be guided by your final project, which will be conducted in pairs. Consider that your final project will be a representation of what you learn in this course, and how you apply it on a project level. Consider also that your final project can serve to showcase your data science and mapping skills, which may become a valuable asset for your career moving forward.

Meet with your partner, discuss and identify a research question, identify and collect data, articulate how you intend to use and analyze it, and begin to speculate how the data can answer your research inquiry. Understand that this may change later, especially as we learn more about what we can do with our methods, and also, as you find the data sources that can or cannot support your research.

Next, identify and download at least two datasets that you believe can guide your research. Make sure that at least one of them has a spatial component. For example, if one of your datasets comes from the census, identify which survey you will use (e.g. Decennial or American Community Survey), and which variables you will select. Note that we will be covering census data in detail in Week 3. Your second dataset should come from a separate source. For example, you may want to do a crime analysis and obtain data from the LA Data Portal.

First, create a group repo.

  1. Select a member of the group who will be the main account holder of the repo
  2. Create a new repository for your group project
  3. Go to Settings, Manage Access, and add your partner as a collaborator
  4. Create a Group Assignments folder (hint: click on Add File, Create new File, and enter "Group Assignments/readme.md" in the text box)
  5. Create a markdown file for your project proposal

Your project proposal should include the following:

  • An introduction of your research question
  • An explanation of why it is important to you, why it matters to others, and what is at stake
  • A description of the spatial scope (e.g. Boyle Heights or Hong Kong), and why space and/or time matters for your project
  • A preliminary but definitive description of data sources (at least two) that you will use
    • Include datasource with links
  • A scope that explains the intended analysis and resulting visualizations for your project
  • A concluding paragraph of what insights you expect to gain from your research

Individual assignment: Data Exploration

Create a token

In order to pull and push content to GitHub, you must first create a token, which will serve as your password. Refer to this tutorial to create your token:

Clone your repo

This is your first code assignment submission. Before you begin, create a clone of your repo in JupyterHub.

Submission guidelines:

Find and download a dataset of your choice. This can be a shapefile, csv file, or json file.

Launch JupyterHub, go to your up206a repo folder, and create a week01 folder.

Load the dataset to the up206a/week01 folder.

Create a new python notebook (do not work on a copy of the lab notebook):

Right click on the Untitled.ipynb tab and rename the notebook to week 1 assignment.ipynb

Add an introductory markdown cell with a title (header) and paragraph that describes what you are doing.

Import the data, and conduct data exploration, making sure to document your steps and your preliminary findings. At minimum, run the following commands:

  • .shape
  • .info
  • .head()
  • .plot()
  • .value_counts()
  • run a query on the data that filters it in some way

For each code cell, add a markdown cell that explains what you are doing.

Add markdown cells that describe the output of each operation.

Save your notebook.

Commit your changes to your GitHub class repo.

Commit your changes to your GitHub repo by following these instructions:

Submit your assignment

The last step is to submit your assignment to the class repo discussion section here.

Reading assignment:

Silvia

Next week we begin our first episode of our "Conversation Series." Our guest will be Silvia Gonzales, Director of Climate, Environmental Justice and Health Research for the UCLA Latino Policy and Politics Initiative.

Read the following articles that she has been affiliated with, and come prepared to ask her questions next week.