### Table of Contents
To return to the table of contents, click on the number at any major section heading.

[1. Project Title](#1.-Project-Title)

[2. Team members](#2.-Team-members)

[3. Question(s) you addressed, why it is important](#3.-Question(s)-you-addressed,-why-it-is-important)

[4. Background and literature](#4.-Background-and-literature)

[5. Python packages you used and why](#5.-Python-packages-you-used-and-why)

[6. Data sources](#6.-Data-sources)

[7. Data cleaning you have done](#7.-Data-cleaning-you-have-done)

[8. Descriptive statistics for the data](#8.-Descriptive-statistics-for-the-data)

[9. Analysis](#9.-Analysis)

[10. Summary of products and results](#10.-Summary-of-products-and-results)

[11. Discussion](#11.-Discussion)

[12. Conclusions and future work](#12.-Conclusions-and-future-work)

### [1.](#Table-of-Contents) Project Title

Exploring the Impact of Computer Ownership on Unemployment in San Diego, CA

### [2.](#Table-of-Contents) Team members

By: Michael Garcia-Perez (A16366187) & Haoyu Fu (A16633278)

### [3.](#Table-of-Contents) Question(s) you addressed, why it is important

We aim to delve deeper into the relationship between computer ownership and unemployment in San Diego, California, while also considering additional factors if time allows. The primary audience for our research includes political officials and policymakers in San Diego. This focus is particularly relevant because if a significant relationship exists between computer ownership and unemployment, it suggests that enhancing citizens' access to such equipment could contribute to boosting the employment rate.

San Diego, CA, is a thriving city; however, homelessness has become a growing concern among residents, evident in headlines such as "Downtown San Diego homeless population reaches record high for the sixth month in a row" (Warth, LA Times). Homelessness stems from various causes, including a lack of affordable housing, poverty, and unemployment. Despite common misconceptions associating homelessness with mental health issues, a comprehensive survey revealed that poverty is the primary culprit (The Coronado Times). The cost of living in San Diego is a driving factor, but so is "inadequate employment," such as the "inability to find gainful employment..." (The Coronado Times). Hence, homelessness can be linked to unemployment, prompting us to explore additional relationships, including those involving computer ownership.

We believe that computer ownership could significantly impact unemployment in San Diego. Initially unaware of the issue's significance, we were surprised to discover a dataset on computer ownership before starting our research. Our analysis also aims to uncover information related to the number of broadband providers per San Diego census tract, Federal Broadband Threshold compliance, income, ethnic populations, and more, if time permits, utilizing broadband data from SANDAG and visualizing through San Diego census tracts.

Our expectations include identifying a strong relationship between computer ownership and unemployment rates in different San Diego census tracts. We anticipate that individuals without access to computers may face challenges in job applications, potentially leading to higher unemployment rates. Ultimately, we hope our findings offer a new perspective on the factors influencing unemployment in certain San Diego areas and provide insights for policymakers on utilizing tax money to fund initiatives promoting increased computer ownership, should it prove to positively impact the employment rate.

### [4.](#Table-of-Contents) Background and literature

1. https://www.utwente.nl/en/bms/vandijk/publications/digital_divide_impact_access.pdf
2. https://www.sciencedirect.com/science/article/pii/S0040162521007903?casa_token=8X2wFMUI1bsAAAAA:bgo7kmgL2LXDxj0fmjMEv0ZoSaLLMTQuY3WEPV1n9nHhRgdTpY7_0tUvDW77bmxlYUH9CtIP1w
3. https://files.eric.ed.gov/fulltext/ED463391.pdf
4. https://www.sciencedirect.com/science/article/pii/S026427512031252X?casa_token=_jvi54cy2FcAAAAA:Cmou9AI7qMpBBHCH1bAjAVWWQFe5QBN-sZGBU9tvg8mD8uGtYDfPflMQ6ILBVFjODOT8IW-PFA

These references play a crucial role in providing essential background literature necessary for comprehending our topic, which involves investigating the influence of computer ownership on unemployment in San Diego. The articles explore subjects such as the digital divide and broadband access. The digital divide represents an inequality where more privileged individuals have access to digital resources like the internet, along with the skills and hardware necessary for utilizing these resources, such as laptops or phones. This reference has been instrumental in refining our focus, especially since we concentrate on computer ownership and its impact on unemployment.

As highlighted in these articles, the digital divide significantly affects employment. Without computers or internet access, the process of applying for jobs or acquiring new skills becomes exceedingly challenging. Employment relies heavily on these resources, as job seekers need to submit applications, connect with recruiters, participate in interviews, respond to crucial emails, learn about job positions, and more. While libraries can help mitigate the impact of the digital divide, managing these tasks is still more challenging compared to having access to a computer and reliable internet at home. Finding a job is inherently challenging due to the competitive job market and economy, but the digital divide has evidently exacerbated these difficulties, as outlined in these articles.

Another pivotal aspect from these articles that significantly influences our research is understanding broadband access. Broadband, as described in the aforementioned articles, determines your access to reliable internet. To achieve high-speed internet connections, your broadband access must facilitate the transmission of large amounts of data at fast speeds, ultimately affecting your ability to use your laptop for job searching and employment-related activities. We assert that computer ownership and broadband access are interconnected, as a computer without internet access is not useful for employment-seeking purposes. Furthermore, different census tracts in San Diego exhibit varying qualities and speeds of broadband internet, leading us to narrow our focus on a specific hypothesis. We aim to explore how these variations impact different tracts in San Diego and their effects on unemployment.

### [5.](#Table-of-Contents) Python packages you used and why

Will do once we finish implementation...

In [2]:
# Importing necessary packages
import pandas as pd
import geopandas as gpd

### [6.](#Table-of-Contents) Data sources

1.  https://opendata.sandag.org/Sustainable-Development-Goals/SDG-Indicator-5-b-1-Data-Computer-Ownership/tsaq-axm2/abo

    - This URL provides data regarding computer ownership in San Diego. Specifically, the data includes information on the number of households with one or more types of computing devices, such as desktops, laptops, smartphones, tablets, or other portable wireless computers, and other computer types. There are no concerns related to data quality or constraints. The geographic granularity of this data is census tracts, making it ideal for use with the other data source we have identified. Our choice of sources has evolved since the proposal phase, as we considered whether to join/merge data by census tracts instead of zip codes. We decided that analyzing census tracts would yield more comprehensive results, given the greater number of census tracts in San Diego compared to zip codes. Although we cannot use the zip code data, there is still ample information available for analysis from this URL.
    
    
2. https://opendata.sandag.org/stories/s/Digital-Divide-2-0-Broadband-Access-and-More/9vde-3c89/

    - This URL contains data about broadband providers per census block code, including bandwidth performance/accessibility and geographic information. Specifically, this URL provides three different dataframes that we can utilize in our analysis. We aim to convert the census block code to census tracts and use the data accordingly. While there might be a need for significant data cleaning, there are no concerns related to data quality or constraints. Once again, our choices of sources have evolved, and we have decided to focus solely on data associated with census tracts instead of zip codes.
    
    
3. We will be getting information about unemployment via Geoenrichment from ArcGIS

In [3]:
# Reading all CSV files into GeoDataFrames
employment_centers = gpd.read_file('Employment_Centers_V2_Landing_Page_Data_20240226.csv') # No census tracts
computer_ownership = gpd.read_file('SDG_Indicator_5.b.1_Data_-_Computer_Ownership_20240226.csv') # Yes census tracts
employee_resident_survey = gpd.read_file('Telework_-_SANDAG_Employee_Resident_Study_2023_20240226.csv') # No census tracts
unemployment = gpd.read_file('Unemployment_20240226.csv') # No census tracts
business_tax_certificates = gpd.read_file('sd_businesses_active_datasd.csv') # No census tracts
 
broadband1 = gpd.read_file('Digital_Divide_-_Broadband_Cost_with_Coastline_20240307.csv') # No census tracts
broadband2 = gpd.read_file('Digital_Divide_-_Broadband_Providers_Meeting_Federal_Threshold_20240307.csv') # Yes by census block
broadband3 = gpd.read_file('Digital_Divide_-_Fixed_Broadband_Meets_Federal_Threshold_20240307.csv') # Yes by census block
broadband4 = gpd.read_file('Digital_Divide_-_Household_Broadband_Adoption__Low-Income_Households_20240307.csv') # No census tracts
broadband5 = gpd.read_file('Digital_Divide_-_Household_Broadband_Adoption__Minority_Populations_20240307.csv') # No census tracts
broadband6 = gpd.read_file('Digital_Divide_2.0_Scoring_Data_20240307.csv') # Yes by tract and geoid
broadband7 = gpd.read_file('Digital_Divide_-_Household_Broadband_Adoption__Seniors_20240308.csv') # No census tracts

county_borders = gpd.read_file('san_diego_boundary_datasd.shx') # might not need

In [4]:
# geoenrichmnent is like getting unemployment from ArcGIS

### [7.](#Table-of-Contents) Data cleaning you have done

In [8]:
employment_centers = employment_centers.drop(columns=['geometry'])
computer_ownership = computer_ownership.drop(columns=['geometry'])
employee_resident_survey = employee_resident_survey.drop(columns=['geometry'])
unemployment = unemployment.drop(columns=['geometry'])
business_tax_certificates = business_tax_certificates.drop(columns=['geometry'])

### [8.](#Table-of-Contents) Descriptive statistics for the data

### [9.](#Table-of-Contents) Analysis

### [10.](#Table-of-Contents) Summary of products and results

### [11.](#Table-of-Contents) Discussion 

### [12.](#Table-of-Contents) Conclusions and future work

##### Citations 
- https://www.latimes.com/california/story/2023-02-07/downtown-homeless-population-reaches-another-high
- https://coronadotimes.com/news/2023/08/29/is-addiction-driving-the-homeless-epidemic-in-san-diego-2/#:~:text=Lack%20of%20Income&text=This%20report%20showed%20that%20six,desperately%20to%20make%20ends%20meet.