As a professional chef coming into Data Science; my main goal for my career is to fuse my two passions in order to aid in tackling food accessbility and sustanibilty. My first step towards that journey is with this Module 3 project. I've downloaded data that includes information on farmers markets in the United States as well as financial and population information for counties in the U.S.
My goal with the project is to craft and hone into my own personal work flow; while also starting to get into the mind set that will allow me to succeed in moving on towards tackling deeper issues in this broad category of food sustainability.
The original data set can be found on Kaggle here.
List of Files:
| farmers_markets_from_usda.csv
| county_info.csv
Question 1: {Does product variety correlate to alternative payment option?}
| farmers_markets_from_usda.csv
When cleaning this dataframe; in the interest of time I chose to drop zipcodes, social media, and seasonal information.
While the distribution of product availbility appears relatively normal, I was curious as to whether or not that was an indicator that a market accepted assisted forms of payment outside of cash or credit.
After cleaning and visualizing, I found that typically a market with a higher product variety does tend to accept more forms of alternative payement.
Most interestingly though; I found that the presence of a website was actually a better indicator of a market accepting alternative payment!
After cleaning and exploring the data, I found that typically a market that offers a higher variety of products does have a higher tendency to accept alternative payment forms.
Question 2: {What is the market availability by State?}
| farmers_markets_from_usda.csv
| county_info.csv
To start, I broke down all of the states by count of farmers market in descending order.
I then followed by doing the same method, except by mean product availbility by state:
While New York and California have the highest number of Farmers Markets; Oregon, Washington, Florida, Vermont, and Arizona have the highest mean product count for their markets.
Question 3: {How do urban areas compare to rural areas in terms of accessbility (both to the product itself and to alternative payment methods)?}
| farmers_markets_from_usda.csv
| county_info.csv
It seems at first glance that higher populated states and urban areas have more access:
However, I did not have enough time to fully develop the answers to this question and look forward to coming back in the future to finish up some of the more in-depth analysis.
More future work needs to be done in order to appropriately determine the answer to this question.
- Utilize Gradient Boosting
- Break information down into smaller areas and display in heat maps
- Look into grocery store versus convenience store availability
- Align political parties and counties to availability
- Investigate seasonal distribution and effect
- Start investigating the farmers side of the issue