Skip to content

kailastone/dsc-mod-3-project-v2-1-onl01-dtsc-ft-041320

 
 

Repository files navigation

Farmers Markets in the United States:

Prediciting Markets by County

Medium Blog Post

Presentation PDF

Video Walkthrough

Purpose

As a professional chef coming into Data Science; my main goal for my career is to fuse my two passions in order to aid in tackling food accessbility and sustanibilty. My first step towards that journey is with this Module 3 project. I've downloaded data that includes information on farmers markets in the United States as well as financial and population information for counties in the U.S.

My goal with the project is to craft and hone into my own personal work flow; while also starting to get into the mind set that will allow me to succeed in moving on towards tackling deeper issues in this broad category of food sustainability.

The original data set can be found on Kaggle here.

Data Description

List of Files:
| farmers_markets_from_usda.csv
| county_info.csv

Main Questions:

Question 1 Notebook Link

Question 1: {Does product variety correlate to alternative payment option?}

Tables Used:

| farmers_markets_from_usda.csv

EDA

When cleaning this dataframe; in the interest of time I chose to drop zipcodes, social media, and seasonal information.

While the distribution of product availbility appears relatively normal, I was curious as to whether or not that was an indicator that a market accepted assisted forms of payment outside of cash or credit.

After cleaning and visualizing, I found that typically a market with a higher product variety does tend to accept more forms of alternative payement.

Most interestingly though; I found that the presence of a website was actually a better indicator of a market accepting alternative payment!

Conclusion

After cleaning and exploring the data, I found that typically a market that offers a higher variety of products does have a higher tendency to accept alternative payment forms.

Question 2 Notebook Link

Question 2: {What is the market availability by State?}

Tables Used:

| farmers_markets_from_usda.csv
| county_info.csv

EDA

To start, I broke down all of the states by count of farmers market in descending order.

I then followed by doing the same method, except by mean product availbility by state:

Conclusion

While New York and California have the highest number of Farmers Markets; Oregon, Washington, Florida, Vermont, and Arizona have the highest mean product count for their markets.

Question 3 Notebook Link

Question 3: {How do urban areas compare to rural areas in terms of accessbility (both to the product itself and to alternative payment methods)?}

Tables Used:

| farmers_markets_from_usda.csv
| county_info.csv

EDA

It seems at first glance that higher populated states and urban areas have more access:

However, I did not have enough time to fully develop the answers to this question and look forward to coming back in the future to finish up some of the more in-depth analysis.

Conclusion

More future work needs to be done in order to appropriately determine the answer to this question.

Wrap Up

Future Work

  • Utilize Gradient Boosting
  • Break information down into smaller areas and display in heat maps
  • Look into grocery store versus convenience store availability
  • Align political parties and counties to availability
  • Investigate seasonal distribution and effect
  • Start investigating the farmers side of the issue

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Jupyter Notebook 100.0%