# Project title: Where to sell your organic food production on the Paris marketplaces?

## Introduction:

This project aims at giving interesting insights on the marketplaces in the inner Paris so that small/medium farmers can decide the best location to sell organic products directly to the customer.

<br>
Note: this project can also be useful for tourists to discover the different kinds of food marketplaces and their surroundings. Indeed, it allows them to enjoy the Parisian way of life and so live a different experience when visiting the city.

![alt text](https://upload.wikimedia.org/wikipedia/commons/7/7e/Market%2C_Mouffetard%2C_Paris_15_October_2009.jpg)

_Mouffetard Market, Paris 15 October 2009: image under creativecommons.org_

## Business Problem:

More and more small/medium farmers (or even newcomers in the farming business) are deciding to grow organic food and target “conscious” customers.
<br>However they may face various difficulties and especially:
<br> - Despite the topical trend for organic food, it is hard to target and then deliver the products to the right customers that are willing to pay more for this kind of food.
<br> - Retailers (both large and organic food specialized) do exist and are representing the majority of the market, but business relations may be complicated due to their negotiation power and the quantities requested to deal with them.

__Marketplaces can be the solution for small/medium farmers.__
<br> - These places have indeed different interesting characteristics:
<br> - They are generally in public areas within cities where traders/farmers can directly display and sell their products to the customers during a short period of time (few hours one or few times a week)
<br> - They are supposed to be cheap for sellers (who need to pay a fee to the public/private organizer to rent a spot)
<br> - Most marketplaces are food related but there are dedicated marketplaces for other products such as clothes, books or even birds
<br> - They offer a nice visibility
<br> - They are friendly places and part of the French tradition
<br>
<br>
__Thereby, marketplaces for food are a very efficient channel for small/medium farmers to promote their farm and production processes, and sell their products “hand over hand”.__

However, there are 94 marketplaces dispersed all over the inner Paris (the first 20 districts only without taking into account the suburbs or “banlieue”): this represents a lot of data and it is complicated for a “new-comer” to have many insights on the different features of the marketplaces and more importantly of their surroundings (district, most common venues…). In fact we want here to target “conscious” customers who are willing to pay for organic products. 

## Data description to solve the Business Problem:

The city hall of Paris provides a dataset of all the marketplaces in the inner Paris including the suburbs at the following link:
https://opendata.paris.fr/explore/dataset/liste_des_marches_de_quartier_a_paris/table/

This dataset will need to be prepared and modified for further analysis but it includes all of the useful information needed for the project (city district, name and type of the marketplace, address, latitude, longitude…).

We will use Python and its famous library for data analysis Pandas to prepare the data. This is the first step prior to the following analysis:
<br> - First, we will leverage the Foursquare API to identify and explore the top venues close to each marketplace in the inner Paris.
<br> - Then, we will be able to group the marketplaces into clusters thanks to the k-means clustering algorithm and the the Scikit-learn library dedicated to machine learning.
<br> - Finally, we will take advantage of the Folium library to locate on a map all of the 83 marketplaces dedicated to food and their emerging cluster. 

To give an example, the raw table from opendata.paris.fr before data preparation looks like the following:

In [1]:
import pandas as pd

In [2]:
paris_marketplaces_data = pd.read_csv("liste_des_marches_de_quartier_a_paris.csv",delimiter=';')
paris_marketplaces_data.head()

Unnamed: 0,Type,Société gestionnaire,Marché,Arrondissement,Localisation,Adresse Complète (POI approchant),Lundi,Mardi,Mercredi,Jeudi,Vendredi,Samedi,Dimanche,Jour férié,geo_coordinates
0,Alimentaire découvert,Dadoun,AGUESSEAU,75008,"Place de la Madeleine, Bd Malherbes","Place de la Madeleine 75008 Paris, France",,07h00 à 14h30,,,07h00 à 14h30,,,,"48.8708, 2.325134"
1,Alimentaire découvert,Dadoun,AMIRAL BRUIX,75016,Bd de l'Amiral Bruix entre les rues Weber et M...,"Bd de l'Amiral Bruix 75016 Paris, France",,,07h00 à 14h30,,,07h00 à 15h00,,,"48.873657, 2.277084"
2,Alimentaire découvert,Dadoun,AUTEUIL,75016,Place Jean Lorrain,"Place Jean Lorrain 75016 Paris, France",,,07h00 à 14h30,,,07h00 à 15h00,,,"48.848042, 2.264235"
3,Alimentaire découvert,Dadoun,BARBES,75018,Bd de la Chapelle entre rues Guy Patin et Maub...,"Bd de la Chapelle 75018 Paris, France",,,07h00 à 14h30,,,07h00 à 15h00,,,"48.884376, 2.357139"
4,Alimentaire découvert,Dadoun,BERTHIER,75017,"Bd de Reims, le long du square A. Ulmann","Bd de Reims 75017 Paris, France",,,07h00 à 14h30,,,07h00 à 15h00,,,"48.88971, 2.29816"


## Project Disclaimer:

This project is intended for educational and information purposes only. It is a first study that does not take into account all of the needed parameters for a complete business decision.

Potential sellers/farmers would indeed need to carry out further investigations before renting a spot in a particular marketplace (e.g. study of the present competitors and their prices, schedule of the marketplace, place renting…).

## Methodology section

We will describe here the main steps followed when coding the project.

**The plan falls into 8 parts:**
1. Introduction (Note + Importation of the needed librairies)
2. Data Preparation
3. Geographical representation of the data (marketplaces)
4. Foursquare Credentials and Version
5. Data exploration
6. Data (Marketplaces) analysis
7. Cluster of the marketplaces in Paris (5 clusters)
8. Clusters examination


**Introduction (Note + Importation of the needed librairies)**
We start by noting that several parts of the code provided have been reused from the Coursera IBM Datascience professional certificate.
<br>We then import the needed Python libraries (mainly Pandas, Numpy, json, matplotlib, sklearn and folium).


**Data Preparation**
During this stage we will manipulate (or pre-process) raw data from a dataset representing all of the marketplaces in the inner Paris and its suburbs downloaded at opendata.paris.fr.
<br>The main operations are: translation of the key labels, addition of relevant columns/information, restriction of the city districts to the inner Paris only (first 20 city districts), restriction of food related-marketplaces, and type conversions to the right ones.
<br>This preliminary step is needed in our analysis so that we have accurate data presented in a way that we can carry out the further steps presented below.


**Geographical representation of the data (marketplaces)**
We here use the Folium library to locate on a map all of the 83 marketplaces dedicated to food. It gives a first overview of their location and dispersion. 
<br>It also allows to check wether our final dataset is consistent or not.


**Foursquare Credentials and Version**
This step is required to register our credentials and version to authenticate and then leverage the Foursquare API. 


**Data exploration**
We set at 10 the limit of number of venues returned by Foursquare API in order for the program to execute properly. As the inner center of Paris is both a small and a very dense area we limit the radius at 500 meters around the marketplaces.
<br>We use the Explore Feature of the Foursquare API to find the most popular venues. It is the best way to get really interresting insights on the different features of the marketplaces and more importantly of their surroundings. It is a great feature of Foursquare to identify what is special, trending, what spots are recommended, etc...
<br>We then get a new dataset (paris_venues) that returns the top venue (e.g. "Café Pouchkine") and its Venue Category (e.g. "Dessert Shop"). For the huge majority of the marketplaces we get 10 venues as requested: this shows that we will be able to carry out a consistent analysis with enough data. We also note that there are 147 uniques categories.


**Data (Marketplaces) analysis**
We try here to analyze each marketplace. In that regard, we create a new dataframe with dummy variables corresponding to the venues categories. This will allow us in a second dataframe to group rows by marketplace and by taking the mean of the frequency of occurrence of each category. We end up with a dataframe with 81 rows and 148 columns.
<br>We then print each marketplace along with the top 5 most common venues: this gives us a report by marketplace displaying the top venues categories and their frequencies.
<br>For better data description/visualization, we finally create a last dataframe (Marketplace_venues_sorted) that displays the top 10 venues in columns by marketplace in rows.
<br>The conclusion of this part is that the results are hard to analyze. Indeed, most of the venues categories are related to food (e.g. "French Restaurant", "Italian Restaurant", "Japanese Restaurant"...) or tourist activites (e.g. "Park", "Cultural Center",...).
<br>Yet, we can notice the presence of interesting venue categories for a potential business decision such as: "Farmers Market" or "Vegetarian / Vegan Restaurant".
<br>For a better understanding of the results, we will need to cluster the marketplaces.


**Cluster of the marketplaces in Paris (5 clusters)**
To cluster the marketplaces based on their top 10 venues categories, we will use the K-means machine learning algorythm.
<br>Using the sklearn library, we run k-means to cluster the marketplaces into 5 clusters.
<br>Finally, we use again the Folium library to locate on a map all of the 83 marketplaces dedicated to food and their emerging cluster. 


**Clusters examination**
This part will be discussed in the "Results section" below.

## Results section

The whole analysis lead to 2 features/results:

<br>**The map of the marketplaces dedicated to food only** for the first 20 city districts (the inner center of Paris). Thus we can make some observations that may be interpreted differently by the sellers but that need to be pointed out:
<br>- transportation facilities are different from a marketplace to the other
<br>- some marketplaces are far from the centre

<br>**5 clusters** presenting various venues categories that give some precious insight on the customer population and so we hope their potential to come to the marketplace and buy organic food:
<br>- **cluster 1:** we can call this custer **"Various foods and drinks"**. This cluster of 14 marketplaces presents a variety of results for the most part around food & drinks but without much consistency at first sight (e.g. "Bistro", "Restaurant", "Wine Bar"...)
<br>- **cluster 2:** we can call this cluster **"Shopping and other tourist activities"**. Indeed and contrary to the cluster 1, we have much less categories related to food but mostly around shoping and tourist activities such as "Plaza", "Stadium", "Historic Site", "Art Gallery"... This cluster is composed of 21 markeplaces.
<br>- **cluster 3:** we can call this cluster **"Worldwide restauration"** as it is composed mostly of categories around food with many different labels such as "Argentinian Restaurant", "Creperie", "Chinese Restaurant"... There is no real predominance of French food categories within it. This cluster is composed of 23 marketplaces.
<br>- **cluster 4:** as opposed to cluster 3, we can call this cluster **"French restauration"** as the first category is always "French Restaurant". It is also interesting to note that we find the following categories: "Bakery", "Gourmet Shop", "Seafood Restaurant", and "Farmers Market". This cluster is composed of 17 marketplaces.
<br>- **cluster 5:** we can call this cluster **"Vegetarian / Vegan"** as we find the "Vegetarian / Vegan Restaurant" category in it. Yet, it is quite hard to observe much consistency in it as various other categories labels are present such as "Bookstore", "Gym center" or "Gift Shop". This cluster is composed of 8 marketplaces.

As a result we would recommend a farmer **to focus on customers promoting local / french  + vegetarian / vegan food.** 

<br>Thereby we consider the **clusters 4 and 5** as the most promising.
<br>So, this is not a coincidence if 4 out of the 8 marketplaces of the cluster 5 are located in the 16th citydistrict that is more residential.

<br>Within these 2 clusters, we would particulary recommend the following marketplaces:
<br>-**Batignolles (cluster 4)** where French restauration + Farmers Market come out. 
<br>-**Popincourt and Bastille (cluster 5)** where French restauration as Vegetarian / Vegan Restaurant are poping. 

<br>Of course, any seller can and should use the detailed results and categories to make his mind up.

## Discussion section

As stated in the disclaimer part, potential sellers/farmers would need to carry out further investigations before renting a spot in a particular marketplace.

<br>Especially, the following points would be necessary to be taken into account:
<br>- present competitors and their products + prices
<br>- schedule of the marketplace
<br>- cost to rent a place
<br>- identification of the closest metrostation nearby
<br>- customer marketing study of the neighborhood/city district
<br>- transportation facilities such as roads/motorways to bring the products and park the truck/car
<br>- competition of the specialized organic food stores nearby the marketplace

## Conclusion section

This project gave a first overview of the food marketplaces in the inner center of Paris.

<br>As stated previously, many parameters/features need to be added to the analysis for a business decision.

<br>Nonetheless, we hope that the results may help future farmers to sell their products in Paris thanks to the map provided and the detailed description of the top venues by marketplace.