# Eat Safe, Love

## Part 1: Database and Jupyter Notebook Set Up

Import the data provided in the `establishments.json` file from your Terminal. Name the database `uk_food` and the collection `establishments`.

* In terminal at base:
    * `conda activate PythonData`
    * `brew services start mongodb-community@6.0`
    * `mongosh`

* Now connected to MongoDB and in test>. Create database and begin to use new database:
    * `use uk_food`
        * Example: test> use uk_food

* Create collection:
    * `db.createCollection("establishments")`
        * Example: uk_food> db.createCollection("establishments")

* In alternate terminal, navigate to Resources folder where establishments.json file is stored
    * `mongoimport --type json -d uk_food -c establishments --drop --jsonArray establishments.json`

* Verified and checked the contents of the data in the MongoDB
* Run 'mongo' from the Terminal to launch Mongo. 
* Switch to the 'establishments' database we created with the previous imports

* Find one entry in the establishments collection to ensure imported:
    * `db.establishments.findOne()`
        * Example: uk_food> db.establishments.findOne()

In [2]:
# Import dependencies
import pymongo
from pymongo import MongoClient
from pprint import pprint

In [7]:
 # Create an instance of MongoClient
mongo = MongoClient(port=27017)

In [8]:
# Confirm that new database was created
print(mongo.list_database_names())

['admin', 'classDB', 'config', 'epa', 'fruits_db', 'local', 'petsitly_marketing', 'uk_food']


In [9]:
 # Assign the database to a variable name
uk_food_db = mongo['uk_food']

In [10]:
# Review the collections in new database
print(uk_food_db.list_collection_names())

['establishments']


In [12]:
# Review a document in the establishments collection
pprint(uk_food_db.establishments.find_one())

{'AddressLine1': 'Wear Bay Road',
 'AddressLine2': 'Folkestone',
 'AddressLine3': 'Kent',
 'AddressLine4': '',
 'BusinessName': 'Wear Bay Bowls Club',
 'BusinessType': 'Pub/bar/nightclub',
 'BusinessTypeID': 7843,
 'ChangesByServerID': 0,
 'Distance': 4591.821311183521,
 'FHRSID': 647177,
 'LocalAuthorityBusinessID': 'PI/000041489',
 'LocalAuthorityCode': '188',
 'LocalAuthorityEmailAddress': 'foodteam@folkestone-hythe.gov.uk',
 'LocalAuthorityName': 'Folkestone and Hythe',
 'LocalAuthorityWebSite': 'http://www.folkestone-hythe.gov.uk',
 'NewRatingPending': False,
 'Phone': '',
 'PostCode': 'CT19 6PY',
 'RatingDate': '2014-03-31T00:00:00',
 'RatingKey': 'fhrs_4_en-gb',
 'RatingValue': '4',
 'RightToReply': '',
 'SchemeType': 'FHRS',
 '_id': ObjectId('64162631a42d2371774d5162'),
 'geocode': {'latitude': 51.086058, 'longitude': 1.196408},
 'links': [{'href': 'https://api.ratings.food.gov.uk/establishments/647177',
            'rel': 'self'}],
 'meta': {'dataSource': None,
          'extr

In [13]:
# Assign the collection to a variable
establishments = uk_food_db['establishments']

## Part 2: Update the Database

1. An exciting new halal restaurant just opened in Greenwich, but hasn't been rated yet. The magazine has asked you to include it in your analysis. Add the following restaurant "Penang Flavours" to the database.

In [34]:
# Create a dictionary for the new restaurant data
new_restaurant = {
    "BusinessName":"Penang Flavours",
    "BusinessType":"Restaurant/Cafe/Canteen",
    "BusinessTypeID":"",
    "AddressLine1":"Penang Flavours",
    "AddressLine2":"146A Plumstead Rd",
    "AddressLine3":"London",
    "AddressLine4":"",
    "PostCode":"SE18 7DY",
    "Phone":"",
    "LocalAuthorityCode":"511",
    "LocalAuthorityName":"Greenwich",
    "LocalAuthorityWebSite":"http://www.royalgreenwich.gov.uk",
    "LocalAuthorityEmailAddress":"health@royalgreenwich.gov.uk",
    "scores":{
        "Hygiene":"",
        "Structural":"",
        "ConfidenceInManagement":""
    },
    "SchemeType":"FHRS",
    "geocode":{
        "longitude":"0.08384000",
        "latitude":"51.49014200"
    },
    "RightToReply":"",
    "Distance":4623.9723280747176,
    "NewRatingPending":True
}



In [35]:
# Insert new restaurant into the collection
establishments.insert_one(new_restaurant)

<pymongo.results.InsertOneResult at 0x7fc8fa99abb0>

In [56]:
# Check to see if new restaurant was inserted into the collection
query_new_restaurant = {"BusinessName": "Penang Flavours"}
results = establishments.find(query_new_restaurant)

for result in results:
    pprint(result)

{'AddressLine1': 'Penang Flavours',
 'AddressLine2': '146A Plumstead Rd',
 'AddressLine3': 'London',
 'AddressLine4': '',
 'BusinessName': 'Penang Flavours',
 'BusinessType': 'Restaurant/Cafe/Canteen',
 'BusinessTypeID': 1,
 'Distance': 4623.972328074718,
 'LocalAuthorityCode': '511',
 'LocalAuthorityEmailAddress': 'health@royalgreenwich.gov.uk',
 'LocalAuthorityName': 'Greenwich',
 'LocalAuthorityWebSite': 'http://www.royalgreenwich.gov.uk',
 'NewRatingPending': True,
 'Phone': '',
 'PostCode': 'SE18 7DY',
 'RightToReply': '',
 'SchemeType': 'FHRS',
 '_id': ObjectId('641627626a75abd4cfd84d89'),
 'geocode': {'latitude': 51.490142, 'longitude': 0.08384},
 'scores': {'ConfidenceInManagement': '', 'Hygiene': '', 'Structural': ''}}
{'AddressLine1': 'Penang Flavours',
 'AddressLine2': '146A Plumstead Rd',
 'AddressLine3': 'London',
 'AddressLine4': '',
 'BusinessName': 'Penang Flavours',
 'BusinessType': 'Restaurant/Cafe/Canteen',
 'BusinessTypeID': '',
 'Distance': 4623.972328074718,
 'Loc

2. Find the BusinessTypeID for "Restaurant/Cafe/Canteen" and return only the `BusinessTypeID` and `BusinessType` fields.

In [57]:
 # Select only the mechanic_name and wages.hourly_rate fields from the mechanics collection
query = {'BusinessType': 'Restaurant/Cafe/Canteen'}
fields = {'BusinessType','BusinessTypeID'}

# Capture the results to a variable
results = establishments.find(query, fields)

# Pretty print the results
for result in results:
    pprint(result)

{'BusinessType': 'Restaurant/Cafe/Canteen',
 'BusinessTypeID': 1,
 '_id': ObjectId('64162631a42d2371774d5169')}
{'BusinessType': 'Restaurant/Cafe/Canteen',
 'BusinessTypeID': 1,
 '_id': ObjectId('64162631a42d2371774d516c')}
{'BusinessType': 'Restaurant/Cafe/Canteen',
 'BusinessTypeID': 1,
 '_id': ObjectId('64162631a42d2371774d5170')}
{'BusinessType': 'Restaurant/Cafe/Canteen',
 'BusinessTypeID': 1,
 '_id': ObjectId('64162631a42d2371774d5173')}
{'BusinessType': 'Restaurant/Cafe/Canteen',
 'BusinessTypeID': 1,
 '_id': ObjectId('64162631a42d2371774d5174')}
{'BusinessType': 'Restaurant/Cafe/Canteen',
 'BusinessTypeID': 1,
 '_id': ObjectId('64162631a42d2371774d5184')}
{'BusinessType': 'Restaurant/Cafe/Canteen',
 'BusinessTypeID': 1,
 '_id': ObjectId('64162631a42d2371774d518f')}
{'BusinessType': 'Restaurant/Cafe/Canteen',
 'BusinessTypeID': 1,
 '_id': ObjectId('64162631a42d2371774d5192')}
{'BusinessType': 'Restaurant/Cafe/Canteen',
 'BusinessTypeID': 1,
 '_id': ObjectId('64162631a42d2371774d

{'BusinessType': 'Restaurant/Cafe/Canteen',
 'BusinessTypeID': 1,
 '_id': ObjectId('64162631a42d2371774d8628')}
{'BusinessType': 'Restaurant/Cafe/Canteen',
 'BusinessTypeID': 1,
 '_id': ObjectId('64162631a42d2371774d8629')}
{'BusinessType': 'Restaurant/Cafe/Canteen',
 'BusinessTypeID': 1,
 '_id': ObjectId('64162631a42d2371774d862b')}
{'BusinessType': 'Restaurant/Cafe/Canteen',
 'BusinessTypeID': 1,
 '_id': ObjectId('64162631a42d2371774d862c')}
{'BusinessType': 'Restaurant/Cafe/Canteen',
 'BusinessTypeID': 1,
 '_id': ObjectId('64162631a42d2371774d862e')}
{'BusinessType': 'Restaurant/Cafe/Canteen',
 'BusinessTypeID': 1,
 '_id': ObjectId('64162631a42d2371774d8632')}
{'BusinessType': 'Restaurant/Cafe/Canteen',
 'BusinessTypeID': 1,
 '_id': ObjectId('64162631a42d2371774d8636')}
{'BusinessType': 'Restaurant/Cafe/Canteen',
 'BusinessTypeID': 1,
 '_id': ObjectId('64162631a42d2371774d863c')}
{'BusinessType': 'Restaurant/Cafe/Canteen',
 'BusinessTypeID': 1,
 '_id': ObjectId('64162631a42d2371774d

3. Update the new restaurant with the `BusinessTypeID` you found.

In [38]:
# Found that BusinessTypeID = 1
# Update the new restaurant with the correct BusinessTypeID
establishments.update_one({"BusinessName": "Penang Flavours"},{'$set':{"BusinessTypeID":1}})

<pymongo.results.UpdateResult at 0x7fc8f9fce880>

In [58]:
# Confirm that the new restaurant was updated
updated_new_restaurant = {"BusinessName": "Penang Flavours"}
results = establishments.find(updated_new_restaurant)

for result in results:
    pprint(result)

{'AddressLine1': 'Penang Flavours',
 'AddressLine2': '146A Plumstead Rd',
 'AddressLine3': 'London',
 'AddressLine4': '',
 'BusinessName': 'Penang Flavours',
 'BusinessType': 'Restaurant/Cafe/Canteen',
 'BusinessTypeID': 1,
 'Distance': 4623.972328074718,
 'LocalAuthorityCode': '511',
 'LocalAuthorityEmailAddress': 'health@royalgreenwich.gov.uk',
 'LocalAuthorityName': 'Greenwich',
 'LocalAuthorityWebSite': 'http://www.royalgreenwich.gov.uk',
 'NewRatingPending': True,
 'Phone': '',
 'PostCode': 'SE18 7DY',
 'RightToReply': '',
 'SchemeType': 'FHRS',
 '_id': ObjectId('641627626a75abd4cfd84d89'),
 'geocode': {'latitude': 51.490142, 'longitude': 0.08384},
 'scores': {'ConfidenceInManagement': '', 'Hygiene': '', 'Structural': ''}}
{'AddressLine1': 'Penang Flavours',
 'AddressLine2': '146A Plumstead Rd',
 'AddressLine3': 'London',
 'AddressLine4': '',
 'BusinessName': 'Penang Flavours',
 'BusinessType': 'Restaurant/Cafe/Canteen',
 'BusinessTypeID': '',
 'Distance': 4623.972328074718,
 'Loc

4. The magazine is not interested in any establishments in Dover, so check how many documents contain the Dover Local Authority. Then, remove any establishments within the Dover Local Authority from the database, and check the number of documents to ensure they were deleted.

In [17]:
# Find how many documents have LocalAuthorityName as "Dover"
dover_query = {"LocalAuthorityName":"Dover"}

# Print the number of results
print("Number of documents containing the 'LocalAuthorityName' as 'Dover' in the result:",\
      establishments.count_documents(dover_query))

Number of documents containing the 'LocalAuthorityName' as 'Dover' in the result: 994


In [18]:
# Delete all documents where LocalAuthorityName is "Dover"
dover_query = {"LocalAuthorityName":"Dover"}

establishments.delete_many(dover_query)

<pymongo.results.DeleteResult at 0x7fc8e8853b80>

In [19]:
# Check if any remaining documents include Dover
check_dover_query = {"LocalAuthorityName":"Dover"}

# Print the number of results
print("Number of documents left containing the 'LocalAuthorityName' as 'Dover' in result:",\
      establishments.count_documents(check_dover_query))

Number of documents left containing the 'LocalAuthorityName' as 'Dover' in result: 0


In [20]:
# Check that other documents remain with 'find_one'
establishments.find_one()

{'_id': ObjectId('64162631a42d2371774d5162'),
 'FHRSID': 647177,
 'ChangesByServerID': 0,
 'LocalAuthorityBusinessID': 'PI/000041489',
 'BusinessName': 'Wear Bay Bowls Club',
 'BusinessType': 'Pub/bar/nightclub',
 'BusinessTypeID': 7843,
 'AddressLine1': 'Wear Bay Road',
 'AddressLine2': 'Folkestone',
 'AddressLine3': 'Kent',
 'AddressLine4': '',
 'PostCode': 'CT19 6PY',
 'Phone': '',
 'RatingValue': '4',
 'RatingKey': 'fhrs_4_en-gb',
 'RatingDate': '2014-03-31T00:00:00',
 'LocalAuthorityCode': '188',
 'LocalAuthorityName': 'Folkestone and Hythe',
 'LocalAuthorityWebSite': 'http://www.folkestone-hythe.gov.uk',
 'LocalAuthorityEmailAddress': 'foodteam@folkestone-hythe.gov.uk',
 'scores': {'Hygiene': 5, 'Structural': 5, 'ConfidenceInManagement': 10},
 'SchemeType': 'FHRS',
 'geocode': {'longitude': '1.196408', 'latitude': '51.086058'},
 'RightToReply': '',
 'Distance': 4591.821311183521,
 'NewRatingPending': False,
 'meta': {'dataSource': None,
  'extractDate': '0001-01-01T00:00:00',
  '

5. Some of the number values are stored as strings, when they should be stored as numbers. Use `update_many` to convert `latitude` and `longitude` to decimal numbers.

In [21]:
# Change the data type from String to Decimal for longitude and latitude
establishments.update_many({}, [{'$set':{'geocode.longitude': {'$toDouble': '$geocode.longitude'},
                                          'geocode.latitude':{'$toDouble':'$geocode.latitude'}}}])
                                      

<pymongo.results.UpdateResult at 0x7fc8e8853070>

In [72]:
# Check that the coordinates are now numbers
list_query = [doc for doc in establishments.find({}, ['geocode.longitude', 'geocode.latitude'])]
list_query

[{'_id': ObjectId('64162631a42d2371774d5162'),
  'geocode': {'longitude': 1.196408, 'latitude': 51.086058}},
 {'_id': ObjectId('64162631a42d2371774d5163'),
  'geocode': {'longitude': 1.194762, 'latitude': 51.085797}},
 {'_id': ObjectId('64162631a42d2371774d5166'),
  'geocode': {'longitude': 1.188537, 'latitude': 51.08084}},
 {'_id': ObjectId('64162631a42d2371774d5167'),
  'geocode': {'longitude': 1.188537, 'latitude': 51.08084}},
 {'_id': ObjectId('64162631a42d2371774d5169'),
  'geocode': {'longitude': 1.195625, 'latitude': 51.083812}},
 {'_id': ObjectId('64162631a42d2371774d516b'),
  'geocode': {'longitude': 1.188537, 'latitude': 51.08084}},
 {'_id': ObjectId('64162631a42d2371774d516c'),
  'geocode': {'longitude': 1.18590330311705, 'latitude': 51.0783519967076}},
 {'_id': ObjectId('64162631a42d2371774d516d'),
  'geocode': {'longitude': 1.18590330311705, 'latitude': 51.0783519967076}},
 {'_id': ObjectId('64162631a42d2371774d516e'),
  'geocode': {'longitude': 1.18590330311705, 'latitude

In [78]:
# Now find the type for longitude
type(list_query[0]['geocode']['longitude'])

float

In [79]:
# Now find the type for latitude 
type(list_query[0]['geocode']['latitude'])

float