## Data Exploration
### Simplified Occurrence Dataset

#### Column Variable Info
- 'id': unique id for each entry
- 'day': Day of the month of encounter.
- 'month': Month of sighting or interaction
- 'year': Year of recorded encounter
- 'eventDate': date of event, yyyy-mm
- 'decimalLatitude': Latitude coordinates of event location (float)
- 'decimalLongitude': Longitude coordinates of event location (float)
- 'footprintWKT': Geospatial data in WKT (Well-Known Text) format. A single point location expressed as a MultiPoint, with latitude and longitude.
- 'bathymetry': Water depth at a location (often in meters, negative down)
- 'shoredistance': Distance from shore. Unique values for most entries
- 'sss': likely, Sea Surface Salinity
- 'sst': likely, Sea Surface Temperature
- 'occurrenceID': Each ID is unique, but it appears to be the same as 'catalogNumber'
- 'associatedMedia': provides a unique link for each entry, which leaved to anavnet, with what appears to be the mapped location of each siting/interaction
- 'occurrenceRemarks': Comments about the interaction in Portuguese. Mostly repetitive but could still be helpful.

In [2]:
# Import Libraries
import pandas as pd
import numpy as np
import plotly as plt

In [4]:
# Load dataset
df = pd.read_csv("OccurrenceDataset_simplified.csv")

In [5]:
df.head()

Unnamed: 0,id,day,month,year,eventDate,decimalLatitude,decimalLongitude,footprintWKT,bathymetry,shoredistance,sss,sst,occurrenceID,associatedMedia,occurrenceRemarks
0,ff3f89a4-518d-4e7c-a2dc-dda812013747,,,2022,2022,38.289937,-9.041748,MultiPoint ((-9.04174800000000012 38.289937000...,743.2,16220.0,35.49,17.1,1020_ANAV_NR_2310/22,https://geoanavnet.hidrografico.pt/coastal-war...,Novos perigos|Animais Marinhos - InteraÁ„o
1,716b10c8-5520-4577-9969-abd9f15b049c,30.0,9.0,2022,2022-09-30,38.23815,-8.8565,MultiPoint ((-8.85650000000000048 38.238149999...,86.6,7067.0,35.5,17.15,1032_ANAV_NR_2317/22,https://geoanavnet.hidrografico.pt/coastal-war...,Novos perigos|Animais Marinhos - InteraÁ„o
2,04513443-a748-4db8-97ff-d7feeb05efe6,,10.0,2022,2022-10,38.190013,-9.040319,MultiLineString ((-8.93188499999999941 37.9311...,158.6,22472.0,35.52,17.19,1060_ANAV_NR_2335/22,https://geoanavnet.hidrografico.pt/coastal-war...,Novos perigos|Animais Marinhos - InteraÁ„o
3,e908ddf8-bfea-416b-bc95-e3014ed35446,,10.0,2022,2022-10,37.022205,-8.73825,MultiPolygon (((-9.01977499999999921 36.908175...,56.0,5857.0,35.79,17.8,1140_ANAV_NR_2385/22,https://geoanavnet.hidrografico.pt/coastal-war...,Requisitos de seguranÁa maritima|Animais Marin...
4,c928f316-1827-4fac-b333-2504a40e8936,,10.0,2022,2022-10,39.362879,-9.441375,MultiPolygon (((-9.59930400000000006 39.249271...,31.2,2824.0,35.27,16.69,1144_ANAV_NR_2387/22,https://geoanavnet.hidrografico.pt/coastal-war...,Requisitos de seguranÁa maritima|Animais Marin...
